Changes

Jump to: navigation, search

Working with MARC

5,562 bytes added, 22:26, 7 November 2023
Syntax highlight for bat command
== Working MARC stands for Machine Readable Cataloging, and many folks in the code4lib community find themselves working with Marc ==MARC records at some point. This page is meant to be a round-up of the tools for working with MARC. If you want a general introduction to the standard, [http://en.wikipedia.org/wiki/MARC_standards the Wikipedia article] is a good place to start. MARC data is usually expressed either in ISO 2709 ("binary") form or MARCXML form.
MaRC stands for Machine Readable Cataloging, and many folks in the code4lib community find themselves working with marc records at some point. This page is meant to be a round-up of the tools for working with marc. If you want a general introduction to the standard, [http://en.wikipedia.org/wiki/MARC_standards the wikipedia article] is a good place to start.__TOC__
=== Desktop tools ===
MarcEdit http://people.oregonstate.edu/~reeset/marcedit/html/index.php
=== Getting Marc Indexed for Search Engines =Reference information ==
==== * [http://www.loc.gov/marc/ Library of Congress MARC in Solr ====Standards Pages] (covers MARC21; formerly USMARC)* [http://www.bl.uk/bibliographic/ukmarc.html British Library UKMARC Pages]* [http://archive.ifla.org/VI/3/p1996-1/sec-uni.htm IFLA UNIMARC Pages]* [http://www.oclc.org/content/bibformats/en.html OCLC MARC Pages]
SolrMarc http://code.google.com/p/solrmarc/== Desktop tools ==
Solr * [http://lucenemarcedit.apachereeset.orgnet/solrMarcEdit]
==== Marc in Zebra ====* [http://csharpmarc.net/ C# MARC Editor]: is a simple and light weight MARC Editor for Windows
Getting Started with Zebra * [http://wikiwww.code4libauto-graphics.orgcom/index.phpdownload/Getting_Started_with_ZebraSHOWMARC.EXE Showmarc]: is a DOS program that will show all the MARC fields used and how many times each is used.
Zebra * [http://wwwrossjohnson.zebrahomemail.com.au/MARCRTP/ MARC Record Translation Program] (MARC RTP) is a command line utility that shows fields and subfields used in a collection of MARC records and then converts, and selectively imports, into databases built with general-purpose applications.
=== Marc Programming Libraries ===* The [http://www.loc.gov/marc/marc-functional-analysis/tool.html FRBR Display Tool] takes a file of MARC records and creats XML and HTML files arranged using the Functional Requirements for Bibliographic Records principles.
==== Java ====* [http://mysite.du.edu/~ttyler/freeware/marcxgen.htm MarcXGen] is a Marc URL extractor and HTML generator. Useful for link checking MARC records.
marc4j * [http://marc4jwww.tigrisloc.orggov/marc/makrbrkr.html MARCMaker and MARCBreaker] are DOS programs by the Library of Congress for converting MARC records to a text format and back.
==== Perl ====* [http://www.bl.uk/bibliographic/usemarcon.html USEMARCON] is a multi-platform rule-based MARC record manipulation program. It is a command-line utility but there is also a GUI for it.
You can find many packages for working with == Getting Marc in Perl of them at CPAN: http://search.cpan.org/search?queryIndexed for Search Engines =marc&mode=all. Most of them fall under the umbrella of the MARC/Perl project on Sourceforge: http://marcpm.sourceforge.net/.
==== PHP =MARC in Solr ===
File_MARC (Pear package) http* SolrMarc https://peargithub.php.net/packagecom/File_MARCsolrmarc/solrmarc
==== Python ====* Solr http://lucene.apache.org/solr
pymarc * Catmandu http://pypi.pythonlibrecat.org/pypi/pymarc(provides also loading into ElasticSearch, MongoDB and others)
==== Ruby =MARC in Zebra ===
ruby-marc * Getting Started with Zebra http://rubyforgewiki.code4lib.org/projects/marcindex.php/Getting_Started_with_Zebra
* Zebra http://wikiwww.code4libindexdata.orgcom/index.php/Ruby-marc - some notes and recipes for processing MARC files in rubyzebra
=== Getting Sample Data ===
One common question is where to get sample marc records for testing or playing around with. If you work at a library, chances are good that you can get some records out of your ILS (go ask your systems librarian if you don't know how to do this yourself). If you don't work in a library, you can get marc records from the internet archive at [http://www.archive.org/details/marcrecords http://www.archive.org/details/marcrecords]. == MARC Programming Libraries ==
{| class="wikitable sortable"|-valign="top"! Project !! Language !! class="unsortable" | Links !! class="unsortable" | Notes|-valign="top"| MARC4J || Java || http://marc4j.tigris.org/ || |-valign="top"| javamarc || Java || http://github.com/billdueber/javamarc || Fork of MARC4J|-valign="top"| MARC/Perl || Perl || http://marcpm.sf.net || Umbrella project; see also [http://search.cpan.org/search?query=marc&mode=all CPAN]|-valign="top"| pymarc || Python || http://github.com/edsu/pymarc/ || |-valign="top"| File_MARC || PHP || http://pear.php.net/package/File_MARC/ || PEAR package; sanctioned fork of PHP-MARC|-valign="top"| PHP-MARC || PHP || http://www.emilda.org/index.php?q=php-marc || Abandoned(?); served as basis for File_MARC|-valign="top"| ruby-marc || Ruby || http://rubyforge.org/projects/marc/ <br/> http://wiki.code4lib.org/index.php/Ruby-marc |||-valign="top" | enhanced-marc || Ruby || http://github.com/rsinger/enhanced-marc || Convenience methods for ruby-marc|-valign="top" | marc21 || Scheme || http://code.google.com/p/marc21 |||-valign="top"| marcerl || Erlang || svn://pubserv.oclc.org/marcerl|| Very alpha code|-valign="top"| Scala-MARC || Scala || http://github.com/achelous/Scala-MARC || |-valign="top"| MARC Library (SobekCM) || C# || http://sourceforge.net/projects/marclibrary/ || Implemented in .NET 4.0 with LINQ and streams with Z39.50 support|-valign="top"| CSharp MARC || C# || http://csharpmarc.net || Based upon File_MARC Pear packaged for PHP, but restyled for use in .NET|-valign="top"| MARC.NET || C# || http://github.com/willkurt/MARC.NET || basic start, not thoroughly 'real world' tested|-valign="top"| marc_record.js || JavaScript || http://www.pusc.it/bib/mel/marc_record.js (dead link) || Part of [http://www.pusc.it/bib/mel/ MARC Editor Lite] (dead link)|-valign="top"| marcjs || JavaScript (node) || https://github.com/fredericd/marcjs || |-valign="top"| USEMARCON || C++ || http://www.nationallibrary.fi/libraries/format/usemarcon.html || A rule-based MARC record conversion library|-valign="top"| clj-marc || Clojure || http://github.com/phochste/clj-marc || Basic MARC21 and Aleph500 sequential export parser|-valign="top"| MARC4J.Net || C# || https://github.com/mxurshid/MARC4J.Net || https://www.nuget.org/packages/MARC4J.Net|-valian="top"| marc4js || JavaScript (Node.js) || https://github.com/jiaola/marc4js || Read/transform/write records with Node stream api. Handles MARC8 and UTF8.|-valign="top"| marcli || Go || https://github.com/hectorcorrea/marcli || Command line to parse MARC files|-valign="top"| fml || Go || https://github.com/MITLibraries/fml || MARC 21 library for Go |} A feed of commit messages and release announcements from many of the projects listed above can be found at http://pipes.yahoo.com/gmcharlt/marctoolchanges. == Utilities and Frameworks =={| class="wikitable sortable"|-valign="top"! Project !! Language !! class="unsortable" | Links !! class="unsortable" | Notes|-valign="top"| MarcXimiL || Python || http://marcximil.sourceforge.net/ || Bibliographic Similarity Analysis Framework |-valign="top"| Catmandu || Perl || http://librecat.org || An ETL-framework to extract, transform and load MARC (and other formats) from/to various databases, indexes|-valign="top"| Syntax highlight for bat command || YAML || https://gist.github.com/pabloab/0a532be47e5cf9e12e396cd79d7feabc || Sublime syntax for MARC line mode files (yaz-marcdump), used by syntect and therefore by [https://github.com/sharkdp/bat bat] |} == Getting Sample Data == One common question is where to get sample MARC records for testing or playing around with. If you work at a library, chances are good that you can get some records out of your ILS (go ask your systems librarian if you don't know how to do this yourself). If you don't work in a library, you can get [http://www.archive.org/details/ol_data MARC bibliographic records from the Internet Archive]. You can also get [http://www.hathitrust.org/data MARCXML data for titles in HathiTrust through OAI-PMH]. There is a nascent movement within the code4lib community to establish a test set of problematic marc MARC records, especially records that are representative of the kinds of weirdness that is encountered in real libraries. It is hoped that this could eventually become a test corpus against which to run various marc MARC processing implementations. For more information, watch [http://www.archive.org/details/MARCTHULU Simon Spero's excellent talk from Code4LibCon 2010]. MARC records for authority data are more common. The [http://www.getty.edu/research/conducting_research/vocabularies/download.html Getty Vocabularies] makes both the The Art & Architecture Thesaurus (AAT) and The Union List of Artist Names (ULAN) freely available. The [http://www.library.northwestern.edu/public/gsafd/ Guidelines On Subject Access To Individual Works Of Fiction, Drama, Etc.] records are available from Northwestern University. The [http://www.nlm.nih.gov/mesh/filelist.html Medical Subject Headings (MeSH)] are available in many formats, one of them being MARC. == Reporting on How MARC Has Been Used == [http://experimental.worldcat.org/marcusage/ MARC Usage in WorldCat] - A site that reports on how MARC has been used within the 300 million record WorldCat database
3
edits

Navigation menu