Changes

Jump to: navigation, search

Notes from Open Source Discovery Portal Camp

28 bytes added, 01:13, 26 February 2009
solr marc
=== Non-catalog content / digital repositories ===
Could we adapt [[SolrMARC ]] to also include [[SolrOAI]]? Yes, Bob, Naomi and Andrew all have ideas about how this could work. Sounds like this is the kernel of our kernel. [[Solr ]] already has a lot of functionality to allow for this. Do we want a couple of plugins, one for solr and one for [[OAI]]? Or do we want an app that handles both?
Lots of little data silos aren't going to work, we need everything in a local catalog. But that doesn't mean we should all try to be google. We still need well-defined collection development policies.
What about social data? [[SoPAC ]] is neat, and has an independent layer for saving social data.
We also talked about [[Blacklight ]] and the ways it brings in various data sources and handles behavior for different kinds of objects, e.g., [http://musicbrainz.org/ MusicBrainz] data for music items.
(interested in further development: Bob, Dennis, Peter, Naomi, Bess)
Q: How well is solr marc handling bad data these days?
Bob: I've been adding to [[marc4j ]] more permissive reading and error correction. It's also reporting errors as it finds them, to make it easier to find bad records. Request for writing to log files instead of standard out. How to handle records with bad leaders? Naomi has some marc test data. We need more test driven development.
Naomi is offering code for parsing OCLC numbers and LC numbers, she'll be working with Bob next week to get that into solrmarc.

Navigation menu