Difference between revisions of "HAMR: Human/Authority Metadata Reconciliation"
From Code4Lib
m |
|||
Line 15: | Line 15: | ||
** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed]) | ** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed]) | ||
*** See 'retrievePubmedXML()' in above java code for actual call to PubMed | *** See 'retrievePubmedXML()' in above java code for actual call to PubMed | ||
− | *** See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format) | + | *** Mapping happens here: See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format) |
+ | ** More examples of querying PubMed: http://www.my-whiteboard.com/how-to-automate-pubmed-search-using-perl-php-or-java/ | ||
* crossref | * crossref | ||
** simply send the DOI to crossref, and get JSON/XML back | ** simply send the DOI to crossref, and get JSON/XML back |
Revision as of 15:58, 7 February 2011
HAMR: Human/Authority Metadata Reconciliation
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.
Focus:
- Dublin core (maybe qualified)
- framework that allows multiple authority sources
- NOT focusing on author names (ORCID is already working on this), except the fact that they are strings, and we'll do basic string matching
Possible authority sources:
- pubmed
- Sample pubmed query (in Java): DSpace PubMedPrefillStep.java (From Populate Metadata from PubMed)
- See 'retrievePubmedXML()' in above java code for actual call to PubMed
- Mapping happens here: See pmid-to-dim.xsl for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format)
- More examples of querying PubMed: http://www.my-whiteboard.com/how-to-automate-pubmed-search-using-perl-php-or-java/
- Sample pubmed query (in Java): DSpace PubMedPrefillStep.java (From Populate Metadata from PubMed)
- crossref
- simply send the DOI to crossref, and get JSON/XML back
- http://api.labs.crossref.org/10.1111/j.1558-5646.2009.00626.x.json
- http://api.labs.crossref.org/10.2307/1935157.xml
- java code that includes a lookup
- google scholar - does it have an API?
- mendeley - Mendeley API
- vivo
- bibapp