https://wiki.code4lib.org/api.php?action=feedcontributions&user=Tdonohue&feedformat=atomCode4Lib - User contributions [en]2024-03-28T08:16:12ZUser contributionsMediaWiki 1.26.2https://wiki.code4lib.org/index.php?title=Libraries_Sharing_Code&diff=36988Libraries Sharing Code2013-02-18T21:09:11Z<p>Tdonohue: /* Institutions on GitHub */</p>
<hr />
<div>A number of libraries have organizational repositories in GitHub. These can be very valuable and we attempt to collect them here.<br />
<br />
==Institutions on GitHub==<br />
* [https://github.com/gwu-libraries/ George Washington University Libraries]<br />
* [https://github.com/gvsulib Grand Valley State University Libraries]<br />
* [https://github.com/nypl/ The New York Public Library]<br />
* [https://github.com/NYULibraries NYU Libraries]<br />
* [https://github.com/psu-stewardship Penn State Digital Stewardship]<br />
* [https://github.com/organizations/RockefellerArchiveCenter Rockefeller Archive Center]<br />
* [https://github.com/adsabs/ SAO/NASA Astrophysics Data System]<br />
* [https://github.com/ucsdlib?tab=repositories UCSD Library]<br />
* [https://github.com/ualibraries The University of Arizona Libraries]<br />
* [https://github.com/ui-libraries University of Iowa Libraries]<br />
* [https://github.com/UNC-Libraries University of North Carolina at Chapel Hill Libraries]<br />
* [https://github.com/ndlib University of Notre Dame Hesburgh Libraries] (And [https://github.com/ndlibersa the CORAL stuff])<br />
* [https://github.com/yalemssa Manuscripts and Archives, Yale University Library]<br />
* [https://github.com/yorkulibraries York University Libraries]<br />
* [https://github.com/chattlibrary Chattanooga Public Library] (currently empty)<br />
* [https://github.com/DarienLibrary Darien (CT) Library] (currently empty)<br />
* [https://github.com/NCSU-Libraries NCSU Libraries] (currently empty, but see individuals below)<br />
* [https://github.com/pulibrary Princeton University Library]<br />
* [https://github.com/osulibraries/ Ohio State University Libraries]<br />
<br />
==Institutions with Non-GitHub open repositories==<br />
* University of Florida: SobekCM software [http://sourceforge.net/directory/?q=sobekcm Sourceforge], [http://code.google.com/p/sobekcm/ Google code], [http://ufdc.ufl.edu/software UFDC institutional site]<br />
* [http://eXtensibleCatalog.org eXtensible Catalog] repositories: [http://code.google.com/p/xcmetadataservicestoolkit/ Metadata Services Toolkit], [http://code.google.com/p/xcoaitoolkit/ OAI Toolkit], [http://code.google.com/p/xcncip2toolkit/ NCIP Toolkit] (all these on Google code), [http://drupal.org/project/xc Drupal Toolkit] (on drupal.org)<br />
<br />
==Other Library Organizations on GitHub==<br />
* [https://github.com/projectblacklight Project Blacklight]<br />
* [https://github.com/projecthydra Project Hydra]<br />
* [https://github.com/islandora Islandora]<br />
* [https://github.com/artefactual Artefactual]<br />
* [https://github.com/duraspace DuraSpace], along with projects: [https://github.com/dspace DSpace], [https://github.com/fcrepo/ Fedora], and [https://svn.duraspace.org/view/duracloud/ DuraCloud (not yet on GitHub)]<br />
<br />
==Individuals on GitHub==<br />
* [https://github.com/jronallo Jason Ronallo] North Carolina State University<br />
* [https://github.com/cazzerson Jason Casden] North Carolina State University</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=Libraries_Sharing_Code&diff=36987Libraries Sharing Code2013-02-18T20:59:06Z<p>Tdonohue: /* Other Library Organizations on GitHub */</p>
<hr />
<div>A number of libraries have organizational repositories in GitHub. These can be very valuable and we attempt to collect them here.<br />
<br />
==Institutions on GitHub==<br />
* [https://github.com/gwu-libraries/ George Washington University Libraries]<br />
* [https://github.com/gvsulib Grand Valley State University Libraries]<br />
* [https://github.com/nypl/ The New York Public Library]<br />
* [https://github.com/NYULibraries NYU Libraries]<br />
* [https://github.com/psu-stewardship Penn State Digital Stewardship]<br />
* [https://github.com/organizations/RockefellerArchiveCenter Rockefeller Archive Center]<br />
* [https://github.com/adsabs/ SAO/NASA Astrophysics Data System]<br />
* [https://github.com/ucsdlib?tab=repositories UCSD Library]<br />
* [https://github.com/ualibraries The University of Arizona Libraries]<br />
* [https://github.com/ui-libraries University of Iowa Libraries]<br />
* [https://github.com/UNC-Libraries University of North Carolina at Chapel Hill Libraries]<br />
* [https://github.com/ndlib University of Notre Dame Hesburgh Libraries] (And [https://github.com/ndlibersa the CORAL stuff])<br />
* [https://github.com/yalemssa Manuscripts and Archives, Yale University Library]<br />
* [https://github.com/yorkulibraries York University Libraries]<br />
* [https://github.com/chattlibrary Chattanooga Public Library] (currently empty)<br />
* [https://github.com/DarienLibrary Darien (CT) Library] (currently empty)<br />
* [https://github.com/NCSU-Libraries NCSU Libraries] (currently empty, but see individuals below)<br />
* [https://github.com/pulibrary Princeton University Library]<br />
<br />
==Institutions with Non-GitHub open repositories==<br />
* University of Florida: SobekCM software [http://sourceforge.net/directory/?q=sobekcm Sourceforge], [http://code.google.com/p/sobekcm/ Google code], [http://ufdc.ufl.edu/software UFDC institutional site]<br />
* [http://eXtensibleCatalog.org eXtensible Catalog] repositories: [http://code.google.com/p/xcmetadataservicestoolkit/ Metadata Services Toolkit], [http://code.google.com/p/xcoaitoolkit/ OAI Toolkit], [http://code.google.com/p/xcncip2toolkit/ NCIP Toolkit] (all these on Google code), [http://drupal.org/project/xc Drupal Toolkit] (on drupal.org)<br />
<br />
==Other Library Organizations on GitHub==<br />
* [https://github.com/projectblacklight Project Blacklight]<br />
* [https://github.com/projecthydra Project Hydra]<br />
* [https://github.com/islandora Islandora]<br />
* [https://github.com/artefactual Artefactual]<br />
* [https://github.com/duraspace DuraSpace], along with projects: [https://github.com/dspace DSpace], [https://github.com/fcrepo/ Fedora], and [https://svn.duraspace.org/view/duracloud/ DuraCloud (not yet on GitHub)]<br />
<br />
==Individuals on GitHub==<br />
* [https://github.com/jronallo Jason Ronallo] North Carolina State University<br />
* [https://github.com/cazzerson Jason Casden] North Carolina State University</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=Mentorship&diff=13893Mentorship2012-06-04T18:06:52Z<p>Tdonohue: </p>
<hr />
<div>We're looking to try out a mentorship program through the #code4lib channel on IRC Freenode. The idea is that experienced people who are willing to volunteer some time would provide 1-on-1 mentoring to those who would like to learn during an "Office hours" period, once a week. The idea is sort-of modelled on the Drupal project's [http://drupal.org/node/1242856 Core Office Hours]. If this sounds like something you'd like to contribute to, don't hesitate to add your name!<br />
<br />
{| border="1" cellspacing="0" cellpadding="5" align="center"<br />
! Mentor name<br />
! IRC name<br />
! Social<br />
! Willing to help with<br />
! Looking to learn<br />
! Office hours<br />
|-<br />
| [[User:Mdm|Matt McCollow]]<br />
| mdm_<br />
|<br />
| Javascript<br />
|<br />
| Fridays, 1-2 pm EST<br />
|-<br />
| [http://ruebot.net Nick Ruest]<br />
| ruebot<br />
| [http://twitter.com/ruebot Twitter] [https://plus.google.com/107824412266039153440 G+] <br />
| Islandora<br />
| Getting better with Solr<br />
| Fridays, 1-2 pm EST (pls ping in advance)<br />
|-<br />
| [http://jbfink.github.com John Fink]<br />
| jbfink<br />
|<br />
| git<br />
|<br />
| Fridays, 1-2pm EST<br />
|-<br />
| [http://lackoftalent.org/michael/ Mike Giarlo]<br />
| mjgiarlo<br />
| [http://twitter.com/mjgiarlo t] [http://gplus.to/mjgiarlo G+] [http://facebook.com/mjgiarlo fb]<br />
| Repository technology and architecture, Ruby, Git, Project Hydra, "Persistent identifiers"<br />
| <br />
| Fridays, 1-5 pm EST (pls ping in advance)<br />
|-<br />
| [http://kennison.name/ Brian Kennison]<br />
| briankenn<br />
|<br />
| xslt<br />
|<br />
| Fridays, 1-2pm EST (let me know what works for you)<br />
|-<br />
| [http://github.com/tdonohue/ Tim Donohue]<br />
| tdonohue <br />
| [http://twitter.com/timdonohue Twitter]<br />
| DSpace, repository technologies/integrations in general, SWORD protocol, digital preservation stuff<br />
|<br />
| [http://wiki.duraspace.org/display/~tdonohue/DSpace+Office+Hours Weds, 1-4pm ET in #duraspace channel on IRC Freenode]<br />
|-<br />
|}</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=Mentorship&diff=13888Mentorship2012-06-04T17:49:28Z<p>Tdonohue: </p>
<hr />
<div>We're looking to try out a mentorship program through the #code4lib channel on IRC Freenode. The idea is that experienced people who are willing to volunteer some time would provide 1-on-1 mentoring to those who would like to learn during an "Office hours" period, once a week. The idea is sort-of modelled on the Drupal project's [http://drupal.org/node/1242856 Core Office Hours]. If this sounds like something you'd like to contribute to, don't hesitate to add your name!<br />
<br />
{| border="1" cellspacing="0" cellpadding="5" align="center"<br />
! Mentor name<br />
! IRC name<br />
! Social<br />
! Willing to help with<br />
! Office hours<br />
|-<br />
| [[User:Mdm|Matt McCollow]]<br />
| mdm_<br />
|<br />
| Javascript<br />
| Fridays, 1-2 pm EST<br />
|-<br />
| [http://ruebot.net Nick Ruest]<br />
| ruebot<br />
| [http://twitter.com/ruebot Twitter] [https://plus.google.com/107824412266039153440 G+] <br />
| Islandora<br />
| Fridays, 1-2 pm EST (pls ping in advance)<br />
|-<br />
| [http://jbfink.github.com John Fink]<br />
| jbfink<br />
|<br />
| git<br />
| Fridays, 1-2pm EST<br />
|-<br />
| [http://lackoftalent.org/michael/ Mike Giarlo]<br />
| mjgiarlo<br />
|<br />
| Repository technology and architecture, Ruby, Git, Project Hydra, "Persistent identifiers"<br />
| Fridays, 1-2 pm EST (though pls ping in advance)<br />
|-<br />
| [http://kennison.name/ Brian Kennison]<br />
| briankenn<br />
|<br />
| xslt<br />
| Fridays, 1-2pm EST (let me know what works for you)<br />
|-<br />
| [http://github.com/tdonohue/ Tim Donohue]<br />
| tdonohue <br />
| [http://twitter.com/timdonohue Twitter]<br />
| DSpace, repository technologies/integrations in general<br />
| [http://wiki.duraspace.org/display/~tdonohue/DSpace+Office+Hours Weds, 1-4pm ET in #duraspace channel on IRC Freenode]<br />
|-<br />
|}</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7280HAMR: Human/Authority Metadata Reconciliation2011-02-07T21:13:12Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Narrowing the focus for today:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names ([http://www.orcid.org/ ORCID] is already working on this), except the fact that they are strings, and we'll do basic string matching<br />
* 1 to 1 matching. Even if you want to eventually match with multiple authorities, you'd only do one at a time<br />
<br />
Possible authority sources:<br />
* PubMed<br />
** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
*** See 'retrievePubmedXML()' in above java code for actual call to PubMed<br />
*** Mapping happens here: See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format)<br />
** More examples of querying PubMed: http://www.my-whiteboard.com/how-to-automate-pubmed-search-using-perl-php-or-java/<br />
* CrossRef<br />
** simply send the DOI to crossref, and get JSON/XML back<br />
*** http://api.labs.crossref.org/10.1111/j.1558-5646.2009.00626.x.json<br />
*** http://api.labs.crossref.org/10.2307/1935157.xml<br />
*** [http://code.google.com/p/dryad/source/browse/trunk/dryad/dspace/modules/doi/dspace-doi-webapp/src/main/java/org/dspace/doi/DOIServlet.java java code that includes a lookup]<br />
** [http://labs.crossref.org/site/crossref_metadata_search.html Metadata Search] -- send a text query, receive a list of matching records<br />
** [http://labs.crossref.org/site/quick_and_dirty_api_guide.html OpenURL search]<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]<br />
<br />
Thoughts / Questions:<br />
* Is there a way to do most/all of this via Javascript/AJAX/JQuery? Could it be a simple Javascript framework you could "drop" into any metadata editing interface?<br />
** Unfortunately, it seems this wouldn't work out. In order to perform querying of external authorities, they'd all need to support [http://en.wikipedia.org/wiki/JSON#JSONP JSONP] or similar (and they don't)<br />
<br />
== Code ==<br />
<br />
* [http://gitref.org/ quick reference for Git]<br />
* [https://github.com/ryscher/hamr Ryan's really stupid scratch implementation]<br />
<br />
== Output Spec ==<br />
<br />
* We will use a simple XML output consisting of paired (and possibly unpaired) values.<br />
* The root element will contain an attribute signifying the source of the authority metadata.<br />
* The <match> element will be used to pair values, with a strength attribute to signify the string distance.<br />
* Within each match element will be exactly 2 metadata elements with attributes signifying the source of each value: either the local input or the remote authority data.<br />
* An <nonmatch> element will be used for unpaired values.<br />
<br />
=== Sample Output ===<br />
<pre><br />
<hamr authority="PubMed"><br />
<match strength="100%"><br />
<creator src="input">Trojan, Tommy</creator><br />
<creator src="authority">Trojan, Tommy</creator><br />
</match><br />
<match strength="90%"><br />
<title src="input">Great American Article</title><br />
<title src="authority">Great American Article, The</title><br />
</match><br />
<nonmatch><br />
<subject src="input">Medical Stuff</subject><br />
</nonmatch><br />
<nonmatch><br />
<type src="authority">text</type><br />
</nonmatch><br />
</hamr><br />
</pre><br />
<br />
== Need to do ==<br />
<br />
# Implement metadata retrieval from authority ''(done for crossref in ryan's code)''<br />
# Design structure of plugins<br />
## crosswalk from authority format to simple dc<br />
# Design matching algorithm</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7252HAMR: Human/Authority Metadata Reconciliation2011-02-07T19:13:09Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Narrowing the focus for today:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names ([http://www.orcid.org/ ORCID] is already working on this), except the fact that they are strings, and we'll do basic string matching<br />
* 1 to 1 matching. Even if you want to eventually match with multiple authorities, you'd only do one at a time<br />
<br />
Possible authority sources:<br />
* PubMed<br />
** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
*** See 'retrievePubmedXML()' in above java code for actual call to PubMed<br />
*** Mapping happens here: See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format)<br />
** More examples of querying PubMed: http://www.my-whiteboard.com/how-to-automate-pubmed-search-using-perl-php-or-java/<br />
* CrossRef<br />
** simply send the DOI to crossref, and get JSON/XML back<br />
*** http://api.labs.crossref.org/10.1111/j.1558-5646.2009.00626.x.json<br />
*** http://api.labs.crossref.org/10.2307/1935157.xml<br />
*** [http://code.google.com/p/dryad/source/browse/trunk/dryad/dspace/modules/doi/dspace-doi-webapp/src/main/java/org/dspace/doi/DOIServlet.java java code that includes a lookup]<br />
** [http://labs.crossref.org/site/crossref_metadata_search.html Metadata Search] -- send a text query, receive a list of matching records<br />
** [http://labs.crossref.org/site/quick_and_dirty_api_guide.html OpenURL search]<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]<br />
<br />
Thoughts / Questions:<br />
* Is there a way to do most/all of this via Javascript/AJAX/JQuery? Could it be a simple Javascript framework you could "drop" into any metadata editing interface?<br />
<br />
== Code ==<br />
<br />
* [http://gitref.org/ quick reference for Git]<br />
* [https://github.com/ryscher/hamr Ryan's really stupid scratch implementation]<br />
<br />
== Need to do ==<br />
<br />
# Create basic code framework<br />
# Implement metadata retrieval from authority<br />
# Design matching algorithm</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7214HAMR: Human/Authority Metadata Reconciliation2011-02-07T16:04:43Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names ([http://www.orcid.org/ ORCID] is already working on this), except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed<br />
** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
*** See 'retrievePubmedXML()' in above java code for actual call to PubMed<br />
*** Mapping happens here: See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format)<br />
** More examples of querying PubMed: http://www.my-whiteboard.com/how-to-automate-pubmed-search-using-perl-php-or-java/<br />
* crossref<br />
** simply send the DOI to crossref, and get JSON/XML back<br />
*** http://api.labs.crossref.org/10.1111/j.1558-5646.2009.00626.x.json<br />
*** http://api.labs.crossref.org/10.2307/1935157.xml<br />
*** [http://code.google.com/p/dryad/source/browse/trunk/dryad/dspace/modules/doi/dspace-doi-webapp/src/main/java/org/dspace/doi/DOIServlet.java java code that includes a lookup]<br />
** [http://labs.crossref.org/site/crossref_metadata_search.html Metadata Search] -- send a text query, receive a list of matching records<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]<br />
<br />
Thoughts / Questions:<br />
* Is there a way to do most/all of this via Javascript/AJAX/JQuery? Could it be a simple Javascript framework you could "drop" into any metadata editing interface?</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7211HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:58:34Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names ([http://www.orcid.org/ ORCID] is already working on this), except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed<br />
** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
*** See 'retrievePubmedXML()' in above java code for actual call to PubMed<br />
*** Mapping happens here: See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format)<br />
** More examples of querying PubMed: http://www.my-whiteboard.com/how-to-automate-pubmed-search-using-perl-php-or-java/<br />
* crossref<br />
** simply send the DOI to crossref, and get JSON/XML back<br />
** http://api.labs.crossref.org/10.1111/j.1558-5646.2009.00626.x.json<br />
** http://api.labs.crossref.org/10.2307/1935157.xml<br />
** [http://code.google.com/p/dryad/source/browse/trunk/dryad/dspace/modules/doi/dspace-doi-webapp/src/main/java/org/dspace/doi/DOIServlet.java java code that includes a lookup]<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7203HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:49:58Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names ([http://www.orcid.org/ ORCID] is already working on this), except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed<br />
** Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefillStep.java] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
*** See 'retrievePubmedXML()' in above java code for actual call to PubMed<br />
*** See [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-pmid+dim.xsl pmid-to-dim.xsl] for a sample XSLT crosswalk to translate PubMed format to a qualified dublin core (internal DSpace metadata format)<br />
* crossref<br />
** simply send the DOI to crossref, and get JSON back<br />
** example: http://api.labs.crossref.org/10.1111/j.1558-5646.2009.00626.x.json<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7200HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:45:43Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names ([http://www.orcid.org/ ORCID] is already working on this), except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed - Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefill] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
* crossref<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7199HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:43:08Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names, except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed - Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefill] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
* crossref<br />
* google scholar - does it have an API?<br />
* [http://www.mendeley.com mendeley] - [http://dev.mendeley.com/ Mendeley API]<br />
* [http://vivoweb.org/ vivo]<br />
* [http://bibapp.org/ bibapp]</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7196HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:33:00Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names, except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed - Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefill] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed])<br />
* crossref<br />
* google scholar</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7195HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:32:47Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names, except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed - Sample pubmed query (in Java): [https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefill] (From [https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed]<br />
* crossref<br />
* google scholar</div>Tdonohuehttps://wiki.code4lib.org/index.php?title=HAMR:_Human/Authority_Metadata_Reconciliation&diff=7194HAMR: Human/Authority Metadata Reconciliation2011-02-07T15:32:26Z<p>Tdonohue: </p>
<hr />
<div>[[HAMR: Human/Authority Metadata Reconciliation]]<br />
<br />
Sean Chen, Tim Donohue, Joshua Gomez, Ranti Junus, Ryan Scherle<br />
<br />
A tool for a curator to determine whether the various fields of a metadata record are correct. Takes a metadata record, locates any identifiers (e.g., DOI, PMID). Retrieves a copy of the metadata record from an authoritative source (e.g., CrossRef, PubMed). Displays a human-readable page that compares fields in the initial record with fields in the authoritative record. Each field is color-coded based on how well it matches, so the curator can quickly identify discrepancies.<br />
<br />
<br />
Focus:<br />
* Dublin core (maybe qualified)<br />
* framework that allows multiple authority sources<br />
* NOT focusing on author names, except the fact that they are strings, and we'll do basic string matching<br />
<br />
Possible authority sources:<br />
* pubmed - Sample pubmed query (in Java): [[https://wiki.duraspace.org/display/DSPACE/PubMedPrefill-PubmedPrefillStep.java DSpace PubMedPrefill]] (From [[https://wiki.duraspace.org/display/DSPACE/PopulateMetadataFromPubMed Populate Metadata from PubMed]]<br />
* crossref<br />
* google scholar</div>Tdonohue