Difference between revisions of "Umlaut wishlist"

From Code4Lib
Jump to: navigation, search
(moved some wishlist items to bottom of the page under "done or in progress")
Line 4: Line 4:
  
 
* Parsing of formatted references from an entry screen. Use http://wing.comp.nus.edu.sg/parsCit/ package. Very interesting!  Or a similar UCOP package: http://purl.net/net/egh/hmm-citation-extractor/
 
* Parsing of formatted references from an entry screen. Use http://wing.comp.nus.edu.sg/parsCit/ package. Very interesting!  Or a similar UCOP package: http://purl.net/net/egh/hmm-citation-extractor/
 
  
  
 
* Rochester “Getting Users Fulltext” style code to skip right to the full text, skipping content-provider metadata pages.
 
* Rochester “Getting Users Fulltext” style code to skip right to the full text, skipping content-provider metadata pages.
* Google Books search to complement the OCA and Gutenberg searches I’ve got--may or may not be possible with no google books api. Screen scrape? Umich oai-pmh records?
 
  
* UMich MBooks for fulltext (and search-inside)
 
    http://mirlyn.lib.umich.edu/cgi-bin/sdrsmd?id=1&oclc=16857172
 
    http://code.google.com/p/jquery-sdrsmd/
 
  
 
* UMich Mirlyn for metadata enrichment?  
 
* UMich Mirlyn for metadata enrichment?  
 
     http://webservices.itcs.umich.edu/mediawiki/MLibraryAPI/index.php/Mirlynapi:Home
 
     http://webservices.itcs.umich.edu/mediawiki/MLibraryAPI/index.php/Mirlynapi:Home
  
* connection to OCLC Identities
 
    http://outgoing.typepad.com/outgoing/2008/06/linking-to-worl.html
 
  
 
* xISBN/thingISBN use. (Some thought is required in how to integrate this while avoiding false positives). Bowker ISSN service for metadata enhancement. OCLC xISSN?  Integrate preceding/succeeding title information from OPAC or xISSN?
 
* xISBN/thingISBN use. (Some thought is required in how to integrate this while avoiding false positives). Bowker ISSN service for metadata enhancement. OCLC xISSN?  Integrate preceding/succeeding title information from OPAC or xISSN?
 +
 +
 
* Journal covers from Ulrich's via screen-scraping (or Ulrich's/sersol built in api?)
 
* Journal covers from Ulrich's via screen-scraping (or Ulrich's/sersol built in api?)
 +
 +
 
* Connotea integration
 
* Connotea integration
 +
 +
 
* Fetch ToC from LC. Screen scrape, I guess? Or z3950? Any other content from LC?
 
* Fetch ToC from LC. Screen scrape, I guess? Or z3950? Any other content from LC?
 +
 +
 
* Link to Books In Print ala Notre Dame.  
 
* Link to Books In Print ala Notre Dame.  
 
http://www.library.nd.edu/eresources/findit/findit.cgi?doc_num=001939269&aleph_session=U5AVHRXD5QB1CGDFDSVJ9DSY2UA6QNCGVEU8EYRX9NNMIQ429Q-54668%22
 
http://www.library.nd.edu/eresources/findit/findit.cgi?doc_num=001939269&aleph_session=U5AVHRXD5QB1CGDFDSVJ9DSY2UA6QNCGVEU8EYRX9NNMIQ429Q-54668%22
Line 33: Line 34:
  
 
* SFX plugin:  Notice when first title given is non-roman, and look for roman title to enhance metadata with when so.  
 
* SFX plugin:  Notice when first title given is non-roman, and look for roman title to enhance metadata with when so.  
 
  
  
 
* Cover images from Open Library?  See http://johnmiedema.ca/openbook-wordpress-plugin/.
 
* Cover images from Open Library?  See http://johnmiedema.ca/openbook-wordpress-plugin/.
 
  
  
Line 48: Line 47:
 
* "Cited by" service. Scopus via screen scraping? (scopus javascript api? http://www.scopus.com/scsearchapi/ ) ISI Web of Science is too hard to even screen scrape the interface is such a mess, but Scopus looks do-able.  Google scholar?
 
* "Cited by" service. Scopus via screen scraping? (scopus javascript api? http://www.scopus.com/scsearchapi/ ) ISI Web of Science is too hard to even screen scrape the interface is such a mess, but Scopus looks do-able.  Google scholar?
  
* Enhance metadata to have full metadata for a refworks etc export. Using: CrossRef?  Metalib?  Anything else?
 
  
 +
* Enhance metadata to have full metadata for a refworks etc export. Using: CrossRef?  Metalib?  Anything else?
  
  
 
* A general purpose responsecache. Schema: Date, service/source, key.  Use for caching image urls, ToC urls from LC, etc.  
 
* A general purpose responsecache. Schema: Date, service/source, key.  Use for caching image urls, ToC urls from LC, etc.  
 
  
  
Line 60: Line 58:
  
 
* Add a Worldcat search that uses API, instead of screen scrape.  
 
* Add a Worldcat search that uses API, instead of screen scrape.  
 +
  
 
* Switch OCA search to use OCA native APIs, instead of indexdata mirror index.  
 
* Switch OCA search to use OCA native APIs, instead of indexdata mirror index.  
 
  
  
Line 71: Line 69:
  
  
 +
* Crazy idea for an abstract interface/architecture to support querying web service apis that require client side javascript, like Google Books and Scopus.
  
* Crazy idea for an abstract interface/architecture to support querrying web service apis that require client side javascript, like Google Books and Scopus.
 
  
 +
* Integrate my various local document delivery services into menu of options when full text isn’t available. More generally, a clear architecture for providing localized doc delivery services in addition to a single ILL link.
  
  
* Integrate my various local document delivery services into menu of options when full text isn’t available. More generally, a clear architecture for providing localized doc delivery services in addition to a single ILL link.
+
* SFX adaptor: Add a "rollup" feature that pays attention to dates to avoid eliminating coverage.
  
  
  
 +
== done or in progress ==
  
 +
* Google Books search to complement the OCA and Gutenberg searches I’ve got--may or may not be possible with no google books api. Screen scrape? Umich oai-pmh records?
  
* SFX adaptor: Add a "rollup" feature that pays attention to dates to avoid eliminating coverage.
+
* UMich MBooks for fulltext (and search-inside)
 +
    http://mirlyn.lib.umich.edu/cgi-bin/sdrsmd?id=1&oclc=16857172
 +
    http://code.google.com/p/jquery-sdrsmd/
 +
 
 +
* connection to OCLC Identities
 +
    http://outgoing.typepad.com/outgoing/2008/06/linking-to-worl.html

Revision as of 21:26, 6 July 2008


Desired or planned features.


  • Rochester “Getting Users Fulltext” style code to skip right to the full text, skipping content-provider metadata pages.


  • UMich Mirlyn for metadata enrichment?
    http://webservices.itcs.umich.edu/mediawiki/MLibraryAPI/index.php/Mirlynapi:Home


  • xISBN/thingISBN use. (Some thought is required in how to integrate this while avoiding false positives). Bowker ISSN service for metadata enhancement. OCLC xISSN? Integrate preceding/succeeding title information from OPAC or xISSN?


  • Journal covers from Ulrich's via screen-scraping (or Ulrich's/sersol built in api?)


  • Connotea integration


  • Fetch ToC from LC. Screen scrape, I guess? Or z3950? Any other content from LC?


  • Link to Books In Print ala Notre Dame.

http://www.library.nd.edu/eresources/findit/findit.cgi?doc_num=001939269&aleph_session=U5AVHRXD5QB1CGDFDSVJ9DSY2UA6QNCGVEU8EYRX9NNMIQ429Q-54668%22 example

  • bip search url? :

http://www.booksinprint.com/merge_shared/Search/advsearch.asp%3FdateState%3DY%26txtAction%3D%26BooleanSearch%3D%26SType%3Dadv%26collection%3DBIP%26QueryMode%3DSimple%26ResultCount%3D25%26ResultTemplate%3Dmbbookresult_fl.hts%26navPage%3D1%26SrchFrm%3DAdv%26ScoreThreshold%3D0%26Criteria1%3DISBN%26CriteriaText1%3D0838935370


  • SFX plugin: Notice when first title given is non-roman, and look for roman title to enhance metadata with when so.



  • HIP and other OPAC searchers should pull ToC from MARC 505 when present. And 856's judged to be ToC in ToC, not full text.


  • Fix Umlaut Referent to more easily allow multiple authors. Architectural change neccessary to get a lot of this stuff working right.


  • "Cited by" service. Scopus via screen scraping? (scopus javascript api? http://www.scopus.com/scsearchapi/ ) ISI Web of Science is too hard to even screen scrape the interface is such a mess, but Scopus looks do-able. Google scholar?


  • Enhance metadata to have full metadata for a refworks etc export. Using: CrossRef? Metalib? Anything else?


  • A general purpose responsecache. Schema: Date, service/source, key. Use for caching image urls, ToC urls from LC, etc.


  • Fix Worldcat registry auto-discovery.


  • Add a Worldcat search that uses API, instead of screen scrape.


  • Switch OCA search to use OCA native APIs, instead of indexdata mirror index.


  • fix unapi in umlaut. unapi to rsi? For zotero.


  • Change background to use Spawn plugin instead of manual threading. Investigating using spawn with fork instead of thread (terry reese on limited pool of forks).


  • Crazy idea for an abstract interface/architecture to support querying web service apis that require client side javascript, like Google Books and Scopus.


  • Integrate my various local document delivery services into menu of options when full text isn’t available. More generally, a clear architecture for providing localized doc delivery services in addition to a single ILL link.


  • SFX adaptor: Add a "rollup" feature that pays attention to dates to avoid eliminating coverage.


done or in progress

  • Google Books search to complement the OCA and Gutenberg searches I’ve got--may or may not be possible with no google books api. Screen scrape? Umich oai-pmh records?
  • UMich MBooks for fulltext (and search-inside)
    http://mirlyn.lib.umich.edu/cgi-bin/sdrsmd?id=1&oclc=16857172
    http://code.google.com/p/jquery-sdrsmd/
  • connection to OCLC Identities
    http://outgoing.typepad.com/outgoing/2008/06/linking-to-worl.html