Changes

Jump to: navigation, search

Umlaut wishlist

2,785 bytes added, 16:22, 19 June 2012
no edit summary
[[Category:Umlaut]]
=WARNING: This is Outdated Documentation!!!!= '''THIS IS OUTDATED DOCUMENTATION''' See new Umlaut documentation at http://github.com/team-umlaut/umlaut/wiki--------- Some actual current future plans: * JournalTOCs ToC? * Use OCLC xISBN to find HT and Internet Archive/OCA matches? * Internet Archive -- use new OL/IA api, discover search-inside-the-book.  * WorldCat, use new api, link directly to nearest public library in 'see also' or elsewhere.  * CiteSeerX -- source of 'cited by' info, AND, most excitingly, open access pre-prints. But their Atom/RSS feeds (the only API I could find) don't seem to advertise enough info to actually use these features. Would need to talk to developer team -- possibly offer to help code? Also not entirely clear how big their corpus actually is, if it's worth it.  * Try screen-scraping Google Scholar (and maybe Microsoft Academic) to get the open access full text links they find. Also, there's a Springer API for open access content now. http://dev.springer.com/docs/Restful_operations * When no full text is found, provide link to search on Google Scholar, or Bing Academic? Need to have sufficient metadata to create the search. Oct 2010 Library Technology Reports article has some ideas, I think.   '''old''' Desired or planned features.  * Check for similar articles from: http://biosemantics.org/jane/faq.php#api * Full-text availability check from http://chroniclingamerica.loc.gov/ -- check by title/city, check by lccn (?), able to check particular dates/link to particular dates and/or pages of paper? * Allow a service_response to have a tree relationship to children, so for instance alternate versions of a text can be attached as children of the main link, expandable by the user.  * http://export.arxiv.org/api_help/ !!!! * PubMed Central full text lookup http://www.ncbi.nlm.nih.gov/entrez/query/static/esearch_help.html (SFX may already do this?) * Journal ToC from CiteULike * Parsing of formatted references from an entry screen. Use http://wing.comp.nus.edu.sg/parsCit/ package. Very interesting! Or a similar UCOP package: http://purl.net/net/egh/hmm-citation-extractor/ See list of such packages here under "Other Parsing Tools" http://freecite.library.brown.edu/ * LibraryThing open knowledge API for more data. http://www.librarything.com/blog/2008/08/free-web-services-api-to-common.php * Connect to internet linked movie database on movies: http://www.linkedmdb.org/ * Add information about the conversation happening around an article with Scintilla if we have a URL, PMID or DOI (Alf at Scintilla would prefer us NOT to use the API for high-traffic. But we can copy his techniques internally to Umlaut. CrossRef and PubMed for "cited by" on DOI and PMID identifiers are a good idea. He has also reverse engineered the Scopus javascript api to allow server-side json access. http://hublog.hubmed.org/archives/001512.html): http://hublog.hubmed.org/archives/001609.html Unofficially it will return json: http://scintilla.nature.com/conversations?uri=info%3Adoi%2F10.1371%2Fjournal.pmed.0020124&format=json 
* Parsing of formatted references from an entry screen. Use http://wing.comp.nus.edu.sg/parsCit/ package. Very interesting! Or a similar UCOP package: http://purl.net/net/egh/hmm-citation-extractor/
* xISBN/thingISBN use. (Some thought is required in how to integrate this while avoiding false positives). Bowker ISSN service for metadata enhancement. OCLC xISSN? Integrate preceding/succeeding title information from OPAC or xISSN?
* LibraryLookup: http://xisbn.worldcat.org/liblook/index.htm At least until xISBN is baked in we could provide a link to this service. Increases the chances of finding a desired book in the catalog through work set grouping. Used by LibX.
http://xisbn.worldcat.org/liblook/resolve.htm?res_id=http://www.iucat.iu.edu&rft.isbn=0451530942&url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book
* Journal covers from Ulrich's via screen-scraping (or Ulrich's/sersol built in api?)
* SFX plugin: Notice when first title given is non-roman, and look for roman title to enhance metadata with when so.
 
 
* Cover images from Open Library? See http://johnmiedema.ca/openbook-wordpress-plugin/.
* Fix Umlaut Referent to more easily allow multiple authors. Architectural change neccessary to get a lot of this stuff working right.
 * "Cited by" service. Scopus via screen scraping? (scopus javascript api? http://www.scopus.com/scsearchapi/ See also http://hublog.hubmed.org/archives/001512.html ) ISI Web of Science is too hard to even screen scrape the interface is such a mess, but Scopus looks do-able. Google scholar?
* SFX adaptor: Add a "rollup" feature that pays attention to dates to avoid eliminating coverage.
 
 
 
== done or in progress ==
 
* Google Books search to complement the OCA and Gutenberg searches I’ve got--may or may not be possible with no google books api. Screen scrape? Umich oai-pmh records?
 
* UMich MBooks for fulltext (and search-inside)
http://mirlyn.lib.umich.edu/cgi-bin/sdrsmd?id=1&oclc=16857172
http://code.google.com/p/jquery-sdrsmd/
 
* connection to OCLC Identities
http://outgoing.typepad.com/outgoing/2008/06/linking-to-worl.html

Navigation menu