Changes

2015 Prepared Talk Proposals

20,479 bytes added, 01:03, 8 November 2014

m

Protected "2015 Prepared Talk Proposals": Proposal deadline hit ([Edit=Allow only administrators] (indefinite) [Move=Allow only administrators] (indefinite))

* Bill Levay, wjlevay@gmail.com, Linked Jazz Project

How can we take semi-structured but messy metadata from a repository like CONTENTdm and transform it into rich linked data? Working with metadata from Tulane’s Hogan Jazz Archive Photography Collection, the Linked Jazz Project used Open Refine and Python scripts to tease out proper names, match them with name authority URIs, and specify FOAF relationships between musicians who appear together in photographs. Additional RDF triples were created for any dates associated with the photos, and for those images with place information we employed GeoNames URIs. Historical images and data that were siloed can now interact with other datasets, like Linked Jazz’s rich set of names and personal relationships, and can be visualized [~~link to come~~[http://linkedjazz.org/tulane/ see prototype visualization]] or otherwise presented on the web in any number of ways. I have not previously presented at a Code4Lib conference.

== Taking User Experience (UX) to new heights ==

== Technology on your Wrist: Cross-platform Smartwatch Development for Libraries ==

* [[User:sanderson|Steven Carl Anderson]], sanderson@bpl.org, [http://www.bpl.org Boston Public Library ] (no previously accepted prepared talks but have done lightning talks in the past)

I'll be the first to admit: smartwatches are unlikely to completely revolutionize how a library provides online services. But I believe they still represent an opportunity to further enhance existing library services and resources in a unique way.

Stanford is using Fedora4 for managing Open Annotations, via a middleware component called Triannon. Triannon receives the JSON data from the annotation client, and uses the Linked Data Platform API implementation in Fedora4 to create, retrieve, update and delete the constituent resources. Triannon could be easily modified to use other LDP implementations, or could be modified to work with linked data other than annotations.

== Hydras in the Wild: A survey of current projects ==

* Mark Bussey, mark@curationexperts.com, Data Curation Experts

You've seen the tutorials, but [https://github.com/projecthydra/hydra/wiki/Dive-into-Hydra Dive Into Hydra] seems to leave something wanting. What can you really do using the Hydra Framework? This talks looks at a number of current Hydra projects and highlights the design and functional features unique to each. Compare and contrast UX, design and functional capabilites from a range of hydra-based repositories including:

* Avalon for media discovery and distribution (Indiana and Northwestern Univiersities)

* HydraDam for media archive management(WGBH)

* HyHull for general Institutional Repository needs (University of Hull)

* T-DIL for slide library functions (Tufts University)

* Sufia & Scholarsphere as a bundled self-deposit IR solution (Pennsylvania State University)

* Curate & Worthwhile as general purpose repository platforms (Multiple Insititutions)

This will be a whirlwind tour aimed at providing ideas and inspiration for your own repository development project.

== Hydra Makeovers! ==

* Alicia Cozine, alicia@curationexperts.com, Data Curation Experts

* Patrick Feeley, pgf8@case.edu, Case Western Reserve University

Compare two Hydra-based applications with the systems they replaced. Marvel at the Before and After snapshots of functionality, speed, and look & feel.

* '''Digital Case 2.0''' is an institutional repository, complete with administrative tools, derivatives transcoding, flexible XML metadata storage, embargo and lease capability, faceted searching, and content viewers for texts/TEI, images, audio recordings, and videos. Digital Case 2.0 is based on worthwhile, an open-source IR starter gem.

* '''The Tufts Digital Image Library''' is a specialized tool for art and art history resources, offering image collections with user access controls, image ordering, collection nesting, drag-and-drop organization, slideshows, and export capability.

Both new systems are built on hydra, the open-source Ruby-on-Rails repository solution that incorporates Fedora for storage, Solr for indexing, and Blacklight for search optimization. Their beauty is not just skin-deep!

== Helping Google (and scholars, researchers, educators, & the public) find archival audio ==

We’ll explain our efforts to make the application usable by those with hearing impairments and mobility impairments, reactions from our first users, and challenges in working with the relatively new beacon technology.

== Distributed Remediation: Small tools for big problems: ==

* Matt Miller, matthewmiller@nypl.org, New York Public Library, NYPL Labs

Remediation of legacy data can be automated only so much. Certain essential cleanup tasks, such as aligning a name with the correct authorized version, is very difficult for a computer yet trivial for a person. While it is these types remediations that will allow an institution to take advantage and participate in the web of Linked Open Data, a wholly manual approach is unrealistic. However, by augmenting automated remediation with a light human touch we can quickly and efficiently reach our goals. This talk will look at tools and methods being developed at NYPL Labs to empower library staff and the public to help clean up our legacy metadata through collaborative remediation.

==VuFind + WorldCat: Open Source Discovery Meets Big Library Data==

* Karen A. Coombs, coombsk@oclc.org, OCLC

* Demian Katz, demian.katz@villanova.edu, Villanova University

Good collaboration is crucial to any integration. Our project, to integrate the open source discovery tool, VuFind, with results from the new WorldCat Discovery API, is no different. We want to exploit the flexibility of VuFind and add the depth and breadth of WorldCat and central index content. Plus, the project has the potential to eliminate the hurdle of exporting and indexing MARC records for 32 VuFind libraries.

While we are really excited about the UI enhancements, we are also curious to see how we can share our domain expertise between our two organizations to get the project done quickly. How will different perspectives shape our existing code bases? How will we adjust to working on the same code base simultaneously? And how can we best incorporate what we learn along the way?

There is also new territory to explore using the API: integrating bibliographic and article data into a single results set. And the overarching question: will we be able to take advantage of Linked Data in the WorldCat Discovery API to create some “glue” between records in existing VuFind indexes and third-party data providers?

This presentation will discuss our adventures in the OCLC WorldShare Platform/VuFind collaboration: the opportunities, challenges and results. You’ll learn what worked, what didn’t and how you can improve your own discovery interface integration project—no matter what provider or APIs you use.

== Book Reader Bingo: Which Page-Turner Should I Use? ==

* Eben English, eenglish [at] bpl.org, Boston Public Library

Another day, another library reinventing the book-viewer wheel. When will the madness end? This talk will explore the current landscape of book-viewer/page-turner applications for digital library systems with an eye towards helping you make the right decision for your project. We'll look at some the major players on the market (such as Internet Archive BookReader, Wellcome Player, Mirador, and WDL-Viewer to name a few) and compare them based on a number of criteria: feature sets, mobile/tablet friendliness, ease of integration, code health, test coverage, "market share" (number of implementers), and other important factors. We'll look at the results of ACTUAL USABILITY TESTS to see what features users REALLY want in a book-viewer, and how each app measures up.

We'll also discuss important recent trends (such as the IIIF Presentation API, ReadersFirst, and NYPL's Library Simplified initiative) that have the potential to shape the book-viewer development landscape in the immediate future. Which page-turner applications are best poised to adopt/integrate/leverage these emerging standards? Which will become obsolete? This talk has the answers you need.

== Rich Citations ==

* Adam Becker, abecker@plos.org, Public Library of Science

* Erik Hetzner, ehetzner@plos.org, Public Library of Science

Citations should connect you to the research you need. They should

link directly to the relevant papers and data. They should tell you

everything you need to know about how and where two pieces of research

are connected. They should be easy to use and they should never get in

the way of the reading experience. In short, citations should not

clumsily point to a mere chunk of plain undifferentiated text sitting

in a static list at the end of the paper. But with almost no

exceptions, citations in scientific papers do just that.

PLOS Labs has been working on a project to capture extra information

about citations. To this end, we have designed a metadata structure

describing the context of a paper's citations, called rich citations.

This structured information includes complete bibliographic

information for the cited items, location and context of the in-text

citation, co-citations, reference license and status (updated,

retracted, etc.), and more.

We have processed the complete PLOS corpus to extract this rich

citation metadata, and made this data available in an API located at

http://api.richcitations.org/.

We have also developed a JavaScript overlay to enhance a paper's view

using rich citations (http://alpha.richcitations.org). This overlay

uses the underlying rich citation metadata to allow the user to view

information about an in-text citation, quickly navigate between

citations, and sort and filter the paper's reference list.

In this talk, we will describe the rich citation metadata that we are

capturing. We will demonstrate how this metadata can enhance a

reader's experience of an article and how it can be used by

researchers to better understand how citations are used in the

scientific community. We will also discuss our ongoing plans to extend

this project to the wider literature beyond PLOS, and how interested

members of the community can help.

== Scriptaloging with ArchivesSpace and NodeJS ==

* Brian Hoffman, brianjhoffman@gmail.com

ArchivesSpace is a new web application for managing archival collections. It has a browser-based interface for entering and editing metadata, and can import data serialized as EAD, MARC, and several other formats. But there may be situations where neither of these are quite what you want. For instance, you may have a large folder of images that each need a digital object record; or you may want to export an EAD for every collection in your repository; or calculate the total extent of your collection; or execute a global search and replace; or batch-update barcodes, etc. You could write a plugin using ArchivesSpace’s plugin API, but that requires facility with Ruby as well as access to the environment where the application is running. A more lightweight approach is to access your data through ArchivesSpace’s powerful REST API, and process it using whatever scripting language you prefer. This talk will present some simple “scriptaloging” solutions that a moderately skilled programmer can use to automate data entry or import tasks using an extendable command line tool written in NodeJS (https://www.npmjs.org/package/as-cli) and loosely inspired by Drupal’s drush utilitly.

== Consuming Big Linked Open Data in Practice: Authority Shifts and Identifier Drift ==

* Kathryn Stine, katstine@berkeley.edu, UC Berkeley (first-time presenter)

* Stephanie Collett, stephanie.collett@ucop.edu, California Digital Library, UC (c4l 2012 presenter)

Increasingly, authoritative datasets of interest to libraries (subjects, names, classifications, etc.) are are available in bulk, exposed as linked open data. Unfettered access can allow libraries to aggregate, connect, and augment data in new ways that will benefit users. This talk will describe our exploratory experience integrating bulk data from the Virtual International Authority File (VIAF) into HathiTrust metadata to improve discovery and collection management.

Authoritative data is not static - datasets change with new contributions and re-clustering, resulting in new identifier relationships. We will describe the challenges this presents with accessing, processing, and syncing our metadata with a massive, complex linked dataset. We will talk about our technical approach to navigating an ecosystem of identifiers and mitigating cached identifier drift between systems as authority data shifts. We aim to spark conversation about data accessibility and the relationships between local, consortial, and authoritative metadata as the library community moves beyond “Hello, world” linked data examples to integrating this data at scale into existing systems.

== Your Chocolate is in My Peanut Butter! Mixing up Content and Presentation Layers to Build Smarter Books in Browsers with RDFa, Schema.org, and Linked Data Topics ==

* Jason A. Clark [@jaclark], Head, Library Informatics & Computing, Montana State University, jaclark@montana.edu

* Scott W. H. Young [@hei_scott], Digital Initiatives Librarian, Montana State University, swyoung@montana.edu

Common methods of publishing book content have focused on various implementations of existing technologies such as LaTeX, Markdown, and .epub. A common theme within this development has been the separation of presentation layers and content layers. What if there was another way? In responding to that question, we’ll look at our local @msulibrary prototype software funded by an IMLS Sparks! Innovation grant for presenting books inside of web browsers (https://github.com/jasonclark/bib-template). Our talk will focus on the tools and technologies of open web publishing. We’ll consider the strange and wonderful benefits of integrating the presentation layer and content layer using semantic RDFa HTML5 markup, and we’ll demonstrate how describing and displaying books within an open web model impacts discovery, eBook production, and machine-readability.

Our session will include:

* The benefits of using RDFa, Schema.org, and linked data models for book production

* How structured data models for book content can turn your webpage into your API

* Analyze the effects of this practice for machine-understanding, SEO, and UX

* Discuss the advantages and disadvantages of this model as it applies to a range of book genres, including web book prototypes for fiction and poetry.

Chocolate/Peanut Butter... RDFa/HTML5... Linked Data/Dbpedia Topics... "Great tastes that go great together."

== Integration/Collaboration: Playing Well With Others ==

* Sarah Romkey, sromkey@artefactual.com, Artefactual Systems

* Justin Simpson, jsimpson@artefactual.com, Artefactual Systems

There are many different software applications and systems being used in collecting institutions to handle all aspects of managing their digital collections. Within a single institution, it is possible to have several tools with overlapping functionality. Sometimes the biggest gains in functionality and productivity come from a focus on connecting these systems together.

In this talk we will present several examples of integration and collaboration that we have been involved in while working on the Archivematica project. Examples of integrations with both open source and proprietary systems, as well as examples of collaborative development processes.

== BYOB: Build Your Own Bootstrap ==

* Axa Mei Liauw, aliauw@princeton.edu, Princeton University Library

* Kevin Reiss, kr2@princeton.edu, Princeton University Library

Bootstrap is awesome! It allows for quick and easy front-end development without demanding too much time with the design of the user interface. A framework like Bootstrap can also help libraries address one of our top usability problems: wild inconsistency of styles and layouts across our own in-house applications and websites.

However, what if you do not want to compromise your creativity and make your project visually stand out? With enough Sass or Less knowledge it is possible to highly customize Bootstrap to fit your needs, but in the end you still end up with your design and code buying into Bootstrap's markup and design conventions, which are not always what you want. Sometimes you simply want to start with a clean slate and not fight the framework and use only the components and features you need.

In that case, build your own Bootstrap! In this talk we will discuss techniques for using some of the emerging tools from the front-end development and design communities to create a maintainable, modular in-house Sass toolkit that we are implementing within a variety of tools including Drupal, Rails applications, in house PHP applications, as well as within vendor solutions like Libguides and ILLiad. Some specific tools we will discuss are Grunt, Bower, and Sass tools like Singularity, Breakpoint, and Bourbon.

== Plz to let me download this ebook: an idea for better leisure reading access ==

* Lauren Ajamie, lajamie@nd.edu, University of Notre Dame

(I haven't presented at Code4Lib before)

The landscape of finding and getting fiction (and non-fiction for leisure reading) has changed dramatically in the past few years, and one of the most disruptive new developments could turn out to be Oyster, a "Netflix for books" ebook subscription. While somewhat expensive and with a developing selection of materials, the platform is amazingly easy to use, and makes finding and reading ebooks a two step process: find a book, start reading. Compare this to getting ebooks (or even print books) from your library, a process which could take multiple user names and up to a dozen steps. For both the future of libraries, and my own selfish desires, I want to make this better! This speculative talk will discuss an idea I have (that I will need help with) to make finding and borrowing leisure reading materials less frustrating, and will hopefully result in a library-wide conversation about the discovery of, and access to, leisure reading materials.

== Formats For All! The Future of the Archivematica Format Policy Registry ==

* Misty De Meo, mdemeo@artefactual.com, Artefactual Systems (first-time talk submitter)

Archivematica's Format Policy Registry (FPR) is an open-source preservation planning tool. The FPR is designed to abstract away many of the challenges involved in identifying file formats, as well as picking the right tools to use to perform tasks like metadata extraction, format transformation and normalization.

The first part of this talk will focus on the FPR's structure and its capabilities, but this isn't just a vendor pitch. The current version of the registry is only designed around Archivematica's usecases, but we want to bring the FPR to the world and make it work great for other software too. Share your format policies between Archivematica, Islandora, and Hydra! Share your file IDs between applications so you never misidentify a file again! Build new software without having to worry about the details of how to run external tools on files! The second part of the talk will focus on possible future developments, how the FPR might change, and what a community effort to bring the FPR forward might look like.

== PeerLibrary – Facilitating the global conversation on academic literature ==

* Timothy Quach, UC Berkeley

* Rachel Wolfsohn, UC Berkeley, rachel at peerlibrary.org

* Mitar Milutinovic, UC Berkeley, mitar.code4lib at peerlibrary.org

* Not presented or attended code4lib before

We present [https://peerlibrary.org/ PeerLibrary] as an example of how to intertwine various tools and methods to spread the conversation about academic publications. PeerLibrary is unique because of our collaborative annotation and discussions layer integrated with the in-browser PDF viewer. PeerLibrary provides tools to annotate and comment directly on the works. Users can highlight important information and take notes in the margins online, just as how one would mark up a physical research paper.

Our vision is a virtual space spans the horizontal of all academic knowledge, where individuals across the globe can connect and learn things only another human can help teach. PeerLibrary can be used as a system for depositing original work, archive navigation, to help organize conferences and journals, as a reference manager, a social network to follow what others are doing, for proposing/recommending connections, collaboration, reading, commenting, organizing or sharing papers. PeerLibrary is a platform that synthesizes an online repository and those who utilize it. Our longer-term goal is for PeerLibrary to become a web-first standards-based publication platform. We believe that all stages of research should be collaborative, from the idea, to experiments, conclusions and publication (constant feedback).

To achieve this vision we are working with partners. Such as [http://www.contentmine.org/ ContentMine], to push facts they mine in academic publications into commons available through PeerLibrary. We use Internet Archive to permanently store open access publications and public annotations for future generations. We are using code developed at projects [http://hypothes.is/ Hypothes.is] and [http://annotateit.org/ Annotator] to ensure compatibility with other annotation projects and platforms. We are also using the [http://meteor.com/ Meteor framework], [http://lens.elifesciences.org/ eLife Lens], [https://github.com/guardian/scribe The Guardian Scribe] and [https://github.com/mozilla/pdf.js Mozilla PDF.js]. Our presentation will be an exploration of how these organizations are contributing to the Open Scholarship initiative.

With no subscription or registration needed, anybody can easily access the annotations in PeerLibrary. By eliminating the need to print files, the research process is contained in one online medium, creating a greater sense of organization. Furthermore, note-taking tools encourage active analysis, a recording of insights and questions while reading. Users can choose to keep their highlights and annotations private, or they can make them public to engage in an open online discussion. We believe that the collection of tools provided in PeerLibrary will make education more inclusive.

See it live here: https://peerlibrary.org/

Project presentation: https://vimeo.com/93085636

Screencast: https://vimeo.com/109787685