Changes

Jump to: navigation, search

2011 Preconference Proposals

7,099 bytes added, 05:45, 12 April 2011
m
Protected "2011 Preconference Proposals" [edit=autoconfirmed:move=autoconfirmed]
== Proposals for 2011 Code4LibCon Preconferences ==
Proposals will close Friday November 19 so we can finalize the list and add them to registration!
'''Please include a "Contact/Responsible Individual" name and email address so we know who is willing to put on the proposed precon.'''
==Full Day==
== Text mining = CURATEcamp Hackfest ===* Description: This workshop will describe and demonstrate the principles Want to hack/design/plan/document on a team of text mining and other digital humanities computing techniques. With the advent of so much full text content available people who enjoy learning by creating? Interested in librariesdigital curation? Well, and with the increasing ease in which people can find content, the question to ask one's self this hackfest is, "What do I do for you. Not familiar with all the concept of this contenta hackfest?" Or, as Gregory Crane said, See Roy Tennant's [http://www.dliblibraryjournal.orgcom/dlibarticle/march06/crane/03craneCA332564.html "What do you do with a million books?Where Librarians Go To Hack"] Text mining, visualization, concordancing are some of and the answers -- process page for making sense of large the [http://access2010.lib.umanitoba.ca/node/3 Access 2010 Hackfest]. I propose a full text corpora -- something often called "distant reading". Participants will go away day hackfest with a better understanding of what focus on the domains of digital humanities are curation, preservation, and repositories -- think stuff like CDL's microservices, Hydra, Fedora, etc. Here's how they can applied it works, roughly: we assemble in the morning and do some whiteboarding, mostly to gauge folks' interests and jot down project ideas; then we separate into teams and hack on stuff for the rest of the day and present our progress at the end. Not a library settingcode hacker? No worries; all skill sets and backgrounds are valuable! (Participants may begin kicking around [[2011 CURATEcamp Hackfest Ideas]].)* Duration: halffull-day* Speaker Bio: Eric Lease Morgan considers himself a librarian first and a computer user second. His professional goal is to discover new ways to use computer to provide better library services. Some Facilitators of his more notable projects included Mr. Serials, Index Morganagus, the Alex Catalogue of Electronic TextsCURATEcamp Hackfest will be:** Shaun Ellis - Digital Library Collections Interface Developer, and MyLibrary. Currently he spends his time investigating the digital humanities and integrating them into VUFind.Princeton University Library** Jason Fowler - Programmer Analyst, UBC Library Systems* Contact: Eric Lease Morgan Mike Giarlo (emorgan michael at ndpsu.edu)* [http://curatecamp.org/pages/curatecamp-hackfest-c4l11-videos Videos] ==Half Day Morning==
=== What's New In Solr ===
* Description: The library world is fired up about Solr. Practically every next-gen catalog is using it (via Blacklight, VuFind, or other technologies). Solr has continued improving in some dramatic ways, including geospatial support, field collapsing/grouping, extended dismax query parsing, pivot/grid/matrix/tree faceting, autosuggest, and more. This session will cover all of these new features, showcasing live examples of them all, including anything new that is implemented prior to the conference.
* Duration: half-day
* Contact: Erik Hatcher (erik.hatcher at lucidimagination.com)
=== Intro to Functional Programming with JavaScript (and a little Haskell) ===* [http://www.slideshare.net/willkurt/intro-to-functional-programming-workshop-code4lib Slides]* [http://dl.dropbox.com/u/5373312/Code4libFPfinaldoc.pdf Workbook] 
* Description: Functional programming is a topic that is becoming increasingly important for programmers to be aware of. Unfortunately it also has the reputation of being an area of programming that is particularly difficult and academic. Languages like Haskell, while being very powerful, certainly live up to this reputation. However many of the essential features of functional programming can be explored through a language as simple and commonplace as JavaScript.
* Contact: Will Kurt (wkurt at unr.edu)
== CURATEcamp Hackfest ==* Description: Want to hack/design/plan/document on a team of people who enjoy learning by creating? Interested in digital curation? Well, this hackfest is for you. Not familiar with the concept of a hackfest? See Roy Tennant's [http://www.libraryjournal.com/article/CA332564.html "Where Librarians Go To Hack"] and the page for the [http://access2010.lib.umanitoba.ca/node/3 Access 2010 Hackfest]. I propose a full-day hackfest with a focus on the domains of digital curation, preservation, and repositories -- think stuff like CDL's microservices, Hydra, Fedora, etc. Here's how it works, roughly: we assemble in the morning and do some whiteboarding, mostly to gauge folks' interests and jot down project ideas; then we separate into teams and hack on stuff for the rest of the day and present our progress at the end. Not a code hacker? No worries; all skill sets and backgrounds are valuable! (Wiki space will be made in advance of the hackfest so participants may begin kicking around ideas.)* Duration: full-day* Speaker Bio: Facilitators of the CURATEcamp Hackfest will be:** Shaun Ellis - Digital Library Collections Interface Developer, Princeton University Library** Jason Fowler - Programmer Analyst, UBC Library Systems* Contact: Mike Giarlo (michael at psu.edu) == Running cloud Servers ===
*Desription: In this pre-conference we will work with the Amazon EC2, S3, and EBS platforms to launch, configure and deploy cloud-based servers. The workshop will include a series of short hands-on tutorials designed to take you from complete novice to semi-skilled cloud server administrator. the tutorials include: 1)short overview of Amazon cloud services and how they are used 2)Amazon registration, 3)Launching, configuring and securing your first instance, 4)Installing a service (Vufind) and 5)Backing up in the cloud - Backup routines and server images.
*Duration: half-day
== Publishing Historic Newspapers with NDNP tools = Creating a new JHOVE2 Format Module===
An in-depth session on publishing and working with historic newspaper content made available through the US National Digital Newspaper Program. The software behind the LC-hosted site at [httpDescription://chroniclingamerica.loc.gov/ chroniclingamerica.loc.gov] (python/django/mysql/solr) JHOVE2 is available under a free/libre/open source license at [http://sourceforge.net/projects/locJava framework and application for format-ndnp/ sourceforge]. This session will include an introduction to the program and working with the software; discussion aware characterization of adding features such as linking between ChromAm at LC files, byte streams within files, and file containers or other institutions publishing the same newspaper content; creating structure and submission for user edited OCR corrections; and article level viewingfile aggregations. This event is open to everyone - non-NDNP participants are invited to join us JHOVE2 examines a digital source unit and learn how to work with this content and help consider how to improve the software. The schedule will include ample time extracts feature information about that source unit for technical discussion purposes of classification, analysis, and hacking on the software itselfuse.
Full day JHOVE2 is a significant re-engineering of its JHOVE (possibly preceded by NDNP[http://hul.harvard.edu/jhove/ http://hul.harvard.edu/jhove/]) predecessor, with a highly modular structure, intended to facilitate the rapid creation of new characterization modules for many formats that can easily be plugged into the JHOVE2 framework. The initial JHOVE2 distribution includes modules for UTF-participants8, SGML, Shapefile, TIFF, WAV, XML, and ICC color profiles, with ZIP, PDF and JPEG-only pre2000 modules expected to be deployed in the next few months. Developers at the Wegener Institute ([http://www.awi-prepotsdam.de http://www.awi-conference session potsdam.de] ) have already created new modules for netCDF and GRIB. Developers at the day beforeFrench National Library (La Bibliothèque nationale de France [http://www.bnf.fr/fr/acc/x.accueil.html http://www.bnf.fr/fr/acc/x.accueil.html])are currently working on GZIP and ARC modules.
Contact: Karen Estlund, University of Oregon Libraries; Dan Chudnov, Library of Congress
This session will provide an overview of the JHOVE2 processing module and plug-in architecture, and will walk through the steps of creating a new format module.
For more information, visit http://jhove2.org.
Duration: half-day
Speaker Bio: Sheila Morrissey is a member of the JHOVE2 development team and is Senior Research Developer at Portico ([http://www.portico.org/digital-preservation/ http://www.portico.org/digital-preservation/])
 
Contact: Sheila Morrissey <sheila dot morrissey at ithaka dot org>
 
==Half Day Afternoon==
 
=== Using JHOVE2 for Policy Assessment of Files ===
 
Description: JHOVE2 is a Java framework and application for format-aware characterization of files, bytestreams within files, and file containers or other file aggregatations. JHOVE2 examine a digital source unit and extracts feature information about that source unit for purposes of classification, analysis, and use.
In addition to detailed output of the features of a format instance, JHOVE2 can provide summary determination of the validity of an item (its conformance to the normative syntactic and semantic requirements defined by an authoritative specification) and can be used for assessing the level of acceptability of a digital object for a specific purpose on the basis of locally-defined policy rules. The latter is one of the significant enhancements of JHOVE2 over its predecessor.
This session will provide some examples of the structure of JHOVE2 format modules, the outputs produced by those modules, and the configuration of the JHOVE2 assessment module so that it can be used to perform rule-based analysis of the reportable properties previously generated during characterization of a source unit.
For more information, visit [http://jhove2.org http://jhove2.org].
Duration: half-day
Speaker Bio: Richard Anderson is a member of the JHOVE2 develpment team and a Software Engineer with the Digital Library Systems and Services unit of Stanford University
Contact: Richard Anderson <rnanders at stanford dot edu>
 
 
 
=== Publishing Historic Newspapers with NDNP tools ===
 
* An in-depth session on publishing and working with historic newspaper content made available through the US National Digital Newspaper Program. The software behind the LC-hosted site at [http://chroniclingamerica.loc.gov/ chroniclingamerica.loc.gov] (python/django/mysql/solr) is available under a free/libre/open source license at [http://sourceforge.net/projects/loc-ndnp/ sourceforge]. This session will include an introduction to the program and working with the software; discussion of adding features such as linking between ChromAm at LC and other institutions publishing the same newspaper content; creating structure and submission for user edited OCR corrections; and article level viewing. This event is open to everyone - non-NDNP participants are invited to join us and learn how to work with this content and help consider how to improve the software. The schedule will include ample time for technical discussion and hacking on the software itself.
 
* Duration: half-day
* Contact: Karen Estlund, University of Oregon Libraries; Dan Chudnov, Library of Congress
 
 
 
=== VIVO Boot Camp===
 
Description: VIVO is an open source semantic web application originally developed and implemented at Cornell University. When installed and populated with researcher interests, activities, and accomplishments, it enables the discovery of research and scholarship across disciplines at that institution. VIVO supports browsing and a search function which returns faceted results for rapid retrieval of desired information and includes options for RDF linked data distribution.
 
This boot camp will be run by members of the NIH/NCRR funded VIVO network and will focus on four components including, an overview of what VIVO is and how it can help researchers, an installation walk-through, how VIVO works (its ontology, visualization functionality, and user interface), and future directions for the project (e.g. profile data re-use in CMSs such as Drupal and Joomla!, federated search, etc.).
For more information, visit [http://vivoweb.org http://vivoweb.org].
Duration: half-day
Speakers:
*Paul Albert, Weill Cornell Medical College
*Nick Cappadona, Cornell University
*Ying Ding, Indiana University
*Bryan Keese, Indiana University
*Micah Linnemeier, Indiana University
*Ryan Cobine, Indiana University
 
 
Contact: Ryan Cobine <rcobine AT indiana DOT edu>
 
=== Islandora Repository System ===
 
Description: The Islandora project (islandora.ca) is growing, with new functionality provided by Solr integration and funding to support the growth of this OS project beyond our library borders. Islandora provides integration between Fedora and Drupal, with custom solution packs to address the needs of multiple data types. This session will review the project's development and current features, as well as providing guidance for basic installation and configuration.
 
Duration: half-day
 
Speaker Bio: Mark Leggott is the founder of the Islandora project. As the UL for the University of Prince Edward Island, and the projects major architect. He has spoken at a number of conferences, and is the founder of a new SaSS company providing services around Islandora software.
 
'''Update 1/12''': Paul Pound will co-lead; Kirsta Stapelfeldt will not attend.
 
Contact: Kirsta Stapelfeldt (kstapelfeldt AT upei.ca)
 
=== Code4Lib Preconference Unconference===
 
* Description: The [http://en.wikipedia.org/wiki/Unconference "Wikipedia entry for unconference"] will give you a good idea what to expect. An "unconference" is "a facilitated, participant-driven conference centered around a theme or purpose." These unconferences came up from the hacker world (see [http://en.wikipedia.org/wiki/Barcamp "BarCamp"]) as a way to avoid high conference fees and sponsored presentations. Unconferences are not spectactor events, nor are they places to "be seen." Participants are involved from the schedule creation to the wrap-up session, and actively present, discuss, and collaborate with fellow participants. In recent years, [http://thatcamp.org/ "THATCamp (The Humanities and Technology Camp)"] has become a popular incarnation of the *camp gathering. In general, check your papers at the door, and just be ready to talk about the work you’re doing, the work you want to do, how you might collaborate with others. Think of it like a conference entirely made up of [[2011_Breakout_Sessions | breakout sessions]], but with some unifying theme. Or not. It depends on you.<br /><br />Now, how will we run an unconference in three hours and in one room? Carefully. I propose a rough schedule of 30 minutes for discussion-of-topics, then three 45-minute bursts of discussions, followed by 15 minutes of wrap-up. As this is all user-generated, it's all up for change in that first 30 minutes. We can have as many concurrent bursts-of-discussion as will fit in the one room, and that would also allow greater flexibility for wandering between groups.<br /><br />This is actually a compressed micro preconference unconference, that should--if all goes according to plan--produce a really fun, interesting, collaborative time, as well as a model that could be taken back to our own workplaces. '''Please''' contact the organizer with questions as well as any ideas for conversations you might want to have; will update this entry accordingly.
 
* Duration: half-day
 
* Organizer / Contact: Julie Meloni (jcmeloni AT gmail dot com)
 
* '''THE SCHEDULE'''<br/>Admin/Schedule setting from 1:30-2:00<br/>Session 1 (210-250pm) A: Web Services & Libraries ([https://docs.google.com/document/d/1PC0vHWCJEA8v8VE0Z_-W-dLPaiYYYwKFDT1JIyCrP9M/edit?hl=en Google Doc]), B: UI/UX Test & Development<br/>Session 2 (2:50-3:30) A: Linked Data/Non-MARC Metadata, B: How Do We Work (Well) [https://docs.google.com/document/d/1sMChUzutBxyhyU9XirKglW75mbFKpaKx5br33F6MvHA/edit?hl=en Google Doc] <br/>Session 3 (3:30-4:10) A: Search (Lots of Things), B: Mobile Web/Location-Based<br/>Wrap-Up/Sharing (until 4:30).
 
 
 
 
[[Category:Code4Lib2011]]

Navigation menu