Difference between revisions of "Hacking Pre-Ingest Assessment Tools (Solr/Ruby/Python)"

From Code4Lib
Jump to: navigation, search
(+fears and desires)
m (+sections)
Line 8: Line 8:
 
This would maybe be happy with an Event microservice.  Mark Phillips hopes to release a Django app to this effect in April 2011.
 
This would maybe be happy with an Event microservice.  Mark Phillips hopes to release a Django app to this effect in April 2011.
  
Fears:
+
==Fear==
 
* Identifiers are precious
 
* Identifiers are precious
 
* Ingest is forever
 
* Ingest is forever
Line 14: Line 14:
 
* Hard drives full of junk and an uncorrelated spreadsheet.
 
* Hard drives full of junk and an uncorrelated spreadsheet.
  
Desires:
+
==Desires==
 
* Command-line statistical analysis (histogram, number of distinct values, etc.) of spreadsheets.
 
* Command-line statistical analysis (histogram, number of distinct values, etc.) of spreadsheets.

Revision as of 15:51, 7 February 2011

Django/Solr Metadata Archive Tool

As part of my code4lib presentation I (Matienzo) may demo some code that works with Digital Forensics XML and gets it into a Solr index. I've successfully thrown Blacklight on top of it, but want to extend it further, especially in terms of figuring what I can do with it and creating a straightforward UI that will represent directory hierarchies.

This would maybe be happy with an Event microservice. Mark Phillips hopes to release a Django app to this effect in April 2011.

Fear

  • Identifiers are precious
  • Ingest is forever
  • Where does rights management come in
  • Hard drives full of junk and an uncorrelated spreadsheet.

Desires

  • Command-line statistical analysis (histogram, number of distinct values, etc.) of spreadsheets.