<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>https://wiki.code4lib.org/index.php?action=history&amp;feed=atom&amp;title=Talk%3A2015_Prepared_Talk_Proposals</id>
		<title>Talk:2015 Prepared Talk Proposals - Revision history</title>
		<link rel="self" type="application/atom+xml" href="https://wiki.code4lib.org/index.php?action=history&amp;feed=atom&amp;title=Talk%3A2015_Prepared_Talk_Proposals"/>
		<link rel="alternate" type="text/html" href="https://wiki.code4lib.org/index.php?title=Talk:2015_Prepared_Talk_Proposals&amp;action=history"/>
		<updated>2026-04-06T23:38:03Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.26.2</generator>

	<entry>
		<id>https://wiki.code4lib.org/index.php?title=Talk:2015_Prepared_Talk_Proposals&amp;diff=41544&amp;oldid=prev</id>
		<title>Cbeer: Created page with &quot; == The Impossible Search: Pulling data form unknown sources ==   * Riley Childs, no official affiliation (currently a Senior in High School at Charlotte United Christian Academy...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.code4lib.org/index.php?title=Talk:2015_Prepared_Talk_Proposals&amp;diff=41544&amp;oldid=prev"/>
				<updated>2014-09-09T01:06:05Z</updated>
		
		<summary type="html">&lt;p&gt;Created page with &amp;quot; == The Impossible Search: Pulling data form unknown sources ==   * Riley Childs, no official affiliation (currently a Senior in High School at Charlotte United Christian Academy...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;
== The Impossible Search: Pulling data form unknown sources ==&lt;br /&gt;
 &lt;br /&gt;
* Riley Childs, no official affiliation (currently a Senior in High School at Charlotte United Christian Academy), rchilds (AT) cucawarriors.com &lt;br /&gt;
&lt;br /&gt;
It's easy to search data you know the structure of, but what if you need to pull in data from sources that don't have a standard structure. The ability to search community events along with your standard catalog search results is an example, but often the only way to pull these events is through XML, JSON, (Insert structured format here), or even just raw html. But how do you get that structure? That simple question is what makes this impossible. The process to define and process this structure takes a lot of manual labor, especially if the data you are pulling is just HTML, and then every time you add data to the index you have to run all the data through a script to pull in data in a format Solr or an other index can use. This talk will focus on Solr, but the principles explained will apply to many other indexes.&lt;/div&gt;</summary>
		<author><name>Cbeer</name></author>	</entry>

	</feed>