« Intellectual Property Disclosure Process, Michael Doran | main | Lightning Talks 4 »
Open-Source Endeca in 250 Lines or Less, Casey Durfee
2 Mar 07
Open-Source Endeca in 250 Lines or Less
Casey Durfee
Seattle Public Library
#bugs ~ lines of code ^ 1.5
demo: http://catalog.spl.org/catalog/
code: http://extranet.spl.org/code/code4lib2007.zip
presentation: http://extranet.spl.org/talks/open_source_endeca/
Solr shortcuts
– results in Python format
– no database
– lucene search syntax
Django
– faster than Rails, can handle concurrent users
– forces you to do things the right way; forces split between coding and design
– object-oriented templates
goal to keep the URL as simple as possible
– no bizarre numeric codes
Solr performance tricks
– <optimize/>
– huge filterCache, very important for faceting; roughly equal to number of bib records in database
– some facets are faster than others; need to warm facets: query all records and do facets on every field (facetwarmer.py run every 10 minutes)
Posted by pzed on March 2, 2007 at 11.47am
Categories: code4lib 2007, conferences, libraries
