Lucene is a high performance, scalable, cross-platform search engine that contains many advanced features that often go untapped by the majority of users. In this session, designed for those familiar with Lucene, we will examine some of Lucene's more advanced topics and their application, including:
1. Term Vectors: Manual and Pseudo relevance feedback; Advanced document collection analysis for domain specialization
2. Span Queries: Better phrase matching; Candidate Identification for Question Answering
3. Tying it all Together: Building a search framework for experimentation and rapid deployment
4. Case Studies from CNLP: Crosslingual/multilingual retrieval in Arabic, English and Dutch; Sublanguage specialization for commercial trouble ticket analysis; Passage retrieval and analysis for Question Answering application
Topics 1 through 3 will provide technical details on implementing the advanced Lucene features, while the fourth topic will provide a broader context for understanding when and where to use these features.
For more information, see the CNLP ApacheCon Info