README ********************** NSF dataset ********************** Local copy (to be added soon): /cs/sandbox/faculty/tyang/290N/NSFabstract/ contains 3 parts. Source: http://archive.ics.uci.edu/ml/datasets/NSF+Research+Award+Abstracts+1990-2003 ********************** Lucene Readme ********************** 1. download lucene and untar wget http://apache.osuosl.org/lucene/java/3.6.2/lucene-3.6.2.tgz 2. set classpath export CLASSPATH=./PATHTOJAR:./PATHTODEMOJAR for example, export CLASSPATH=$HOME/lucene-3.6.2/lucene-core-3.6.2.jar:$HOME/lucene-3.6.2/contrib/demo/lucene-demo-3.6.2.jar 3. index documents java org.apache.lucene.demo.IndexFiles -docs ./PATHTODOCS 4. search documents java org.apache.lucene.demo.SearchFiles You might find this link helpful: http://lucene.apache.org/core/3_6_2/queryparsersyntax.html#Boolean operators ********************** Solr Readme ********************** Official tutorial: http://lucene.apache.org/solr/api/doc-files/tutorial.html 1. download solr and untar wget http://apache.mirrors.lucidnetworks.net/lucene/solr/3.6.2/apache-solr-3.6.2.tgz 2. command line: start connection cd example java -jar start.jar 3. open web browser http://localhost:8983/solr/admin/ 4. index documents cd example/exampledocs type in command: java -jar post.jar for example, java -jar post.jar solr.xml monitor.xml 5. type in search word from browser