Solr: Populate Separate Fields from a Tokenizer?

The way I do it is less elegant that what it looks like you are shooting for.

The way I do it is less elegant that what it looks like you are shooting for: I preprocess the documents using a named entity recognizer and save all of the entities in a separate file. Then, when I am publishing to Solr, I just read the entities from this file and populate the entity fields (different for people, locations, and organizations). This could be simplified, but since I had already done the parsing for other work, it was easier to just reuse what already existed.

Here's an idea I think would work in lucene, but I have no idea if it's possible in solr. You could tokenize the string outside the typical tokenstream chain as you suggest then manually add the tokens to the document using the NOT_ANALYZED option. You have to add each token separately with document.

Add(...) which lucene will treat as a single field for searching.

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Solr: Populate Separate Fields from a Tokenizer?

Related Questions

Solr: org.apache.solr.common.SolrException: Invalid Date String?

Drupal 7 conditional fields : How to display specific fields based on other select option fields?

Sqlite and FTS tables with ICU tokenizer bug or not?

Is SQLite on Android built with the ICU tokenizer enabled for FTS?

Using boost::tokenizer with string delimiters?

How can I add data to dynamic fields when using solr's extract functionality?