Currently it is possible to config full text search by providing a custom implementation of IAnalyzerFactory.
The purpose of this work item is to provide one general purpose new implementation as an alternative to DefaultAnalyzerFactory that is configurable but with roughly the same functionality as DefaultAnalyzerFactory.
Here is a sample bigdata.properties file section setting up the new class:
Goal is to provide support for each of the language specific Analyzers from lucene, the PatternAnalyzer, which needs an additional property, WhitespaceAnalyzer, SimpleAnalyzer, KeywordAnalyzer, StopAnalyzer
Some analyzers support stop words, some do not. The stopwords property can have value none or default (if stopwords are supported).
For the PatternAnalyzer that supports stop words but has no default list, the stop words property can be none or the name of a class that does have a default stop words list.
There may be an additional analyzer which is the com.bigdata.search.EmptyAnalyzer which always returns an EmptyTokenStream; this allows for turning bds search off for certain language tags and/or off for the default and on only for specified tags.
When looking up a language tag the rules of rfc4647 should be used.