CLucene - a full-featured, c++ search engine
API Documentation


Analyzers.h File Reference

#include "CLucene/util/Reader.h"
#include "AnalysisHeader.h"
#include "CLucene/util/Misc.h"
#include "CLucene/util/VoidMapSetDefinitions.h"

Go to the source code of this file.

Namespaces

namespace  lucene
namespace  lucene::analysis

Data Structures

class  lucene::analysis::CharTokenizer
 An abstract base class for simple, character-oriented tokenizers. More...
class  lucene::analysis::LetterTokenizer
 A LetterTokenizer is a tokenizer that divides text at non-letters. More...
class  lucene::analysis::LowerCaseTokenizer
 LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together. More...
class  lucene::analysis::WhitespaceTokenizer
 A WhitespaceTokenizer is a tokenizer that divides text at whitespace. More...
class  lucene::analysis::WhitespaceAnalyzer
 An Analyzer that uses WhitespaceTokenizer. More...
class  lucene::analysis::SimpleAnalyzer
 An Analyzer that filters LetterTokenizer with LowerCaseFilter. More...
class  lucene::analysis::LowerCaseFilter
 Normalizes token text to lower case. More...
class  lucene::analysis::StopFilter
 Removes stop words from a token stream. More...
class  lucene::analysis::WordlistLoader
 Loader for text files that represent a list of stopwords. More...
class  lucene::analysis::StopAnalyzer
 Filters LetterTokenizer with LowerCaseFilter and StopFilter. More...
class  lucene::analysis::PerFieldAnalyzerWrapper
 This analyzer is used to facilitate scenarios where different fields require different analysis techniques. More...
class  lucene::analysis::ISOLatin1AccentFilter
 A filter that replaces accented characters in the ISO Latin 1 character set (ISO-8859-1) by their unaccented equivalent. More...
class  lucene::analysis::KeywordTokenizer
 Emits the entire input as a single token. More...
class  lucene::analysis::KeywordAnalyzer
 "Tokenizes" the entire stream as a single token. More...
class  lucene::analysis::LengthFilter
 Removes words that are too long and too short from the stream. More...


clucene.sourceforge.net