org.apache.lucene.analysis.miscellaneous.CodepointCountFilter

All Implemented Interfaces:: Closeable, AutoCloseable, Unwrappable<TokenStream>

public final class CodepointCountFilter extends FilteringTokenFilter

Removes words that are too long or too short from the stream.

Note: Length is calculated as the number of Unicode codepoints.

Nested Class Summary

Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.State
Field Summary

Fields inherited from class org.apache.lucene.analysis.TokenFilter
input

Fields inherited from class org.apache.lucene.analysis.TokenStream
DEFAULT_TOKEN_ATTRIBUTE_FACTORY
Constructor Summary

Constructors

Constructor

Description

CodepointCountFilter(TokenStream in, int min, int max)

Create a new CodepointCountFilter.
Method Summary

Modifier and Type

Method

Description

boolean

accept()

Methods inherited from class org.apache.lucene.analysis.FilteringTokenFilter
end, incrementToken, reset

Methods inherited from class org.apache.lucene.analysis.TokenFilter
close, unwrap

Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toString

Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait

Class CodepointCountFilter

Nested Class Summary