Module org.apache.lucene.codecs
Class UniformSplitTerms
java.lang.Object
org.apache.lucene.index.Terms
org.apache.lucene.codecs.uniformsplit.UniformSplitTerms
- Direct Known Subclasses:
STUniformSplitTerms
Terms based on the Uniform Split technique.
The index dictionary is lazy loaded only when TermsEnum.seekCeil(org.apache.lucene.util.BytesRef) or TermsEnum.seekExact(org.apache.lucene.util.BytesRef) are called (it is not loaded for a direct
terms enumeration).
- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionprotected final BlockDecoderprotected final IndexInputprotected final IndexDictionary.BrowserSupplierprotected final FieldMetadataprotected final PostingsReaderBaseFields inherited from class org.apache.lucene.index.Terms
EMPTY_ARRAY -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotectedUniformSplitTerms(IndexInput blockInput, FieldMetadata fieldMetadata, PostingsReaderBase postingsReader, BlockDecoder blockDecoder, IndexDictionary.BrowserSupplier dictionaryBrowserSupplier) -
Method Summary
Modifier and TypeMethodDescriptionprotected voidcheckIntersectAutomatonType(CompiledAutomaton automaton) intReturns the number of documents that have at least one term for this field.getMax()Returns the largest term (in lexicographic order) in the field.longReturns the sum ofTermsEnum.docFreq()for all terms in this field.longReturns the sum ofTermsEnum.totalTermFreq()for all terms in this field.booleanhasFreqs()Returns true if documents in this field store per-document term frequency (PostingsEnum.freq()).booleanReturns true if documents in this field store offsets.booleanReturns true if documents in this field store payloads.booleanReturns true if documents in this field store positions.intersect(CompiledAutomaton compiled, BytesRef startTerm) Returns a TermsEnum that iterates over all terms and documents that are accepted by the providedCompiledAutomaton.iterator()Returns an iterator that will step through all terms.longsize()Returns the number of terms for this field, or -1 if this measure isn't stored by the codec.
-
Field Details
-
blockInput
-
fieldMetadata
-
postingsReader
-
blockDecoder
-
dictionaryBrowserSupplier
-
-
Constructor Details
-
UniformSplitTerms
protected UniformSplitTerms(IndexInput blockInput, FieldMetadata fieldMetadata, PostingsReaderBase postingsReader, BlockDecoder blockDecoder, IndexDictionary.BrowserSupplier dictionaryBrowserSupplier) - Parameters:
blockDecoder- Optional block decoder, may be null if none. It can be used for decompression or decryption.
-
-
Method Details
-
iterator
Description copied from class:TermsReturns an iterator that will step through all terms. This method will not return null.- Specified by:
iteratorin classTerms- Throws:
IOException
-
intersect
Description copied from class:TermsReturns a TermsEnum that iterates over all terms and documents that are accepted by the providedCompiledAutomaton. If thestartTermis provided then the returned enum will only return terms> startTerm, but you still must call next() first to get to the first term. Note that the providedstartTermmust be accepted by the automaton.This is an expert low-level API and will only work for
NORMALcompiled automata. To handle any compiled automata you should instead useCompiledAutomaton.getTermsEnum(org.apache.lucene.index.Terms)instead.NOTE: the returned TermsEnum cannot seek.
- Overrides:
intersectin classTerms- Throws:
IOException
-
checkIntersectAutomatonType
-
getMax
Description copied from class:TermsReturns the largest term (in lexicographic order) in the field. Note that, just like other term measures, this measure does not take deleted documents into account. This returns null when there are no terms. -
size
public long size()Description copied from class:TermsReturns the number of terms for this field, or -1 if this measure isn't stored by the codec. Note that, just like other term measures, this measure does not take deleted documents into account. -
getSumTotalTermFreq
public long getSumTotalTermFreq()Description copied from class:TermsReturns the sum ofTermsEnum.totalTermFreq()for all terms in this field. Note that, just like other term measures, this measure does not take deleted documents into account.- Specified by:
getSumTotalTermFreqin classTerms
-
getSumDocFreq
public long getSumDocFreq()Description copied from class:TermsReturns the sum ofTermsEnum.docFreq()for all terms in this field. Note that, just like other term measures, this measure does not take deleted documents into account.- Specified by:
getSumDocFreqin classTerms
-
getDocCount
public int getDocCount()Description copied from class:TermsReturns the number of documents that have at least one term for this field. Note that, just like other term measures, this measure does not take deleted documents into account.- Specified by:
getDocCountin classTerms
-
hasFreqs
public boolean hasFreqs()Description copied from class:TermsReturns true if documents in this field store per-document term frequency (PostingsEnum.freq()). -
hasOffsets
public boolean hasOffsets()Description copied from class:TermsReturns true if documents in this field store offsets.- Specified by:
hasOffsetsin classTerms
-
hasPositions
public boolean hasPositions()Description copied from class:TermsReturns true if documents in this field store positions.- Specified by:
hasPositionsin classTerms
-
hasPayloads
public boolean hasPayloads()Description copied from class:TermsReturns true if documents in this field store payloads.- Specified by:
hasPayloadsin classTerms
-