Class FSTCompletionLookup
- All Implemented Interfaces:
Accountable
Lookup API to FSTCompletion.
This adapter differs from FSTCompletion in that it attempts to discretize any
"weights" as passed from in InputIterator.weight() to match the number of buckets. For
the rationale for bucketing, see FSTCompletion.
Note:Discretization requires an additional sorting pass.
The range of weights for bucketing/ discretization is determined by sorting the input by weight and then dividing into equal ranges. Then, scores within each range are assigned to that bucket.
Note that this means that even large differences in weights may be lost during automaton construction, but the overall distinction between "classes" of weights will be preserved regardless of the distribution of weights.
For fine-grained control over which weights are assigned to which buckets, use FSTCompletion directly or TSTLookup, for example.
- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.search.suggest.Lookup
Lookup.LookupPriorityQueue, Lookup.LookupResult -
Field Summary
FieldsModifier and TypeFieldDescriptionprivate intprivate longNumber of entries the lookup was built withprivate booleanprivate FSTCompletionAutomaton used for completions with higher weights reordering.private static intAn invalid bucket count if we're creating an object of this class from an existing FST.private FSTCompletionAutomaton used for normal completions.private static final intShared tail length for conflating in the created automaton.private final Directoryprivate final StringFields inherited from class org.apache.lucene.search.suggest.Lookup
CHARSEQUENCE_COMPARATORFields inherited from interface org.apache.lucene.util.Accountable
NULL_ACCOUNTABLE -
Constructor Summary
ConstructorsConstructorDescriptionThis constructor should only be used to read a previously saved suggester.FSTCompletionLookup(Directory tempDir, String tempFileNamePrefix) This constructor prepares for creating a suggested FST using thebuild(InputIterator)method.FSTCompletionLookup(Directory tempDir, String tempFileNamePrefix, int buckets, boolean exactMatchFirst) This constructor prepares for creating a suggested FST using thebuild(InputIterator)method.FSTCompletionLookup(Directory tempDir, String tempFileNamePrefix, FSTCompletion completion, boolean exactMatchFirst) This constructor takes a pre-built automaton. -
Method Summary
Modifier and TypeMethodDescriptionvoidbuild(InputIterator iterator) Builds up a new internalLookuprepresentation based on the givenInputIterator.private static intencodeWeight(long value) weight -> costget(CharSequence key) Returns the bucket (weight) as a Long for the provided key if it exists, otherwise null if it does not.Returns nested resources of this class.longgetCount()Get the number of entries the lookup was built withbooleanDiscard current lookup data and load it from a previously saved copy.lookup(CharSequence key, Set<BytesRef> contexts, boolean higherWeightsFirst, int num) Look up a key and return possible completion for this key.longReturn the memory usage of this object in bytes.booleanstore(DataOutput output) Persist the constructed lookup data to a directory.
-
Field Details
-
INVALID_BUCKETS_COUNT
private static int INVALID_BUCKETS_COUNTAn invalid bucket count if we're creating an object of this class from an existing FST.- See Also:
-
tempDir
-
tempFileNamePrefix
-
buckets
private int buckets -
exactMatchFirst
private boolean exactMatchFirst -
higherWeightsCompletion
Automaton used for completions with higher weights reordering. -
normalCompletion
Automaton used for normal completions. -
count
private volatile long countNumber of entries the lookup was built with
-
-
Constructor Details
-
FSTCompletionLookup
public FSTCompletionLookup()This constructor should only be used to read a previously saved suggester. -
FSTCompletionLookup
This constructor prepares for creating a suggested FST using thebuild(InputIterator)method. The number of weight discretization buckets is set toFSTCompletion.DEFAULT_BUCKETSand exact matches are promoted to the top of the suggestions list. -
FSTCompletionLookup
public FSTCompletionLookup(Directory tempDir, String tempFileNamePrefix, int buckets, boolean exactMatchFirst) This constructor prepares for creating a suggested FST using thebuild(InputIterator)method.- Parameters:
buckets- The number of weight discretization buckets (seeFSTCompletionfor details).exactMatchFirst- Iftrueexact matches are promoted to the top of the suggestions list. Otherwise they appear in the order of discretized weight and alphabetical within the bucket.
-
FSTCompletionLookup
public FSTCompletionLookup(Directory tempDir, String tempFileNamePrefix, FSTCompletion completion, boolean exactMatchFirst) This constructor takes a pre-built automaton.- Parameters:
completion- An instance ofFSTCompletion.exactMatchFirst- Iftrueexact matches are promoted to the top of the suggestions list. Otherwise they appear in the order of discretized weight and alphabetical within the bucket.
-
-
Method Details
-
build
Description copied from class:LookupBuilds up a new internalLookuprepresentation based on the givenInputIterator. The implementation might re-sort the data internally.- Specified by:
buildin classLookup- Throws:
IOException
-
encodeWeight
private static int encodeWeight(long value) weight -> cost -
lookup
public List<Lookup.LookupResult> lookup(CharSequence key, Set<BytesRef> contexts, boolean higherWeightsFirst, int num) Description copied from class:LookupLook up a key and return possible completion for this key.- Specified by:
lookupin classLookup- Parameters:
key- lookup key. Depending on the implementation this may be a prefix, misspelling, or even infix.contexts- contexts to filter the lookup by, or null if all contexts are allowed; if the suggestion contains any of the contexts, it's a matchhigherWeightsFirst- return only more popular resultsnum- maximum number of results to return- Returns:
- a list of possible completions, with their relative weight (e.g. popularity)
-
get
Returns the bucket (weight) as a Long for the provided key if it exists, otherwise null if it does not. -
store
Description copied from class:LookupPersist the constructed lookup data to a directory. Optional operation.- Specified by:
storein classLookup- Parameters:
output-DataOutputto write the data to.- Returns:
- true if successful, false if unsuccessful or not supported.
- Throws:
IOException- when fatal IO error occurs.
-
load
Description copied from class:LookupDiscard current lookup data and load it from a previously saved copy. Optional operation.- Specified by:
loadin classLookup- Parameters:
input- theDataInputto load the lookup data.- Returns:
- true if completed successfully, false if unsuccessful or not supported.
- Throws:
IOException- when fatal IO error occurs.
-
ramBytesUsed
public long ramBytesUsed()Description copied from interface:AccountableReturn the memory usage of this object in bytes. Negative values are illegal. -
getChildResources
Description copied from interface:AccountableReturns nested resources of this class. The result should be a point-in-time snapshot (to avoid race conditions).- See Also:
-
getCount
public long getCount()Description copied from class:LookupGet the number of entries the lookup was built with
-