Language:
Format:

Red Hat Training

A Red Hat training course is available for Red Hat JBoss Enterprise Application Platform

23.2. Configuration

23.2.1. Minimum Configuration

Hibernate Search has been designed to provide flexibility in its configuration and operation, with default values carefully chosen to suit the majority of use cases. At a minimum a Directory Provider must be configured, along with its properties. The default Directory Provider is filesystem, which uses the local filesystem for index storage. For details of available Directory Providers and their configuration, see Section 23.2.3, “DirectoryProvider Configuration”.

If you are using Hibernate directly, settings such as the DirectoryProvider must be set in the configuration file, either hibernate.properties or hibernate.cfg.xml. If you are using Hibernate via JPA the configuration file is persistence.xml.

Property	Description
hibernate.search.worker.scope	The fully qualified class name of the `Worker` implementation to use. If this property is not set, empty or `transaction` the default `TransactionalWorker` is used.
hibernate.search.worker.*	All configuration properties prefixed with `hibernate.search.worker` are passed to the Worker during initialization. This allows adding custom, worker specific parameters.
hibernate.search.worker.batch_size	Defines the maximum number of indexing operation batched per context. Once the limit is reached indexing will be triggered even though the context has not ended yet. This property only works if the `Worker` implementation delegates the queued work to BatchedQueueingProcessor (which is what the `TransactionalWorker` does)

Property	Description
hibernate.search.<indexName>.worker.execution	`sync`: synchronous execution (default) `async`: asynchronous execution
hibernate.search.<indexName>.worker.thread_pool.size	The backend can apply updates from the same transaction context (or batch) in parallel, using a threadpool. The default value is 1. You can experiment with larger values if you have many operations per transaction.
hibernate.search.<indexName>.worker.buffer_queue.max	Defines the maximal number of work queue if the thread poll is starved. Useful only for asynchronous execution. Default to infinite. If the limit is reached, the work is done by the main thread.

Property	Description
hibernate.search.<indexName>.worker.jndi.*	Defines the JNDI properties to initiate the InitialContext (if needed). JNDI is only used by the JMS back end.
hibernate.search.<indexName>.worker.jms.connection_factory	Mandatory for the JMS back end. Defines the JNDI name to lookup the JMS connection factory from (`/ConnectionFactory` by default in Red Hat JBoss Enterprise Application Platform)
hibernate.search.<indexName>.worker.jms.queue	Mandatory for the JMS back end. Defines the JNDI name to lookup the JMS queue from. The queue will be used to post work messages.

Property	Description	Default Value
hibernate.search.[default\|<indexname>].exclusive_index_use	Set to `true` when no other process will need to write to the same index. This enables Hibernate Search to work in exclusive mode on the index and improve performance when writing changes to the index.	`true` (improved performance, releases locks only at shutdown)
hibernate.search.[default\|<indexname>].max_queue_length	Each index has a separate "pipeline" which contains the updates to be applied to the index. When this queue is full adding more operations to the queue becomes a blocking operation. Configuring this setting doesn't make much sense unless the `worker.execution` is configured as `async`.	`1000`
hibernate.search.[default\|<indexname>].indexwriter.max_buffered_delete_terms	Determines the minimal number of delete terms required before the buffered in-memory delete terms are applied and flushed. If there are documents buffered in memory at the time, they are merged and a new segment is created.	Disabled (flushes by RAM usage)
hibernate.search.[default\|<indexname>].indexwriter.max_buffered_docs	Controls the amount of documents buffered in memory during indexing. The bigger the more RAM is consumed.	Disabled (flushes by RAM usage)
hibernate.search.[default\|<indexname>].indexwriter.max_merge_docs	Defines the largest number of documents allowed in a segment. Smaller values perform better on frequently changing indexes, larger values provide better search performance if the index does not change often.	Unlimited (Integer.MAX_VALUE)
hibernate.search.[default\|<indexname>].indexwriter.merge_factor	Controls segment merge frequency and size. Determines how often segment indexes are merged when insertion occurs. With smaller values, less RAM is used while indexing, and searches on unoptimized indexes are faster, but indexing speed is slower. With larger values, more RAM is used during indexing, and while searches on unoptimized indexes are slower, indexing is faster. Thus larger values (> 10) are best for batch index creation, and smaller values (< 10) for indexes that are interactively maintained. The value must not be lower than 2.	10
hibernate.search.[default\|<indexname>].indexwriter.merge_min_size	Controls segment merge frequency and size. Segments smaller than this size (in MB) are always considered for the next segment merge operation. Setting this too large might result in expensive merge operations, even tough they are less frequent. See also `org.apache.lucene.index.LogDocMergePolicy`. `minMergeSize`.	0 MB (actually ~1K)
hibernate.search.[default\|<indexname>].indexwriter.merge_max_size	Controls segment merge frequency and size. Segments larger than this size (in MB) are never merged in bigger segments. This helps reduce memory requirements and avoids some merging operations at the cost of optimal search speed. When optimizing an index this value is ignored. See also `org.apache.lucene.index.LogDocMergePolicy`. `maxMergeSize`.	Unlimited
hibernate.search.[default\|<indexname>].indexwriter.merge_max_optimize_size	Controls segment merge frequency and size. Segments larger than this size (in MB) are not merged in bigger segments even when optimizing the index (see `merge_max_size` setting as well). Applied to `org.apache.lucene.index.LogDocMergePolicy`. `maxMergeSizeForOptimize`.	Unlimited
hibernate.search.[default\|<indexname>].indexwriter.merge_calibrate_by_deletes	Controls segment merge frequency and size. Set to `false` to not consider deleted documents when estimating the merge policy. Applied to `org.apache.lucene.index.LogMergePolicy`. `calibrateSizeByDeletes`.	`true`
hibernate.search.[default\|<indexname>].indexwriter.ram_buffer_size	Controls the amount of RAM in MB dedicated to document buffers. When used together max_buffered_docs a flush occurs for whichever event happens first. Generally for faster indexing performance it's best to flush by RAM usage instead of document count and use as large a RAM buffer as you can.	16 MB
hibernate.search.[default\|<indexname>].indexwriter.term_index_interval	Expert: Set the interval between indexed terms. Large values cause less memory to be used by IndexReader, but slow random-access to terms. Small values cause more memory to be used by an IndexReader, and speed random-access to terms. See Lucene documentation for more details.	128
hibernate.search.[default\|<indexname>].indexwriter.use_compound_file	The advantage of using the compound file format is that less file descriptors are used. The disadvantage is that indexing takes more time and temporary disk space. You can set this parameter to `false` in an attempt to improve the indexing time, but you could run out of file descriptors if `mergeFactor` is also large. Boolean parameter, use "`true`" or "`false`". The default value for this option is `true`.	true
hibernate.search.enable_dirty_check	Not all entity changes require a Lucene index update. If all of the updated entity properties (dirty properties) are not indexed, Hibernate Search skips the re-indexing process. Disable this option if you use custom `FieldBridge`s which need to be invoked at each update event (even though the property for which the field bridge is configured has not changed). This optimization will not be applied on classes using a `@ClassBridge` or a `@DynamicBoost`. Boolean parameter, use "`true`" or "`false`". The default value for this option is `true`.	true

Language and Page Formatting Options

Red Hat Training

23.2. Configuration

23.2.1. Minimum Configuration

23.2.2. Configuring the IndexManager

23.2.2.1. Directory-based

23.2.2.2. Near Real Time

23.2.2.3. Custom

23.2.3. DirectoryProvider Configuration

23.2.4. Sharding Indexes

23.2.5. Worker Configuration

23.2.5.1. JMS Master/Slave Back End

23.2.5.2. Slave Nodes

23.2.5.3. Master Node

23.2.6. Tuning Lucene Indexing

23.2.6.1. Tuning Lucene Indexing Performance

23.2.6.2. The Lucene IndexWriter

23.2.6.3. Performance Option Configuration

23.2.6.4. Tuning the Indexing Speed

23.2.6.5. Control Segment Size

23.2.7. LockFactory Configuration

23.2.8. Exception Handling Configuration

23.2.9. Index Format Compatibility

23.2.10. Disable Hibernate Search

Quick Links

Help

Site Info

Related Sites

Systems Status

About

Red Hat legal and privacy links

Red Hat legal and privacy links