Apache Solr
In order to index documents, a collection (SolrCloud Mode) or core (Standalone Mode) needs to be present in the Apache Solr instance. The collection / core encapsulates diverse configuration options, like the search engine schema definition. As the connector does only index document metadata which are present in the existing Solr schema, it is required to define a schema before creating a collection / core.
High level step-by-step guide:
-
Create a configuration set.
-
Define a Solr schema or use the default one provided with the connector.
See the Schema Design Documentation. -
Define the Solr configuration.
-
Optional: Enable the auto commit feature of Solr.
See the Commit Documentation.
-
-
Create the configuration set based on the schema and Solr configuration.
Standalone & SolrCloud Mode: See the ConfigSet Documentation
-
-
Create a collection / core based on the created configuration set.
Standalone Mode: CoreAdmin API.
SolrCloud Mode: Collections API
Further useful Apache Solr control scripts can be found here.
The default schema provided with the connector package, can be found in
the schema folder of the connector directory. This example schema is based on
standard connector metadata.
|
|
If you want to get a grasp of the set of metadata extracted from the source system, we suggest to configure a collection / core with a schema which contains the following dynamic field:
By using this dynamic field in the schema, all available metadata are indexed as text fields. |
|
The connector does not trigger commits. Therefore, using the auto-commit feature in your Solr configuration is recommended. |