SearchStax

The SearchStax® Frequently Asked Questions page includes the following approved question and answer about our Apache Solr Cloud services.


How Do I Update the Solr Schema?

When Solr logs start to show “unknown field” errors, it means that you are sending documents to Solr that do not match the Solr schema. Documents with unknown fields cannot be added to the index, which has obvious drawbacks for your project. For more information, see What Causes Indexing Errors? To remedy this situation, you’ll have to update the schema.

How should you update the Solr schema? Well, that depends. Sitecore users must update the schema through the Sitecore Dashboard. Clients who configure Solr directly use zkcli.bat or zkcli.sh to upload a new schema file. Some clients use the Solr Configset API and Config API, but these are disabled in Managed Solr by default.

——————————————————

SearchStax Managed Solr is a fully-managed hosted Solr SaaS solution that automates, manages and scales Solr infrastructure.

Sitecore Schema Update

Here are instructions for updating the Sitecore schema. Updating the Solr schema(s) from Sitecore is extremely simple, but there is one pitfall to avoid.

  1. Log in to the Sitecore Content Management environment.
  2. Open the Sitecore Control Panel.
  3. In the Indexing tab, click Populate Solr Managed Schema.
  4. Select all indexes and click Populate.
  5. On the same page, go to the Indexing Manager, select all, and click Rebuild.

The above procedure (“select all indexes”) assumes that you used the SearchStax Sitecore Plugin to set up the deployment. The Plugin configures each collection to have its own schema. In that configuration, you can refresh all of the collections at the same time.

If you manually configured the collections to all use the same schema, however, you cannot regenerate all of the schemas in one step. You must regenerate them one at a time. See Why does Sitecore fail to populate schemas?

If you run into any difficulty making the updates, see Sitecore Connection Errors for a list of common Sitecore/Solr issues.

Zkcli Schema Update

Many users have asked how to manually update the Solr schema file in their deployments. Solr documentation recommends using the Solr Schema API to update the schema, but many Solr users find it burdensome and unintuitive. They prefer to edit the managedschema file locally in a text editor and then upload it. The two approaches both work, as long as you don’t mix them.

The following procedure uses the edit-locally-and-upload strategy. The critical part is that changes to field definitions invalidate the collection’s index. The index must be wiped and the documents reloaded as part of a schema update. The process occurs in four steps.

Zookeeper Best Practice

The examples below rely on zkcli, the Zookeeper upload/download utility. This is appropriate to your initial experiments, but it is not the best way to interact with Zookeeper.

For truly secure Zookeeper management, use the SearchStax API. The API’s Zookeeper methods require SearchStax user authentication, and they record entries in the SearchStax access logs. Even better, they are not affected by changes in IP addresses.

Download a copy of the schema!

If you don’t have a copy of your Solr configuration, you can download one.

1. Update Schema (upload a new configuration to Zookeeper).

Edit your schema.xml or managed-schema file in a text editor. Be sure to save it in your local copy of your project-configuration directory.

When you are ready to deploy your new schema file, simply use zkcli to upload your configuration directory to your Zookeeper ensemble. (See zkcli download instructions.)

Run zkcli in a terminal window:

./zkcli.sh -zkhost <zookeeper URLs> -cmd upconfig -confdir <conf directory> -confname <config name>

where <zookeeper URLs> corresponds to the URLs of the Zookeeper servers in the deployment details page:

SearchStax Solr Schema Update

<conf directory> is you local Solr configuration directory (../configsets/basic_configs/conf/ or ../configsets/_default/conf/); and <config name> is the name of the configuration in Zookeeper (test).

2. Delete Existing Data

To delete the data that was loaded under the previous schema, see How to Empty a Solr Index.

3. Reload Collection (distribute new configuration from Zookeeper)

The next step is to download the new configuration from the Zookeeper ensemble to the Solr servers.

curl 'https:/<load_balancer_URL>/solr/admin/collections?action=RELOAD&name=<collection_name>'

where <load_balancer_URL> is the Solr HTTP Endpoint URL from the SearchStax dashboard, and <collection_name> is the name of the Solr collection (testcollection).

Note that you can perform this step directly from the Solr Dashboard.

SearchStax Solr Schema Update

4. Reload Data (re-ingest documents)

We presume that you already know how to ingest documents, but here’s a reminder:

Run the following cURL command from a terminal window.

curl  -X POST -H 'Content-type:application/json' -d <datafile_path> 'https://<load_balancer_URL>/solr/<collection_name>/update?commit=true'

where <load_balancer_URL> is the Solr HTTP Endpoint URL from the deployment details page, <datafile_path> is the location of the document file (@sample.json), and <collection_name> is the name of the Solr collection (testcollection).


We love to answer questions!

Please contact the SearchStax Support Desk immediately if you have any question about Solr Cloud deployments.

Return to Frequently Asked Questions.