Monitoring the CM Elasticsearch Index

December 06, 2017 by Erik W in ElasticSearch

Here's a couple different ways to manage your new CM 9.2 Elasticsearch instance.

via Chrome

Visit the webstore and add the ElasticSearch Head extension.

Then when you launch the extension you can update the server address. With the visuals I can easily see that the unassigned shards are what's pushing my health to yellow.

You can also browse the data in the index

via Head Stand-alone Server

The same functionality as what's shown above, however the interface is a stand-alone server operating off port 9100. You'd access it via any browser from address http://localhost:9100. You would use this option if you can't use chrome.

via Powershell

Install Elastico from an elevated powershell prompt.

You have to trust the repository in order to continue. If you're in a secure environment then visit the github page for the source, then manually install the module.

Now I can run a command to check the overall cluster's health...

Another command to check the statuses of the indexes...

Note that in both instances I'm calling a script intended for v5 even though I'm using v6. I can actually run any version of the command as they all seem to be forward & backward compatible. Probably still makes the most sense to run the one intended for the most recent version of ES.

I can also perform a search. I went to the client and found the record I worked with in my last post (I had used Kibana as a front-end to the ES index). Then I searched via powershell.

via Kibana

When creating your index pattern, keep in mind that the default naming convention for the CM content index starts with "hpecm_". You could use that, or just a plain asterisk, to configure your access. If you wish to use the Timelion feature then you should also pick the date registered field as the time filter.

When you then click discover you can explore the index.

You can pick a different time period that might show some results...

If I can now get some data then I know the system is at least partially working. Kibana doesn't really give much detail as to the internal workings of the cluster and/or index.

Creating a PostgreSQL Dataset

December 05, 2017 by Erik W in Configuration

Creating a database that uses PostgreSQL is just as easy as SQL. Here I'll step through the process and show screenshots of the various steps. Although I'm using a locally installed PostgreSQL database, the process for a remote one is the same.

Identifying the dataset requires the entry of the name and ID. Then I selected the PostgreSQL dataset type and clicked next.

Next I clicked the KwikSelect icon

Then I picked the appropriate driver. Note that if you select the ANSI driver you will still be forced to enable unicode characters (you cannot uncheck the option later in the wizard). Therefore you should pick the unicode driver.

If you don't see any drivers then you need to run the application stack builder and install the appropriate drivers. As you can see below, I've already installed the drivers I'll need.

The connection string is not much different from SQL Server. Check with your database administrator to verify that your driver is correct.

Clicking OK will then enter the connection string into the dialog. You can change the string manually if you need to for some reason. Though you cannot change the password as it's encrypted here.

At the last step in the configuration process I made sure to uncheck the GIS columns option. I don't currently have PostGIS setup and configured.

The creation of the database finished without error.

Now I can inspect the schema that's been created. The exact same number of tables exist in PostgreSQL as within SQL.

From within the client you can see the configuration in the setup information dialog.

The schema manager feature in the Enterprise Studio behaves the same, with all the same options. To verify that I manually removed some indexes. Running check should highlight this.

As I hoped the feature works and tells me I'm missing some indexes.

You can also migrate from SQL to PostgreSQL by using the export feature in the Enterprise Studio.

However, take care to plan ahead. Any dataset created with the GIS options will require the PostGIS be installed locally. The header of the export will include a message indicating such.

Use Kibana to inspect your ElasticSearch index

December 04, 2017 by Erik W in ElasticSearch

Now that I've got ElasticSearch installed, configured, and "ready to go", why doesn't it work? When I "upgraded" my machine to 9.2 I configured ElasticSearch and imported the contents of the ISO. I then checked that it worked by searching for "content".

Then I imported the JFK archives and tried searching for the word kennedy. No results this time. :(

I checked the event processor and see it was blocked by error...

I was actively importing records via a powershell script and didn't want to restart the entire workgroup service. Instead, I killed TRIMEvent.exe and watched it spawn again automatically. Then the event processor just flew through all of my records.

After I let them finish processing, I try running a search for kennedy and get some results. You can see the results below.

What I am most excited about though is ElasticSearch. If I installed kibana I can visualize the electronic content of my records. For instance, I'll search for the record number highlighted above and show the results below.

The record is returned because the word was in the contents

But that's not all! Content Manager is pushing lots of data into the index, as shown below.

I can now use open source tools to convert my records repository into a big data gold mine. I can use the visualize feature to generate some reports. There's even an ElasticSearch connector for MicroStrategy. At a minimum, if something appears wrong with Content Index I can use Kibana as a way to verify what should be returned for a given record.