Try local file search using Fess on CentOS7

Try a full-text search of local files using Fess. Open Source Full Text Search Server Fess

There is no particular reason, but except for the JDK, download the zip and install it. Here when installing from rpm.

The version of CentOS to install to.

CentOS Linux release 7.9.2009 (Core)

Since there is Document, there was nothing particularly clogged up. Just try it, so run it all in your home directory.

JDK installation

sudo yum install java-11-openjdk-devel

Check if it works

Install the following files, compile and see if it works

HelloWorld.java


public class HelloWorld {
    public static void main(String[] args) {
        System.out.println("Hello World!");
        System.out.println(System.getProperty("java.version"));
    }
}
javac HelloWorld.java
java HelloWorld

Installation of Elasticsearch

Download Elasticsearch Free | Get Started Now | Elastic | ElasticDownload from. The latest at this point is 7.10.1.

wget https://artifacts.elastic.co/downloads/elasticsearch/elasticsearch-7.10.1-linux-x86_64.tar.gz
tar -zxvf elasticsearch-7.10.1-linux-x86_64.tar.gz

Installation of Fess

Get the latest code for fess from github. Releases · codelibs/fess · GitHub The latest at this point is 13.10.2.

wget https://github.com/codelibs/fess/releases/download/fess-13.10.2/fess-13.10.2.zip
unzip fess-13.10.2.zip

Change the directory name.

mv elasticsearch-7.10.1 elasticsearch
mv fess-13.10.2 fess

Elasticsearch settings

Reference: Install

Install the Fess plugin on Elasticsearch. When I checked github, the latest was 7.10.0. Tags · codelibs/elasticsearch-analysis-fess · GitHub

./elasticsearch/bin/elasticsearch-plugin install org.codelibs:elasticsearch-analysis-fess:7.10.0
./elasticsearch/bin/elasticsearch-plugin install org.codelibs:elasticsearch-analysis-extension:7.10.0
./elasticsearch/bin/elasticsearch-plugin install org.codelibs:elasticsearch-configsync:7.10.0
./elasticsearch/bin/elasticsearch-plugin install org.codelibs:elasticsearch-dataformat:7.10.0
./elasticsearch/bin/elasticsearch-plugin install org.codelibs:elasticsearch-minhash:7.10.0

Recognize ./elasticsearch/data/config/. Added settings for accessing Fess. The path is a full path.

./elasticsearch/config/elasticsearch.yml


configsync.config_path: /path/to/elasticsearch/data/config/

He said that he will specify a boot option to connect to the Elasticsearch cluster with Fess.

./fess/bin/fess.in.sh


ES_HTTP_URL=http://localhost:9200
FESS_DICTIONARY_PATH=/path/to/elasticsearch/data/config/

Verification

Start it and check if it can be accessed with a browser.

./elasticsearch/bin/elasticsearch
./fess/bin/fess

Check if it can be accessed with http: // [server IP]: 8080 http: // [server IP]: 8080/admin is the admin login screen. ID: admin PASS: admin

If you cannot access it, check around the firewall.

Crawl local files

Configuration

Reference: File Crawl

This time, the files are placed under / var/fess so that the files under this directory can be searched in full text. Log in to the management screen and register as follows in the left menu Crawler-> File System.

Name: Appropriate Path file:/var/fess / Path to crawl file: /var/fess/.*

I want to run it immediately, so click "Default Crawler" in System-> Scheduler and click the" Start Now Button "to crawl. You can check the crawl status and results with System Information-> Crawl Information, so check if the data is available when finished.

Check if you can search from System Information-> Search.

If you want to delete the search index

Reference: Remove document from Fess index – with a Christian Wife

Enter "*: * "in System Information-> Search` to search and delete with the delete button at the bottom of the screen.

Recommended Posts

Try local file search using Fess on CentOS7
Try local file search using Fess on CentOS8
Try OpenLiteSpeed on CentOS8
Try RabbitMQ + PHP on CentOS
Try DPDK20 SDK on CentOS7 ①Install
Try using Redmine on Mac docker
Install Ruby 2.5 on CentOS 7 using SCL
Try putting CentOS 8 on Raspberry Pi 3
Using a local network on iOS 14
Try DPDK20 SDK on CentOS7 (2) Control DPDK driver
Try using the service on Android Oreo
Try using Firebase Cloud Functions on Android (Java)
HelloWorld using OpenJDK on CentOS8 after brain death
Try similar search of Image Search using Java SDK [Search]
Try communication using gRPC on Android + Java server
Try using libGDX
Try using Maven
Try using powermock-mockito2-2.0.2
Try using GraalVM
Try using jmockit 1.48
Try using sql-migrate
OpenVPN on CentOS 8
Try using SwiftLint
Try using Log4j 2.0
Try Image Search's similar search using Java SDK [Registration]
Try to implement using Rakuten product search API (easy)
Try image classification using TensorFlow Lite on Android (JAVA)
I installed CentOS 8 on Windows 10 using VirtualBox and Vagrant
Try using the query attribute of Ruby on Rails
I installed Squid on CentOS in my local environment