5.1.1. Character filtering

As you can see in the upper left of the figure, Elasticsearch first runs the character filters; these filters are used to transform particular character sequences into other character sequences. This can be used for things like stripping HTML out of text or converting an arbitrary number of characters into other

characters (perhaps correcting the text-message shortening of “I love u 2” into “I love you too”). In figure

5.1 we use the character filter to replace “&” with the word “and.”

Figure 5.2. Analyzer overview