Clarify regex character range case insensitivity limitations (#125413)

* Update regexp-syntax.md

9.x equivalent of https://github.com/elastic/elasticsearch/pull/125412

* use md syntax
This commit is contained in:
Liam Thompson 2025-03-21 17:43:44 +01:00 committed by GitHub
parent 76260267b0
commit 397c9c59c7
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
1 changed files with 7 additions and 0 deletions

View File

@ -138,6 +138,13 @@ A `^` before a character in the brackets negates the character or range. For exa
[^abc\-] # matches any character except 'a', 'b', 'c', or '-'
```
:::{note}
Character range classes such as `[a-c]` do not behave as expected when using `case_insensitive: true` — they remain case sensitive. For example, `[a-c]+` with `case_insensitive: true` will match strings containing only the characters 'a', 'b', and 'c', but not 'A', 'B', or 'C'. Use `[a-zA-Z]` to match both uppercase and lowercase characters.
This is due to a known limitation in Lucene's regular expression engine.
See [Lucene issue #14378](https://github.com/apache/lucene/issues/14378) for details.
:::
## Optional operators [regexp-optional-operators]