elasticsearch

Commit Graph

Author	SHA1	Message	Date
Liam Thompson	ba95390895	[DOCS][9.x] Fix tip placement in lookup-join.md (#127552 ) h/t @alex-spies	2025-04-30 12:15:14 +02:00
Pete Gillin	061a751a09	Fix a one-word typo in the `date` processor docs (#127548 ) This erroneously claimed that the example used a `drop` processor (which drops whole documents) when it actually uses a `remove` processor (which removes fields).	2025-04-30 10:03:43 +02:00
Nik Everett	10336c950c	ESQL: Speed loading stored fields (#127348 ) This speeds up loading from stored fields by opting more blocks into the "sequential" strategy. This really kicks in when loading stored fields like `text`. And when you need less than 100% of documents, but more than, say, 10%. This is most useful when you need 99.9% of field documents. That sort of thing. Here's the perf numbers: ``` %100.0 {"took": 403 -> 401,"documents_found":1000000} %099.9 {"took":3990 -> 436,"documents_found": 999000} %099.0 {"took":4069 -> 440,"documents_found": 990000} %090.0 {"took":3468 -> 421,"documents_found": 900000} %030.0 {"took":1213 -> 152,"documents_found": 300000} %020.0 {"took": 766 -> 104,"documents_found": 200000} %010.0 {"took": 397 -> 55,"documents_found": 100000} %009.0 {"took": 352 -> 375,"documents_found": 90000} %008.0 {"took": 304 -> 317,"documents_found": 80000} %007.0 {"took": 273 -> 287,"documents_found": 70000} %005.0 {"took": 199 -> 204,"documents_found": 50000} %001.0 {"took": 46 -> 46,"documents_found": 10000} ``` Let's explain this with an example. First, jump to `main` and load a million documents: ``` rm -f /tmp/bulk for a in {1..1000}; do echo '{"index":{}}' >> /tmp/bulk echo '{"text":"text '$(printf %04d $a)'"}' >> /tmp/bulk done curl -s -uelastic:password -HContent-Type:application/json -XDELETE localhost:9200/test for a in {1..1000}; do echo -n $a: curl -s -uelastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_bulk?pretty --data-binary @/tmp/bulk \| grep errors done curl -s -uelastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_forcemerge?max_num_segments=1 curl -s -uelastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_refresh echo ``` Now query them all. Run this a few times until it's stable: ``` echo -n "%100.0 " curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" } }' \| jq -c '{took, documents_found}' ``` Now fetch 99.9% of documents: ``` echo -n "%099.9 " curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| WHERE NOT text.keyword IN (\"text 0998\") \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" } }' \| jq -c '{took, documents_found}' ``` This should spit out something like: ``` %100.0 { "took":403,"documents_found":1000000} %099.9 {"took":4098, "documents_found":999000} ``` We're loading fewer documents but it's slower! What in the world?! If you dig into the profile you'll see that it's value loading: ``` $ curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" }, "profile": true }' \| jq '.profile.drivers[].operators[] \| select(.operator \| contains("ValuesSourceReaderOperator"))' { "operator": "ValuesSourceReaderOperator[fields = [text]]", "status": { "readers_built": { "stored_fields[requires_source:true, fields:0, sequential: true]": 222, "text:column_at_a_time:null": 222, "text:row_stride:BlockSourceReader.Bytes": 1 }, "values_loaded": 1000000, "process_nanos": 370687157, "pages_processed": 222, "rows_received": 1000000, "rows_emitted": 1000000 } } $ curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| WHERE NOT text.keyword IN (\"text 0998\") \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" }, "profile": true }' \| jq '.profile.drivers[].operators[] \| select(.operator \| contains("ValuesSourceReaderOperator"))' { "operator": "ValuesSourceReaderOperator[fields = [text]]", "status": { "readers_built": { "stored_fields[requires_source:true, fields:0, sequential: false]": 222, "text:column_at_a_time:null": 222, "text:row_stride:BlockSourceReader.Bytes": 1 }, "values_loaded": 999000, "process_nanos": 3965803793, "pages_processed": 222, "rows_received": 999000, "rows_emitted": 999000 } } ``` It jumps from 370ms to almost four seconds! Loading fewer values! The second big difference is in the `stored_fields` marker. In the second on it's `sequential: false` and in the first `sequential: true`. `sequential: true` uses Lucene's "merge" stored fields reader instead of the default one. It's much more optimized at decoding sequences of documents. Previously we only enabled this reader when loading compact sequences of documents - when the entire block looks like ``` 1, 2, 3, 4, 5, ... 1230, 1231 ``` If there are any gaps we wouldn't enable it. That was a very conservative thing we did long ago without doing any experiments. We knew it was faster without any gaps, but not otherwise. It turns out it's a lot faster in a lot more cases. I've measured it as faster for 99% gaps, at least on simple documents. I'm a bit worried that this is too aggressive, so I've set made it configurable and made the default being to use the "merge" loader with 10% gaps. So we'd use the merge loader with a block like: ``` 1, 11, 21, 31, ..., 1231, 1241 ```	2025-04-29 23:20:15 +02:00
Pete Gillin	35c2b25415	Add info to `date` processor docs (#127434 ) This does two things: - It describes what the `timezone` option actually does. The existing wording is misleading. - It recommends avoiding short abbreviations for timezones such as `PST`. This has come up at least twice recently.	2025-04-29 13:40:36 +01:00
Liam Thompson	32a4462dfe	[DOCS][9.x] Improve ESQL reference docs information architecture (#127248 ) * [DOCS][9.0] Improve ESQL reference docs IA - reorganized es\|ql reference documentation from flat list to logical hierarchy - created three main sections: syntax reference , special fields, advanced operations - renamed pages with more consistent and task-oriented titles - aligned navigation titles with page content - improved introductory text for each section - used parallel phrasing for similar concepts - clarified the relationship between reference docs and conceptual docs Co-authored-by: Alexander Spies <alexander.spies@elastic.co>	2025-04-25 09:54:45 +02:00
Colleen McGinnis	08552f1c2e	[docs] Fix various syntax and rendering errors (#127062 ) * fix syntax and rendering errors * clean up * fix versions * more clean up * more fixes * more fixes * more fixes	2025-04-24 17:57:03 +02:00
Liam Thompson	c4cba5a545	[DOCS] Update esql-lookup-join.md (#127306 ) - I trimmed the KEEP query in my final iteration in https://github.com/elastic/elasticsearch/pull/127215 but neglected to update the query itself, only the response. This fixes that so the query matches the response. - 🚘 I also updated the table response to match other ESQL response tables	2025-04-24 12:32:17 +02:00
Liam Thompson	7b95ec4767	[DOCS] Clarify update behavior for indices with semantic_text fields, flag CCS/CCR limitation (#127310 )	2025-04-24 12:19:48 +02:00
Ioana Tagirta	a684e109f7	Improve listing of index mode options in docs (#127155 )	2025-04-24 09:58:16 +02:00
Liam Thompson	2c2e9a5266	[DOCS][ESQL] Cleanup and cross-reference LOOKUP JOIN reference and landing pages (#127215 ) * [DOCS][ESQL] Cleanup and cross-reference LOOKUP JOIN reference and landing pages lookup-join.md (syntax reference): - removed tip formatting for simpler direct link to landing page - improved parameter formatting and descriptions - fixed template variable from `{esql}` to `{{esql}}` esql-lookup-join.md (landing page): - added "compare with enrich" section header - simplified "how the command works" with clearer parameter explanation - added code example in how it works section - improved image alt text for accessibility - organized example section with better context and SQL comparison - added dropdown for sample tables to reduce visual clutter - added "query" subheading for clearer organization - included reference to additional examples in command reference - removed excessive whitespace * Improve example, add setup code replaced abstract employee/language example with security monitoring use case added setup instructions for creating test indices included sample data loading via bulk api new practical query example joining firewall logs with threat data simplified results output showing threat detection scenario added note about left-join behavior improved code comments and structure added required index.mode: lookup setting info	2025-04-23 13:22:42 +02:00
István Zoltán Szabó	1e7c6abaf6	[DOCS] Fixes formatting issue on dense vector reference page. (#127214 )	2025-04-23 11:24:17 +02:00
Ahmed Khan	98a3719e46	Update elasticsearch-keystore.md with special character handling and echo command to enter the password. (#127135 ) * Update elasticsearch-keystore.md Customer needs document update for handling special characters and how we can use the echo command to enter the password. * Update docs/reference/elasticsearch/command-line-tools/elasticsearch-keystore.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> * Update docs/reference/elasticsearch/command-line-tools/elasticsearch-keystore.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> * Update elasticsearch-keystore.md Moving the section out of Examples as advised. * Update docs/reference/elasticsearch/command-line-tools/elasticsearch-keystore.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> * Update docs/reference/elasticsearch/command-line-tools/elasticsearch-keystore.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-04-23 09:40:38 +02:00
Charlotte Hoblik	838bb0bbd7	fix superscript (#127147 )	2025-04-22 18:48:15 +02:00
George Wallace	b98a4fa067	Fixing external link (#127114 )	2025-04-21 17:57:48 +02:00
Craig Taverner	f6a05c6a7c	Support depthOffset in MD docs headings for nesting functions (#126984 ) While this change appears subtle at this point, I am using this in a later PR that adds a lot more spatial functions, where nesting them in related groups like this looks much better. The main impact of this is that the On this page navigator on the right panel of the docs will show the nesting Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-04-19 11:28:05 +02:00
Brian Seeders	af6dac5c05	Revert "Forward port release notes for v8.17.5 (#127024 )" This reverts commit `66b504a881`.	2025-04-17 16:16:21 -04:00
elasticsearchmachine	66b504a881	Forward port release notes for v8.17.5 (#127024 )	2025-04-17 16:15:42 -04:00
David Turner	7e62862eab	Clarify queues in thread pool settings (#127027 ) The docs about the queue in a `fixed` pool are a little awkwardly worded, and there is no mention of the queue in a `scaling` pool at all. This commit cleans this area up.	2025-04-17 19:58:02 +01:00
Liam Thompson	b6c9b9b54d	[DOCS] Update URLs for ESQL Kibana generated docs (#127011 )	2025-04-17 18:25:24 +02:00
Samiul Monir	afb83b7551	Updating text_similarity_reranker documentation (#127004 ) * updating documentation to remove duplicate and redundant wording from 9.x * Update links to rerank model landing page --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-04-17 11:54:19 -04:00
Tim Vernum	e53d3ff64b	Update docs to reflect removal of TLSv1.1 (#126892 ) In ES9 and later, we do not enable TLSv1.1 by default, even if the JDK supports it. This updates the docs accordingly. Relates: #121731	2025-04-17 10:15:29 +10:00
Samiul Monir	2e1101cf5e	Updating text_similarity_reranker documentation (#126175 ) * Updating text_similarity_reranker documentation * Updating docs to include urls * remove extra THE from the text --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2025-04-16 17:05:30 -04:00
Liam Thompson	92148cfde3	[DOCS] Update esql-lookup-join.md to mention index mode requirement (#126901 ) * Update esql-lookup-join.md to mention index mode requirement * fix 8.x page mapping metadata	2025-04-16 12:15:45 +02:00
Svilen Mihaylov	02f9af732e	Add multi_match function #121525 (#125062 ) Implement multi_match function for ESQL. Its currently available on snapshot builds pending refinement of the syntax.	2025-04-15 09:38:08 -04:00
Liam Thompson	7de46e9897	[DOCS] Update es-connectors-salesforce.md (#126828 ) * [DOCS] Update es-connectors-salesforce.md 9.x equivalent of https://github.com/elastic/elasticsearch/pull/126791 * Reformat known issues section	2025-04-15 11:47:36 +02:00
Kofi B	08beb534ef	[DOCS] Added sort order explanation (#125182 ) * Added explanation of sort order and default behavior * Update docs/reference/elasticsearch/rest-apis/sort-search-results.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> --------- Co-authored-by: George Wallace <georgewallace@users.noreply.github.com> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-04-14 10:28:03 +02:00
Craig Taverner	ec495e9f0b	Make LOOKUP JOIN docs examples fully tested (#126622 ) The current LOOKUP JOIN docs include examples that are not tested by the ES\|QL tests, unlike most other examples in the documentation. This PR fixes that, changing two examples to use existing tests, and adding a new csv-spec file for the remaining four examples. These four are not required to show results, so the tests have empty data and do not require any results. This means we are testing only the syntax (parsing and semantic analysis), which is sufficient for the docs.	2025-04-14 09:57:58 +02:00
Jan Kuipers	3f2f5ee158	ES\|QL change_point docs and tech preview (#126407 ) * ES\|QL change point docs * Move ES\|QL change_point to tech preview * Update docs/reference/query-languages/esql/esql-commands.md Co-authored-by: Craig Taverner <craig@amanzi.com> * different example + add it the csv tests * Restructure change_point docs to new structure * Added generated test examples to change_point docs * Fixed a few README.md text mistakes and added more details * fix grammar * License check * regen parser * Update docs/reference/query-languages/esql/_snippets/commands/layout/change_point.md Co-authored-by: Craig Taverner <craig@amanzi.com> --------- Co-authored-by: Craig Taverner <craig@amanzi.com>	2025-04-14 09:56:03 +02:00
Lisa Cawley	ae33eaabdb	[DOCS] Fix broken images (#126648 )	2025-04-11 19:04:08 -07:00
Nik Everett	55a6624746	ESQL: TO_IP can handle leading zeros (#126532 ) Modifies TO_IP so it can handle leading `0`s in ipv4s. Here's how it works now: ``` ROW ip = TO_IP("192.168.0.1") // OK! ROW ip = TO_IP("192.168.010.1") // Fails ``` This adds ``` ROW ip = TO_IP("192.168.010.1", {"leading_zeros": "octal"}) ROW ip = TO_IP("192.168.010.1", {"leading_zeros": "decimal"}) ``` We do this because there isn't a consensus on how to parse leading zeros in ipv4s. The standard unix tools like `ping` and `ftp` interpret leading zeros as octal. Java's built in ip parsing interprets them as decimal. Because folks are using this for security rules we need to support all the choices. Closes #125460	2025-04-11 19:45:14 +02:00
Bogdan Pintea	9784e0ec5f	ESQL: Split grouping functions based on their EVAL-ability (#126597 ) This splits the grouping functions in two: those that can be evaluated independently through the EVAL operator (`BUCKET`) and those that don't (like those that that are evaluated through an agg operator, `CATEGORIZE`). Closes #124608	2025-04-11 16:19:54 +02:00
Colleen McGinnis	24dfda583f	update mapped_pages (#126647 )	2025-04-11 08:48:29 -05:00
Kathleen DeRusso	489a38895e	Update chunking_settings docs for semantic_text (#126634 ) * Update chunking_settings docs for semantic_text * Remove redundancy	2025-04-11 08:55:47 -04:00
Liam Thompson	ef633d53bd	Add license mention to ESQL categorize (#126666 ) * Add license mention to ESQL categorize exceptional licensing mention in docs	2025-04-11 11:13:12 +02:00
Larisa Motova	1324f82ed2	Update keyword ignore_above documentation for logsdb (#126651 ) This commit adds a note that ignore_above has a different limit for logsdb indices to the documentation. Related to https://github.com/elastic/docs-content/pull/1092 and https://github.com/elastic/sdh-elasticsearch/issues/8892	2025-04-10 21:49:47 -10:00
Lisa Cawley	627e3099f6	[DOCS] Add node specifications to API conventions (#126571 ) Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>	2025-04-10 19:08:40 +02:00
Lisa Cawley	6c4a230858	[DOCS] Add ranking evaluation API examples (#126577 )	2025-04-10 09:50:15 -07:00
Craig Taverner	67b15ad5d8	Split ES\|QL functions/operators/commands into separate pages for similar functions and make commands examples generated (#126279 ) While the internal structure of the docs is already split into many (over 1000) sub-pages, the final display for the `Functions and Operators` page is a single giant page, making navigation harder. This PR splits it into separate pages, one for each group of similar functions and one for the operators. Twelve new pages. This PR also bundles a few other related changes. In total what is done is: * Split functions/operators into 12 pages, one for each group, maintaining the existing split of each function/operator into a snippet with dynamically generated examples * Split esql-commands.md into source-commands.md and processing-commands.md, each of which is split into individual snippets, one for each command * Each command snippet has it's examples split out into separate files, if they were examples that were dynamically generated in the older asciidoc system * The examples files are overwritten by the ES\|QL unit tests, using a similar mechanism to the examples written for functions and operators) * Some additional refinements to the Kibana definition and markdown files (nicer operator headings, and display text)	2025-04-10 15:56:05 +02:00
Charlotte Hoblik	e9d3328903	[DOCS]: Move ES connectors `Known issues` page in 9.0+ (#126600 ) * add known issues page to es connectors * update known issues * Update docs/reference/search-connectors/es-connectors-known-issues.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> * Update docs/reference/search-connectors/es-connectors-known-issues.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-04-10 15:26:20 +02:00
Colleen McGinnis	1b021c58df	fix cross-repo link syntax (#126554 )	2025-04-09 14:46:19 -04:00
Ryan Ernst	3bac50e818	Use logs dir as working directory (#124966 ) In the unexpected case that Elasticsearch dies due to a segfault or other similar native issue, a core dump is useful in diagnosing the problem. Yet core dumps are written to the working directory, which is read-only for most installations of Elasticsearch. This commit changes the working directory to the logs dir which should always be writeable.	2025-04-09 07:07:11 -07:00
Iván Cea Fontenla	90dcccfc96	ESQL: Updated RENAME docs with the behaviour of multiple column renames (#126462 ) * ESQL: Updated RENAME docs with the behaviour of multiple column renames * Added rename example to csv-spec	2025-04-09 12:39:10 +02:00
Gal Lalouche	953b9fbb83	ESQL: List/get query API (#124832 ) This PR adds two new REST endpoints, for listing queries and getting information on a current query. * Resolves #124827 * Related to #124828 (initial work) Changes from the API specified in the above issues: * The get API is pretty initial, as we don't have a way of fetching the memory used or number of rows processed. List queries response: ``` GET /_query/queries // returns for each of the running queries // query_id, start_time, running_time, query { "queries" : { "abc": { "id": "abc", "start_time_millis": 14585858875292, "running_time_nanos": 762794, "query": "FROM logs* \| STATS BY hostname" }, "4321": { "id":"4321", "start_time_millis": 14585858823573, "running_time_nanos": 90231, "query": "FROM orders \| LOOKUP country_code ON country" } } } ``` Get query response: ``` GET /_query/queries/abc { "id" : "abc", "start_time_millis": 14585858875292, "running_time_nanos": 762794, "query": "FROM logs* \| STATS BY hostname" "coordinating_node": "oTUltX4IQMOUUVeiohTt8A" "data_nodes" : [ "DwrYwfytxthse49X4", "i5msnbUyWlpe86e7"] } ```	2025-04-08 22:21:32 +03:00
Slobodan Adamović	284121ad9f	Set `keyUsage` for generated HTTP certificates and self-signed CA (#126376 ) The `elasticsearch-certutil http` command, and security auto-configuration, generate the HTTP certificate and CA without setting the `keyUsage` extension. This PR fixes this by setting (by default): - `keyCertSign` and `cRLSign` for self-signed CAs - `digitalSignature` and `keyEncipherment` for HTTP certificates and CSRs These defaults can be overridden when running `elasticsearch-certutil http` command. The user will be prompted to change them as they wish. For `elasticsearch-certutil ca`, the default value can be overridden by passing the `--keysage` option, e.g. ``` elasticsearch-certutil ca --keyusage "digitalSignature,keyCertSign,cRLSign" -pem ``` Fixes #117769	2025-04-08 09:44:09 +02:00
Craig Taverner	1f6518f371	Document special behaviour of ignore_malformed for geo_point mappings (#125692 ) With `geo_point` fields, here is the special case of values that have a syntactically valid format, but the numerical values for `latitude` and `longitude` are out of range. If `ignore_malformed` is `false`, an exception will be thrown as usual. But if it is `true`, the document will be indexed correctly, by normalizing the latitude and longitude values into the valid range. The special `_ignored` field will not be set. The original source document will remain as before, but indexed values, doc-values and stored fields will all be normalized.	2025-04-07 11:05:51 +02:00
Lisa Cawley	1d1feb6010	[DOCS] Migrate search profile API examples (#126347 )	2025-04-04 22:42:09 +01:00
George Wallace	ce8b418686	Update esql-lookup-join.md (#126290 )	2025-04-04 09:43:45 -06:00
Kathleen DeRusso	e7d4a28a87	Support configurable chunking in semantic_text fields (#121041 ) * test * Revert "test" This reverts commit `9f4e2adba0`. * Refactor InferenceService to allow passing in chunking settings * Add chunking config to inference field metadata and store in semantic_text field * Fix test compilation errors * Hacking around trying to get ingest to work * Debugging * [CI] Auto commit changes from spotless * POC works and update TODO to fix this * [CI] Auto commit changes from spotless * Refactor chunking settings from model settings to field inference request * A bit of cleanup * Revert a bunch of changes to try to narrow down what broke CI * test * Revert "test" This reverts commit `9f4e2adba0`. * Fix InferenceFieldMetadataTest * [CI] Auto commit changes from spotless * Add chunking settings back in * Update builder to use new map * Fix compilation errors after merge * Debugging tests * debugging * Cleanup * Add yaml test * Update tests * Add chunking to test inference service * Trying to get tests to work * Shard bulk inference test never specifies chunking settings * Fix test * Always process batches in order * Fix chunking in test inference service and yaml tests * [CI] Auto commit changes from spotless * Refactor - remove convenience method with default chunking settings * Fix ShardBulkInferenceActionFilterTests * Fix ElasticsearchInternalServiceTests * Fix SemanticTextFieldMapperTests * [CI] Auto commit changes from spotless * Fix test data to fit within bounds * Add additional yaml test cases * Playing with xcontent parsing * A little cleanup * Update docs/changelog/121041.yaml * Fix failures introduced by merge * [CI] Auto commit changes from spotless * Address PR feedback * [CI] Auto commit changes from spotless * Fix predicate in updated test * Better handling of null/empty ChunkingSettings * Update parsing settings * Fix errors post merge * PR feedback * [CI] Auto commit changes from spotless * PR feedback and fix Xcontent parsing for SemanticTextField * Remove chunking settings check to use what's passed in from sender service * Fix some tests * Cleanup * Test failure whack-a-mole * Cleanup * Refactor to handle memory optimized bulk shard inference actions - this is ugly but at least it compiles * [CI] Auto commit changes from spotless * Minor cleanup * A bit more cleanup * Spotless * Revert change * Update chunking setting update logic * Go back to serializing maps * Revert change to model settings - source still errors on missing model_id * Fix updating chunking settings * Look up model if null * Fix test * Work around https://github.com/elastic/elasticsearch/issues/125723 in semantic text field serialization * Add BWC tests * Add chunking_settings to docs * Refactor/rename * Address minor PR feedback * Add test case for null update * PR feedback - adjust refactor of chunked inputs * Refactored AbstractTestInferenceService to return offsets instead of just Strings * [CI] Auto commit changes from spotless * Fix tests where chunk output was of size 3 * Update mappings per PR feedback * PR Feedback * Fix problems related to merge * PR optimization * Fix test * Delete extra file --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-04-03 17:45:26 -04:00
kanoshiou	30b2a1f729	ESQL: Enhanced `DATE_TRUNC` with arbitrary intervals (#120302 ) Originally, `DATE_TRUNC` only supported 1-month and 3-month intervals for months, and 1-year interval for years, while arbitrary intervals were supported for weeks and days. This PR adds support for `DATE_TRUNC` with arbitrary month and year intervals. Closes #120094	2025-04-03 16:55:56 +02:00
Benjamin Trent	33dcc921be	Mark rescore_vector as generally available (#126038 ) * Mark rescore_vector as generally available * Update docs/changelog/126038.yaml	2025-04-02 16:10:01 -04:00
Joe Gallo	078f7ff9f7	Minor docs fixes (#126143 )	2025-04-02 12:30:07 -04:00
Nik Everett	d30296229b	ESQL: Hide some "extras" from docs (#124763 ) Hides some of the "extra" lines from ESQL's documentation. These lines are required to make the documentation into nice tests which is important to make sure the docs don't get out of date. But readers don't need to see them.	2025-04-01 21:24:15 +01:00
Colleen McGinnis	d966938842	add missing mapped pages (#126054 )	2025-04-01 19:41:37 +02:00
Craig Taverner	7b263b4b83	Kibana updates, remove links from JSON and split is-null/is-not-null (#125986 ) In particular: * Remove all links (both asciidoc and markdown) from the JSON definition files. * This required a two phase edit, from asciidoc links to markdown, and then removal of markdown (replace with markdown text). This is because the asciidoc does not have the display text, and because some links were already markdown. * Split predicates into is_null and is_not_null * We kept the old combined version because the main docs still use that, so now we have both combined and separate versions, and Kibana can select the version they want.	2025-04-01 15:46:24 +02:00
Brandon Morelli	74e4ce23e0	Update limitations.md (#125893 )	2025-03-28 22:35:41 +01:00
Craig Taverner	98a2c711f8	Refine ESQL docs handling of applies_to (#125835 ) This primarily splits the old preview:true warning from the newer applies_to approach. Since all of our current applies_to examples are actually just behaviour modifications of current functions, we do not use the official docs {applies_to} syntax. However there is code to make use of that in the case where we have an entirely new function which will appear in a new version. Co-authored-by: Alexander Spies <alexander.spies@elastic.co>	2025-03-28 22:09:15 +01:00
Bogdan Pintea	1bd80d10a6	ESQL: supplement docs on LIMIT (#125839 ) This adds a few extra details around how ESQL processes input docs and how it limits output results. Closes #125819	2025-03-29 06:03:27 +11:00
Mayya Sharipova	332abe4198	[DOCS] Clarify that min_score applies to aggs (#125882 ) Clarify that min_score param of a search request also applies to aggregations.	2025-03-28 14:41:14 -04:00
Colleen McGinnis	adccaa66a4	remove reliance on redirects in docs-content (#125863 )	2025-03-28 16:41:38 +01:00
Alexander Spies	ea98166919	ESQL: Improve LOOKUP JOIN page (#125688 ) (#125798 ) Forward port of #125688	2025-03-28 09:07:28 +01:00
Benjamin Trent	009a86a0e3	Allow zero for rescore_vector.oversample to indicate by-passing oversample and rescoring (#125599 ) This allows a `rescore_vector: {oversample: 0}` to indicate bypassing oversampling and rescoring. This is useful for: - Updating a quantized mapping to turn off automatic rescoring - Bypassing oversampling at query time in an ad-hoc manner if its on by default in the mapping closes: https://github.com/elastic/elasticsearch/issues/125157	2025-03-27 06:56:51 +11:00
Larisa Motova	10719831b5	[ES\|QL] Add ToAggregateMetricDouble example (#125518 ) Adds AggregateMetricDouble to the ES\|QL CSV tests and examples of how to use the ToAggregateMetricDouble function	2025-03-26 07:56:48 -10:00
Bogdan Pintea	b6b8159ed9	SQL: Docs: Drop examples of LIKE/RLIKE vs QUERY/MATCH equivalence (#125673 ) This drops the examples of LIKE/RLIKE vs QUERY/MATCH equivalence.	2025-03-27 03:28:38 +11:00
Karen Metts	f0168b4b84	Doc: Update links to logstash plugin docs (#125675 ) * Add logstash plugin repo to cross_links	2025-03-26 11:54:37 -04:00
Tommaso Teofili	7a610c30fd	[docs] nested knn only supports score_mode max (#125582 ) * [docs] nested knn only supports score_mode max	2025-03-26 11:31:43 +01:00
Colleen McGinnis	162763bd13	[docs] More updates for docs-assembler (#125509 ) * update docset.yml, add reference/toc.yml, update reference/elasticsearch/index.md * Update docs/docset.yml * add index.md	2025-03-24 14:20:14 -05:00
Alexander Spies	f8536aadda	ESQL: Add more details on ENRICH vs. LOOKUP JOIN to docs (#125487 ) * Add more details on ENRICH vs. LOOKUP JOIN * Move example, fix syntax formatting	2025-03-24 16:26:28 +01:00
Craig Taverner	8ffecb408d	Additional support for docs for ES\|QL operators and version-specific differentiation (#125251 ) This PR was originally focused on improving support for Kibana docs, in particular the missing operator docs, but it has expanded to cover a bunch of related things: * Primarily the main work was to improve operators support. ESQL generated docs cover all functions and most operators for which their is a clear operator class and test class. However, some are built-in behaviour and need additional support. This PR adds more generated content for those operators. * Various specific operators requested by Kibana: Cast & null-predicates, and in particular the addition of examples * Two functions without examples: mv_append and to_date_nanos * Many small visual document cleanups (spelling, grammar, capitalization, etc.) * Initial support for `applies_to` for multi-version differentiation. This last point requires more work, as it is not yet agreed on just how we want this to look. We'll probably need to do refinements in followup PR. Consider the version in this PR as a first step into how this could look.	2025-03-24 09:56:45 +01:00
Jeremy Dahlgren	d7995975d9	Add cache support in TransportGetAllocationStatsAction (#124898 ) Adds a new cache and setting TransportGetAllocationStatsAction.CACHE_TTL_SETTING "cluster.routing.allocation.stats.cache.ttl" to configure the max age for cached NodeAllocationStats on the master. The default value is currently 1 minute per the suggestion in issue 110716. Closes #110716	2025-03-21 20:35:40 +02:00
Liam Thompson	397c9c59c7	Clarify regex character range case insensitivity limitations (#125413 ) * Update regexp-syntax.md 9.x equivalent of https://github.com/elastic/elasticsearch/pull/125412 * use md syntax	2025-03-21 18:43:44 +02:00
Carlos Delgado	160ac698d7	ES\|QL: Add default values for match function options (#125282 )	2025-03-21 10:44:41 +01:00
Colleen McGinnis	9bcd59596d	[docs] Prepare for docs-assembler (#125118 ) * reorg files for docs-assembler and create toc.yml files * fix build error, add redirects * only toc * move images	2025-03-20 12:09:12 -05:00
Mike Pellegrini	f67b5d6e95	Mark semantic text as GA in docs (#124669 )	2025-03-20 08:13:00 -04:00
Lisa Cawley	ec0f8be34d	[DOCS] Clean up Asciidoc links in markdown files (#125046 )	2025-03-19 08:03:55 -07:00
Craig Taverner	65dfaf1c91	Rewrite Kibana docs asciidoc links to be MD links (#125155 ) Did a few things: * Rewrite Kibana docs asciidoc links to be MD links * Make kibana docs links absolute to planned publication path * Clarify which operators are generated and which are static * Removed the trailing .md from kibana docs links	2025-03-19 13:56:05 +01:00
Kofi B	e34bfd166a	[DOCS] Opster Migration: Nested bool query addition (#124455 ) added section related to nested bool queries to provide a more clear example and clean up surrounding language and grammatical issues	2025-03-18 20:42:31 -05:00
Larisa Motova	08ae54e423	[ES\|QL] ToAggregateMetricDouble function (#124595 ) This commit adds a conversion function from numerics (and aggregate metric doubles) to aggregate metric doubles. It is most useful when you have multiple indices, where one index uses aggregate metric double (e.g. a downsampled index) and another uses a normal numeric type like long or double (e.g. an index prior to downsampling).	2025-03-18 11:39:27 -10:00
Charlotte Hoblik	64a56439a6	[DOCS] Restructure user settings reference pages (#125000 ) * add elasticsearch settings page * add logo to ech applicable settings * removing ECH settings page * removing duplicate information from ECH * move settings to correcponding page * update configuration page * fix link * Add applies_to frontmatter to auditing settings * remove duplicate how-to pages * fix broken links * replce cloud icon text * adjust settings pages * add applies_to tag --------- Co-authored-by: lcawl <lcawley@elastic.co>	2025-03-18 18:18:49 +01:00
Craig Taverner	50a7eb09d4	Fix ES\|QL build.gradle for configuration-cache (#125097 ) Earlier work on the ES\|QL port of docs to V3 introduced an issue in the build.gradle file making it fail with --configuration-cache. This fixes that, as well as one other broken link and removes some unused files. In addition we bring back partial support for deleting unused files. It is tricky to have full support for this due to the mix of static and generated content, particularly in the operators snippets.	2025-03-18 17:15:53 +01:00
David Turner	a2d98e44a1	Upgrade `discovery-ec2` to AWS SDK v2 (#122062 )	2025-03-18 19:38:16 +11:00
Craig Taverner	94cad286bc	Restructure query-languages docs files for clarity (#124797 ) In a few previous PR's we restructured the ES\|QL docs to make it possible to generate them dynamically. This PR just moves a few files around to make the query languages docs easier to work with, and a little more organized like the ES\|QL docs. A bit part of this was setting up redirects to the new locations, so other repo's could correctly link to the elasticsearch docs.	2025-03-17 17:58:58 +01:00
Charlotte Hoblik	c9724557a2	add signposts to docs-content (#124866 )	2025-03-17 11:41:52 +01:00
David Turner	37a559c57d	Mention zero-window state in networking docs (#124967 ) Clarify that it is expected sometimes to see inter-node connections sending zero-window advertisements as part of the usual TCP backpressure mechanism.	2025-03-16 19:43:29 +00:00
George Wallace	472536c189	lookup join docs (#124531 ) * lookup join docs --------- Co-authored-by: Alexander Spies <alexander.spies@elastic.co>	2025-03-13 12:47:58 -06:00
Benjamin Trent	b2c1c4e0f0	New `vector_rescore` parameter as a quantized index type option (#124581 ) This adds a new parameter to the quantized index mapping that allows default oversampling and rescoring to occur. This doesn't adjust any of the defaults. It allows it to be configured. When the user provides `rescore_vector: {oversample: <number>}` in the query it will overwrite it. For example, here is how to use it with bbq: ``` PUT rescored_bbq { "mappings": { "properties": { "vector": { "type": "dense_vector", "index_options": { "type": "bbq_hnsw", "rescore_vector": {"oversample": 3.0} } } } } } ``` Then, when querying, it will auto oversample the `k` by `3x` and rerank with the raw vectors. ``` POST _search { "knn": { "query_vector": [...], "field": "vector" } } ```	2025-03-14 00:40:08 +11:00
Craig Taverner	d5ddb909a4	ESQL autogenerate docs v3 (#124312 ) Building on the work started in https://github.com/elastic/elasticsearch/pull/123904, we now want to auto-generate most of the small subfiles from the ES\|QL functions unit tests. This work also investigates any remaining discrepancies between the original asciidoc version and the new markdown, and tries to minimize differences so the docs do not look too different. The kibana json and markdown files are moved to a new location, and the operator docs are a little more generated than before (although still largely manual).	2025-03-13 14:16:46 +01:00
Charlotte Hoblik	9e754ec8f6	[DOCS] Plugin management reference cleanup (#124578 ) * add content to plugin management * add content to Plugin Management * Update docs/reference/elasticsearch-plugins/plugin-management.md Co-authored-by: florent-leborgne <florent.leborgne@elastic.co> * fix applies-to tag * add ech to docset.yml --------- Co-authored-by: florent-leborgne <florent.leborgne@elastic.co>	2025-03-12 17:01:10 +01:00
kanoshiou	deff3df9f0	ES\|QL: Support `::date` in inline cast (#123460 ) * Inline cast to date * Update docs/changelog/123460.yaml * New capability for `::date` casting * More tests * Update tests --------- Co-authored-by: Fang Xing <155562079+fang-xing-esql@users.noreply.github.com>	2025-03-11 17:08:10 -04:00
Mark Tozzi	3e949479d8	ESQL - Include thread names in profile output (#124262 ) Resolves #123053 This adds the thread name to the driver sleep profile output. --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-03-11 15:53:22 +01:00
Carlos Delgado	2b40e73fe9	ES\|QL - Add scoring for full text functions disjunctions (#121793 )	2025-03-11 15:29:15 +01:00
Jan Calanog	435d1db5b9	Remove subs attribute (#124551 )	2025-03-11 12:14:58 +01:00
Charlotte Hoblik	e51b50139b	Fix external URI images (#124350 )	2025-03-10 11:31:47 +01:00
David Kilfoyle	e158cd868b	[Docs] Fix cross-repo links to Beats docs (#124360 ) Co-authored-by: Colleen McGinnis <colleen.mcginnis@elastic.co>	2025-03-07 14:38:46 -05:00
Svilen Mihaylov	ee4bcac1db	Added optional parameters to QSTR ES\|QL function (#121787 ) Adds options to QSTR function. #118619 added named function parameters. This PR uses this mechanism for allowing query string function parameters, so query string parameters can be used in ES\|QL. Closes #120933	2025-03-07 13:00:22 -05:00
Kostas Krikellas	296cae8a30	[DOCS] Document source-related restrictions (#124011 ) * Document source-related restrictions * Update mapping-source-field.md * Update docs/reference/elasticsearch/mapping-reference/mapping-source-field.md Co-authored-by: Marci W <333176+marciw@users.noreply.github.com> * Update mapping-source-field.md --------- Co-authored-by: Marci W <333176+marciw@users.noreply.github.com>	2025-03-06 11:38:09 -05:00
Colleen McGinnis	23be51a04f	[DOCS] fix external links (#124248 )	2025-03-06 17:27:03 +01:00
Marci W	bea3af2467	[DOCS] Clarify support for doc_values (#124047 ) * Update doc-values.md * Make the note more visible * fix link	2025-03-06 09:01:19 -05:00
Lee Hinman	47706b505f	Add index mode to get data stream API (#122486 ) This commit adds the `index_mode` for both the data stream and each backing index to the output of `GET /_data_stream`. An example looks like: ``` { "data_streams" : [ { "name" : "foo-things", "indices" : [ { "index_name" : ".ds-foo-things-2025.02.13-000001", ... "index_mode" : "standard" } ], ... "index_mode" : "standard" }, { "name" : "logs-foo-bar", "indices" : [ { "index_name" : ".ds-logs-foo-bar-2025.02.13-000001", ... "index_mode" : "logsdb" }, { "index_name" : ".ds-logs-foo-bar-2025.02.13-000002", ... "index_mode" : "logsdb" } ], ... "index_mode" : "logsdb", } ] } ```	2025-03-06 07:39:58 +11:00
shainaraskas	a06c8ea5b8	Update node-settings.md (#123997 ) * Update node-settings.md Port change https://github.com/elastic/elasticsearch/pull/123939 forward to new docs system * Update docs/reference/elasticsearch/configuration-reference/node-settings.md	2025-03-05 11:21:16 -05:00
Liam Thompson	2456cd375a	Add note to servicenow connector ref (#124101 )	2025-03-05 15:26:22 +01:00
Craig Taverner	efe7379e67	Split ESQL functions/operators docs files (#123904 ) * Port from asciidocalypse * Fix links for operator lists * Remove unused image files after moving/editing them * Fix lists links * Fix like/rlike links * Fix remaining bad references to /elasticsearch/docs * Fix logstash and beats references * Fix logstash and beats references * Fix image links	2025-03-04 14:59:31 +01:00
John Wagster	be577e382d	Update Flatten Graph Docs to Include a Real Flattened Graph 9.x (#123901 ) updated flatten graph docs to include a real flattened graph	2025-03-03 14:33:53 -06:00
Colleen McGinnis	db5acd8976	add missing pages (#123774 )	2025-03-03 15:02:51 +00:00
Liam Thompson	6b27e420fe	Cleanup search connectors, add some reference -> docs content signposts in various sections (#123733 )	2025-02-28 17:10:09 +00:00
Liam Thompson	91c2654570	Fix broken cross-repo links, versions in search connectors docker instructions (#123700 )	2025-02-28 16:02:54 +01:00
Colleen McGinnis	b7e3a1e14b	[docs] Migrate docs from AsciiDoc to Markdown (#123507 ) * delete asciidoc files * add migrated files * fix errors * Disable docs tests * Clarify release notes page titles * Revert "Clarify release notes page titles" This reverts commit `8be688648d`. * Comment out edternal URI images * Clean up query languages landing pages, link to conceptual docs * Add .md to url * Fixes inference processor nesting. --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: Liam Thompson <leemthompo@gmail.com> Co-authored-by: Martijn Laarman <Mpdreamz@gmail.com> Co-authored-by: István Zoltán Szabó <szabosteve@gmail.com>	2025-02-27 17:56:14 +01:00
Kathleen DeRusso	ae6474db63	Deprecate Behavioral Analytics CRUD apis (#122960 ) * Deprecate Behavioral Analytics CRUD APIs * Add allowed warning for REST Compatibility tests * Update docs/changelog/122960.yaml * Update changelog * Update docs to add deprecation flags and fix failing tests * Update changelog * Update changelog again * Update docs formatting Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> * Skip asciidoc test --------- Co-authored-by: Efe Gürkan YALAMAN <efeyalaman@gmail.com> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: Efe Gürkan YALAMAN <efeguerkan.yalaman@elastic.co>	2025-02-25 16:02:50 +01:00
Craig Taverner	ec82c24a87	Add support to VALUES aggregation for spatial types (#122886 ) The original work at https://github.com/elastic/elasticsearch/pull/106065 did not support geospatial types with this comment: > I made this work for everything but geo_point and cartesian_point because I'm not 100% sure how to integrate with those. We can grab those in a follow up. The geospatial types should be possible to collect using the VALUES aggregation with similar behavior to the `ST_COLLECT` OGC function, based on the Elasticsearch convention that treats multi-value geospatial fields as behaving similarly to any geometry collection. So this implementation is a trivial addition to the existing values types support.	2025-02-25 11:38:51 +01:00
Luke Whiting	e3792d19b5	Allow data stream reindex tasks to be re-run after completion (#122510 ) * Allow data stream reindex tasks to be re-run after completion * Docs update * Update docs/reference/migration/apis/data-stream-reindex.asciidoc Co-authored-by: Keith Massey <keith.massey@elastic.co> --------- Co-authored-by: Keith Massey <keith.massey@elastic.co>	2025-02-20 15:03:51 +00:00
David Turner	cdaa5dd7ad	Clarify breaking change note for #112903 (#122998 ) Closes #122994	2025-02-20 12:11:56 +00:00
Lee Hinman	2ae80c799d	Allow setting the `type` in the reroute processor (#122409 ) * Allow setting the `type` in the reroute processor This allows configuring the `type` from within the ingest `reroute` processor. Similar to `dataset` and `namespace`, the type defaults to the value extracted from the index name. This means that documents sent to `logs-mysql.access.default` will have a default value of `logs` for the type. Resolves #121553 * Update docs/changelog/122409.yaml	2025-02-18 12:38:00 -07:00
Nik Everett	df2f3b3b3f	ESQL: Update kibana signatures (#121951 ) This updates the kibana signature json files in two ways: * Renames `eval` to `scalar` - that's the name we use inside of ESQL and we may as well make the name the same. * Calls the `CATEGORIZE` and `BUCKET` function `grouping` because they can only be used in the "grouping" positions of the `STATS` command. Closes #113411	2025-02-07 09:51:09 -05:00
Fang Xing	f58fdf81e9	[ES\|QL] Change function_named_parameters in Kibana doc to expected format (#121585 ) * change function_named_parameters in kibana doc to expected format	2025-02-04 12:20:34 -05:00
elasticsearchmachine	69bdf465b0	Bump to version 9.1.0	2025-01-30 16:55:46 +00:00
Jim Ferenczi	fb3c666663	Remove outdated reference to internal semantic text format (#121276 ) The semantic text format was updated in #119183. This commit removes the last remaining reference to the old format from the documentation to ensure consistency.	2025-01-30 15:01:55 +01:00
Chris Hegarty	4baffe4de1	Upgrade to Lucene 10.1.0 (#119308 ) This commit upgrades to Lucene 10.1.0.	2025-01-30 13:41:02 +00:00
Liam Thompson	c8dfb4ea9e	[DOCS] Fix missing id syntax (#121264 ) * [DOCS] Fix missing id syntax * Update docs/reference/troubleshooting/common-issues/disk-usage-exceeded.asciidoc * fix id	2025-01-30 12:52:37 +01:00
Jim Ferenczi	dbeb55cb3d	Enable Mapped Field Types to Override Default Highlighter (#121176 ) This commit introduces the `MappedFieldType#getDefaultHighlighter`, allowing a specific highlighter to be enforced for a field. The semantic field mapper utilizes this new functionality to set the `semantic` highlighter as the default. All other fields will continue to use the `unified` highlighter by default.	2025-01-29 21:55:53 +00:00
Slobodan Adamović	c5ab17c3aa	Deprecate certificate-based remote cluster security model (#120806 ) Today, Elasticsearch supports two models to establish secure connections and trust between two Elasticsearch clusters: - API key based security model - Certificate based security model This PR deprecates the _Certificate based security model_ in favour of API key based security model. The _API key based security model_ is preferred way to configure remote clusters, as it allows to follow security best practices when setting up remote cluster connections and defining fine-grained access control. Users are encouraged to migrate remote clusters from certificate to API key authentication.	2025-01-29 19:43:04 +01:00
Kuni Sen	a0f1856a40	(Doc+) Expand watermark resolution (#119174 ) * (Doc+) Expand watermark resolution Relaunch https://github.com/elastic/elasticsearch/pull/116892 since the original one seems to be outdated and hard to update branch. * Apply suggestions from code review Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com> --------- Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>	2025-01-29 19:31:50 +01:00
Luiz Santos	c0f3024c3f	Make it clear that previous enrich indices are deleted every 15 minutes (#109085 ) Before this change, one could interpret that enrich policies are executed every 15 minutes, which is not true.	2025-01-29 19:28:43 +01:00
Liam Thompson	9edd64e608	[DOCS] Fix failing docs test (at least try) (#118934 ) Fix failing docs test: * Unmute test * Replace hardcoded values with regex in snippet test	2025-01-29 19:21:58 +01:00
Nikolaj Volgushev	51b4fffb5e	Default to `SSHA-256` as API key stored credential hasher (#120997 ) API keys are high-entropy secure random strings. This means that the additional work factor of functions like PBKDF or bcrypt are not necessary, and a faster hash function like salted SHA-256 provides adequate security against offline attacks (hash collision, brute force, etc.). This PR adds `SSHA-256` to the list of supported stored hash algorithms for API key secrets, and makes it the default algorithm. Additionally, this PR changes the format of API key secrets, moving from an encoded UUID to a random string which increase the entropy of API keys from 122 bits to 128 bits, without changing overall secret length. Relates: ES-9504	2025-01-30 05:14:15 +11:00
Michael Peterson	d3f20e5b4b	Updated resolve/cluster end user docs with information about the timeout flag and no index expression endpoint (#121199 )	2025-01-29 18:22:40 +01:00
Stanislav Malyshev	3669e061d4	Fix typo in docs example (#121206 )	2025-01-29 09:44:42 -07:00
Peter Straßer	6b76457a23	Fix syntax errors in the rescore retriever example (#121024 )	2025-01-29 16:10:59 +01:00
Michael Peterson	e9b877e58b	Clarify the behavior of remote/info and resolve/cluster for connected status of remotes (#118993 )	2025-01-29 10:08:25 -05:00
Kathleen DeRusso	4b4c59de7f	Fix error in docs code snippet (#121187 )	2025-01-29 16:05:05 +01:00
Benjamin Trent	038aab864e	Mark bbq indices as GA and add rolling upgrade integration tests (#121105 ) With the introduction of our new backing algorithm and making rescoring easier with the `rescore_vector` API, let's mark bbq as GA. Additionally, this commit adds rolling upgrade tests to ensure stability.	2025-01-30 01:58:08 +11:00
Pat Whelan	9009606a47	[Transform] add support for extended_stats (#120340 ) Building off of `stats` and multi-value aggregations, including the limitation: - all values of extended_stats will be mapped to `double` if mapping deduction is used Relates #51925	2025-01-29 15:33:16 +01:00
Martijn van Groningen	952bf229fb	Conditionally enable logsdb by default (#121049 ) Enable logsdb by default if logsdb.prior_logs_usage has not been set to true. Meaning that if no data streams were created matching with the logs-- pattern in 8.x, then logsdb will be enabled by default for data streams matching with logs-- pattern. Also removes LogsPatternUsageService as with version 9.0 and beyond, this component is no longer necessary. Followup from #120708 Closes #106489	2025-01-29 15:03:28 +01:00
Liam Thompson	f5f0e3bd7f	[DOCS] Update getting-started.asciidoc (#116151 ) (#121173 ) Update `new_field` to `language` which is the actual new field added in dynamic mapping Co-authored-by: Ekwinder <ekwindersaini@gmail.com>	2025-01-30 00:52:11 +11:00
Valeriy Khakhutskyy	15b93fefdb	Extend documentation note. (#121146 )	2025-01-29 13:03:42 +01:00
Jihyun(Brian) Jeong	e1207398c7	(Doc+) Clarify dimension field requirements for time_series aggregation (#119442 ) * (Doc+) Clarify dimension field requirements for time_series aggregation 👋 howdy, team! This PR adds a note explaining that time series indices require: - index.mode set to "time_series" - at least one dimension field with time_series_dimension: true - a routing_path array listing those dimension fields Without these settings, the time_series aggregation may return empty buckets or behave unexpectedly. By emphasizing the dimension field requirement, we help users configure their time series indices correctly and see meaningful aggregation results. * Apply suggestions from code review Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com> --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>	2025-01-29 13:03:11 +01:00
Stef Nestor	31597b3897	(Doc+) System Index definition (#120327 )	2025-01-29 11:14:36 +01:00
Kofi B	5bcd170a0b	[DOCS] Added additional context to page (#120569 )	2025-01-29 09:48:25 +01:00
Kofi B	2258911112	[DOCS] Search multiple indices added info (#120572 ) * [DOCS] Search multiple indices added info * Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc Co-authored-by: George Wallace <georgewallace@users.noreply.github.com> * Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc Co-authored-by: George Wallace <georgewallace@users.noreply.github.com> * Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc Co-authored-by: George Wallace <georgewallace@users.noreply.github.com> * Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc Co-authored-by: George Wallace <georgewallace@users.noreply.github.com> * Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc Co-authored-by: George Wallace <georgewallace@users.noreply.github.com> --------- Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>	2025-01-29 09:46:39 +01:00
Kofi B	63a890e30d	[DOCS] Upsert documentation clarification (#120684 ) Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-01-29 09:46:01 +01:00
Martijn van Groningen	8185cafaf2	Emit deprecation warning when executing one of the rollup APIs (#113131 ) Relates to #112690	2025-01-29 08:48:53 +01:00
Parker Timmins	635a4c21de	Add docs for reindex data stream REST endpoints (#120653 ) Add documentation for new REST endpoints related to data stream upgrade. Endpoints: - /_migration/reindex - /_migration/reindex/{index}/_status - /_migration/reindex/{index}/_cancel - /_create_from/{source}/{dest}	2025-01-28 19:44:56 -06:00
Lisa Cawley	bfeba89e0c	[DOCS] Move ML function reference out of appendix (#121111 )	2025-01-28 23:56:24 +01:00
Panagiotis Bailis	375814d007	Adding linear retriever to support weighted sums of sub-retrievers (#120222 )	2025-01-28 19:33:12 +02:00
George Wallace	1a05f41a71	Adjusted alias doc for clarity (#120437 ) (#121064 ) Co-authored-by: Kofi B <kofi.bartlett@elastic.co> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-01-29 03:52:52 +11:00
István Zoltán Szabó	b925b0cbcc	[DOCS] Adds anomaly detection info to migration guide (#121015 ) Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com>	2025-01-28 17:50:37 +01:00
István Zoltán Szabó	08255da9ac	[DOCS] Fixes max_chunk_size parameter name. (#121052 )	2025-01-28 17:10:08 +01:00
István Zoltán Szabó	7400a14995	[DOCS] Documents that deployment_id can be used as inference_id in certain cases. (#121055 )	2025-01-28 17:01:18 +01:00
Slobodan Adamović	953f1749a4	[Docs] Update Query Roles API documentation (#120740 ) The query role API now returns built-in roles as well. This PR notes this and adds an example on how the built-in roles can be filtered out.	2025-01-28 16:29:50 +01:00
Liam Thompson	3939198477	Update match-phrase-query.asciidoc (#118828 ) (#121033 ) (cherry picked from commit `8e9cccba6a`) Co-authored-by: Damien RENIER <153135842+damien-renier-elastic@users.noreply.github.com>	2025-01-28 16:19:14 +01:00
Panagiotis Bailis	8e2044de15	Normalize negative scores for text_similarity_reranker retriever (#120930 )	2025-01-28 16:56:47 +02:00
Amine GANI	38ea49a1b9	Fix incorrect use of "updateable" flag in synonyms documentation (#120866 ) Co-authored-by: Amine GANI <amine.gani@adelean.com> Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com>	2025-01-28 15:39:25 +01:00
Carlos Delgado	a87bd7ae26	ESQL - Allow full text functions disjunctions for non-full text functions (#120291 )	2025-01-28 14:08:13 +01:00
Roberto Seldner	ddc2362592	Update async-search.asciidoc - Indicating `search.max_async_search_response_size` is a Dynamic (#112758 ) Indicating `search.max_async_search_response_size` is a Dynamic setting here as it does not appear to be documented elsewhere.	2025-01-28 11:39:10 +01:00
Pius Fung	38b0e925f5	Add warning on scripted metric aggregation's intermediate state memory usage (#119379 )	2025-01-28 11:10:43 +01:00
Sean Story	636e3645ac	Clarify need to submit for authorization (#119460 )	2025-01-28 11:09:15 +01:00
István Zoltán Szabó	7837a96ce5	[DOCS] Adds EIS reference docs (#120706 )	2025-01-28 11:02:28 +01:00
Sylvain Morin	e18baa12fa	Minor fix in documentation (#119385 ) Co-authored-by: Iraklis Psaroudakis <kingherc@gmail.com>	2025-01-28 10:56:33 +01:00
Charlotte Hoblik	ee0ad557e6	Fix typo in tutorial (#120928 )	2025-01-28 10:54:20 +01:00
Carlos Delgado	d91d51600e	ESQL - Add Match function options (#120360 )	2025-01-28 08:54:33 +01:00
Lee Hinman	e0f5a60d32	Document that disabling stack templates is not recommended (#120963 ) There are many features of the Elasticsearch ecosystem that may malfunction, or fail to work entirely, if these templates are not installed. This commit adds documentation cautioning against disabling the installation of templates.	2025-01-27 15:17:48 -07:00
Joe Gallo	9bc9ba788b	Add a replicate_for option to the ILM searchable_snapshot action (#119003 )	2025-01-27 14:32:46 -05:00
Mark Tozzi	5b3436dce0	Esql - Support date nanos in date extract function (#120727 ) Resolves https://github.com/elastic/elasticsearch/issues/110000 Add support for running the date extract function on nanosecond dates.	2025-01-27 14:34:50 +00:00
Kostas Krikellas	3532d0bb10	[DOCS] Update documentation for index sorting and routing for logsdb (#120721 ) * [DOCS] Update documentation for index sorting and routing for logsdb * update * Apply suggestions from code review Co-authored-by: Marci W <333176+marciw@users.noreply.github.com> * Update logs.asciidoc * Update docs/reference/data-streams/logs.asciidoc Co-authored-by: Marci W <333176+marciw@users.noreply.github.com> * Update logs.asciidoc --------- Co-authored-by: Marci W <333176+marciw@users.noreply.github.com>	2025-01-27 16:21:28 +02:00
Luigi Dell'Aquila	a0840a0463	EQL: set allow_partial_search_results=true by default (#120267 )	2025-01-27 10:23:34 +00:00
Tim Sullivan	7d7a9d9fdb	[Index Management] Doc updates for Kibana Reporting built-ins (#120829 ) * [Index Management] Doc updates for Kibana Reporting built-ins * Update docs/reference/indices/index-templates.asciidoc Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com> --------- Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>	2025-01-24 20:48:33 +00:00
Carlos Delgado	f61f139653	Match, Like and RLike operators improved docs (#120504 )	2025-01-24 07:58:10 +01:00
Mark Tozzi	7e43605e38	Esql Support date nanos on date diff function (#120645 ) Resolves #109999 This adds support for date nanos in the date diff function, as well as mixed nanos/millis use cases. --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-01-23 18:04:38 +00:00
Stanislav Malyshev	f27f74666f	ES\|QL async queries: Partial result on demand (#118122 ) Add capability to stop async query on demand The theory: - User initiates async search request - User sends the stop request (POST _query/async/<ID>/stop) - If the async is finished by that time, it's like regular async get - If it's not finished, the sinks are closed and the request is forcefully finished	2025-01-23 10:21:52 -07:00
Nik Everett	eae93a2097	ESQL: Signatures for `NOT IN` et al (#120673 ) * ESQL: Signatures for `NOT IN` et al This generates signatures for `NOT IN`, `NOT LIKE`, and `NOT RLIKE` using a small hack on top of the process used to generate the signatures for `IN`, `LIKE`, and `RLIKE`. This is a very perl-worth hack, replacing `LIKE` with `NOT LIKE` in the description. But it's useful for our kibana friends and if we need to make it nicer we can do so later. * Zap	2025-01-23 10:57:53 -05:00
Oleksandr Kolomiiets	cdff3defde	Fix typo in synthetic source docs (#120685 )	2025-01-23 07:51:58 -08:00
István Zoltán Szabó	443f0f3ded	[DOCS] Adds note about differences between chat completion and stream API (#120636 )	2025-01-23 14:41:12 +01:00
Liam Thompson	bb0d0ed6dd	Removes outdated admonition (#120556 ) (#120703 ) Resolves /security-docs/https://github.com/elastic/security-docs/issues/6430. Removes an outdated admonition. (cherry picked from commit `63074d8e70`) Co-authored-by: Benjamin Ironside Goldstein <91905639+benironside@users.noreply.github.com>	2025-01-23 14:08:27 +01:00
Marci W	abeb60ff1e	[DOCS] Count API: clarify ways to specify search query (#120564 ) * Clarify query methods; other sprucing * Apply suggestions from review	2025-01-22 18:05:00 -05:00
Michael Peterson	b3a032cc4e	Resolve/cluster allows querying for cluster info only (no index expression required) (#119898 ) Resolve/cluster allows querying for cluster-info-only (no index expression required) This enhancement provides users with the ability to query the _resolve/cluster API endpoint without specifying an index expression to match against. This allows users to quickly test what remote clusters are configured on a cluster and whether they are available for querying. The new endpoint takes no index expression: ``` GET _resolve/cluster ``` and returns the same information as before except for the "matching_indices" field. Example response: ``` { "remote1": { "connected": false, "skip_unavailable": true }, "remote2": { "connected": true, "skip_unavailable": false, "version": { "number": "8.17.0", "build_flavor": "default", "minimum_wire_compatibility_version": "7.17.0", "minimum_index_compatibility_version": "7.0.0" } } } ``` For backwards compatibility, this new endpoint works with clusters from older versions by querying with the index expression `dummy*` on those older clusters and ignoring the matching_indices value in the response they return.	2025-01-22 12:17:29 -05:00
Andrei Stefan	cdf7be27ea	Update search-across-clusters.asciidoc to reflect the `true` default value of `skip_unavailable` setting. (#120592 )	2025-01-22 16:04:56 +02:00
Pete Gillin	b8bf111830	Remove telemetry related to frozen indices (#119890 ) This deprecated feature is being removed in 9.0, so the telemetry is no longer needed. The usage action is retained to support mixed v8/v9 clusters, with annotations to remove in V10. But it is no longer registered in `XPackUsageFeatureAction.ALL` and so the usage data is no longer reported by `GET _xpack/usage`, and if invoked it always returns a count of 0. ES-9736 # comment Removed the telemetry in https://github.com/elastic/elasticsearch/pull/119890	2025-01-22 11:19:15 +00:00
Jim Ferenczi	1db194df22	Add Multi-Field Support for Semantic Text Fields (#120128 ) Semantic text fields now support multi-fields, either as part of a multi-field structure or containing multi-fields internally. This enhancement aligns with the semantic text field's current behavior as a standard text field. Note: Multi-field support is only available for the new index format. Attempting to set a multi-field on an index created with the older format will still result in a failure.	2025-01-21 22:01:11 +01:00
Panagiotis Bailis	3e6b8bf51a	Fix for rrf documentation test using a knn retriever (#120112 )	2025-01-21 19:32:45 +02:00
Tommaso Teofili	1b1296ef54	Move scoring in ES\|QL out of snapshot (#120354 ) * Move scoring in ES\|QL out of snapshot --------- Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com>	2025-01-21 14:22:19 +01:00
István Zoltán Szabó	c60b3be6c7	[DOCS] Rename inference services to inference integrations in docs (#120212 ) Co-authored-by: David Kyle <david.kyle@elastic.co>	2025-01-21 11:19:44 +01:00
Liam Thompson	18b281ea16	[DOCS] Updated wording for clarity for new users (#120257 ) (#120507 ) Co-authored-by: Kofi B <kofi.bartlett@elastic.co>	2025-01-21 20:32:20 +11:00
Liam Thompson	8b00d503a1	[DOCS] Update wildcard query documentation (#120251 ) (#120502 ) Co-authored-by: Kofi B <kofi.bartlett@elastic.co>	2025-01-21 20:29:38 +11:00
Charlotte Hoblik	c760d73c55	Fix aggregation typo (#120461 )	2025-01-20 11:38:50 +01:00
Carlos Delgado	aea4853069	[Docs] kNN vector rescoring for quantized vectors (#118425 )	2025-01-17 17:02:09 +01:00
Iván Cea Fontenla	acb46af612	ESQL: Fix ROUND() with unsigned longs throwing in some edge cases (#119536 ) There were different error cases with `ROUND(number, decimals)`: - Decimals accepted unsigned longs, but threw a 500 with a `can't process [unsigned_long -> long]` in the cast evaluator - Fixed by improving the `resolveType()` - If the number was a BigInteger unsigned long, there were 2 cases throwing an exception: 1. Negative decimals outside the range of integer: Error 2. Negative decimals insie the range of integer, but "big enough" for `BigInteger.TEN.pow(...)` to throw a `BigInteger would overflow supported range` 3. -19 decimals with big unsigned longs like `18446744073709551615` was throwing an `unsigned_long overflow` Also, when the number is a BigInteger and the decimals is a big negative (but not big enough to throw), it may be very slow. Taking _many_ seconds for a single computation (It tries to calculate a `10^(big number)`. I didn't do anything here, but I wonder if we should limit it. To solve most of the cases, a warnExceptions was added for the overflow case, and a guard clause to return 0 for <-19 decimals on unsigned longs. Another issue is that rounding to a number like 7 to -1 returns 0 instead of 10, which may be considered an error. But it's consistent, so I'm leaving it to another PR	2025-01-17 13:38:14 +00:00
Nik Everett	1c13465991	ESQL: Move more test type error testing (#119945 ) This reduces the number of test cases in ESQL a little more ala #119678. It migrates a few random tests and all of the multivalue functions: ``` 92775 -> 43760 3m45 -> 4m04 ``` This adds a few more error test cases that were missing to make sure it all lines up well. And it fixes a few error messages in a few functions. That's likely where the extra time goes.	2025-01-16 20:27:27 +00:00
Nik Everett	ec0cab9a1a	Add operator to ESQL signature for kibana (#120230 ) This adds a field to the kibana defintion files for each signature that looks like: ``` "operator": "+", ``` Kibana wants these symbols.	2025-01-16 19:50:18 +00:00
Lisa Cawley	3129851b8f	[DOCS] Move settings out of reindex API (#120260 )	2025-01-16 09:30:20 -08:00
Jedr Blaszyk	0317c1ce36	[Connector API] Support hard deletes with new URL param in delete endpoint (#120200 ) * [Connector API] Add hard delete support * Undo accidental change * undo accidental build gradle change * Tweak typos * Update docs/changelog/120200.yaml * [CI] Auto commit changes from spotless * Fix yaml test * Actually skip the feature check since we don't have the feature anyway --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-01-16 09:45:10 +01:00
Brandon Morelli	c0f54a94be	Update redirects.asciidoc (#120249 )	2025-01-15 21:58:01 -05:00
George Wallace	1a4c862dd4	Added additional entries for troubleshooting unhealthy cluster (#119914 ) (#120233 ) * Added additional entries for troubleshooting unhealthy cluster Reordered "Re-enable shard allocation" because not as common as other causes Added additional causes of yellow statuses Changed watermark commadn to include high and low watermark so users can make their cluster operate once again. * Drive-by copyedit with suggestions for concision and some formatting fixes. * Concision and some formatting fixes. * Colon added * Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc * Title change * Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc * Spelling fix * Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc * Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc * Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc * Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc --------- Co-authored-by: Kofi B <seanziee@gmail.com> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>	2025-01-16 07:25:57 +11:00
Pat Whelan	958a861cd0	[ML] Update docs to say PUT instead of POST (#120215 )	2025-01-15 13:50:20 -05:00
Mark Tozzi	2708463e12	Esql - support date nanos in date format function (#120143 ) This adds support for passing Date Nanos into the Date Format function. It works for both the single argument and two argument versions. Format strings are unchanged, as the same formatting logic works for both resolutions. resolves #109994 --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-01-15 16:51:08 +00:00
István Zoltán Szabó	defbd96b96	[DOCS] Clarifies param description of model_size_bytes. (#120190 )	2025-01-15 13:14:41 +01:00
Liam Thompson	f7f8ab0012	[DOCS] More targeted link for ESQL in CCS overview (#120125 )	2025-01-15 10:32:33 +01:00
Mark Tozzi	2482f06f3c	ESQL - docs for to_date_nanos (#120124 ) I forgot to link the ToDateNanos docs when I merged that function. --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-01-14 16:31:24 -05:00
Pete Gillin	d85b90ad8c	Remove unfreeze REST endpoint (#119227 ) This adds a sentence to `redirects.asciidoc` explaining what frozen indices were - otherwise, everything will point to the message about the unfreeze API having gone away, which is not very helpful. Some cross-references are updated to point to this rather than to the notice about the removal of the unfreeze API. ES-9736 #comment Removed `_unfreeze` REST endpoint in https://github.com/elastic/elasticsearch/pull/119227	2025-01-14 10:34:46 +00:00
Ioana Tagirta	f5ac68df95	ESQL: Document support for semantic_text field mapping (#120052 ) * Document support for semantic_text field mapping * Address review comments	2025-01-13 22:18:47 +01:00
Nik Everett	c990377c95	ESQL: Limit memory usage of `fold` (#118602 ) `fold` can be surprisingly heavy! The maximally efficient/paranoid thing would be to fold each expression one time, in the constant folding rule, and then store the result as a `Literal`. But this PR doesn't do that because it's a big change. Instead, it creates the infrastructure for tracking memory usage for folding as plugs it into as many places as possible. That's not perfect, but it's better. This infrastructure limit the allocations of fold similar to the `CircuitBreaker` infrastructure we use for values, but it's different in a critical way: you don't manually free any of the values. This is important because the plan itself isn't `Releasable`, which is required when using a real CircuitBreaker. We could have tried to make the plan releasable, but that'd be a huge change. Right now there's a single limit of 5% of heap per query. We create the limit at the start of query planning and use it throughout planning. There are about 40 places that don't yet use it. We should get them plugged in as quick as we can manage. After that, we should look to the maximally efficient/paranoid thing that I mentioned about waiting for constant folding. That's an even bigger change, one I'm not equipped to make on my own.	2025-01-13 15:04:27 +00:00
Jonathan Buttner	838a41a839	[ML] Adding docs for the unified inference API (#118696 ) * Including examples * Using js instead of json * Adding unified docs to main page * Adding missing description text * Refactoring to remove unified route * Addign back references to the _unified route * Update docs/reference/inference/chat-completion-inference.asciidoc Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> * Address feedback --------- Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>	2025-01-13 09:48:23 -05:00
Mark Tozzi	e9f2d78923	Esql additional date format testing (#120000 ) This wires up the randomized testing for DateFormat. Prior to this PR, none of the randomized testing was hitting the one parameter version of the function, so I wired that up as well. This required some compromises on the type signatures, see comments in line.less --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-01-13 14:11:52 +00:00

... 2 3 4 5 6 ...

12664 Commits