Commit Graph

12664 Commits

Author SHA1 Message Date
Craig Taverner efe7379e67
Split ESQL functions/operators docs files (#123904)
* Port from asciidocalypse

* Fix links for operator lists

* Remove unused image files after moving/editing them

* Fix lists links

* Fix like/rlike links

* Fix remaining bad references to /elasticsearch/docs

* Fix logstash and beats references

* Fix logstash and beats references

* Fix image links
2025-03-04 14:59:31 +01:00
John Wagster be577e382d
Update Flatten Graph Docs to Include a Real Flattened Graph 9.x (#123901)
updated flatten graph docs to include a real flattened graph
2025-03-03 14:33:53 -06:00
Colleen McGinnis db5acd8976
add missing pages (#123774) 2025-03-03 15:02:51 +00:00
Liam Thompson 6b27e420fe
Cleanup search connectors, add some reference -> docs content signposts in various sections (#123733) 2025-02-28 17:10:09 +00:00
Liam Thompson 91c2654570
Fix broken cross-repo links, versions in search connectors docker instructions (#123700) 2025-02-28 16:02:54 +01:00
Colleen McGinnis b7e3a1e14b
[docs] Migrate docs from AsciiDoc to Markdown (#123507)
* delete asciidoc files

* add migrated files

* fix errors

* Disable docs tests

* Clarify release notes page titles

* Revert "Clarify release notes page titles"

This reverts commit 8be688648d.

* Comment out edternal URI images

* Clean up query languages landing pages, link to conceptual docs

* Add .md to url

* Fixes inference processor nesting.

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: Liam Thompson <leemthompo@gmail.com>
Co-authored-by: Martijn Laarman <Mpdreamz@gmail.com>
Co-authored-by: István Zoltán Szabó <szabosteve@gmail.com>
2025-02-27 17:56:14 +01:00
Kathleen DeRusso ae6474db63
Deprecate Behavioral Analytics CRUD apis (#122960)
* Deprecate Behavioral Analytics CRUD APIs

* Add allowed warning for REST Compatibility tests

* Update docs/changelog/122960.yaml

* Update changelog

* Update docs to add deprecation flags and fix failing tests

* Update changelog

* Update changelog again

* Update docs formatting

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Skip asciidoc test

---------

Co-authored-by: Efe Gürkan YALAMAN <efeyalaman@gmail.com>
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: Efe Gürkan YALAMAN <efeguerkan.yalaman@elastic.co>
2025-02-25 16:02:50 +01:00
Craig Taverner ec82c24a87
Add support to VALUES aggregation for spatial types (#122886)
The original work at https://github.com/elastic/elasticsearch/pull/106065 did not support geospatial types with this comment:

> I made this work for everything but geo_point and cartesian_point because I'm not 100% sure how to integrate with those. We can grab those in a follow up.

The geospatial types should be possible to collect using the VALUES aggregation with similar behavior to the `ST_COLLECT` OGC function, based on the Elasticsearch convention that treats multi-value geospatial fields as behaving similarly to any geometry collection. So this implementation is a trivial addition to the existing values types support.
2025-02-25 11:38:51 +01:00
Luke Whiting e3792d19b5
Allow data stream reindex tasks to be re-run after completion (#122510)
* Allow data stream reindex tasks to be re-run after completion

* Docs update

* Update docs/reference/migration/apis/data-stream-reindex.asciidoc

Co-authored-by: Keith Massey <keith.massey@elastic.co>

---------

Co-authored-by: Keith Massey <keith.massey@elastic.co>
2025-02-20 15:03:51 +00:00
David Turner cdaa5dd7ad
Clarify breaking change note for #112903 (#122998)
Closes #122994
2025-02-20 12:11:56 +00:00
Lee Hinman 2ae80c799d
Allow setting the `type` in the reroute processor (#122409)
* Allow setting the `type` in the reroute processor

This allows configuring the `type` from within the ingest `reroute` processor. Similar to `dataset`
and `namespace`, the type defaults to the value extracted from the index name. This means that
documents sent to `logs-mysql.access.default` will have a default value of `logs` for the type.

Resolves #121553

* Update docs/changelog/122409.yaml
2025-02-18 12:38:00 -07:00
Nik Everett df2f3b3b3f
ESQL: Update kibana signatures (#121951)
This updates the kibana signature json files in two ways:
* Renames `eval` to `scalar` - that's the name we use inside of ESQL and
  we may as well make the name the same.
* Calls the `CATEGORIZE` and `BUCKET` function `grouping` because they
  can only be used in the "grouping" positions of the `STATS` command.

Closes #113411
2025-02-07 09:51:09 -05:00
Fang Xing f58fdf81e9
[ES|QL] Change function_named_parameters in Kibana doc to expected format (#121585)
* change function_named_parameters in kibana doc to expected format
2025-02-04 12:20:34 -05:00
elasticsearchmachine 69bdf465b0 Bump to version 9.1.0 2025-01-30 16:55:46 +00:00
Jim Ferenczi fb3c666663
Remove outdated reference to internal semantic text format (#121276)
The semantic text format was updated in #119183. This commit removes the last remaining reference to the old format from the documentation to ensure consistency.
2025-01-30 15:01:55 +01:00
Chris Hegarty 4baffe4de1
Upgrade to Lucene 10.1.0 (#119308)
This commit upgrades to Lucene 10.1.0.
2025-01-30 13:41:02 +00:00
Liam Thompson c8dfb4ea9e
[DOCS] Fix missing id syntax (#121264)
* [DOCS] Fix missing id syntax

* Update docs/reference/troubleshooting/common-issues/disk-usage-exceeded.asciidoc

* fix id
2025-01-30 12:52:37 +01:00
Jim Ferenczi dbeb55cb3d
Enable Mapped Field Types to Override Default Highlighter (#121176)
This commit introduces the `MappedFieldType#getDefaultHighlighter`, allowing a specific highlighter to be enforced for a field.
The semantic field mapper utilizes this new functionality to set the `semantic` highlighter as the default.
All other fields will continue to use the `unified` highlighter by default.
2025-01-29 21:55:53 +00:00
Slobodan Adamović c5ab17c3aa
Deprecate certificate-based remote cluster security model (#120806)
Today, Elasticsearch supports two models to establish secure connections
and trust between two Elasticsearch clusters:

- API key based security model
- Certificate based security model

This PR deprecates the _Certificate based security model_ in favour of *API key based security model*.
The _API key based security model_ is preferred way to configure remote clusters,
as it allows to follow security best practices when setting up remote cluster connections
and defining fine-grained access control.

Users are encouraged to migrate remote clusters from certificate to API key authentication.
2025-01-29 19:43:04 +01:00
Kuni Sen a0f1856a40
(Doc+) Expand watermark resolution (#119174)
* (Doc+) Expand watermark resolution

Relaunch https://github.com/elastic/elasticsearch/pull/116892 since the original one seems to be outdated and hard to update branch.

* Apply suggestions from code review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2025-01-29 19:31:50 +01:00
Luiz Santos c0f3024c3f
Make it clear that previous enrich indices are deleted every 15 minutes (#109085)
Before this change, one could interpret that enrich policies are executed every 15 minutes, which is not true.
2025-01-29 19:28:43 +01:00
Liam Thompson 9edd64e608
[DOCS] Fix failing docs test (at least try) (#118934)
Fix failing docs test:

* Unmute test
* Replace hardcoded values with regex in snippet test
2025-01-29 19:21:58 +01:00
Nikolaj Volgushev 51b4fffb5e
Default to `SSHA-256` as API key stored credential hasher (#120997)
API keys are high-entropy secure random strings. This means that the
additional work factor of functions like PBKDF or bcrypt are not
necessary, and a faster hash function like salted SHA-256 provides
adequate security against offline attacks (hash collision, brute force,
etc.).

This PR adds `SSHA-256` to the list of supported stored hash algorithms
for API key secrets, and makes it the default algorithm. Additionally,
this PR changes the format of API key secrets, moving from an encoded
UUID to a random string which increase the entropy of API keys from 122
bits to 128 bits, without changing overall secret length. 

Relates: ES-9504
2025-01-30 05:14:15 +11:00
Michael Peterson d3f20e5b4b
Updated resolve/cluster end user docs with information about the timeout flag and no index expression endpoint (#121199) 2025-01-29 18:22:40 +01:00
Stanislav Malyshev 3669e061d4
Fix typo in docs example (#121206) 2025-01-29 09:44:42 -07:00
Peter Straßer 6b76457a23
Fix syntax errors in the rescore retriever example (#121024) 2025-01-29 16:10:59 +01:00
Michael Peterson e9b877e58b
Clarify the behavior of remote/info and resolve/cluster for connected status of remotes (#118993) 2025-01-29 10:08:25 -05:00
Kathleen DeRusso 4b4c59de7f
Fix error in docs code snippet (#121187) 2025-01-29 16:05:05 +01:00
Benjamin Trent 038aab864e
Mark bbq indices as GA and add rolling upgrade integration tests (#121105)
With the introduction of our new backing algorithm and making rescoring
easier with the `rescore_vector` API, let's mark bbq as GA. 

Additionally, this commit adds rolling upgrade tests to ensure
stability.
2025-01-30 01:58:08 +11:00
Pat Whelan 9009606a47
[Transform] add support for extended_stats (#120340)
Building off of `stats` and multi-value aggregations, including the
limitation:
- all values of extended_stats will be mapped to `double` if mapping
  deduction is used

Relates #51925
2025-01-29 15:33:16 +01:00
Martijn van Groningen 952bf229fb
Conditionally enable logsdb by default (#121049)
Enable logsdb by default if logsdb.prior_logs_usage has not been set to true.

Meaning that if no data streams were created matching with the logs-- pattern in 8.x, then logsdb will be enabled by default for data streams matching with logs-*-* pattern.

Also removes LogsPatternUsageService as with version 9.0 and beyond, this component is no longer necessary.

Followup from #120708
Closes #106489
2025-01-29 15:03:28 +01:00
Liam Thompson f5f0e3bd7f
[DOCS] Update getting-started.asciidoc (#116151) (#121173)
Update `new_field` to `language` which is the actual new field added in dynamic mapping

Co-authored-by: Ekwinder <ekwindersaini@gmail.com>
2025-01-30 00:52:11 +11:00
Valeriy Khakhutskyy 15b93fefdb
Extend documentation note. (#121146) 2025-01-29 13:03:42 +01:00
Jihyun(Brian) Jeong e1207398c7
(Doc+) Clarify dimension field requirements for time_series aggregation (#119442)
* (Doc+) Clarify dimension field requirements for time_series aggregation

👋 howdy, team!

This PR adds a note explaining that time series indices require:
- index.mode set to "time_series"
- at least one dimension field with time_series_dimension: true
- a routing_path array listing those dimension fields

Without these settings, the time_series aggregation may return empty buckets or behave unexpectedly. By emphasizing the dimension field requirement, we help users configure their time series indices correctly and see meaningful aggregation results.

* Apply suggestions from code review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2025-01-29 13:03:11 +01:00
Stef Nestor 31597b3897
(Doc+) System Index definition (#120327) 2025-01-29 11:14:36 +01:00
Kofi B 5bcd170a0b
[DOCS] Added additional context to page (#120569) 2025-01-29 09:48:25 +01:00
Kofi B 2258911112
[DOCS] Search multiple indices added info (#120572)
* [DOCS] Search multiple indices added info

* Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc

Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>

* Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc

Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>

* Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc

Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>

* Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc

Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>

* Update docs/reference/search/search-your-data/search-multiple-indices.asciidoc

Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>

---------

Co-authored-by: George Wallace <georgewallace@users.noreply.github.com>
2025-01-29 09:46:39 +01:00
Kofi B 63a890e30d
[DOCS] Upsert documentation clarification (#120684)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2025-01-29 09:46:01 +01:00
Martijn van Groningen 8185cafaf2
Emit deprecation warning when executing one of the rollup APIs (#113131)
Relates to #112690
2025-01-29 08:48:53 +01:00
Parker Timmins 635a4c21de
Add docs for reindex data stream REST endpoints (#120653)
Add documentation for new REST endpoints related to data stream upgrade. 
Endpoints:
- /_migration/reindex
- /_migration/reindex/{index}/_status
- /_migration/reindex/{index}/_cancel
- /_create_from/{source}/{dest}
2025-01-28 19:44:56 -06:00
Lisa Cawley bfeba89e0c
[DOCS] Move ML function reference out of appendix (#121111) 2025-01-28 23:56:24 +01:00
Panagiotis Bailis 375814d007
Adding linear retriever to support weighted sums of sub-retrievers (#120222) 2025-01-28 19:33:12 +02:00
George Wallace 1a05f41a71
Adjusted alias doc for clarity (#120437) (#121064)
Co-authored-by: Kofi B <kofi.bartlett@elastic.co>
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2025-01-29 03:52:52 +11:00
István Zoltán Szabó b925b0cbcc
[DOCS] Adds anomaly detection info to migration guide (#121015)
Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com>
2025-01-28 17:50:37 +01:00
István Zoltán Szabó 08255da9ac
[DOCS] Fixes max_chunk_size parameter name. (#121052) 2025-01-28 17:10:08 +01:00
István Zoltán Szabó 7400a14995
[DOCS] Documents that deployment_id can be used as inference_id in certain cases. (#121055) 2025-01-28 17:01:18 +01:00
Slobodan Adamović 953f1749a4
[Docs] Update Query Roles API documentation (#120740)
The query role API now returns built-in roles as well.
This PR notes this and adds an example on how the 
built-in roles can be filtered out.
2025-01-28 16:29:50 +01:00
Liam Thompson 3939198477
Update match-phrase-query.asciidoc (#118828) (#121033)
(cherry picked from commit 8e9cccba6a)

Co-authored-by: Damien RENIER <153135842+damien-renier-elastic@users.noreply.github.com>
2025-01-28 16:19:14 +01:00
Panagiotis Bailis 8e2044de15
Normalize negative scores for text_similarity_reranker retriever (#120930) 2025-01-28 16:56:47 +02:00
Amine GANI 38ea49a1b9
Fix incorrect use of "updateable" flag in synonyms documentation (#120866)
Co-authored-by: Amine GANI <amine.gani@adelean.com>
Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com>
2025-01-28 15:39:25 +01:00
Carlos Delgado a87bd7ae26
ESQL - Allow full text functions disjunctions for non-full text functions (#120291) 2025-01-28 14:08:13 +01:00
Roberto Seldner ddc2362592
Update async-search.asciidoc - Indicating `search.max_async_search_response_size` is a Dynamic (#112758)
Indicating `search.max_async_search_response_size` is a Dynamic setting here as it does not appear to be documented elsewhere.
2025-01-28 11:39:10 +01:00
Pius Fung 38b0e925f5
Add warning on scripted metric aggregation's intermediate state memory usage (#119379) 2025-01-28 11:10:43 +01:00
Sean Story 636e3645ac
Clarify need to submit for authorization (#119460) 2025-01-28 11:09:15 +01:00
István Zoltán Szabó 7837a96ce5
[DOCS] Adds EIS reference docs (#120706) 2025-01-28 11:02:28 +01:00
Sylvain Morin e18baa12fa
Minor fix in documentation (#119385)
Co-authored-by: Iraklis Psaroudakis <kingherc@gmail.com>
2025-01-28 10:56:33 +01:00
Charlotte Hoblik ee0ad557e6
Fix typo in tutorial (#120928) 2025-01-28 10:54:20 +01:00
Carlos Delgado d91d51600e
ESQL - Add Match function options (#120360) 2025-01-28 08:54:33 +01:00
Lee Hinman e0f5a60d32
Document that disabling stack templates is not recommended (#120963)
There are many features of the Elasticsearch ecosystem that may malfunction, or fail to work entirely, if these templates are not installed. This commit adds documentation cautioning against disabling the installation of templates.
2025-01-27 15:17:48 -07:00
Joe Gallo 9bc9ba788b
Add a replicate_for option to the ILM searchable_snapshot action (#119003) 2025-01-27 14:32:46 -05:00
Mark Tozzi 5b3436dce0
Esql - Support date nanos in date extract function (#120727)
Resolves https://github.com/elastic/elasticsearch/issues/110000

Add support for running the date extract function on nanosecond dates.
2025-01-27 14:34:50 +00:00
Kostas Krikellas 3532d0bb10
[DOCS] Update documentation for index sorting and routing for logsdb (#120721)
* [DOCS] Update documentation for index sorting and routing for logsdb

* update

* Apply suggestions from code review

Co-authored-by: Marci W <333176+marciw@users.noreply.github.com>

* Update logs.asciidoc

* Update docs/reference/data-streams/logs.asciidoc

Co-authored-by: Marci W <333176+marciw@users.noreply.github.com>

* Update logs.asciidoc

---------

Co-authored-by: Marci W <333176+marciw@users.noreply.github.com>
2025-01-27 16:21:28 +02:00
Luigi Dell'Aquila a0840a0463
EQL: set allow_partial_search_results=true by default (#120267) 2025-01-27 10:23:34 +00:00
Tim Sullivan 7d7a9d9fdb
[Index Management] Doc updates for Kibana Reporting built-ins (#120829)
* [Index Management] Doc updates for Kibana Reporting built-ins

* Update docs/reference/indices/index-templates.asciidoc

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

---------

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2025-01-24 20:48:33 +00:00
Carlos Delgado f61f139653
Match, Like and RLike operators improved docs (#120504) 2025-01-24 07:58:10 +01:00
Mark Tozzi 7e43605e38
Esql Support date nanos on date diff function (#120645)
Resolves #109999

This adds support for date nanos in the date diff function, as well as mixed nanos/millis use cases.

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-01-23 18:04:38 +00:00
Stanislav Malyshev f27f74666f
ES|QL async queries: Partial result on demand (#118122)
Add capability to stop async query on demand
The theory:

- User initiates async search request
- User sends the stop request (POST _query/async/<ID>/stop)
- If the async is finished by that time, it's like regular async get
- If it's not finished, the sinks are closed and the request is forcefully finished
2025-01-23 10:21:52 -07:00
Nik Everett eae93a2097
ESQL: Signatures for `NOT IN` et al (#120673)
* ESQL: Signatures for `NOT IN` et al

This generates signatures for `NOT IN`, `NOT LIKE`, and `NOT RLIKE`
using a small hack on top of the process used to generate the signatures
for `IN`, `LIKE`, and `RLIKE`. This is a very perl-worth hack, replacing
`LIKE` with `NOT LIKE` in the description. But it's useful for our
kibana friends and if we need to make it nicer we can do so later.

* Zap
2025-01-23 10:57:53 -05:00
Oleksandr Kolomiiets cdff3defde
Fix typo in synthetic source docs (#120685) 2025-01-23 07:51:58 -08:00
István Zoltán Szabó 443f0f3ded
[DOCS] Adds note about differences between chat completion and stream API (#120636) 2025-01-23 14:41:12 +01:00
Liam Thompson bb0d0ed6dd
Removes outdated admonition (#120556) (#120703)
Resolves /security-docs/https://github.com/elastic/security-docs/issues/6430. Removes an outdated admonition.

(cherry picked from commit 63074d8e70)

Co-authored-by: Benjamin Ironside Goldstein <91905639+benironside@users.noreply.github.com>
2025-01-23 14:08:27 +01:00
Marci W abeb60ff1e
[DOCS] Count API: clarify ways to specify search query (#120564)
* Clarify query methods; other sprucing

* Apply suggestions from review
2025-01-22 18:05:00 -05:00
Michael Peterson b3a032cc4e
Resolve/cluster allows querying for cluster info only (no index expression required) (#119898)
Resolve/cluster allows querying for cluster-info-only (no index expression required)

This enhancement provides users with the ability to query the _resolve/cluster API endpoint without specifying
an index expression to match against. This allows users to quickly test what remote clusters are configured on
a cluster and whether they are available for querying.

The new endpoint takes no index expression:

```
GET _resolve/cluster
```

and returns the same information as before except for the "matching_indices" field. Example response:

```
{
  "remote1": {
    "connected": false,
    "skip_unavailable": true
  },
  "remote2": {
    "connected": true,
    "skip_unavailable": false,
    "version": {
      "number": "8.17.0",
      "build_flavor": "default",
      "minimum_wire_compatibility_version": "7.17.0",
      "minimum_index_compatibility_version": "7.0.0"
    }
  }
}
```

For backwards compatibility, this new endpoint works with clusters from older versions by querying with the index expression `dummy*` on those older clusters and ignoring the matching_indices value in the response they return.
2025-01-22 12:17:29 -05:00
Andrei Stefan cdf7be27ea
Update search-across-clusters.asciidoc to reflect the `true` default value of `skip_unavailable` setting. (#120592) 2025-01-22 16:04:56 +02:00
Pete Gillin b8bf111830
Remove telemetry related to frozen indices (#119890)
This deprecated feature is being removed in 9.0, so the telemetry is
no longer needed.

The usage action is retained to support mixed v8/v9 clusters, with
annotations to remove in V10. But it is no longer registered in
`XPackUsageFeatureAction.ALL` and so the usage data is no longer
reported by `GET _xpack/usage`, and if invoked it always returns a
count of 0.

ES-9736 # comment Removed the telemetry in https://github.com/elastic/elasticsearch/pull/119890
2025-01-22 11:19:15 +00:00
Jim Ferenczi 1db194df22
Add Multi-Field Support for Semantic Text Fields (#120128)
Semantic text fields now support multi-fields, either as part of a multi-field structure or containing multi-fields internally.
This enhancement aligns with the semantic text field's current behavior as a standard text field.

Note: Multi-field support is only available for the new index format. Attempting to set a multi-field on an index created with the older format will still result in a failure.
2025-01-21 22:01:11 +01:00
Panagiotis Bailis 3e6b8bf51a
Fix for rrf documentation test using a knn retriever (#120112) 2025-01-21 19:32:45 +02:00
Tommaso Teofili 1b1296ef54
Move scoring in ES|QL out of snapshot (#120354)
* Move scoring in ES|QL out of snapshot

---------

Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com>
2025-01-21 14:22:19 +01:00
István Zoltán Szabó c60b3be6c7
[DOCS] Rename inference services to inference integrations in docs (#120212)
Co-authored-by: David Kyle <david.kyle@elastic.co>
2025-01-21 11:19:44 +01:00
Liam Thompson 18b281ea16
[DOCS] Updated wording for clarity for new users (#120257) (#120507)
Co-authored-by: Kofi B <kofi.bartlett@elastic.co>
2025-01-21 20:32:20 +11:00
Liam Thompson 8b00d503a1
[DOCS] Update wildcard query documentation (#120251) (#120502)
Co-authored-by: Kofi B <kofi.bartlett@elastic.co>
2025-01-21 20:29:38 +11:00
Charlotte Hoblik c760d73c55
Fix aggregation typo (#120461) 2025-01-20 11:38:50 +01:00
Carlos Delgado aea4853069
[Docs] kNN vector rescoring for quantized vectors (#118425) 2025-01-17 17:02:09 +01:00
Iván Cea Fontenla acb46af612
ESQL: Fix ROUND() with unsigned longs throwing in some edge cases (#119536)
There were different error cases with `ROUND(number, decimals)`:
- Decimals accepted unsigned longs, but threw a 500 with a `can't process [unsigned_long -> long]` in the cast evaluator
  - Fixed by improving the `resolveType()`
- If the number was a BigInteger unsigned long, there were 2 cases throwing an exception:
  1. Negative decimals outside the range of integer: Error
  2. Negative decimals insie the range of integer, but "big enough" for `BigInteger.TEN.pow(...)` to throw a `BigInteger would overflow supported range`
  3. -19 decimals with big unsigned longs like `18446744073709551615` was throwing an `unsigned_long overflow`

Also, when the number is a BigInteger and the decimals is a big negative (but not big enough to throw), it may be **very** slow. Taking _many_ seconds for a single computation (It tries to calculate a `10^(big number)`. I didn't do anything here, but I wonder if we should limit it.

To solve most of the cases, a warnExceptions was added for the overflow case, and a guard clause to return 0 for <-19 decimals on unsigned longs.

Another issue is that rounding to a number like 7 to -1 returns 0 instead of 10, which may be considered an error. But it's consistent, so I'm leaving it to another PR
2025-01-17 13:38:14 +00:00
Nik Everett 1c13465991
ESQL: Move more test type error testing (#119945)
This reduces the number of test cases in ESQL a little more ala #119678.
It migrates a few random tests and all of the multivalue functions:
```
92775 -> 43760
 3m45 -> 4m04
```

This adds a few more error test cases that were missing to make sure it all
lines up well. And it fixes a few error messages in a few functions. That's
*likely* where the extra time goes.
2025-01-16 20:27:27 +00:00
Nik Everett ec0cab9a1a
Add operator to ESQL signature for kibana (#120230)
This adds a field to the kibana defintion files for each signature that
looks like:
```
  "operator": "+",
```
Kibana wants these symbols.
2025-01-16 19:50:18 +00:00
Lisa Cawley 3129851b8f
[DOCS] Move settings out of reindex API (#120260) 2025-01-16 09:30:20 -08:00
Jedr Blaszyk 0317c1ce36
[Connector API] Support hard deletes with new URL param in delete endpoint (#120200)
* [Connector API] Add hard delete support

* Undo accidental change

* undo accidental build gradle change

* Tweak typos

* Update docs/changelog/120200.yaml

* [CI] Auto commit changes from spotless

* Fix yaml test

* Actually skip the feature check since we don't have the feature anyway

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-01-16 09:45:10 +01:00
Brandon Morelli c0f54a94be
Update redirects.asciidoc (#120249) 2025-01-15 21:58:01 -05:00
George Wallace 1a4c862dd4
Added additional entries for troubleshooting unhealthy cluster (#119914) (#120233)
* Added additional entries for troubleshooting unhealthy cluster

Reordered "Re-enable shard allocation" because not as common as other causes

Added additional causes of yellow statuses

Changed watermark commadn to include high and low watermark so users can make their cluster operate once again.

* Drive-by copyedit with suggestions for concision and some formatting fixes.



* Concision and some formatting fixes.



* Colon added



* Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc



* Title change



* Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc



* Spelling fix

* Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc



* Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc



* Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc



* Update docs/reference/troubleshooting/common-issues/red-yellow-cluster-status.asciidoc



---------

Co-authored-by: Kofi B <seanziee@gmail.com>
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2025-01-16 07:25:57 +11:00
Pat Whelan 958a861cd0
[ML] Update docs to say PUT instead of POST (#120215) 2025-01-15 13:50:20 -05:00
Mark Tozzi 2708463e12
Esql - support date nanos in date format function (#120143)
This adds support for passing Date Nanos into the Date Format function. It works for both the single argument and two argument versions. Format strings are unchanged, as the same formatting logic works for both resolutions.

resolves #109994

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-01-15 16:51:08 +00:00
István Zoltán Szabó defbd96b96
[DOCS] Clarifies param description of model_size_bytes. (#120190) 2025-01-15 13:14:41 +01:00
Liam Thompson f7f8ab0012
[DOCS] More targeted link for ESQL in CCS overview (#120125) 2025-01-15 10:32:33 +01:00
Mark Tozzi 2482f06f3c
ESQL - docs for to_date_nanos (#120124)
I forgot to link the ToDateNanos docs when I merged that function.
---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-01-14 16:31:24 -05:00
Pete Gillin d85b90ad8c
Remove unfreeze REST endpoint (#119227)
This adds a sentence to `redirects.asciidoc` explaining what frozen
indices were - otherwise, everything will point to the message about
the unfreeze API having gone away, which is not very helpful. Some
cross-references are updated to point to this rather than to the
notice about the removal of the unfreeze API.

ES-9736 #comment Removed `_unfreeze` REST endpoint in https://github.com/elastic/elasticsearch/pull/119227
2025-01-14 10:34:46 +00:00
Ioana Tagirta f5ac68df95
ESQL: Document support for semantic_text field mapping (#120052)
* Document support for semantic_text field mapping

* Address review comments
2025-01-13 22:18:47 +01:00
Nik Everett c990377c95
ESQL: Limit memory usage of `fold` (#118602)
`fold` can be surprisingly heavy! The maximally efficient/paranoid thing
would be to fold each expression one time, in the constant folding rule,
and then store the result as a `Literal`. But this PR doesn't do that
because it's a big change. Instead, it creates the infrastructure for
tracking memory usage for folding as plugs it into as many places as
possible. That's not perfect, but it's better.

This infrastructure limit the allocations of fold similar to the
`CircuitBreaker` infrastructure we use for values, but it's different
in a critical way: you don't manually free any of the values. This is
important because the plan itself isn't `Releasable`, which is required
when using a real CircuitBreaker. We could have tried to make the plan
releasable, but that'd be a huge change.

Right now there's a single limit of 5% of heap per query. We create the
limit at the start of query planning and use it throughout planning.

There are about 40 places that don't yet use it. We should get them
plugged in as quick as we can manage. After that, we should look to the
maximally efficient/paranoid thing that I mentioned about waiting for
constant folding. That's an even bigger change, one I'm not equipped
to make on my own.
2025-01-13 15:04:27 +00:00
Jonathan Buttner 838a41a839
[ML] Adding docs for the unified inference API (#118696)
* Including examples

* Using js instead of json

* Adding unified docs to main page

* Adding missing description text

* Refactoring to remove unified route

* Addign back references to the _unified route

* Update docs/reference/inference/chat-completion-inference.asciidoc

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Address feedback

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2025-01-13 09:48:23 -05:00
Mark Tozzi e9f2d78923
Esql additional date format testing (#120000)
This wires up the randomized testing for DateFormat. Prior to this PR, none of the randomized testing was hitting the one parameter version of the function, so I wired that up as well. This required some compromises on the type signatures, see comments in line.less

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-01-13 14:11:52 +00:00
George Wallace 8ac3c2d371
Adjustments to match Opster, readability (#119812) (#120031)
* Adjustments to match Opster, readability

https://opster.com/guides/elasticsearch/search-apis/elasticsearch-bool-queries/

* Edits



* Spacing



---------

Co-authored-by: Kofi B <23384471+thekofimensah@users.noreply.github.com>
2025-01-13 12:03:59 +01:00
Mayya Sharipova de0914965f
[DOCS] Interval query max_gaps in all_of rule (#119963)
Add more explanation how `max_gaps` work in interval queries with
`all_of` rule.

Closes #113554
2025-01-11 07:59:32 +11:00
Orestis Floros 3c184b912c
Permissions required for stateful agentless integrations (#118644)
Closes elastic/security-team#11102
Closes elastic/security-team#11104

This allows agentless integrations (via elastic/beats#41446, elastic/kibana#203810) to write to agentless-* indices. Each index is created on-demand by the filebeat client and kibana conditionally extends the API key permissions to allow writing to the index.
2025-01-10 16:30:11 +00:00
Valentin Crettaz 1e608dc223
[DOCS] Improve/fix documentation on stored scripts (#119921)
* Improve/fix documentation on stored scripts

* Update docs/reference/scripting/using.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/scripting/using.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/transform/painless-examples.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2025-01-10 16:13:09 +00:00
Artem Shelkovnikov f2d069e2bf
Update Sharepoint Online connector documentation (#119933)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2025-01-10 15:56:38 +01:00
Aurélien FOUCRET 31f11c3c0c
[ES|QL] Enable KQL function as a tech preview (#119730) 2025-01-10 12:49:28 +01:00
Niels Bauman 80e8017bb6
Run `TransportGetIndexTemplatesAction` on local node (#119837)
This action solely needs the cluster state, it can run on any node.
Additionally, it needs to be cancellable to avoid doing unnecessary work
after a client failure or timeout.

As a drive-by, this removes another usage of the trappy default master
node timeout.
2025-01-10 00:20:16 +00:00
Niels Bauman 65e4ec129c
Run `TransportGetComposableIndexTemplate` on local node (#119830)
This action solely needs the cluster state, it can run on any node.
Additionally, it needs to be cancellable to avoid doing unnecessary work
after a client failure or timeout.

As a drive-by, this removes another usage of the trappy default master
node timeout.
2025-01-10 09:00:31 +10:00
mmahacek 74d235124e
Fix line spacing on ILM docs (#119807) 2025-01-09 08:52:54 -08:00
Kathleen DeRusso c7ec808f45
[Docs] Add docs for new semantic text query functionality (#119520)
* Update docs with new semantic text functionality

* PR feedback

* PR feedback

* PR Feedback
2025-01-09 11:11:20 -05:00
David Kyle 547a567c13
[DOCS][ML] Document the text_expansion task type (#119581) 2025-01-09 15:44:01 +00:00
Arianna Laudazzi 70e5a67904
[AutoOps] Reference AutoOps solution on troubleshooting pages (#119630)
* Reference AutoOps on troubleshooting pages

* Integrate reviewer's feedback
2025-01-09 16:24:20 +01:00
shainaraskas d37e1bd14d
Fix broken anchors (#119802) 2025-01-09 09:15:00 -05:00
Michael Peterson 230acb8ac5
Resolve/cluster should mark remotes as not connected when a security exception is thrown (#119793)
Fixes two bugs in _resolve/cluster.

First, the code that detects older clusters versions and does a fallback to the _resolve/index
endpoint was using an outdated string match for error detection. That has been adjusted.

Second, upon security exceptions, the _resolve/cluster endpoint was marking the clusters as connected: true,
under the assumption that all security exceptions related to cross cluster calls and remote index access were
coming from the remote cluster, but that is not always the case. Some cross-cluster security violations can
be detected on the local querying cluster after issuing the remoteClient.execute call but before the transport
layer actually sends the request remotely. So we now mark the connected status as false for all ElasticsearchSecurityException cases. End user docs have been updated with this information.
2025-01-09 08:56:57 -05:00
Jedr Blaszyk 750a0ab846
[Connector API] Support soft-deletes of connectors (#118669)
* [Connector API] Add interface for soft-deletes

* Define connector deleted system index

* Got soft-delete logic working

* Add unit tests

* Add yaml e2e test and attempt to update permissions

* Fix permissions

* Update docs

* Fix docs

* Update docs/changelog/118282.yaml

* Change logic

* Fix tests

* Remove unnecessary privilege from yaml rest test

* Update changelog

* Update docs/changelog/118669.yaml

* Adapt yaml tests

* Undo changes to muted-tests.yml

* Fix compilation issue after other PR got merged

* Exclude soft-deleted connector from checks about index_name already in use

* Update docs/reference/connector/apis/get-connector-api.asciidoc

Co-authored-by: Tim Grein <tim@4greins.de>

* Update rest-api-spec/src/main/resources/rest-api-spec/api/connector.list.json

Co-authored-by: Tim Grein <tim@4greins.de>

* Adapt comments, add connector wire serializing test

* Introduce new transport versions for passing the delete flag

* Get rid of wire serialisation, use include_deleted instead of deleted flag

* Remove unused import

* Final tweaks

* Adapt variable name in rest layer

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Tim Grein <tim@4greins.de>
2025-01-09 12:20:28 +00:00
Liam Thompson 996a4f8e7d
Update rejected-requests.asciidoc (#119547) (#119673)
I believe this is a typo, as in our 8.16.1 cluster this field appears to be called `combined_coordinating_and_primary`

Co-authored-by: Ian Lee <IanLee1521@gmail.com>
2025-01-09 11:01:46 +01:00
Lisa Cawley 385040d2b5
[DOCS] Fix link in repository-s3.asciidoc (#119719) 2025-01-08 12:02:36 -08:00
James Baiera c3839e1f76
Add selector syntax to index expressions (#118614)
This PR introduces a new syntactical feature to index expression resolution: The selector.

Selectors, denoted with a :: followed by a recognized suffix will allow users to specify which component of 
an index abstraction they would like to operate on within an API call. In this case, an index abstraction is a 
concrete index, data stream, or alias; Any abstraction that can be resolved to a set of indices/shards. We 
define a component of an index abstraction to be some searchable unit of the index abstraction.
2025-01-08 11:37:03 -05:00
Ievgen Degtiarenko fd1be8ce6f
Hash functions (#118938)
This change adds md5, sha1 and sha256 hash functions.
2025-01-08 16:44:15 +01:00
Albert Zaharovits 12eb1cfda1
Metrics for indexing failures due to version conflicts (#119067)
This exposes new OTel node and index based metrics for indexing failures due to version conflicts.

In addition, the /_cat/shards, /_cat/indices and /_cat/nodes APIs also expose the same metric, under the newly added column iifvc.

Relates: #107601
2025-01-08 12:34:37 +00:00
Benjamin Trent c18b48dbd4
Apply default k for knn query eagerly (#118774)
When originally added, the knn query didn't apply `top-k` restrictions
to the query. Instead it would allow the resulting `num_candidate` to be
combined with sibling queries without restricting to `top-size` results
ahead of time.

This honestly is confusing behavior and leads to some bugs in understand
how it all works.

This commit addresses this by eagerly gathering only `size` results when
`k==null` before combining with other queries. 

To achieve the previous behavior, this can be done directly by setting
`k==num_candidates` in the query.
2025-01-08 07:40:34 +11:00
Joan Fontanals 53a16ee967
[docs] Add documentation for JinaAI service (#118782) 2025-01-07 18:04:12 +00:00
Svilen Mihaylov 93c349cc76
Add ability to set "max_analyzed_offset" implicitly to "index.highlight (#118895)
Add ability to set "max_analyzed_offet" implicitly to "index.highlight
.max_analyzed_offset", by setting it excplicitly to "-1".

Closes #112822
2025-01-07 11:19:07 -05:00
Liam Thompson 92bb091521
[DOCS] RAG overview (#119590) 2025-01-07 15:18:39 +01:00
Srikanth Manvi 8ded6c4568
Update search-application-api.asciidoc (#118494)
Fixing minor Typo
2025-01-07 13:30:14 +01:00
Liam Thompson c7b61bdc63
[DOCS] Add full-text search overview (#119462) 2025-01-06 17:56:35 +00:00
shainaraskas 17111e1258
[DOCS] Concept cleanup 2 - ES settings (#119373) 2025-01-06 12:07:15 -05:00
Benjamin Trent a5716c8f99
Add new experimental rank_vectors mapping for late-interaction second order ranking (#118804)
Late-interaction models are powerful rerankers. While their size and
overall cost doesn't lend itself for HNSW indexing, utilizing them as
second order "brute-force" reranking can provide excellent boosts in
relevance. At generally lower inference times than large cross-encoders.


This commit exposes a new experimental `rank_vectors` field that allows
for maxSim operations. This unlocks the initial, and most common use of
late-interaction dense-models. 

For example, this is how you would use it via the API:

```
PUT index
{
  "mappings": {
    "properties": {
      "late_interaction_vectors": {
        "type": "rank_vectors"
      }
    }
  }
}
```

Then to index:

```
POST index/_doc
{
  "late_interaction_vectors": [[0.1, ...],...]
}
```

For querying, scoring can be exposed with scripting:

```
POST index/_search
{
  "query": {
    "script_score": {
      "query": {
        "match_all": {}
      },
      "script": {
        "source": "maxSimDotProduct(params.query_vector, 'my_vector')",
        "params": {
          "query_vector": [[0.42, ...], ...]
        }
      }
    }
  }
}
```

Of course, the initial ranking should be done before re-scoring or
combining via the `rescore` parameter, or simply passing whatever first
phase retrieval you want as the inner query in `script_score`.
2025-01-07 04:06:59 +11:00
Dan Rubinstein f5cffbf300
Deprecating data_frame_transforms roles (#117519)
* Deprecating data_frame_transforms roles

* Update docs/changelog/117519.yaml

* Update changelog

* Removing deprecation warning

* Cleaning up unused role retrieval function

* Update docs/changelog/117519.yaml

* Update changelog with breaking change details

* Revert ClusterPrivilegeResolver

* Remove Deprecated Transform Role permissions

* Update docs/changelog/117519.yaml

Co-authored-by: Nikolaj Volgushev <n1v0lg@users.noreply.github.com>

* Revert "Remove Deprecated Transform Role permissions"

This reverts commit 4599256dcf.

* Revert docs, update changelog

* Revert privilege tests

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Nikolaj Volgushev <n1v0lg@users.noreply.github.com>
Co-authored-by: Pat Whelan <pat.whelan@elastic.co>
2025-01-06 11:00:16 -05:00
István Zoltán Szabó c32cec165a
[DOCS] Adds note on default ELSER and E5 endpoints to service pages (#119507)
* [DOCS] Adds note on default ELSER and E5 enpoints to service pages.

* [DOCS] Further edits.
2025-01-03 15:53:17 +01:00
Edoardo Tenani 171424727a
[DOCS] update recommended template priority for override to 500 (#119450) 2025-01-03 11:12:25 +01:00
Stanislav Malyshev 0292905ef6
Add ESQL telemetry collection (#119474)
* Add ESQL telemetry collection
2025-01-02 14:05:21 -07:00
shainaraskas 9862a43cb6
[DOCS] Concept cleanup (extracting conceptual docs from reference content pt I of ?) (#119016) 2025-01-02 15:05:08 +00:00
Lisa Cawley 8a821f0a96
[DOCS] More links to new API site (#119380) 2024-12-31 12:02:59 -08:00
Lisa Cawley ba8beecdb0
[DOCS] More links to new API site (#119377) 2024-12-31 11:32:29 -08:00
Ryan Ernst 7fb6ca447a
Add ephemeral node id to shutdown metadata (#118722)
Shutdown metadata is keyed on node id. This makes sense since only one
node with a given node id can exist within a cluster. However, it is
possible that shutdown was initiated for one instance of a node, but
that node is restarted. This commit adds the ephemeral node id to
shutdown metadata so that nodes with the same id but different ephemeral
id can be distinguished.
2024-12-30 10:13:58 -08:00
Lisa Cawley 5e0fbef58b
[DOCS] Link to new API site (#119038)
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-12-30 16:52:16 +00:00
Max Hniebergall 7e126d75a4
[Inference API] Update get-inference docs to use elasticsearch service (#119238)
* update get-inference docs to use elasticsearch service

* fix indentation
2024-12-30 10:11:40 -05:00
Jim Ferenczi 12e86b1cd0
Refactor semantic text field to align with text field behaviour (#119183)
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
2024-12-30 09:31:02 +01:00
Stef Nestor ec35dc2049
(Doc+) Videos of rejected requests (#119311)
* (Doc+) Videos of rejected requests
2024-12-27 13:49:57 -07:00
Stef Nestor c8f2703cdf
(Doc+) Flush out Slow Logs (#118518)
* (Doc+) Slow Logs

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-12-27 10:12:00 -07:00
Stef Nestor db1c41b41d
(Doc+) Enrich run on ingest+data nodes not coordinating-only (#119136)
* (Doc+) Enrich run on ingest+data nodes not coordinating-only

👋 howdy, team! I'm not otherwise finding it so documenting https://github.com/elastic/elasticsearch/issues/95969 in ES docs

> Currently we tell users of enrich that they should co-locate the nodes that perform the enrichment (ingest nodes) with the actual enrich data so that enrich operations don't require a remote search operation.

* feedback

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

---------

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2024-12-25 08:17:18 -07:00
Niels Bauman 9641c7623f
Run TransportGetComponentTemplateAction on local node (#116868)
This action solely needs the cluster state, it can run on any node.
Additionally, it needs to be cancellable to avoid doing unnecessary work
after a client failure or timeout.

The `?local` parameter becomes a no-op and is marked as deprecated.

Relates #101805
Relates #107984
2024-12-23 20:01:21 +00:00
Niels Bauman dac3bfd52e
Fix usage API docs test (#119192)
This ensures the usage API docs tests are passing again. We achieve this
by: 1. ignoring the contents of `inference.models` because the models
might not yet have been initialized and 2. adding missing fields to the
`logsdb` usage.
2024-12-23 16:03:47 +01:00
Tanguy Leroux 103d29f0c8
Revert min_read_only_index_version in doc after #119083 (#119125) 2024-12-20 05:38:09 +11:00
Carlos Delgado 6ee641bdfd
ESQL - Update WHERE command docs with MATCH and full text functions examples (#118987) 2024-12-19 16:44:53 +01:00
Marci W 696ee806e7
Revise content to match new troubleshooting guidelines (#118033)
* Revise to match new guidelines

* Address review suggestions and comments

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-12-19 10:09:14 -05:00
kosabogi 78bd9ec6f0
[DOCS] Updates SharePoint Online page (#118318) 2024-12-19 11:43:34 +01:00
Jim Ferenczi 6f261067f2
Add a generic `rescorer` retriever based on the search request's rescore functionality (#118585)
This pull request introduces a new retriever called `rescorer`, which leverages the `rescore` functionality of the search request.  
The `rescorer` retriever re-scores only the top documents retrieved by its child retriever, offering fine-tuned scoring capabilities.  

All rescorers supported in the `rescore` section of a search request are available in this retriever, and the same format is used to define the rescore configuration.  

<details>
<summary>Example:</summary>

```yaml
  - do:
      search:
        index: test
        body:
          retriever:
            rescorer:
              rescore:
                window_size: 10
                query:
                  rescore_query:
                    rank_feature:
                      field: "features.second_stage"
                      linear: { }
                  query_weight: 0
              retriever:
                standard:
                  query:
                    rank_feature:
                      field: "features.first_stage"
                      linear: { }
          size: 2
```

</details>

Closes #118327

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-12-18 19:47:12 +00:00
Bogdan Pintea bc3b629d8d
ESQL: Docs: add example of date bucketing with offset (#116680)
Add an example of how to create date histograms with an offset.

Fixes #114167
2024-12-18 17:12:14 +01:00
Sean Story 5255bfb6fb
Replace 'ent-search-generic' with 'search-default' pipeline (#118899)
* Replace 'ent-search-generic' with 'search-default' pipeline

* missed one

* [CI] Auto commit changes from spotless

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2024-12-18 08:03:08 -06:00
Liam Thompson 54fa07450a
[DOCS] Make Wolfi hardened Docker option more prominent (#118755) 2024-12-18 13:54:17 +01:00
Ievgen Degtiarenko 7cf28a910e
ESQL Add esql hash function (#117989)
This change introduces esql hash(alg, input) function that relies on the Java MessageDigest to compute the hash.
2024-12-18 09:56:42 +01:00
Satyam Mishra 41c3dde414
Updated ilm docs as per the issue (#118148)
This PR updates the Elasticsearch ILM tutorial for the newer screenshot
and the primary shard update in the text.
2024-12-18 10:32:39 +11:00
Pius Fung 1054503ba8
Update start-trained-model-deployment.asciidoc (#118887)
Updating with changes in https://github.com/elastic/elasticsearch/pull/115041
2024-12-17 13:03:28 -08:00
Tanguy Leroux f3a1664906
Add min. read-only index version compatible to DiscoveryNode (#118744)
#118443 added a new index version for indices that can be opened in read-only mode by Lucene. This change adds this information to the discovery node's VersionInformation and the transport serialization logic.

In a short future we'd like to use this information in methods like IndexMetadataVerifier#checkSupportedVersion and NodeJoineExecutor to allow opening indices in N-2 versions as read-only indices on ES V9.
2024-12-17 18:59:49 +01:00
Luigi Dell'Aquila 1d2840ece1
EQL: add support for partial search results (#116388)
Allow queries to succeed if some shards are failing
2024-12-17 17:13:52 +01:00
Adam Szaraniec f64c05ac32
Update alias.asciidoc (#118553)
Add section about removing index
2024-12-17 10:09:22 -06:00
Rene Groeschke 6516a535ab
Add wolfi documentation from 8.16 branch (#118835)
port from https://github.com/elastic/elasticsearch/pull/118684
2024-12-17 23:45:27 +11:00
Slobodan Adamović 6c56c32f7a
Grant necessary Kibana application privileges to `reporting_user` role (#118058)
Previously, Kibana was authorizing (and granting application privileges)
to create reports, simply based on the `reporting_user` role name. This
PR makes these application privileges explicitly granted to the
`reporting_user` role.
2024-12-14 04:40:23 +11:00
Mark Tozzi 1e26791515
Esql bucket function for date nanos (#118474)
This adds support for running the bucket function over a date nanos field. Code wise, this just delegates to DateTrunc, which already supports date nanos, so most of the PR is just tests and the auto-generated docs.

Resolves #118031
2024-12-13 09:25:52 -05:00
Liam Thompson 528593b55f
[DOCS] Link to Elastic Rerank model landing page (#118574)
- Add link to Python notebook
- Fix heading level
2024-12-13 14:00:29 +00:00
Gal Lalouche 2be4cd983f
ESQL: Support ST_EXTENT_AGG (#117451)
This PR adds support for ST_EXTENT_AGG aggregation, i.e., computing a bounding box over a set of points/shapes (Cartesian or geo). Note the difference between this aggregation and the already implemented scalar function ST_EXTENT.

This isn't a very efficient implementation, and future PRs will attempt to read these extents directly from the doc values.
We currently always use longitude wrapping, i.e., we may wrap around the dateline for a smaller bounding box. Future PRs will let the user control this behavior.
Fixes #104659.
2024-12-13 12:41:24 +02:00
Alexander Spies 140d88c59a
ESQL: Dependency check for binary plans (#118326)
Make the dependency checker for query plans take into account binary plans and make sure that fields required from the left hand side are actually obtained from there (and analogously for the right).
2024-12-13 11:38:53 +01:00
John Wagster ce0e4e4fa8
small doc fix for updates related to _source defaults found by community member (#118605) 2024-12-12 13:17:32 -06:00
Marci W 3cbdfba610
Fix invalid index mode (#118579) 2024-12-12 10:59:47 -05:00
Mayya Sharipova b460f081c2
[DOCS] _index_prefix for highligh matched_fields (#118569)
Enhance documenation to explain that "_index_prefix" subfield must
be added to `matched_fields` param for highlighting a main field.
When doing prefix queries on fields that are indexed with prefixes,
"_index_prefix" subfield is used. If we try to highlight the main
field, we may not get any results. "_index_prefix" subfield must
be added to `matched_fields` which instructs ES to use matches
from "_index_prefix" to highlight the main field.
2024-12-12 10:24:55 -05:00
kosabogi b2998378a3
[DOCS] Adds default inference endpoints information (#118463)
* Adds default inference andpoints information

* Update docs/reference/inference/inference-apis.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-12-12 12:15:28 +01:00
kosabogi 8c10f0cc38
Changes elser service to elasticsearch service in the Semantic search with the inference API page (#118536) 2024-12-12 12:15:17 +01:00
kosabogi 3cf7f97141
Adds CCS matrix for 8.17 (#118527)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-12-12 11:17:16 +01:00
Liam Thompson 8fb6edab63
[DOCS] Consolidate connectors release notes on one page (#118464) 2024-12-12 10:52:41 +01:00
Sean Story 4db3f7b751
Add known issue for salesforce DLS (#118489) 2024-12-11 13:34:00 -06:00
Marci W ae9bb90fd1
Update and edit logsdb docs for logsdb / synthetic source GA (#118303)
* Update licensing; fix screenshots; edit generally

* Small edit for clarity and style

* Update docs/reference/index-modules.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Apply changes from review

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Address review comments

* Match similar change from review

* More changes from review

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Update docs/reference/data-streams/logs.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

* Change to general subscription note

* Apply suggestions from review

Co-authored-by: Oleksandr Kolomiiets <olkolomiiets@gmail.com>

* Apply suggestions from review

Co-authored-by: Oleksandr Kolomiiets <olkolomiiets@gmail.com>

* Apply suggestions from review; additional edits

* Apply suggestions from review; clarity tweaks

* Restore previous paragraph structure and context

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
Co-authored-by: Oleksandr Kolomiiets <olkolomiiets@gmail.com>
2024-12-11 13:24:24 -05:00
Liam Thompson 56e1ca52ea
[DOCS][101] Aggregations quickstart tutorial (#116251) 2024-12-11 16:06:24 +01:00
David Kyle a7fdc10bd4
[DOCS][ML] Use elasticsearch service instead of deprecated elser service in tutorials (#118007) 2024-12-11 10:51:24 +00:00
Quentin Pradet c473da5e64
Fix rest-api-spec and docs for bulk API (#118415) 2024-12-11 12:38:42 +04:00
Benjamin Trent 645657cc56
Remove old _knn_search tech preview API in v9 (#118104)
Removes the old `_knn_search` API that was never out of tech preview and
deprecated throughout the v8 cycle. 

To utilize the API, `compatible-with=8` can be utilized.
2024-12-11 02:01:25 +11:00
István Zoltán Szabó 17765e7398
[DOCS] Adds transform breaking changes to the migration guide (#118285)
* [DOCS] Adds transform breaking changes to the migration guide.

* [DOCS] Fixes typo.

* [DOCS] Fixes ordered list.
2024-12-10 15:39:03 +01:00
István Zoltán Szabó d026904dda
[DOCS] Reviews docker examples. (#118339) 2024-12-10 15:16:27 +01:00
István Zoltán Szabó 27b07b3406
[DOCS] Documents `dimensions` param for `openai` service of Inference API (#118317)
Co-authored-by: David Kyle <david.kyle@elastic.co>
2024-12-10 14:12:10 +01:00
Carlos Delgado eb59b989ef
ESQL: Expand type compatibility for match function and operator (#117555) 2024-12-09 19:56:10 +01:00
Dan Rubinstein f40dc99f91
Adding transforms migration guide for 9.0 (#117353)
* Adding transforms migration guide for 9.0

* Adding shared transform attribute and simplifying wording

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-12-09 10:14:32 -05:00
Pete Gillin bc25a73543
Update `UpdateForV9` in `AttachmentProcessor` (#118186)
We are not going to make this change in V9. We may do it in V10. This
change just bumps the annotation to remind us to revisit.

Since we are living with this for a while, it seems worth improving
the documentation. This now encourages explicitly setting the option
one way or the other, since you get a warning if you omit it. It also
changes the existing examples to use true rather than false, as that's
our recommendation. And it adds a new section with an example where
it's true, and moves the content previously in a note into that
section.
2024-12-09 14:28:24 +00:00
Liam Thompson 931f675891
Update connectors overview diagram (#118261) 2024-12-09 15:25:10 +01:00
kanoshiou 67ee03411b
ESQL: Enable async get to support formatting (#111104)
I've updated the listener for GET /_query/async/{id} to EsqlResponseListener, so it now accepts parameters (delimiter, drop_null_columns and format) like the POST /_query API. Additionally, I have added tests to verify the correctness of the code.

You can now set the format in the request parameters to specify the return style.

Closes #110926
2024-12-09 13:08:48 +01:00
kosabogi 6bb0799893
Updates h7 and h8 formatting (#118132) 2024-12-09 11:37:41 +01:00
kosabogi b2b8e3f762
[DOCS] [8.17] Adds new default inference endpoint information (#117985)
* Adds new default inference information

* Update docs/reference/mapping/types/semantic-text.asciidoc

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Update docs/reference/search/search-your-data/semantic-search-semantic-text.asciidoc

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Update docs/reference/mapping/types/semantic-text.asciidoc

Co-authored-by: David Kyle <david.kyle@elastic.co>

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
Co-authored-by: David Kyle <david.kyle@elastic.co>
2024-12-09 09:05:11 +01:00
Jim Ferenczi c580024ea9
Add Highlighter for Semantic Text Fields (#118064)
This PR introduces a new highlighter, `semantic`, tailored for semantic text fields.
It extracts the most relevant fragments by scoring nested chunks using the original semantic query.

In this initial version, the highlighter returns only the original chunks computed during ingestion. However, this is an implementation detail, and future enhancements could combine multiple chunks to generate the fragments.
2024-12-06 18:42:50 +00:00
Mark Tozzi 7cd17d2185
Esql compare nanos and millis (#118027)
Resolves #116281

Introduces support for comparing millisecond dates with nanosecond dates, without the need for casting. Millisecond dates outside of the nanosecond date range are handled correctly.
2024-12-06 09:17:32 -05:00
István Zoltán Szabó f27cb5efd3
[DOCS] Adds examples to inference processor docs (#116018) 2024-12-06 09:15:15 +01:00
Tommaso Teofili 91605860ee
Term query for ES|QL (#117359)
This commit adds a `term` function for ES|QL to run `TermQueries`.

For example:
FROM test | WHERE term(content, "dog")
2024-12-06 07:42:48 +00:00
kosabogi 9d35053760
Adds warning to Create inference API page (#118073) 2024-12-05 16:32:59 +01:00
Benjamin Trent 9a81eb2dbe
Indicate that rescore isn't allowed with retrievers, yet (#118019) 2024-12-04 14:10:32 -05:00
Jim Ferenczi 0901a2734e
Add option to store `sparse_vector` outside `_source` (#117917)
This PR introduces an option for `sparse_vector` to store its values separately from `_source` by using term vectors.
This capability is primarly needed by the semantic text field.
2024-12-04 17:29:46 +00:00
kosabogi 6855a4ecd3
[DOCS] Adds adaptive allocations information to Inference APIs (#117546)
* Adds adaptive allocations information to Inference APIs

* Update docs/reference/inference/inference-apis.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/inference/put-inference.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/inference/inference-apis.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-12-04 14:22:59 +01:00
kosabogi ba1d5be199
Updates minimum_number_of_allocations description (#117746) 2024-12-04 14:22:26 +01:00
Craig Taverner c7e985c3b6
Support ST_ENVELOPE and related ST_XMIN, etc. (#116964)
Support ST_ENVELOPE and related ST_XMIN, etc.

Based on the PostGIS equivalents:

https://postgis.net/docs/ST_Envelope.html
https://postgis.net/docs/ST_XMin.html
https://postgis.net/docs/ST_XMax.html
https://postgis.net/docs/ST_YMin.html
https://postgis.net/docs/ST_YMax.html
2024-12-04 12:20:47 +01:00
Niels Bauman 032b42fcf7
Make TransportLocalClusterStateAction wait for cluster to unblock (#117230)
This will make `TransportLocalClusterStateAction` wait for a new state
that is not blocked. This means we need a timeout (again). For
consistency's sake, we're reusing the REST param `master_timeout` for
this timeout as well.

The only class that was using `TransportLocalClusterStateAction` was
`TransportGetAliasesAction`, so its request needed to accept a timeout
again as well.
2024-12-04 12:17:13 +01:00
Jakob Reiter 4496aa0cbf
Update troubleshooting-unstable-cluster.asciidoc (#117887)
Added missing word
2024-12-04 10:56:58 +00:00
Jan Kuipers 31508f00a1
Document ES|QL categorize limitations (#117892)
* Document ES|QL categorize limitations

* Update x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/expression/function/grouping/Categorize.java

Co-authored-by: Alexander Spies <alexander.spies@elastic.co>

---------

Co-authored-by: Alexander Spies <alexander.spies@elastic.co>
2024-12-04 09:53:21 +01:00
mmahacek 5c1b3c7197
Update email.asciidoc (#117867)
Fix error in documentation.
2024-12-03 09:10:02 -05:00
Marci W 97a626b5ea
Remove ccs banner (#117844) 2024-12-02 14:46:41 -05:00
Mark Tozzi 913e0fbca8
ESQL Date Nanos Addition and Subtraction (#116839)
Resolves #109995

This adds support and tests for addition and subtraction of date nanos with periods and durations. It does not include support for date_diff, which is a separate ticket (#109999). The bulk of the PR is testing, the actual date math is all handled by library functions.

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-12-02 14:08:07 -05:00
Jan Kuipers ddc8b959ee
ES|QL categorize docs (#117827)
* Move ES|QL categorize out of snapshot functions

* Categorize docs

* Add experimental + fix docs

* Add experimental + fix docs
2024-12-02 16:41:02 +01:00
David Turner 9dcd9751f4
Add IMDSv2 support to `repository-s3` (#117748)
The version of the AWS Java SDK we use already magically switches to
IMDSv2 if available, but today we cannot claim to support IMDSv2 in
Elasticsearch since we have no tests demonstrating that the magic really
works for us. In particular, this sort of thing often risks falling foul
of some restrictions imposed by the security manager (if not now then
maybe in some future release).

This commit adds proper support for IMDSv2 by enhancing the test suite
to add the missing coverage to avoid any risk of breaking this magical
SDK behaviour in future.

Closes #105135 Closes ES-9984
2024-12-02 20:01:48 +11:00
David Turner c5166ccf6f
Revert "(+Doc) Link split-brain wiki (#108914)"
This reverts commit 12aab08330.
2024-12-02 08:11:34 +00:00
Panagiotis Bailis ad83d9b35d
Updating retriever-examples documentation to run validation tests on the provided snippets (#116643) 2024-11-29 12:50:01 +00:00
Liam Thompson ab604ada78
[DOCS] Update tutorial example (#117538) 2024-11-28 16:34:57 +01:00
Martijn van Groningen 6a4b68d263
Add source mode stats to MappingStats (#117463) 2024-11-28 10:53:39 +01:00
kosabogi 79d70686b3
Fixes typo (#117684) 2024-11-28 09:26:16 +01:00
Liam Thompson c3ac2bd58a
[DOCS] Add Elastic Rerank usage docs (#117625) 2024-11-28 08:23:28 +01:00
Nik Everett 9022cccba7
ESQL: CATEGORIZE as a BlockHash (#114317)
Re-implement `CATEGORIZE` in a way that works for multi-node clusters.

This requires that data is first categorized on each data node in a first pass, then the categorizers from each data node are merged on the coordinator node and previously categorized rows are re-categorized.

BlockHashes, used in HashAggregations, already work in a very similar way. E.g. for queries like `... | STATS ... BY field1, field2` they map values for `field1` and `field2` to unique integer ids that are then passed to the actual aggregate functions to identify which "bucket" a row belongs to. When passed from the data nodes to the coordinator, the BlockHashes are also merged to obtain unique ids for every value in `field1, field2` that is seen on the coordinator (not only on the local data nodes).

Therefore, we re-implement `CATEGORIZE` as a special BlockHash.

To choose the correct BlockHash when a query plan is mapped to physical operations, the `AggregateExec` query plan node needs to know that we will be categorizing the field `message` in a query containing `... | STATS ... BY c = CATEGORIZE(message)`. For this reason, _we do not extract the expression_ `c = CATEGORIZE(message)` into an `EVAL` node, in contrast to e.g. `STATS ... BY b = BUCKET(field, 10)`. The expression `c = CATEGORIZE(message)` simply remains inside the `AggregateExec`'s groupings.

**Important limitation:** For now, to use `CATEGORIZE` in a `STATS` command, there can be only 1 grouping (the `CATEGORIZE`) overall.
2024-11-27 17:44:55 +01:00
George Wallace 9e61089414
[DOCS] : swap allocation sections (#116518)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-11-27 11:39:07 +01:00
Oleksandr Kolomiiets f57c43cdf5
Include a link to downsampling a TSDS using DSL document (#117510) 2024-11-26 08:09:30 -08:00
Jedr Blaszyk 5e028220c9
[Docs] Update incremental sync note (#117545) 2024-11-26 11:06:52 +00:00
Liam Thompson a860d3ab33
[DOCS] Trivial: remove tech preview badge (#117461) 2024-11-26 10:48:35 +01:00
Benjamin Trent 374c88a832
Correct bit * byte and bit * float script comparisons (#117404)
I goofed on the bit * byte and bit * float comparisons. Naturally, these
should be bigendian and compare the dimensions with the binary ones
appropriately.

Additionally, I added a test to ensure that this is handled correctly.
2024-11-26 03:38:06 +11:00
Craig Taverner 8c22fc479f
Make spatial search functions not preview (#117489) 2024-11-25 17:04:48 +01:00
padmaprasath21 b7d801809f
Update tsds-reindex.asciidoc (#117446) 2024-11-25 07:56:17 -08:00
florent-leborgne fa9f2bff0e
Docs for starred esql queries in Kibana (#117468) 2024-11-25 15:13:23 +01:00
Philippus Baalman fd6e8857bc
Mention `bbq_hnsw` for `m` and `ef_construction` options in docs (#117022) 2024-11-25 14:50:09 +01:00
Aurélien FOUCRET ff58d891a1
ES|QL kql function. (#116764) 2024-11-25 14:22:11 +01:00
István Zoltán Szabó 339e431081
[DOCS] Documents that ELSER is the default service for `semantic_text` (#115769) 2024-11-25 08:07:30 -05:00
Luke Whiting 1d4c8d85f6
(#34659) - Add Timezone Configuration to Watcher (#117033)
* Add timezone support to Cron objects

* Add timezone support to CronnableSchedule

* XContent change to support parsing and display of TimeZone fields on schedules

* Case insensitive timezone parsing

* Doc changes

* YAML REST tests

* Equals, toString and HashCode now include timezone

* Additional random testing for DST transitions

* Migrate Cron class to use wrapped LocalDateTime

The algorithm depends on some quirks of calendar but LocalDateTime
correctly ignores DST during calculations so this uses a LocalDateTime
with a wrapper to emulate some of Calendar's behaviours that the Cron
algorithm depends on

* Additional documentation to explain discontinuity event behaviour

* Remove redundant conversions from ZoneId to TimeZone following move to LocalDateTime

* Add documentation warning that manual clock changes will cause unpredictable watch execution

* Update docs/reference/watcher/trigger/schedule.asciidoc

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

---------

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2024-11-25 09:51:11 +00:00
Larisa Motova 7e801e0410
[ES|QL] Add a standard deviation function (#116531)
Uses Welford's online algorithm, as well as the parallel version, to
calculate standard deviation.
2024-11-22 12:33:46 -10:00
Nik Everett 4ecc7518ef
ESQL: Add docs for MV_PERCENTILE (#117377)
We built this a while back. Let's document it.
2024-11-23 06:41:18 +11:00
Nik Everett 893dfd3c9a
ESQL: Make WEIGHTED_AVG not preview (#117356)
It's not PREVIEW.
2024-11-22 16:28:06 +00:00
Bogdan Pintea 1fe3ed1e85
Add docs for aggs filtering (#116681)
Add documentation for aggs filtering (the WHERE in STATS command).

Fixes: #115083
2024-11-22 13:26:30 +01:00
Luigi Dell'Aquila a1247b3e60
ES|QL: fix validation of SORT by aggregate functions (#117316) 2024-11-22 12:12:09 +01:00
Slobodan Adamović 6ea3e01958
Upgrade Bouncy Castle FIPS dependencies (#112989)
This PR updates `bc-fips` and `bctls-fips` dependencies to the latest
minor versions.
2024-11-22 21:39:25 +11:00
Lisa Cawley 8fe8d22f7c
[DOCS] Remove broken migration guide link (#117293) 2024-11-21 14:02:18 -08:00
elasticsearchmachine b378a1bb54 Bump 8.x to 8.18.0 2024-11-21 14:37:05 -05:00
Carlos Delgado ea4b41fca8
ESQL - match operator included in non-snapshot builds (#116819) 2024-11-21 07:45:22 +01:00
Mark Tozzi c3f73d0319
Esql Enable Date Nanos (#117080)
This enables date nanos support as tech preview. Basic operations, like reading values, binary comparisons, and functions that don't care about type should work, but some functions are not yet supported. Most notably, Bucket is not yet supported, although Date_Trunc is and can be used for grouping. See the docs for the full list of limitations.

relates to #109352
2024-11-20 09:31:01 -05:00
Costin Leau bc785f5ca1
Esql/lookup join grammar (#116515)
First PR for adding LOOKUP JOIN in ESQL.
Introduces grammar and wires the main building blocks to execute a query; follow-ups are required (see #116208 for more details).

Co-authored-by: Nik Everett <nik9000@users.noreply.github.com>
2024-11-19 17:52:24 -08:00
Stef Nestor 72c44595f4
(Doc+) link videos for allocation and ilm (#116880)
* (Doc+) link videos for allocation and ilm

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-11-19 14:43:50 -07:00
Liam Thompson c699af2c67
[DOCS] Rename how-to subsection, move recipes to search relevance (#117044) 2024-11-19 18:27:05 +01:00
Craig Taverner f3cd48209e
Added stricter range type checks and runtime warnings for ENRICH (#115091)
It has been noted that strange or incorrect error messages are returned if the ENRICH command uses incompatible data types, for example a KEYWORD with value 'foo' using in an int_range match: https://github.com/elastic/elasticsearch/issues/107357

This error is thrown at runtime and contradicts the ES|QL policy of only throwing errors at planning time, while at runtime we should instead set results to null and add a warning. However, we could make the planner stricter and block potentially mismatching types earlier.

However runtime parsing of KEYWORD fields has been a feature of ES|QL ENRICH since it's inception, in particular we even have tests asserting that KEYWORD fields containing parsable IP data can be joined to an ip_range ENRICH index.

In order to not create a backwards compatibility problem, we have compromised with the following:

* Strict range type checking at the planner time for incompatible range types, unless the incoming index field is KEYWORD
* For KEYWORD fields, allow runtime parsing of the fields, but when parsing fails, set the result to null and add a warning

Added extra tests to verify behaviour of match policies on non-keyword fields. They all behave as keywords (the enrich field is converted to keyword at policy execution time, and the input data is converted to keyword at lookup time).
2024-11-19 16:34:21 +01:00
Simon Cooper b30a4b23f2
Output a consistent format when generating error json (#90529)
Now, error fields will always have 'type' and 'reason' fields, and the information in those fields is the same regardless of whether the output is detailed or not
2024-11-19 13:35:04 +00:00
Fang Xing d33bff6468
[ES|QL][DOCS] Add docs for date_period and time_duration (#116368)
* add docs for date_period and time_duration
2024-11-19 07:48:35 -05:00
Bogdan Pintea b5addca40a
ESQL: Docs: COUNT: add an explanation to the use of the 3VL (#116684)
Add an explanation of why `... OR NULL` is needed with `COUNT(...)`.

Fixes: #99954
2024-11-19 10:37:47 +01:00
Jason Tu efc3ba9958
Update indexing-speed.asciidoc (#116559) 2024-11-18 13:17:17 -05:00
Peter Straßer c804953105
Provide access to new settings for HyphenationCompoundWordTokenFilter (#115585)
Allow the new flags added in Lucene in the HyphenationCompoundWordTokenFilter

Adds access to the two new flags no_sub_matches and no_overlapping_matches.

Lucene issue: https://github.com/apache/lucene/issues/9231
2024-11-18 17:38:49 +01:00
Luca Cavanna 99689281e0
Remove support for deprecated force_source highlighting parameter (#116943)
force_source is being parsed as a no-op since 8.8. This commit removes support
for it at REST, meaning a search request that provides it gets now an error back.
2024-11-18 17:36:39 +01:00
Cauê Marcondes e019fc03e0
Remove apm_user role (#116712)
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-11-18 15:28:30 +00:00
Liam Thompson 4e17c61d39
[DOCS] Remove 'rescore' from retriever.asciidoc (#116921) 2024-11-18 11:34:28 +01:00
Sean Story f55e5d020b
Note a limitation in Basic Sync Rules (#116859) 2024-11-18 09:33:03 +01:00
shainaraskas 2d2ad00872
fix formatting errors (#116843) 2024-11-14 15:45:16 -05:00
Liam Thompson a193fc34a3
[Docs] Link to ECK Azure snapshot docs (#111586) 2024-11-14 18:39:38 +01:00
Brendan Cully b77df851b1
Add warning about restart migration (#116769)
We have gotten more than one SDH due to customers not understanding
why restarts involving fully-mounted indices can pull a lot of data
from the snapshot tier, so it may help to be more explicit about
why this happens and how it can be avoided.
2024-11-14 18:07:09 +01:00
Gal Lalouche c45977a5fd
[ESQL] Update docs format (missing space before '=') (#116808) 2024-11-14 16:05:28 +02:00
Luke Whiting 2f26ec2351
Introduce Email Address Allow Lists For Watcher (#116672)
* New setting plus mutual exclusiveness validation

* New domain list checking

* Email service tests

* Documentation updates

* PR Changes

Fix comment
2024-11-14 12:38:14 +01:00
Gal Lalouche 591cd591ad
[ES|QL] Update length docs (#116734)
ESQL Update length docs (#116734)
2024-11-14 13:14:43 +02:00
Fang Xing b37a829efa
[ES|QL] Implicit casting string literal to intervals in EsqlScalarFunction and GroupingFunction (#115814)
* implicit casting from string literals to datetime intervals
2024-11-13 18:25:06 -05:00
Kathleen DeRusso 1b03a96e52
Add tracking for query rule types (#116357)
* Add total rule type counts to list calls and xpack usage

* Add feature

* Update docs/changelog/116357.yaml

* Fix docs test failure & update yaml tests

* remove additional spaces

---------

Co-authored-by: Mark J. Hoy <mark.hoy@elastic.co>
2024-11-13 17:05:05 +01:00
Max Hniebergall d1788af03f
Update service-elser.asciidoc (#116272) 2024-11-13 08:42:07 -05:00
kosabogi bada2a60ed
Updates chunk settings documentation (#116719) 2024-11-13 14:14:56 +01:00
Gal Lalouche b4898c959f
[ES|QL] Add support BYTE_LENGTH scalar function (#116591)
Also added documentation and examples for BIT_LENGTH and LENGTH regarding unicode.
2024-11-13 00:42:19 +02:00
Liam Thompson a71c132481
[DOCS] Update sharepoint-online connector perms (#116641) 2024-11-12 16:14:02 +01:00
Jake Landis b7167b73e3
Docs for monitor_stats privilege (#116533)
This commit adds docs for monitor_stats and updates an example snippet to include both remote_indices and remote_cluster.
2024-11-12 09:13:37 -06:00
Jedr Blaszyk 098c8dad90
[Docs] Fix sharepoint docs for 8.16 release (#116661) 2024-11-12 15:27:58 +01:00
Liam Thompson f121e09fbb
[DOCS] Connectors 8.16.0 release notes (#115856) 2024-11-12 10:59:20 +01:00
Johannes Fredén bfb30d2e72
[DOCS] Remove tech preview from bulk create/update/delete roles (#116601)
Mark bulk create/update/delete roles GA in 9.0 and 8.17
2024-11-12 18:42:34 +11:00
Joe Gallo dd32cb6439
Document new ip_location processor (#116623) 2024-11-11 19:55:57 -05:00
Benjamin Trent 89467b212e
Clarify the vector files utilized for preloading (#116488)
Adds clarification for vector preloading, what extension is to what
storage kind, and that quantized vectors are stored in separate files
allowing for individual preload. 

closes: https://github.com/elastic/elasticsearch/issues/116273
2024-11-12 08:30:48 +11:00
Joe Gallo 2302cdbe45
Document new ip_location APIs (#116611) 2024-11-11 13:52:47 -05:00
Joe Gallo b517abcb07
Document new ip geolocation fields (#116603) 2024-11-11 11:13:56 -05:00
Ievgen Degtiarenko 3ebc1f48aa
Clarify docs around disk capacity expectation. (#115745)
Make it explicit that es expects disks to have the same capacity across all the nodes in the same data tier.
2024-11-11 15:59:11 +01:00
Panagiotis Bailis 64c362b154
Adding more retriever examples to documentation (#116196) 2024-11-11 13:19:20 +02:00
Jack Pan 0914679225
Remove trailing semicolon in REPEAT function example (#116218)
Remove trailing semicolon in REPEAT function example (Closes #116156 )
2024-11-11 11:10:05 +01:00
Jake Landis af99654dac
Add a monitor_stats privilege and allow that privilege for remote cluster privileges (#114964)
This commit does the following:
   * Add a new monitor_stats privilege
   * Ensure that monitor_stats can be set in the remote_cluster privileges
   * Give's Kibana the ability to remotely call monitor_stats via RCS 2.0

Since this is the first case where there is more than 1 remote_cluster privilege,
the following framework concern has been added:
    * Ensure that when sending to elder RCS 2.0 clusters that we don't send the new privilege
        previous only supported all or nothing remote_cluster blocks
    * Ensure that we when sending API key role descriptors that contains remote_cluster,
       we don't send the new privileges for RCS 1.0/2.0 if it not new enough
    * Fix and extend the BWC tests for RCS 1.0 and RCS 2.0
2024-11-08 11:59:32 -06:00
Nik Everett b285204fde
DOCS: Correct explanation of percentiles_bucket (#116499)
Corrects the explanation of `percentiles_bucket` so it's clear that it
returns the `nth` largest item always, and it rounds `n` towards
infinity. That's how it's worked since 2016 but the docs talked about
"not greater than" which I don't think is particularly clear.
2024-11-08 12:40:53 -05:00
Benjamin Trent c48e5e5980
Fixing list for size estimates (#116486) 2024-11-08 15:59:34 +01:00
David Kyle b161f2c22a
Document using xpack.ml.use_auto_machine_memory_percent setting in docker getting started (#114009)
If xpack.ml.use_auto_machine_memory_percent is not explicitly set to true then
the default value (false) means ML will only use 30% of the available memory making
it impractical to run the ELSER model. This is useful for users wanting to get started 
with semantic search.The single node docker instructions have been updated with a 
command that gives the container enough memory to run the ELSER model and enables xpack.ml.use_auto_machine_memory_percent. For the multi-node guide the docker 
compose file is updated to enable the ml setting for every node in the cluster.
2024-11-08 13:46:13 +01:00
florent-leborgne ba65914285
refresh ESQL kibana docs (#116441) 2024-11-08 10:39:18 +01:00
Jake Landis e27c90969c
Fix missing remote_cluster docs (#116366)
Documentation for the remote_cluster in the role was added
in #111682 and #108840, but a few places were missed.
This commit fill the gaps in the documentation.
2024-11-07 13:43:50 -06:00
Liam Thompson c42b1ef95a
[DOCS] Use explicit link text in query rules retriever (#116389) 2024-11-07 14:22:53 +01:00
Liam Thompson 22c55fa1ca
[DOCS] Fix boolean for native connectors (#116394) 2024-11-07 12:18:51 +01:00
Tim Grein 81fd1de76b
Add ES|QL bit_length function (#115792) 2024-11-07 08:51:26 +01:00
Kathleen DeRusso 14a7b8fe67
Add documentation for query rules retriever (#115696)
* Add initial query rules retriever docs

* Add docs tests

* Apply suggestions from code review

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* PR feedback

* Make query rules guide retriever-first

* Add warning to DSL doc

* Update docs/reference/search/retriever.asciidoc

Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>

* Update docs/reference/search/retriever.asciidoc

Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>

* Apply suggestions from code review

Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>

* Give parameters subheading an explicit id

* Fix formatting

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
2024-11-06 14:42:06 -05:00
Sean Story 337188244f
Clarify that MSSQL supports only SQL Server auth (#116340)
* Clarify that MSSQL supports only SQL Server auth

* typo
2024-11-06 17:52:06 +01:00
Liam Thompson 8a988445b3
[DOCS] Fix typo in percentile-aggregation.asciidoc (#116268) 2024-11-06 10:21:16 +01:00
Benjamin Trent d33a03ce6b
Add support for bitwise inner-product in painless (#116082)
This adds bitwise inner product to painless. 

The idea here is:

 - For two bit arrays, which we determine to be a byte array whose dimensions match `dense_vector.dim/8`, we simply return bitwise `&`
 - For a stored bit array (remember, with `dense_vector.dim/8` bytes), sum up the provided byte or float array using the bit array as a mask.

This is effectively supporting asynchronous quantization. A prime
example of how this works is:
https://github.com/cohere-ai/BinaryVectorDB

Basically, you do your initial search against the binary space and then
rerank with a differently quantized vector allowing for more information
without additional storage space. 

closes:  https://github.com/elastic/elasticsearch/issues/111232
2024-11-06 09:22:04 +11:00
Stef Nestor e2bbee104c
(Doc+) Resolve watermark on Cloud via Autoscaling (#115990) 2024-11-05 21:11:46 +01:00
kosabogi 954ab8ab79
Updates Connectors section page references (#116239) 2024-11-05 14:32:14 +01:00
David Turner a1daddc9e3
Add note about incompleteness of CBs (#116176)
The docs kinda imply that circuit breakers protect against OOMEs, at
least that's how some customers seem to interpret them. This commit adds
a note spelling out that this isn't the case.
2024-11-04 16:07:54 +00:00
Giorgos Bamparopoulos 9ad09b6ee0
Fix a typo in the example for using pre-existing pipeline definitions (#116084) 2024-11-04 16:06:16 +01:00
Mark Tozzi 744eb507f6
[ESQL] clean up date trunc tests (#116111)
While working on #110008 I discovered that the Date Trunc tests were only running in folding mode, because the interval types are marked as not representable. The correct way to test this is to set the forceLiteral flag for those fields, which will (as the name suggests) force them to be literals even in non-folding tests.

Doing that turned up errors in the evaluatorToString tests, which I fixed. There are two big changes here. First, the second parameter to the evaluator is a Rounding instance, not the actual interval. Since Rounding includes some information about the specific rounding in the toString results, I am just using a starts with matcher to validate the majority of the string, rather than trying to reconstruct the expected rounding string. Second, passing in a literal null for the interval parameter folds the whole expression to null, and thus a completely different toString. I added a clause in AnyNullIsNull to account for this.

While I was in there, I moved some specific test cases to a different file. I know moving code is something we're trying to minimize right now, but this seemed worth it. The tests in question do not depend on the parameters of the test case, but all methods in the class get run for every set of parameters. This was causing these tests to be run many times with the same values, which bloats our test run time and test count. Moving them to a distinct class means they'll only be executed once per test run. I feel like this benefit outweighs the cost of git history complexity.
2024-11-04 15:32:53 +01:00
Craig Taverner 535ad91bdb
Refine ESQL limitations (full-text, TEXT fields, unassigned indexes) (#116098)
* Refine ESQL limitations (full-text, TEXT fields, unassigned indexes)

This PR refactors a section of the ES|QL Limitations page to:
* Refactor both full-text and text-behaves-as-keyword sections to better reflect the new behaviour (the old text implies that no full-text search of any kind exists anywhere, which immediately contradicts the statements directly above it).
* Update text-behaves-as-keyword to include my recent work on making all functions return KEYWORD instead of TEXT or SEMANTIC_TEXT
* Add a section on multi-index querying to cover two limitations (union types and unassigned indexes).

* Fix full-text-search examples
2024-11-01 17:03:49 +01:00
Nhat Nguyen 6d4e11d6bc
Add logsdb telemetry (#115994)
This PR adds telemetry for logsdb. However, this change only tracks the 
count of indices using logsdb and those that use synthetic source. 
Additional stats, such as shard, indexing, and search stats, will be
added in a follow-up, as they require reaching out to data nodes.
2024-11-01 08:25:40 -07:00
Artem Prigoda d93d333141
Remove checking of sync commit ids (#114246)
A Lucene commit doesn't contain sync ids `SegmentInfos` anymore, so we can't rely on them during recovery. The fields was marked as deprecated in #102343.
2024-11-01 16:18:12 +01:00
Chris Hegarty 2275894ca0
ES|QL Add full-text search to the functions docs page (#116024)
Now that the match and qstr functions are Tech Previewing, we should add them to the top-level functions doc page.

Co-authored-by: Craig Taverner <craig@amanzi.com>
2024-11-01 12:04:55 +00:00
Craig Taverner c9c1765986
Remove duplicate 'the the' (#116023)
There were many places where `the the` was typed, in comments, docs and messages. All were incorrect and replaces with a single `the`
2024-10-31 19:14:58 +01:00
Jake Landis aaf7a3ec72
Prep docs for v9 (#115375)
This commit prepares the documentation for version 9. 

Some of the automation generates docs that are not correct for version 9. 
The content has been commented out with a reference to an internal issue 
for us to address before this documentation is used.
2024-10-31 09:46:19 -05:00
Tim Grein 6a3a447f18
Remove double "the" from median absolute deviation description (#115826) 2024-10-31 15:25:20 +01:00
István Zoltán Szabó c77fb33070
Adds hands-on learning for Search link to the landing page (#116007)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-10-31 14:49:11 +01:00
Aurélien FOUCRET 0416812456
Term Stats documentation (#115933)
* Term Stats documentation

* Update docs/reference/reranking/learning-to-rank-model-training.asciidoc

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Fix query example.

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2024-10-30 15:31:26 +01:00
Liam Thompson feea0a09b8
[DOCS] Update connectors link on landing page (#115904) 2024-10-30 10:14:22 +01:00
kosabogi aa979b6f11
Adds 8.16 version to css matrix (#115788) 2024-10-29 07:37:03 +01:00
Slobodan Adamović 023c856c24
[DOCS] `_cat/shards`: clarify required permissions for restricted indices (#115650)
Clarify security requirements when requesting detailed shard information
for restricted indices.
2024-10-28 20:19:51 +01:00