elasticsearch/libs
Jan-Kazlouski-elastic beb18a87c3
Add Llama support to Inference Plugin (#130092)
* Refactor Hugging Face service settings and completion request methods for consistency

* Add Llama model support for embeddings and chat completions

* Refactor Llama request classes to improve secret settings handling

* Refactor DeltaParser in LlamaStreamingProcessor to improve argument handling

* Enhance Llama streaming processing by adding support for nullable object arrays

* [CI] Auto commit changes from spotless

* Fix error messages in LlamaActionCreator

* [CI] Auto commit changes from spotless

* Add detailed Javadoc comments to Llama classes for improved documentation

* Enhance LlamaChatCompletionResponseHandler to support mid-stream error handling and improve error response parsing

* Add Javadoc comments to Llama classes for improved documentation and clarity

* Fix checkstyle

* Update LlamaEmbeddingsRequest to use mediaTypeWithoutParameters for content type header

* Add unit tests for LlamaActionCreator and related models

* Add unit tests for LlamaChatCompletionServiceSettings to validate configuration parsing and serialization

* Add unit tests for LlamaEmbeddingsServiceSettings to validate configuration parsing and serialization

* Add unit tests for LlamaEmbeddingsServiceSettings to validate various configuration scenarios

* Add unit tests for LlamaChatCompletionResponseHandler to validate error response handling

* Refactor Llama embedding and chat completion tests for consistency and clarity

* Add unit tests for LlamaChatCompletionRequestEntity to validate message serialization

* Add unit tests for LlamaEmbeddingsRequest to validate request creation and truncation behavior

* Add unit tests for LlamaEmbeddingsRequestEntity to validate XContent serialization

* Add unit tests for LlamaErrorResponse to validate error handling from HTTP responses

* Add unit tests for LlamaChatCompletionServiceSettings to validate configuration parsing and serialization

* Add tests for LlamaService request configuration validation and error handling

* Fix error message formatting in LlamaServiceTests for better localization support

* Refactor Llama model classes to implement accept method for action visitors

* Hide Llama service from configuration API to enhance security and reduce exposure

* Refactor Llama model classes to remove modelId and update embedding request handling

* Refactor Llama request classes to use pattern matching for secret settings

* Update embeddings handler to use HuggingFace response entity

* Refactor Mistral model classes to remove modelId and update rate limit hashing

* Refactor Mistral action classes to remove taskSettings parameter and streamline action creation

* Refactor Llama and Mistral models to remove taskSettings parameter and simplify model instantiation

* Refactor Llama service tests to use Model instead of CustomModel and update similarity measure to DOT_PRODUCT

* Remove unused tests and imports from LlamaServiceTests

* Add chunking settings support to Llama embeddings model tests

* Add changelog

* Add support for version checks in Llama settings and define new transport version

* Refactor Llama model assertions and remove unused version support methods

* Refactor Llama service constructors to include ClusterService and improve error message handling

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-07-18 16:26:20 -04:00
..
cli Use @WithoutEntitlements for terminal tests (#130614) 2025-07-04 14:56:04 +02:00
core Remove old entitlements legacy code (#131224) 2025-07-15 08:03:09 +02:00
dissect Use directory name as project name for libs (#115720) 2024-10-29 13:02:28 -07:00
entitlement Log NotEntitledExceptions using logger with <package> suffix (#131031) 2025-07-15 13:21:54 +02:00
geo Fix #123425 numerical floating point edge case (#127982) 2025-05-10 16:37:29 +02:00
grok Fix a bunch of non-final static fields (#119185) 2024-12-26 19:14:36 +01:00
h3 Reapply "Update Gradle wrapper to 8.13 (#122421)" (#123889) (#123896) 2025-03-05 08:02:13 +01:00
log4j Patch log4j JAR to remove JndiLookup class (#81629) 2021-12-10 15:51:38 -08:00
logging Set root logger level for CLIs (#123742) 2025-03-02 17:55:09 +00:00
logstash-bridge Make enrich project-aware (#124099) 2025-03-06 19:20:46 +01:00
lz4 Use directory name as project name for libs (#115720) 2024-10-29 13:02:28 -07:00
native Add Boolean#parseBoolean and Boolean.valueOf to forbidden-apis (#129684) 2025-07-08 14:09:10 +02:00
plugin-analysis-api [Build] Remove deprecated BuildParams (#116984) 2024-11-22 16:30:57 +01:00
plugin-api Add AGPLv3 as a supported license 2024-09-13 15:29:46 -07:00
plugin-scanner Update ASM 9.7 -> 9.7.1 to support JDK 24 (#118094) 2024-12-06 08:16:46 +01:00
simdvec Add bulk processing capabilities to ES91Int4VectorsScorer (#131202) 2025-07-14 14:58:40 +01:00
ssl-config Allow adjustment of transport TLS handshake timeout (#130909) 2025-07-10 16:50:20 +01:00
tdigest Reapply "Update Gradle wrapper to 8.13 (#122421)" (#123889) (#123896) 2025-03-05 08:02:13 +01:00
x-content Add Llama support to Inference Plugin (#130092) 2025-07-18 16:26:20 -04:00
build.gradle Support entitlements in internal cluster tests (#130710) 2025-07-17 16:46:00 +02:00