elasticsearch/libs/x-content
Jan-Kazlouski-elastic beb18a87c3
Add Llama support to Inference Plugin (#130092)
* Refactor Hugging Face service settings and completion request methods for consistency

* Add Llama model support for embeddings and chat completions

* Refactor Llama request classes to improve secret settings handling

* Refactor DeltaParser in LlamaStreamingProcessor to improve argument handling

* Enhance Llama streaming processing by adding support for nullable object arrays

* [CI] Auto commit changes from spotless

* Fix error messages in LlamaActionCreator

* [CI] Auto commit changes from spotless

* Add detailed Javadoc comments to Llama classes for improved documentation

* Enhance LlamaChatCompletionResponseHandler to support mid-stream error handling and improve error response parsing

* Add Javadoc comments to Llama classes for improved documentation and clarity

* Fix checkstyle

* Update LlamaEmbeddingsRequest to use mediaTypeWithoutParameters for content type header

* Add unit tests for LlamaActionCreator and related models

* Add unit tests for LlamaChatCompletionServiceSettings to validate configuration parsing and serialization

* Add unit tests for LlamaEmbeddingsServiceSettings to validate configuration parsing and serialization

* Add unit tests for LlamaEmbeddingsServiceSettings to validate various configuration scenarios

* Add unit tests for LlamaChatCompletionResponseHandler to validate error response handling

* Refactor Llama embedding and chat completion tests for consistency and clarity

* Add unit tests for LlamaChatCompletionRequestEntity to validate message serialization

* Add unit tests for LlamaEmbeddingsRequest to validate request creation and truncation behavior

* Add unit tests for LlamaEmbeddingsRequestEntity to validate XContent serialization

* Add unit tests for LlamaErrorResponse to validate error handling from HTTP responses

* Add unit tests for LlamaChatCompletionServiceSettings to validate configuration parsing and serialization

* Add tests for LlamaService request configuration validation and error handling

* Fix error message formatting in LlamaServiceTests for better localization support

* Refactor Llama model classes to implement accept method for action visitors

* Hide Llama service from configuration API to enhance security and reduce exposure

* Refactor Llama model classes to remove modelId and update embedding request handling

* Refactor Llama request classes to use pattern matching for secret settings

* Update embeddings handler to use HuggingFace response entity

* Refactor Mistral model classes to remove modelId and update rate limit hashing

* Refactor Mistral action classes to remove taskSettings parameter and streamline action creation

* Refactor Llama and Mistral models to remove taskSettings parameter and simplify model instantiation

* Refactor Llama service tests to use Model instead of CustomModel and update similarity measure to DOT_PRODUCT

* Remove unused tests and imports from LlamaServiceTests

* Add chunking settings support to Llama embeddings model tests

* Add changelog

* Add support for version checks in Llama settings and define new transport version

* Refactor Llama model assertions and remove unused version support methods

* Refactor Llama service constructors to include ClusterService and improve error message handling

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-07-18 16:26:20 -04:00
..
impl Optimized text for full unicode and some escape sequences (#129169) 2025-06-12 09:55:07 -07:00
src Add Llama support to Inference Plugin (#130092) 2025-07-18 16:26:20 -04:00
build.gradle Use directory name as project name for libs (#115720) 2024-10-29 13:02:28 -07:00