Author

Thomas Veasey

Articles

Improving information retrieval in the Elastic Stack: Improved inference performance with ELSER v2

ML Research

•17 October 2023

Improving information retrieval in the Elastic Stack: Improved inference performance with ELSER v2

Learn about the improvements we've made to the inference performance of ELSER v2.

Thomas Veasey

Quentin Herreros

Valeriy Khakhutskyy

Improving information retrieval in the Elastic Stack: Optimizing retrieval with ELSER v2

ML Research

•17 October 2023

Improving information retrieval in the Elastic Stack: Optimizing retrieval with ELSER v2

Learn about how we're reducing retrieval costs for ELSER v2.

Thomas Veasey

Quentin Herreros

Valeriy Khakhutskyy

RAG evaluation metrics: A journey through metrics

ML Research

•1 December 2023

RAG evaluation metrics: A journey through metrics

Explore RAG evaluation metrics like BLEU score, ROUGE score, PPL, BARTScore, and more. Discover how Elastic is evaluating RAG with UniEval.

Quentin Herreros

Thomas Veasey

Thanos Papaoikonomou

Evaluating scalar quantization in Elasticsearch

ML Research

•3 May 2024

Evaluating scalar quantization in Elasticsearch

Learn how scalar quantization can be used to reduce the memory footprint of vector embeddings in Elasticsearch through an experiment.

Thanos Papaoikonomou

Thomas Veasey

Scalar Quantization Optimized for Vector Databases

ML Research

•25 April 2024

Scalar Quantization Optimized for Vector Databases

Optimizing scalar quantization for the vector database use case allows us to achieve significantly better performance for the same retrieval quality at high compression ratios.

Thomas Veasey

Benjamin Trent

Understanding Int4 scalar quantization in Lucene

LuceneML Research

•25 April 2024

Understanding Int4 scalar quantization in Lucene

This blog explains how int4 quantization works in Lucene, how it lines up, and the benefits of using int4 quantization.

Benjamin Trent

Thomas Veasey

Improving information retrieval in the Elastic Stack: Introducing Elastic Learned Sparse Encoder, our new retrieval model

ML Research

•21 June 2023

Improving information retrieval in the Elastic Stack: Introducing Elastic Learned Sparse Encoder, our new retrieval model

Deep learning has transformed how people retrieve information. We've created a retrieval model that works with a variety of text with streamlined processes to deploy it. Learn about the model's performance, its architecture, and how it was trained.

Thomas Veasey

Quentin Herreros

Lucene

•12 March 2024

Speeding Up Multi-graph Vector Search

Sharing information among segment searches in multi-graph vector search allows us to achieve significant search speedups.

Mayya Sharipova

Thomas Veasey