View profile for Jack FitzGerald

Chief Science Officer at EdgeRunner AI

Engineering a performant AI system is all about tradeoffs. As one example, when creating a vector store over which to perform retrieval augmented generation, what size of embeddings should you choose? Researchers at DeepMind sought to characterize the limitations of embedding based retrieval systems as a function of the embeddings size, providing both theoretical analyses and a new benchmark called LIMIT. In this talk we'll discuss the paper, the broader context, and applications. On the Theoretical Limitations of Embedding-Based Retrieval Paper: https://lnkd.in/g7ur79Ze Repo: https://lnkd.in/gkQdkjen Factual Knowledge Acquisition in Pretraining: https://lnkd.in/gvanC-HE Hybrid Search: https://lnkd.in/gQPsZSCq Matryoshka Models: https://lnkd.in/gP-ZduUi

To view or add a comment, sign in

Explore content categories