Close

Novel “Kaputt” dataset sets new benchmark for large-scale visual defect detection

A new dataset with over 238,000 images challenges and advances the state of the art in visual defect detection for complex retail applications.

Science in the age of foundation models

To transform scientific domains, foundation models will require physical-constraint satisfaction, uncertainty quantification, and specialized forecasting techniques that overcome data scarcity while maintaining scientific rigor.

Scientific frontiers of agentic AI

The language AI agents might speak, sharing context without compromising privacy, modeling agentic negotiations, and understanding users’ commonsense policies are some of the open scientific questions that researchers in agentic AI will need to grapple with.

Revolutionizing warehouse automation with scientific simulation

With a novel parallel-computing architecture, a CAD-to-USD pipeline, and the use of OpenUSD as ground truth, a new simulator can explore hundreds of sensor configurations in the time it takes to test just a few physical setups.

Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases

A decade of database innovation: The Amazon Aurora story

From reimagining storage to serverless computing, Aurora continues to push the boundaries of what's possible in database technology.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Simplifying book discovery with ML-powered visual autocomplete suggestions

September 2, 2025

Audible's ML algorithms connect users directly to relevant titles, reducing the number of purchase steps for millions of daily users.

Search and information retrieval
Amazon builds first foundation model for multirobot coordination

August 11, 2025

Robotics
A better path to pruning large language models

August 8, 2025

Conversational AI
Three challenges in machine-based reasoning

August 4, 2025

Automated reasoning
Multiagent AI for generating chain-of-thought training data

July 31, 2025

Conversational AI

View all

Now open: Call for proposals

Amazon Research Awards have opened their Fall call for proposals, which closes November 5, 2025.

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

DocTalk: Scalable graph-based dialogue synthesis for enhancing LLM conversational capabilities

Jing Yang Lee, Hamed Bonab, Nasser Zalmout, Ming Zeng, Sanket Lokegaonkar, Colin Lockard, Binxuan Huang, Ritesh Sarkhel, Haodong Wang

SIGDIAL 2025

2025

Large Language Models (LLMs) are increasingly employed in multi-turn conversational tasks, yet their pre-training data predominantly consists of continuous prose, creating a potential mismatch between required capabilities and training paradigms. We introduce a novel approach to address this discrepancy by synthesizing conversational data from existing text corpora. We present a pipeline that transforms

Conversational AI
MDSEval: A meta-evaluation benchmark for multimodal dialogue summarization

Yinhong Liu, Jianfeng He, Hang Su, Ruixue Lian, Yi Nian, Jake Vincent, Srikanth Vishnubhotla, Robinson Piramuthu, Saab Mansour

EMNLP 2025 Findings

2025

Multimodal Dialogue Summarization (MDS) is a critical task with wide-ranging applications. To support the development of effective MDS models, robust automatic evaluation methods are essential for reducing both cost and human effort. However, such methods require a strong meta-evaluation benchmark grounded in human annotations. In this work, we introduce MDSEval, the first meta-evaluation benchmark for

Conversational AI
BYOKG-RAG: Multi-strategy graph retrieval for knowledge graph question answering

Costas Mavromatis, Soji Adeshina, Vassilis N. Ioannidis, Zhen Han, Qi Zhu, Ian Robinson, Bryan Thompson, Huzefa Rangwala, George Karypis

EMNLP 2025

2025

Knowledge graph question answering (KGQA) presents significant challenges due to the structural and semantic variations across input graphs. Existing works rely on Large Language Model (LLM) agents for graph traversal and retrieval; an approach that is sensitive to traversal initialization, as it is prone to entity linking errors and may not generalize well to custom (“bring-your-own”) KGs. We introduce

Search and information retrieval
Analyzing and improving coherence of large language models in question answering

Ivano Lauriola, Stefano Campese, Alessandro Moschitti

NAACL 2025

2025

Large language models (LLMs) have recently revolutionized natural language processing. These models, however, often suffer from instability or lack of coherence, that is the ability of the models to generate semantically equivalent outputs when receiving diverse yet semantically equivalent input variations. In this work, we analyze the behavior of multiple LLMs, including Mixtral-8x7B, Llama2-70b, Smaug

Conversational AI
AutoMixAlign: Adaptive data mixing for multi-task preference optimization in LLMs

Nicholas Corrado, Julian Katz-Samuels, Adithya M Devraj, Hyokun Yun, Chao Zhang, Yi Xu, Yi Pan, Bing Yin, Trishul Chilimbi

ACL 2025

2025

When aligning large language models (LLMs), their performance on various tasks (such as being helpful, harmless, and honest) depends heavily on the composition of their training data. However, selecting a data mixture that achieves strong performance across all tasks is challenging. Existing approaches rely on large ablation studies, heuristics, or human intuition, but these can be prohibitively expensive

Conversational AI

COLM 2025

October 7 - 10, 2025

Montreal

Machine learning

ICCV 2025

October 19 - 23, 2025

Honolulu, Hawaii

Computer vision

INFORMS 2025

October 26 - 29, 2025

Atlanta, GA

Operations research and optimization

EMNLP 2025

November 4 - 9, 2025

Suzhou, China

Conversational AI

NeurIPS 2025

December 2 - 7, 2025

San Diego, California

Machine learning

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us