InfoQ Homepage Generative AI Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

Inside Target’s LLM-Based System for Semantic Matching in Marketing Forecast Pipelines

Target built a generative AI system to improve marketing campaign forecasting by retrieving and ranking similar historical campaigns. Using embeddings, vector search, and LLM ranking, it replaces rule-based workflows. Evaluation shows 75% top-1 and 100% top-3 coverage. The system reduces manual effort, improves consistency, and uses feedback loops to refine retrieval using campaign outcomes.

Leela Kumili
on Jun 29, 2026
Cloud

AWS Previews FinOps Agent for Cost Analysis and Optimization

Amazon has released AWS FinOps Agent in public preview, a managed service that automates several common FinOps workflows. The agent can investigate cost anomalies, correlate spend changes with AWS activity data, and integrate with tools such as Slack and Jira to route findings to resource owners.

Renato Losio
on Jun 28, 2026
AI, ML & Data Engineering

AI Coding Agents Get a Stack Overflow of Their Own

Stack Overflow has announced Stack Overflow for Agents, a beta API-first knowledge exchange aimed at AI coding agents rather than human developers. The service is presented as a way to close what the company calls the Ephemeral Intelligence Gap, where agents repeatedly rediscover the same fixes and patterns in isolation instead of sharing them through a common memory.

Matt Saunders
on Jun 16, 2026
Java

Oracle's OpenJDK Bans Generative AI Contributions While Oracle's GraalVM Allows Them

Two related, Oracle-backed projects published opposing policies on open-source contributions created with generative AI: the OpenJDK Governing Board approved an interim policy prohibiting such contributions, while the Coding Assistants policy from GraalVM permits them. Both projects require contributors to sign the same Oracle Contributor Agreement (OCA) for intellectual property.

Karsten Silz
on Jun 12, 2026
Development

Celebrating 20 Years of InfoQ

InfoQ celebrates its 20th anniversary. To mark the occasion, we have published a walk-through of the trends InfoQ called early, where they sit on the adoption curve today, and how that curve may evolve over the next decade.

InfoQ
on Jun 08, 2026
AI, ML & Data Engineering

Sarang Kulkarni on Lessons from Building Deep Research Agents in Production

Deep Research Agentic Systems are AI Agents designed to conduct multi-step research for complex tasks using dynamic reasoning, multi-hop information retrieval, and generate structured analytical reports. Sarang Kulkarni from Thoughtworks spoke at Arc of AI Conference 2026 on how to deploy multi-agent research systems for deep reasoning, and the lessons learned from developing Deep Research Agents.

Srini Penchikala
on May 27, 2026
Architecture & Design

Uber Improves Restaurant Recommendations Using Real-Time Signals and Listwise Ranking

Uber updates its Uber Eats Home Feed recommendation system using near real-time user sequence features and a Generative Recommender model. The system evolves from hand-crafted features to transformer-based sequence modeling, reduces feature freshness from 24 hours to seconds, and shifts from pointwise scoring to listwise GenRec for improved contextual ranking and real-time personalization.

Leela Kumili
on May 22, 2026
Development

Anthropic Traces Six Weeks of Claude Code Quality Complaints to Three Overlapping Product Changes

Anthropic published a postmortem tracing six weeks of Claude Code quality complaints to three overlapping product-layer changes: a reasoning effort downgrade, a caching bug that progressively erased the model's own thinking, and a system prompt verbosity limit that caused a 3% quality drop. The API and model weights were unaffected. All issues were resolved April 20.

Steef-Jan Wiggers
on May 14, 2026
Cloud

Cloudflare Announces Agent Memory, a Managed Persistent Memory Service for AI Agents

Cloudflare announced Agent Memory in private beta, a managed service that extracts structured memories from AI agent conversations and retrieves them on demand using five-channel parallel retrieval with Reciprocal Rank Fusion. Shared memory profiles let teams of agents access common knowledge. Competitors include Mem0, Zep, LangMem, and Letta.

Steef-Jan Wiggers
on Apr 30, 2026
DevOps

DBmaestro MCP Server Puts Natural Language in Control of Database Pipelines

DBmaestro has launched an MCP server that connects AI agents and enterprise copilots to its database DevOps platform, allowing teams to issue natural language commands that trigger real, governed platform workflows. The MCP server, announced on 7 April 2026, allows DBAs to expose DBmaestro's release automation, source control, CI/CD orchestration, and compliance capabilities through MCP.

Matt Saunders
on Apr 30, 2026
DevOps

AWS Announces General Availability of DevOps Agent for Automated Incident Investigation

AWS has announced the general availability of DevOps Agent, a generative AI–powered assistant designed to help developers and operators troubleshoot issues, analyze deployments, and automate operational tasks across AWS environments.

Renato Losio
on Apr 18, 2026
AI, ML & Data Engineering

Google Opens Gemma 4 Under Apache 2.0 with Multimodal and Agentic Capabilities

Google has announced the release of Gemma 4, a series of open-weight AI models, including variants with 2B, 4B, 26B, and 31B parameters, under the Apache 2.0 license. Key features include enhanced video and image processing, audio input on smaller models, and extended context windows up to 256K tokens.

Hien Luu
on Apr 16, 2026
Architecture & Design

Zendesk Says AI Makes Code Abundant, Shifting the Bottleneck to “Absorption Capacity”

Zendesk argues that GenAI shifts the bottleneck in software delivery from writing code to “absorption capacity”, which is the organisation’s ability to define problems clearly, integrate changes into the wider system, and turn implementation into reliable value. As code becomes abundant, architectural coherence, review capacity, and delivery flow become the main constraints.

Eran Stiller
on Apr 15, 2026
AI, ML & Data Engineering

Teleport Report Finds Over-Privileged AI Systems Linked to Fourfold Rise in Security Incidents

Enterprises that grant excessive access permissions to AI systems experience 4.5 times as many security incidents as those that do not, according to The 2026 State of AI in Enterprise Infrastructure Security, a report published by infrastructure identity company Teleport. The study found that identity management hasn't kept up with AI adoption in production systems.

Matt Saunders
on Mar 28, 2026
DevOps

QCon London 2026: AI Agents Write Your Code. What’s Left for Humans?

Hannah Foxwell began her QCon London 2026 talk by noting that the long-sought velocity in development has arrived, but the industry is unsure how to use it. She set aside the technical details of agentic coding, focusing instead on its implications for the people working with these systems.

Matt Saunders
on Mar 27, 2026

Newer News

Older News

InfoQ Software Architects' Newsletter

News