Updated May 23, 2026

Tech Trends

AI-aggregated emerging technology news from top sources. Updated daily via automated Python scraper + GitHub Actions.

AI Intelligence Summary

Executive Summary

Category Highlights

AI/ML

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Hugging Face - BlogMay 23, 2026
AI/ML

Catch up on the Dialogues stage at Google I/O 2026.

Alphabet CEO Sundar Pichai in conversation on the I/O 2026 Dialogues stage

AIMay 22, 2026
AI/ML

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

Hugging Face - BlogMay 22, 2026
AI/ML

Temporal Contrastive Transformer for Financial Crime Detection: Self-Supervised Sequence Embeddings via Predictive Contrastive Coding

arXiv:2605.21490v1 Announce Type: new Abstract: We introduce the Temporal Contrastive Transformer (TCT), a representation learning framework designed to capture contextual temporal dynamics in sequences of financial transactions. The model is trained using a self-supervised contrastive objective to

cs.LG updates on arXiv.orgMay 22, 2026
AI/ML

Teaching Language Models to Forecast Research Success Through Comparative Idea Evaluation

arXiv:2605.21491v1 Announce Type: new Abstract: As language models accelerate scientific research by automating hypothesis generation and implementation, a new bottleneck emerges: evaluating and filtering hundreds of AI-generated ideas without exhaustive experimentation. We ask whether LMs can lear

cs.LG updates on arXiv.orgMay 22, 2026
AI/ML

Tool-Augmented Agent for Closed-loop Optimization,Simulation,and Modeling Orchestration

arXiv:2605.20190v1 Announce Type: new Abstract: Iterative industrial design-simulation optimization is bottlenecked by the CAD-CAE semantic gap: translating simulation feedback into valid geometric edits under diverse, coupled constraints. To fill this gap, we propose COSMO-Agent (Closed-loop Optim

cs.AI updates on arXiv.orgMay 22, 2026
AI/ML

SOLAR: A Self-Optimizing Open-Ended Autonomous Agent for Lifelong Learning and Continual Adaptation

arXiv:2605.20189v1 Announce Type: new Abstract: Despite the remarkable success of large language models (LLMs), they still face bottlenecks while deploying in dynamic, real-world settings with primary challenges being concept drift and the high cost of gradient-based adaptation. Traditional fine-tu

cs.AI updates on arXiv.orgMay 22, 2026
AI/ML

GraphDiffMed: Knowledge-Constrained Differential Attention with Pharmacological Graph Priors for Medication Recommendation

arXiv:2605.20188v1 Announce Type: new Abstract: Recommending safe and effective medication combinations from electronic health records (EHRs) is a core clinical AI problem, yet it remains difficult because patient trajectories are long, noisy, and clinically heterogeneous. Existing methods typicall

cs.LG updates on arXiv.orgMay 21, 2026
AI/ML

Neural Estimation of Pairwise Mutual Information in Masked Discrete Sequence Models

arXiv:2605.20187v1 Announce Type: new Abstract: Understanding dependencies between variables is critical for interpretability and efficient generation in masked diffusion models (MDMs), yet these models primarily expose marginal conditional distributions and do not explicitly represent inter-variab

cs.LG updates on arXiv.orgMay 21, 2026
AI/ML

We’re announcing new community investments in Missouri.

We’re helping build the state’s next-generation workforce and investing in energy programs.

AIMay 20, 2026
AI/ML

100 things we announced at I/O 2026

Image with the words "Ready, Set, I/O" and a colorful Gemini logo

AIMay 20, 2026
AI/ML

Robust Basis Spline Decoupling for the Compression of Transformer Models

arXiv:2605.18794v1 Announce Type: new Abstract: Decoupling is a powerful modeling paradigm for representing multivariate functions as compositions of linear transformations and univariate nonlinear functions. A single-layer decoupling can be viewed as a fully connected neural network with a single

cs.LG updates on arXiv.orgMay 20, 2026
AI/ML

Dimensional Balance Improves Large Scale Spatiotemporal Prediction Performance

arXiv:2605.18793v1 Announce Type: new Abstract: Accurate spatiotemporal pattern analysis is critical in fields such as urban traffic, meteorology, and public health monitoring. However, existing methods face performance bottlenecks, typically yielding only incremental gains and often exhibiting lim

cs.LG updates on arXiv.orgMay 20, 2026
AI/ML

Operationalizing Document AI: A Microservice Architecture for OCR and LLM Pipelines in Production

arXiv:2605.18818v1 Announce Type: new Abstract: Academic research tends to focus on new models for document understanding creating a wide gap in the literature between model definition and running models at production scale. To close that gap, we present a microservice architecture that encapsulate

cs.AI updates on arXiv.orgMay 20, 2026
AI/ML

Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance

arXiv:2605.18801v1 Announce Type: new Abstract: Data is fundamental to large language models (LLMs). However, understanding of what makes certain data useful for different stages of an LLM workflow, including training, tuning, alignment, in-context learning, etc., and why, remains an open question.

cs.AI updates on arXiv.orgMay 20, 2026
AI/ML

OlmoEarth v1.1: A more efficient family of models

OlmoEarth v1.1: A more efficient family of Earth observation models

Hugging Face - BlogMay 19, 2026
AI/ML

I/O 2026

At Google I/O 2026, we shared how we’re making AI more helpful for everyone. See everything we announced.

AIMay 19, 2026
AI/ML

How AI Mode is changing the way people search in the U.S.

A graphic features the text "How people are using AI Mode in the U.S." surrounded by colorful, stylized illustrations of a pencil, planet, banana, gift box, cursor, gamepad, and lipstick on a light blue background.

AIMay 19, 2026
AI/ML

AgentWall: A Runtime Safety Layer for Local AI Agents

arXiv:2605.16265v1 Announce Type: new Abstract: The safety of autonomous AI agents is increasingly recognized as a critical open problem. As agents transition from passive text generators to active actors capable of executing shell commands, modifying files, calling APIs, and browsing the web, the

cs.AI updates on arXiv.orgMay 19, 2026
AI/ML

ANNEAL: Adapting LLM Agents via Governed Symbolic Patch Learning

arXiv:2605.16309v1 Announce Type: new Abstract: LLM-based agents can recover from individual execution errors, yet they repeatedly fail on the same fault when the underlying process knowledge--operator schemas, preconditions, and constraints--remains unrepaired. Existing self-evolving approaches ad

cs.AI updates on arXiv.orgMay 19, 2026
AI/ML

Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra

arXiv:2605.16259v1 Announce Type: new Abstract: While real-time image generation using diffusion models has advanced rapidly on NVIDIA GPUs, systematic optimization research on non-CUDA platforms such as Apple Silicon remains extremely limited. In this study, we conducted comprehensive optimization

cs.LG updates on arXiv.orgMay 19, 2026
AI/ML

Mirror Descent-Type Algorithms for the Variational Inequality Problem with Functional Constraints

arXiv:2605.16262v1 Announce Type: new Abstract: Variational inequalities play a key role in machine learning research, such as generative adversarial networks, reinforcement learning, adversarial training, and generative models. This paper is devoted to the constrained variational inequality proble

cs.LG updates on arXiv.orgMay 19, 2026
AI/ML

Introducing the Ettin Reranker Family

Introducing the Ettin Reranker Family

Hugging Face - BlogMay 19, 2026
AI/ML

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

Hugging Face - BlogMay 18, 2026
AI/ML

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

Hugging Face - BlogMay 18, 2026
AI/ML

SDOF: Taming the Alignment Tax in Multi-Agent Orchestration with State-Constrained Dispatch

arXiv:2605.15204v1 Announce Type: new Abstract: Multi-agent orchestration frameworks such as LangChain, LangGraph, and CrewAI route tasks through graph-based pipelines but do not enforce the stage constraints that govern real business processes. We present SDOF, a framework that treats multi-agent

cs.AI updates on arXiv.orgMay 18, 2026
AI/ML

AgentStop: Terminating Local AI Agents Early to Save Energy in Consumer Devices

arXiv:2605.15206v1 Announce Type: new Abstract: Autonomous agents powered by large language models (LLMs) are increasingly used to automate complex, multi-step tasks such as coding or web-based question answering. While remote, cloud-based agents offer scalability and ease of deployment, they raise

cs.LG updates on arXiv.orgMay 18, 2026
AI/ML

DeepSlide: From Artifacts to Presentation Delivery

arXiv:2605.15202v1 Announce Type: new Abstract: Presentations are a primary medium for scholarly communication, yet most AI slide generators optimize the artifact (a visually plausible deck) while under-optimizing the delivery process (pacing, narrative, and presentation preparation). We present De

cs.AI updates on arXiv.orgMay 18, 2026
AI/ML

TeamTR: Trust-Region Fine-Tuning for Multi-Agent LLM Coordination

arXiv:2605.15207v1 Announce Type: new Abstract: Multi-agent LLM systems have shown promise for complex reasoning, yet recent evaluations reveal they often underperform single-model baselines. We identify a structural failure mode in sequential fine-tuning of shared-context teams: updating one agent

cs.LG updates on arXiv.orgMay 18, 2026
AI/ML

Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders

arXiv:2605.13930v1 Announce Type: new Abstract: EEG foundation models achieve state-of-the-art clinical performance, yet the internal computations driving their predictions remain opaque: a barrier to clinical trust. We apply TopK Sparse Autoencoders (SAEs) across three architecturally distinct EEG

cs.LG updates on arXiv.orgMay 15, 2026
AI/ML

GraphBit: A Graph-based Agentic Framework for Non-Linear Agent Orchestration

arXiv:2605.13848v1 Announce Type: new Abstract: Agentic LLM frameworks that rely on prompted orchestration, where the model itself determines workflow transitions, often suffer from hallucinated routing, infinite loops, and non-reproducible execution. We introduce GraphBit, an engine-orchestrated f

cs.AI updates on arXiv.orgMay 15, 2026
AI/ML

Mixed Integer Goal Programming for Personalized Meal Optimization with User-Defined Serving Granularity

arXiv:2605.13849v1 Announce Type: new Abstract: Determining what to eat to satisfy nutritional requirements is one of the oldest optimization problems in operations research, yet existing formulations have two persistent limitations: continuous variables produce impractical fractional servings (1.7

cs.AI updates on arXiv.orgMay 15, 2026
AI/ML

Vision-Based Runtime Monitoring under Varying Specifications using Semantic Latent Representations

arXiv:2605.13923v1 Announce Type: new Abstract: We study certified runtime monitoring of past-time signal temporal logic (ptSTL) from visual observations under partial observability. The monitor must infer safety-relevant quantities from images and provide finite-sample guarantees, while being \emp

cs.LG updates on arXiv.orgMay 15, 2026
AI/ML

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Hugging Face - BlogMay 14, 2026
AI/ML

Think Twice, Act Once: Verifier-Guided Action Selection For Embodied Agents

arXiv:2605.12620v1 Announce Type: new Abstract: Building generalist embodied agents capable of solving complex real-world tasks remains a fundamental challenge in AI. Multimodal Large Language Models (MLLMs) have significantly advanced the reasoning capabilities of such agents through strong vision

cs.AI updates on arXiv.orgMay 14, 2026
AI/ML

Macro-Action Based Multi-Agent Instruction Following through Value Cancellation

arXiv:2605.12655v1 Announce Type: new Abstract: Multi-agent reinforcement learning (MARL) in real-world use cases may need to adapt to external natural language instructions that interrupt ongoing behavior and conflict with long-horizon objectives. However, conditioning rewards on instructions intr

cs.AI updates on arXiv.orgMay 14, 2026
AI/ML

Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance

arXiv:2605.12561v1 Announce Type: new Abstract: Safe reinforcement learning (RL) typically asks $\textit{what}$ an agent should do. We ask $\textit{when}$ it needs to act, and show that a single policy can jointly learn control inputs and communication-efficient timing decisions under a pointwise L

cs.LG updates on arXiv.orgMay 14, 2026
AI/ML

CAWI: Copula-Aligned Weight Initialization for Randomized Neural Networks

arXiv:2605.12580v1 Announce Type: new Abstract: Randomized neural networks (RdNNs) enable efficient, backpropagation-free training by freezing randomly initialized input-to-hidden weights, which permits a closed-form solution for the output layer. However, conventional random initialization is blin

cs.LG updates on arXiv.orgMay 14, 2026
AI/ML

Unlocking asynchronicity in continuous batching

Unlocking asynchronicity in continuous batching

Hugging Face - BlogMay 14, 2026
AI/ML

QuIDE: Mastering the Quantized Intelligence Trade-off via Active Optimization

arXiv:2605.10959v1 Announce Type: new Abstract: There is currently no unified metric for evaluating the efficiency of quantized neural networks. We propose QuIDE, built around the Intelligence Index I = (C x P)/log_2(T+1), which collapses the compression-accuracy-latency trade-off into a single sco

cs.LG updates on arXiv.orgMay 13, 2026
AI/ML

Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation

arXiv:2605.10947v1 Announce Type: new Abstract: EEG microstate analysis segments continuous brain electrical activity into brief, quasi-stable topographic configurations that reflect discrete functional brain states. Conventional approaches such as Modified K-Means operate directly in electrode spa

cs.LG updates on arXiv.orgMay 13, 2026
AI/ML

Reinforcement learning for inverse structural design and rapid laser cutting of kirigami prototypes

arXiv:2605.08098v1 Announce Type: new Abstract: Kirigami is an increasingly useful fabrication method to produce shape-programmable metamaterial structures. However, inverse design remains difficult because deployment is nonlinear, and feasible cut layouts must satisfy discrete compatibility rules,

cs.LG updates on arXiv.orgMay 12, 2026
AI/ML

Path-Based Gradient Boosting for Graph-Level Prediction

arXiv:2605.08102v1 Announce Type: new Abstract: We propose PathBoost, a gradient tree boosting method for graph-level classification and regression that learns discriminative path-based features directly from the input graph structure. Building on a previous work, which was tailored to a specific c

cs.LG updates on arXiv.orgMay 12, 2026
AI/ML

Spatial Priming Outperforms Semantic Prompting: A Grid-Based Approach to Improving LLM Accuracy on Chart Data Extraction

arXiv:2605.08220v1 Announce Type: new Abstract: The automated extraction of data from scientific charts is a critical task for large-scale literature analysis. While multimodal Large Language Models (LLMs) show promise, their accuracy on non-standardized charts remains a challenge. This raises a ke

cs.AI updates on arXiv.orgMay 12, 2026
AI/ML

Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits

arXiv:2605.08200v1 Announce Type: new Abstract: A pervasive intuition holds that vision-language models (VLMs) are most trustworthy when their attention maps look sharp: concentrated attention on the queried region should imply a confident, calibrated answer. We test this Attention-Confidence Assum

cs.AI updates on arXiv.orgMay 12, 2026
AI/ML

Building Blocks for Foundation Model Training and Inference on AWS

Building Blocks for Foundation Model Training and Inference on AWS

Hugging Face - BlogMay 11, 2026
AI/ML

The new AI-powered Google Finance is expanding to Europe.

A screenshot of the AI-powered experience on Google Finance.

AIMay 11, 2026
AI/ML

RateQuant: Optimal Mixed-Precision KV Cache Quantization via Rate-Distortion Theory

arXiv:2605.06675v1 Announce Type: new Abstract: Large language models cache all previously computed key-value (KV) pairs during generation, and this KV cache grows linearly with sequence length, making it a primary memory bottleneck for serving. Quantizing the KV cache to fewer bits reduces this co

cs.LG updates on arXiv.orgMay 11, 2026
AI/ML

More Thinking, More Bias: Length-Driven Position Bias in Reasoning Models

arXiv:2605.06672v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning and reasoning-tuned models such as DeepSeek-R1 are commonly assumed to reduce shallow heuristic biases by thinking carefully. We test this on position bias in multiple-choice QA and find a different story: within any r

cs.AI updates on arXiv.orgMay 11, 2026
AI/ML

LKV: End-to-End Learning of Head-wise Budgets and Token Selection for LLM KV Cache Eviction

arXiv:2605.06676v1 Announce Type: new Abstract: Long-context inference in Large Language Models (LLMs) is bottlenecked by the linear growth of Key-Value (KV) cache memory. Existing KV cache compression paradigms are fundamentally limited by heuristics: heuristic budgeting relies on statistical prio

cs.LG updates on arXiv.orgMay 11, 2026

Peter's AI Agents

Portfolio · Tech · DoD Policy · Notes

🤖

Agent Hub

Hi! I have 4 specialized agents — Portfolio 💼, Tech Trends 📡, DoD Policy 🏛️, and Notes 📝. I'll automatically route your question to the right one. What would you like to know?