Customer-obsessed science
Research areas
-
June 8, 20267 min readFour approaches can dramatically improve the performance and trustworthiness of AI agents in operational environments.
-
-
-
-
May 26, 20265 min read
Featured news
-
Conventional adaptive bitrate (ABR) streaming systems typically rely on static bitrate ladders to optimize Quality of Experience (QoE). While operationally simple, this 'one-size-fits-all' approach neglects content-specific characteristics, often compromising streaming efficiency. Per-title optimization methods address this by predicting the rate-distortion convex hull directly from the source content,
-
2026LLMs can perform multi-step reasoning through Chain-of-Thought (CoT), but they cannot reliably verify their own logic. Even when they reach correct answers, the underlying reasoning may be flawed, undermining trust in high-stakes scenarios. To mitigate this issue, we introduce VERICOT, a neuro-symbolic method that extracts and verifies formal logical arguments from CoT reasoning. VERICOT formalizes each
-
2026Search-augmented LLM agents can produce deep research reports (DRRs), but claim-level factuality remains hard to verify. Existing fact-checkers are optimized for short, general-domain claims and often reduce verification to matching snippets or checking cited sources, missing uncited synthesis and broader scientific consensus. Yet evaluating better DRR verifiers is itself difficult: the usual solution,
-
IEEE Symposium on Product Compliance Engineering 20262026Lightning-induced surge voltages constitute a primary failure mechanism for outdoor electronic equipment deployed in residential and commercial environments. This paper presents a physics-based Distance Effect model that establishes the quantitative relationship between lightning-induced voltage magnitude and the spatial volume fraction capable of producing that voltage level, yielding an inverse cubic
-
2026Context management enables agentic models to solve long-horizon tasks through iterative summarization of previous interaction histories. However, this process typically incurs substantial decoding overhead for the extra summarization tokens, which significantly affect the end-to-end response latency at deployment. In this paper, we introduce COMEM, a novel framework that decouples memory management from
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all