Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.AI
arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Artificial Intelligence

Authors and titles for recent submissions

  • Fri, 3 Oct 2025
  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025
  • Tue, 30 Sep 2025
  • Mon, 29 Sep 2025

See today's new changes

Total of 1615 entries : 1-50 51-100 101-150 151-200 ... 1601-1615
Showing up to 50 entries per page: fewer | more | all

Fri, 3 Oct 2025 (showing first 50 of 222 entries )

[1] arXiv:2510.02276 [pdf, html, other]
Title: BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals
Chenqi Li, Yu Liu, Timothy Denison, Tingting Zhu
Subjects: Artificial Intelligence (cs.AI)
[2] arXiv:2510.02263 [pdf, html, other]
Title: RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
Yuxiao Qu, Anikait Singh, Yoonho Lee, Amrith Setlur, Ruslan Salakhutdinov, Chelsea Finn, Aviral Kumar
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[3] arXiv:2510.02250 [pdf, html, other]
Title: The Unreasonable Effectiveness of Scaling Agents for Computer Use
Gonzalo Gonzalez-Pumariega, Vincent Tu, Chih-Lun Lee, Jiachen Yang, Ang Li, Xin Eric Wang
Comments: 23 pages, 7 figures, 10 tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4] arXiv:2510.02230 [pdf, html, other]
Title: The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
Phuc Minh Nguyen, Chinh D. La, Duy M. H. Nguyen, Nitesh V. Chawla, Binh T. Nguyen, Khoa D. Doan
Comments: 23 pages, 15 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[5] arXiv:2510.02194 [pdf, html, other]
Title: UpSafe$^\circ$C: Upcycling for Controllable Safety in Large Language Models
Yuhao Sun, Zhuoer Xu, Shiwen Cui, Kun Yang, Lingyun Yu, Yongdong Zhang, Hongtao Xie
Subjects: Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[6] arXiv:2510.02190 [pdf, html, other]
Title: A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports
Yang Yao, Yixu Wang, Yuxuan Zhang, Yi Lu, Tianle Gu, Lingyu Li, Dingyi Zhao, Keming Wu, Haozhe Wang, Ping Nie, Yan Teng, Yingchun Wang
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[7] arXiv:2510.02133 [pdf, html, other]
Title: FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
Karan Dua, Hitesh Laxmichand Patel, Puneet Mittal, Ranjeet Gupta, Amit Agarwal, Praneet Pabolu, Srikant Panda, Hansa Meghwani, Graham Horwood, Fahad Shah
Comments: Accepted at EMNLP 2025
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2510.02125 [pdf, html, other]
Title: Do AI Models Perform Human-like Abstract Reasoning Across Modalities?
Claas Beger, Ryan Yi, Shuhao Fu, Arseny Moskvichev, Sarah W. Tsai, Sivasankaran Rajamanickam, Melanie Mitchell
Comments: 10 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[9] arXiv:2510.02091 [pdf, html, other]
Title: Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning
Xinyuan Song, Keyu Wang, PengXiang Li, Lu Yin, Shiwei Liu
Comments: ICASSP 2025
Subjects: Artificial Intelligence (cs.AI)
[10] arXiv:2510.02060 [pdf, html, other]
Title: ReTabAD: A Benchmark for Restoring Semantic Context in Tabular Anomaly Detection
Sanghyu Yoon, Dongmin Kim, Suhee Yoon, Ye Seul Sim, Seungdong Yoa, Hye-Seung Cho, Soonyoung Lee, Hankook Lee, Woohyung Lim
Comments: 9 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[11] arXiv:2510.02027 [pdf, html, other]
Title: Zero-shot reasoning for simulating scholarly peer-review
Khalid M. Saqr
Subjects: Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[12] arXiv:2510.01924 [pdf, html, other]
Title: To Mask or to Mirror: Human-AI Alignment in Collective Reasoning
Crystal Qian, Aaron Parisi, Clémentine Bouleau, Vivian Tsai, Maël Lebreton, Lucas Dixon
Subjects: Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[13] arXiv:2510.01902 [pdf, html, other]
Title: Constrained Adaptive Rejection Sampling
Paweł Parys, Sairam Vaidya, Taylor Berg-Kirkpatrick, Loris D'Antoni
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[14] arXiv:2510.01857 [pdf, html, other]
Title: Learning a Dense Reasoning Reward Model from Expert Demonstration via Inverse Reinforcement Learning
Claudio Fanconi, Nicolás Astorga, Mihaela van der Schaar
Subjects: Artificial Intelligence (cs.AI)
[15] arXiv:2510.01833 [pdf, html, other]
Title: Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
Zhihao Dou, Qinjian Zhao, Zhongwei Wan, Dinggen Zhang, Weida Wang, Towsif Raiyan, Benteng Chen, Qingtao Pan, Yang Ouyang, Zhiqiang Gao, Shufei Zhang, Sumon Biswas
Comments: 19 pages and 5 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[16] arXiv:2510.01815 [pdf, other]
Title: Human-AI Teaming Co-Learning in Military Operations
Clara Maathuis, Kasper Cools
Comments: Submitted to Sensors + Imaging; presented on 18th of September (Artificial Intelligence for Security and Defence Applications III)
Subjects: Artificial Intelligence (cs.AI)
[17] arXiv:2510.01800 [pdf, html, other]
Title: REBot: From RAG to CatRAG with Semantic Enrichment and Graph Routing
Thanh Ma, Tri-Tam La, Lam-Thu Le Huu, Minh-Nghi Nguyen, Khanh-Van Pham Luu, Huu-Hoa Nguyen
Subjects: Artificial Intelligence (cs.AI)
[18] arXiv:2510.01751 [pdf, other]
Title: A cybersecurity AI agent selection and decision support framework
Masike Malatji
Comments: 6 figures, 6 tables, AI agents decision support framework
Subjects: Artificial Intelligence (cs.AI)
[19] arXiv:2510.01724 [pdf, other]
Title: MetaboT: AI-based agent for natural language-based interaction with metabolomics knowledge graphs
Madina Bekbergenova (ICN), Lucas Pradi (ICN), Benjamin Navet (ICN), Emma Tysinger (ICN), Franck Michel (WIMMICS), Matthieu Feraud (ICN), Yousouf Taghzouti (ICN, WIMMICS), Yan Zhou Chen, Olivier Kirchhoffer (UNIGE), Florence Mehl (SIB), Martin Legrand (ICN), Tao Jiang (ICN), Marco Pagni (SIB), Soha Hassoun, Jean-Luc Wolfender (UNIGE), Wout Bittremieux (UA), Fabien Gandon (WIMMICS, Laboratoire I3S - SPARKS), Louis-Félix Nothias (CNRS, UniCA, ICN)
Journal-ref: ISMB/ECCB 2025, Jul 2025, Liverpool, United Kingdom
Subjects: Artificial Intelligence (cs.AI)
[20] arXiv:2510.01700 [pdf, html, other]
Title: VaPR -- Vision-language Preference alignment for Reasoning
Rohan Wadhawan, Fabrice Y Harel-Canada, Zi-Yi Dou, Suhaila Shakiah, Robinson Piramuthu, Nanyun Peng
Journal-ref: COLM 2025
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[21] arXiv:2510.01687 [pdf, html, other]
Title: Improving AGI Evaluation: A Data Science Perspective
John Hawkins
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[22] arXiv:2510.01671 [pdf, other]
Title: A Locally Executable AI System for Improving Preoperative Patient Communication: A Multi-Domain Clinical Evaluation
Motoki Sato (Nagasaki University, Japan), Yuki Matsushita (Nagasaki University, Japan), Hidekazu Takahashi (Boston Medical Sciences, Tokyo, Japan), Tomoaki Kakazu (Showa Medical University Koto Toyosu Hospital, Japan), Sou Nagata (Nagasaki University, Japan), Mizuho Ohnuma (Nagasaki University, Japan), Atsushi Yoshikawa (Kanto Gakuin University, Japan), Masayuki Yamamura (Institute of Science Tokyo, Japan)
Comments: 32 pages, 4 figures, 10 tables 32 pages, 4 figures, 10 tables. This paper is currently under review at ACM Transactions on Computing for Healthcare. Reproducibility resources: this http URL
Subjects: Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[23] arXiv:2510.01670 [pdf, html, other]
Title: Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness
Erfan Shayegani, Keegan Hines, Yue Dong, Nael Abu-Ghazaleh, Roman Lutz, Spencer Whitehead, Vidhisha Balachandran, Besmira Nushi, Vibhav Vineet
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computers and Society (cs.CY); Machine Learning (cs.LG)
[24] arXiv:2510.01664 [pdf, html, other]
Title: GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents
Yejin Kim, Youngbin Lee, Juhyeong Kim, Yongjae Lee
Comments: 7 Pages, 2 figures
Journal-ref: CIKM 2025 Workshop on Advances in Financial AI: Innovations, Risk, and Responsibility in the Era of LLMs
Subjects: Artificial Intelligence (cs.AI)
[25] arXiv:2510.01639 [pdf, html, other]
Title: Understanding the Geospatial Reasoning Capabilities of LLMs: A Trajectory Recovery Perspective
Thinh Hung Truong, Jey Han Lau, Jianzhong Qi
Subjects: Artificial Intelligence (cs.AI)
[26] arXiv:2510.01620 [pdf, html, other]
Title: Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CDMPs
Peidong Liu, Junjiang Lin, Shaowen Wang, Yao Xu, Haiqing Li, Xuhao Xie, Siyi Wu, Hao Li
Subjects: Artificial Intelligence (cs.AI)
[27] arXiv:2510.01611 [pdf, html, other]
Title: PychoBench: Evaluating the Psychology Intelligence of Large Language Models
Min Zeng
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[28] arXiv:2510.01609 [pdf, html, other]
Title: AgentRec: Next-Generation LLM-Powered Multi-Agent Collaborative Recommendation with Adaptive Intelligence
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Lau
Subjects: Artificial Intelligence (cs.AI)
[29] arXiv:2510.01586 [pdf, html, other]
Title: AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Zhenyu Pan, Yiting Zhang, Zhuo Liu, Yolo Yunlong Tang, Zeliang Zhang, Haozheng Luo, Yuwei Han, Jianshu Zhang, Dennis Wu, Hong-Yu Chen, Haoran Lu, Haoyang Fang, Manling Li, Chenliang Xu, Philip S. Yu, Han Liu
Subjects: Artificial Intelligence (cs.AI)
[30] arXiv:2510.01569 [pdf, html, other]
Title: InvThink: Towards AI Safety via Inverse Reasoning
Yubin Kim, Taehan Kim, Eugene Park, Chunjong Park, Cynthia Breazeal, Daniel McDuff, Hae Won Park
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[31] arXiv:2510.01544 [pdf, html, other]
Title: Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models
Shaoan Xie, Lingjing Kong, Xiangchen Song, Xinshuai Dong, Guangyi Chen, Eric P.Xing, Kun Zhang
Subjects: Artificial Intelligence (cs.AI)
[32] arXiv:2510.01531 [pdf, html, other]
Title: Information Seeking for Robust Decision Making under Partial Observability
Djengo Cyun-Jyun Fang, Tsung-Wei Ke
Comments: The project page is available at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[33] arXiv:2510.01530 [pdf, html, other]
Title: LOGicalThought: Logic-Based Ontological Grounding of LLMs for High-Assurance Reasoning
Navapat Nananukul, Yue Zhang, Ryan Lee, Eric Boxer, Jonathan May, Vibhav Giridhar Gogate, Jay Pujara, Mayank Kejriwal
Subjects: Artificial Intelligence (cs.AI)
[34] arXiv:2510.01528 [pdf, html, other]
Title: Towards Interpretable and Inference-Optimal COT Reasoning with Sparse Autoencoder-Guided Generation
Daniel Zhao, Abhilash Shankarampeta, Lanxiang Hu, Tajana Rosing, Hao Zhang
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[35] arXiv:2510.01500 [pdf, html, other]
Title: Lateral Tree-of-Thoughts Surpasses ToT by Incorporating Logically-Consistent, Low-Utility Candidates
Abhinav Madahar
Subjects: Artificial Intelligence (cs.AI)
[36] arXiv:2510.01474 [pdf, html, other]
Title: AIReg-Bench: Benchmarking Language Models That Assess AI Regulation Compliance
Bill Marino, Rosco Hunter, Zubair Jamali, Marinos Emmanouil Kalpakos, Mudra Kashyap, Isaiah Hinton, Alexa Hanson, Maahum Nazir, Christoph Schnabl, Felix Steffek, Hongkai Wen, Nicholas D. Lane
Subjects: Artificial Intelligence (cs.AI)
[37] arXiv:2510.01444 [pdf, html, other]
Title: VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Rui Liu, Dian Yu, Tong Zheng, Runpeng Dai, Zongxia Li, Wenhao Yu, Zhenwen Liang, Linfeng Song, Haitao Mi, Pratap Tokekar, Dong Yu
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[38] arXiv:2510.01432 [pdf, html, other]
Title: On the Role of Domain Experts in Creating Effective Tutoring Systems
Sarath Sreedharan, Kelsey Sikes, Nathaniel Blanchard, Lisa Mason, Nikhil Krishnaswamy, Jill Zarestky
Comments: Accepted to AIED 2025 Blue Sky Track
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[39] arXiv:2510.01427 [pdf, html, other]
Title: A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
Sipeng Zhang, Longfei Yun, Zilong Wang, Jingbo Shang, Letian Peng
Subjects: Artificial Intelligence (cs.AI)
[40] arXiv:2510.01409 [pdf, html, other]
Title: OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models
Luca Cotti, Idilio Drago, Anisa Rula, Devis Bianchini, Federico Cerutti
Comments: 20 pages, 6 tables, 7 figures
Subjects: Artificial Intelligence (cs.AI)
[41] arXiv:2510.01398 [pdf, html, other]
Title: Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents
Yang Liu, Zaid Abulawi, Abhiram Garimidi, Doyeong Lim
Subjects: Artificial Intelligence (cs.AI)
[42] arXiv:2510.01375 [pdf, other]
Title: Fine-tuning with RAG for Improving LLM Learning of New Skills
Humaid Ibrahim, Nikolai Rozanov, Marek Rei
Comments: Under review at ICLR 2026
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[43] arXiv:2510.01367 [pdf, other]
Title: Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort
Xinpeng Wang, Nitish Joshi, Barbara Plank, Rico Angell, He He
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[44] arXiv:2510.01363 [pdf, other]
Title: Retrieval-Augmented Framework for LLM-Based Clinical Decision Support
Leon Garza, Anantaa Kotal, Michael A. Grasso, Emre Umucu
Subjects: Artificial Intelligence (cs.AI)
[45] arXiv:2510.01353 [pdf, other]
Title: MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments
Darshan Deshpande, Varun Gangal, Hersh Mehta, Anand Kannappan, Rebecca Qian, Peng Wang
Comments: Accepted to NeurIPS 2025 SEA Workshop
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[46] arXiv:2510.01346 [pdf, html, other]
Title: Aristotle: IMO-level Automated Theorem Proving
Tudor Achim, Alex Best, Kevin Der, Mathïs Fédérico, Sergei Gukov, Daniel Halpern-Leister, Kirsten Henningsgard, Yury Kudryashov, Alexander Meiburg, Martin Michelsen, Riley Patterson, Eric Rodriguez, Laura Scharff, Vikram Shanker, Vladmir Sicca, Hari Sowrirajan, Aidan Swope, Matyas Tamas, Vlad Tenev, Jonathan Thomm, Harold Williams, Lawrence Wu
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[47] arXiv:2510.01304 [pdf, html, other]
Title: Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models
Yu Zeng, Wenxuan Huang, Shiting Huang, Xikun Bao, Yukun Qi, Yiming Zhao, Qiuchen Wang, Lin Chen, Zehui Chen, Huaian Chen, Wanli Ouyang, Feng Zhao
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[48] arXiv:2510.01295 [pdf, html, other]
Title: The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation
Zarreen Reza
Comments: 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling
Subjects: Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[49] arXiv:2510.01293 [pdf, html, other]
Title: Cyber Academia-Chemical Engineering (CA-ChemE): A Living Digital Town for Self-Directed Research Evolution and Emergent Scientific Discovery
Zekun Jiang, Chunming Xu, Tianhang Zhou
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[50] arXiv:2510.01272 [pdf, html, other]
Title: Modeling Others' Minds as Code
Kunal Jha, Aydan Yuenan Huang, Eric Ye, Natasha Jaques, Max Kleiman-Weiner
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Total of 1615 entries : 1-50 51-100 101-150 151-200 ... 1601-1615
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • Click here to contact arXiv Contact
  • Click here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack