cv-arxiv-daily

Updated on 2026.04.29

Usage instructions: here

Table of Contents

<a href=#visual-localization>Visual Localization</a>
<a href=#point-cloud-place-recognition>Point Cloud Place Recognition</a>
<a href=#cross-modality-localization>Cross-modality Localization</a>
<a href=#3d-gs>3D GS</a>
<a href=#autonomous-driving>Autonomous Driving</a>
<a href=#map>Map</a>
<a href=#non-rigid-registration>Non-rigid Registration</a>
<a href=#moe>MoE</a>
<a href=#mamba>Mamba</a>
<a href=#hsi-classification>HSI Classification</a>

Visual Localization

Publish Date	Title	Authors	PDF	Code
2026-04-28	GeoSearch: Augmenting Worldwide Geolocalization with Web-Scale Reverse Image Search and Image Matching	Tung-Duong Le-Duc et.al.	2604.25390	null
2026-04-28	COMPASS: COmpact Multi-channel Prior-map And Scene Signature for Floor-Plan-Based Visual Localization	Muhammad Shaheer et.al.	2604.25388	null
2026-04-27	Geometric Analysis of Self-Supervised Vision Representations for Semantic Image Retrieval	Esteban Rodríguez-Betancourt et.al.	2604.24469	null
2026-04-24	Region Matters: Efficient and Reliable Region-Aware Visual Place Recognition	Shunpeng Chen et.al.	2604.22390	null
2026-04-24	Revisiting Geometric Obfuscation with Dual Convergent Lines for Privacy-Preserving Image Queries in Visual Localization	Jeonggon Kim et.al.	2604.22310	null
2026-04-24	TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval	Zixu Li et.al.	2604.21806	link
2026-04-22	StarLoc: Pinpointing Transmitting LEO Satellites from a Single Passive Array	Ishani Janveja et.al.	2604.21147	null
2026-04-22	ConeSep: Cone-based Robust Noise-Unlearning Compositional Network for Composed Image Retrieval	Zixu Li et.al.	2604.20358	link
2026-04-22	UniCVR: From Alignment to Reranking for Unified Zero-Shot Composed Visual Retrieval	Haokun Wen et.al.	2604.20318	null
2026-04-22	Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval	Zhiheng Fu et.al.	2604.19386	null
2026-04-20	Colour Extraction Pipeline for Odonates using Computer Vision	Megan Mirnalini Sundaram Rajaraman et.al.	2604.18725	null
2026-04-20	T-REN: Learning Text-Aligned Region Tokens Improves Dense Vision-Language Alignment and Scalability	Savya Khosla et.al.	2604.18573	null
2026-04-20	S2H-DPO: Hardness-Aware Preference Optimization for Vision-Language Models	Nitish Shukla et.al.	2604.18512	null
2026-04-20	INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval	Zhiwei Chen et.al.	2604.18051	null
2026-04-20	HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval	Zixu Li et.al.	2604.18037	null
2026-04-20	Brain-Inspired Capture: Evidence-Driven Neuromimetic Perceptual Simulation for Visual Decoding	Feixue Shao et.al.	2604.17927	null
2026-04-20	ReTrack: Evidence-Driven Dual-Stream Directional Anchor Calibration Network for Composed Video Retrieval	Zixu Li et.al.	2604.17898	null
2026-04-20	Subject-Aware Multi-Granularity Alignment for Zero-Shot EEG-to-Image Retrieval	Lin Jiang et.al.	2604.17782	null
2026-04-18	PPEDCRF: Dynamic-CRF-Guided Selective Perturbation for Background-Based Location Privacy in Video Sequences	Bo Ma et.al.	2604.17163	null
2026-04-18	mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval	Kyeong Seon Kim et.al.	2604.17054	null
2026-04-18	KIRA: Knowledge-Intensive Image Retrieval and Reasoning Architecture for Specialized Visual Domains	Parthaw Goswami et.al.	2604.16915	null
2026-04-17	Where Do Vision-Language Models Fail? World Scale Analysis for Image Geolocalization	Siddhant Bharadwaj et.al.	2604.16248	null
2026-04-17	Continual Hand-Eye Calibration for Open-world Robotic Manipulation	Fazeng Li et.al.	2604.15814	null
2026-04-17	Sketch and Text Synergy: Fusing Structural Contours and Descriptive Attributes for Fine-Grained Image Retrieval	Siyuan Wang et.al.	2604.15735	null
2026-04-16	G-MIXER: Geodesic Mixup-based Implicit Semantic Expansion and Explicit Semantic Re-ranking for Zero-Shot Composed Image Retrieval	Jiyoung Lim et.al.	2604.14710	null
2026-04-15	Comprehensive Review of Doppler Shift Localization Methods: Advances, Limitations, and Research Opportunities	Rafal Szczepanik et.al.	2604.14413	null
2026-04-15	SceneGlue: Scene-Aware Transformer for Feature Matching without Scene-Level Annotation	Songlin Du et.al.	2604.13941	null
2026-04-14	Indexing Multimodal Language Models for Large-scale Image Retrieval	Bahey Tharwat et.al.	2604.13268	null
2026-04-14	A Sanity Check on Composed Image Retrieval	Yikun Liu et.al.	2604.12904	null
2026-04-14	VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale	Parth Parag Kulkarni et.al.	2604.12159	null
2026-04-13	Human-Inspired Context-Selective Multimodal Memory for Social Robots	Hangyeol Kang et.al.	2604.12081	null
2026-04-13	Privacy-Preserving Structureless Visual Localization via Image Obfuscation	Vojtech Panek et.al.	2604.12068	null
2026-04-13	Seeing Through Touch: Tactile-Driven Visual Localization of Material Regions	Seongyu Kim et.al.	2604.11579	null
2026-04-13	CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space	Sohwi Lim et.al.	2604.11539	null
2026-04-11	Buried Fiber-Optic Geolocalization with Distributed Acoustic Sensing	Khen Cohen et.al.	2604.10331	null
2026-04-11	FashionMV: Product-Level Composed Image Retrieval with Multi-View Fashion Data	Peng Yuan et.al.	2604.10297	null
2026-04-10	AsymLoc: Towards Asymmetric Feature Matching for Efficient Visual Localization	Mohammad Omama et.al.	2604.09445	null
2026-04-10	FIRE-CIR: Fine-grained Reasoning for Composed Fashion Image Retrieval	François Gardères et.al.	2604.09114	null
2026-04-10	Towards Lifelong Aerial Autonomy: Geometric Memory Management for Continual Visual Place Recognition in Dynamic Environments	Xingyu Shao et.al.	2604.09038	null
2026-04-10	Skill-Conditioned Visual Geolocation for Vision-Language	Chenjie Yang et.al.	2604.09025	null
2026-04-07	Pretrain-then-Adapt: Uncertainty-Aware Test-Time Adaptation for Text-based Person Search	Jiahao Zhang et.al.	2604.08598	null
2026-04-10	Bag of Bags: Adaptive Visual Vocabularies for Genizah Join Image Retrieval	Sharva Gogawale et.al.	2604.08138	null
2026-04-09	SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving	Felix Embacher et.al.	2604.08008	null
2026-04-09	Learning to Search: A Decision-Based Agent for Knowledge-Based Visual Question Answering	Zhuohong Chen et.al.	2604.07146	null
2026-04-08	VGGT-SLAM++	Avilasha Mandal et.al.	2604.06830	null
2026-04-06	CraterBench-R: Instance-Level Crater Retrieval for Planetary Scale	Jichao Fang et.al.	2604.06245	null
2026-04-08	Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models	Zonghao Ying et.al.	2604.05853	null
2026-04-07	Stealthy and Adjustable Text-Guided Backdoor Attacks on Multimodal Pretrained Models	Yiyang Zhang et.al.	2604.05809	null
2026-04-07	Conditional Publics: Shared Events and Divergent Meanings in the European Twitter Debate on the Ukraine War	Corrado Monti et.al.	2604.05800	null
2026-04-07	WRF4CIR: Weight-Regularized Fine-Tuning Network for Composed Image Retrieval	Yizhuo Xu et.al.	2604.05583	null
2026-04-07	LSGS-Loc: Towards Robust 3DGS-Based Visual Localization for Large-Scale UAV Scenarios	Xiang Zhang et.al.	2604.05402	null
2026-04-07	Beyond Semantic Search: Towards Referential Anchoring in Composed Image Retrieval	Yuxin Yang et.al.	2604.05393	null
2026-04-05	GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces	Xinyu Geng et.al.	2604.04017	null
2026-03-24	Event-Driven Neuromorphic Vision Enables Energy-Efficient Visual Place Recognition	Geoffroy Keime et.al.	2604.03277	null
2026-04-08	EarthEmbeddingExplorer: A Web Application for Cross-Modal Retrieval of Global Satellite Images	Yijie Zheng et.al.	2603.29441	null
2026-03-31	MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network	Guozhi Qiu et.al.	2603.29291	null
2026-03-30	The Problem of Dynamic Spatial Sampling and Geofence Surveillance	Marty Davidson et.al.	2603.28958	null
2026-03-29	RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization	Junwei Zheng et.al.	2603.27758	null
2026-03-29	NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries	Mahdi Erfanian et.al.	2603.27464	null
2026-03-28	Zero-shot Vision-Language Reranking for Cross-View Geolocalization	Yunus Talha Erzurumlu et.al.	2603.27251	null
2026-03-27	Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones	Moritz Nottebaum et.al.	2603.26551	null
2026-03-27	HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network	Mingyu Zhang et.al.	2603.26341	null
2026-03-26	Bayesian Deep Count Regression and Anomaly Detection: Evidence from GDELT Event Panels	Hsin-Hsiung Huang et.al.	2603.25970	null
2026-03-26	Few Shots Text to Image Retrieval: New Benchmarking Dataset and Optimization Methods	Ofer Idan et.al.	2603.25891	null
2026-03-26	Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming	Yunus Talha Erzurumlu et.al.	2603.25686	null
2026-03-26	On-Demand Instructional Material Providing Agent Based on MLLM for Tutoring Support	Takumi Kato et.al.	2603.25195	null
2026-03-28	TIGeR: A Unified Framework for Time, Images and Geo-location Retrieval	David G. Shatwell et.al.	2603.24749	null
2026-03-25	GeoRouter: Dynamic Paradigm Routing for Worldwide Image Geolocalization	Pengyue Jia et.al.	2603.24376	null
2026-03-25	Combi-CAM: A Novel Multi-Layer Approach for Explainable Image Geolocalization	David Faget et.al.	2603.24117	null
2026-03-24	Sparse Autoencoders for Interpretable Medical Image Representation Learning	Philipp Wesp et.al.	2603.23794	null
2026-03-24	ARGENT: Adaptive Hierarchical Image-Text Representations	Chuong Huynh et.al.	2603.23311	null
2026-03-24	Retrieval-Guided Photovoltaic Inventory Estimation from Satellite Imagery for Distribution Grid Planning	Muhao Guo et.al.	2603.22856	null
2026-03-24	SOUPLE: Enhancing Audio-Visual Localization and Segmentation with Learnable Prompt Contexts	Khanh Binh Nguyen et.al.	2603.22732	null
2026-03-24	HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment	Sangmin Jo et.al.	2603.22721	null
2026-03-23	GeoFlow: Real-Time Fine-Grained Cross-View Geolocalization via Iterative Flow Prediction	Ayesh Abu Lehyeh et.al.	2603.21943	null
2026-03-23	ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive Text-to-Image Retrieval	Zhuocheng Zhang et.al.	2603.21886	null
2026-03-21	SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval	Qunjie Huang et.al.	2603.20738	null
2026-03-21	A Multihead Continual Learning Framework for Fine-Grained Fashion Image Retrieval with Contrastive Learning and Exponential Moving Average Distillation	Ling Xiao et.al.	2603.20648	null
2026-03-20	IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment	Simone Magistri et.al.	2603.19862	null
2026-03-20	IUP-Pose: Decoupled Iterative Uncertainty Propagation for Real-time Relative Pose Regression via Implicit Dense Alignment v1	Jun Wang et.al.	2603.19625	null
2026-03-24	LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment	Shuaibang Peng et.al.	2603.19609	null
2026-03-19	Mapping the Midweek Mountain: The New Geography of Hybrid Work	Norman Guo et.al.	2603.18440	null
2026-03-18	MCoT-MVS: Multi-level Vision Selection by Multi-modal Chain-of-Thought Reasoning for Composed Image Retrieval	Xuri Ge et.al.	2603.17360	null
2026-03-17	Visual Product Search Benchmark	Karthik Sulthanpete Govindappa et.al.	2603.17186	null
2026-03-17	Retrieving Counterfactuals Improves Visual In-Context Learning	Guangzhi Xiong et.al.	2603.16737	null
2026-03-17	HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture	Aojie Yuan et.al.	2603.16679	null
2026-03-17	Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty	Mangyu Kong et.al.	2603.16538	null
2026-03-17	Geometric Search for Hawking Radiation from Nearby Primordial Black Holes	Shuo Xiao et.al.	2603.16508	null
2026-03-18	VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents	Zhengbo Zhang et.al.	2603.16289	null
2026-03-14	Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics	Dennis Haitz et.al.	2603.13917	null
2026-03-14	Sky2Ground: A Benchmark for Site Modeling under Varying Altitude	Zengyan Wang et.al.	2603.13740	null
2026-03-13	Design and evaluation of an agentic workflow for crisis-related synthetic tweet datasets	Roben Delos Reyes et.al.	2603.13625	null
2026-03-13	A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks	Tangzheng Lian et.al.	2603.12998	null
2026-03-13	Text-Phase Synergy Network with Dual Priors for Unsupervised Cross-Domain Image Retrieval	Jing Yang et.al.	2603.12711	null
2026-03-13	CM-Bench: A Comprehensive Cross-Modal Feature Matching Benchmark Bridging Visible and Infrared Images	Liangzheng Sun et.al.	2603.12690	null
2026-03-12	Unequal changes in commuting patterns across socio-economic strata in response to pandemic restrictions	Cristiano Marinelli et.al.	2603.11758	null
2026-03-12	FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval	Chenchen Zhao et.al.	2603.11520	null
2026-03-12	Efficient Cross-View Localization in 6G Space-Air-Ground Integrated Network	Min Hao et.al.	2603.11398	null
2026-03-11	Imaging flat band electron hydrodynamics in biased bilayer graphene	Canxun Zhang et.al.	2603.11175	null
2026-03-11	Learning to Wander: Improving the Global Image Geolocation Ability of LMMs via Actionable Reasoning	Yushuo Zheng et.al.	2603.10463	link
2026-03-10	Composed Vision-Language Retrieval for Skin Cancer Case Search via Joint Alignment of Global and Local Representations	Yuheng Wang et.al.	2603.09108	null
2026-03-09	Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational Modeling	Bowen Liu et.al.	2603.08063	null
2026-03-09	$L^3$ :Scene-agnostic Visual Localization in the Wild	Yu Zhang et.al.	2603.07937	null
2026-03-08	Fluctuation imaging of disorder in monolayer semiconductors	Tom T. C. Sistermans et.al.	2603.07418	null
2026-03-08	QdaVPR: A novel query-based domain-agnostic model for visual place recognition	Shanshan Wan et.al.	2603.07414	null
2026-03-06	EventGeM: Global-to-Local Feature Matching for Event-Based Visual Place Recognition	Adam D. Hines et.al.	2603.05807	null
2026-03-06	Visual Words Meet BM25: Sparse Auto-Encoder Visual Word Scoring for Image Retrieval	Donghoon Han et.al.	2603.05781	null
2026-03-05	Interpretable Perception and Reasoning for Audiovisual Geolocation	Yiyang Su et.al.	2603.05708	null
2026-03-04	PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing	Rohan Mahadev et.al.	2603.04598	null
2026-03-04	SSR: A Generic Framework for Text-Aided Map Compression for Localization	Mohammad Omama et.al.	2603.04272	null
2026-03-04	Long-Term Visual Localization in Dynamic Benthic Environments: A Dataset, Footprint-Based Ground Truth, and Visual Place Recognition Benchmark	Martin Kvisvik Larsen et.al.	2603.04056	null
2026-03-04	HE-VPR: Height Estimation Enabled Aerial Visual Place Recognition Against Scale Variance	Mengfan He et.al.	2603.04050	null
2026-03-04	DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative Negative Sampling in Composed Image Retrieval	Geon Park et.al.	2603.04037	null
2026-03-03	From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes	Qifan Zhang et.al.	2603.03577	null
2026-03-03	LOO-PIT predictive model checking	Herman Tesso et.al.	2603.02928	null
2026-03-03	Cross-view geo-localization, Image retrieval, Multiscale geometric modeling, Frequency domain enhancement	Hongying Zhang et.al.	2603.02726	null
2026-03-02	Contributions of geolocated weather and building related data for insurance assessment of flood risks	Mulah Moriah et.al.	2603.02418	null
2026-03-02	GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis	Srikumar Sastry et.al.	2603.02172	null
2026-03-02	Learning to Read Where to Look: Disease-Aware Vision-Language Pretraining for 3D CT	Simon Ging et.al.	2603.02026	null
2026-03-02	Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning	Haonan Jia et.al.	2603.01696	null
2026-03-01	MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning	Eileen Wang et.al.	2603.01055	null
2026-02-28	Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning	Ruoshuang Du et.al.	2603.00511	null
2026-02-27	Altitude-Aware Visual Place Recognition in Top-Down View	Xingyu Shao et.al.	2602.23872	null
2026-02-26	VGG-T $^3$ : Offline Feed-Forward 3D Reconstruction at Scale	Sven Elflein et.al.	2602.23361	null
2026-03-07	WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval	Tianyue Wang et.al.	2602.23029	null
2026-02-26	Autoregressive Visual Decoding from EEG Signals	Sicheng Dai et.al.	2602.22555	null
2026-02-26	Pix2Key: Controllable Open-Vocabulary Retrieval with Semantic Decomposition and Self-Supervised Visual Dictionary Learning	Guoyizhe Wei et.al.	2602.22510	null
2026-02-25	Global-Aware Edge Prioritization for Pose Graph Initialization	Tong Wei et.al.	2602.21963	null
2026-03-04	Automatic Map Density Selection for Locally-Performant Visual Place Recognition	Somayeh Hussaini et.al.	2602.21473	null
2026-02-24	Seeing Through Words: Controlling Visual Retrieval Quality with Language Models	Jianglin Lu et.al.	2602.21175	null
2026-02-24	Long-Term Multi-Session 3D Reconstruction Under Substantial Appearance Change	Beverley Gorry et.al.	2602.20584	null
2026-02-23	Unlocking Multimodal Document Intelligence: From Current Triumphs to Future Frontiers of Visual Document Retrieval	Yibo Yan et.al.	2602.19961	null
2026-02-23	Evaluating the Impact of Data Anonymization on Image Retrieval	Marvin Chen et.al.	2602.19641	null
2026-02-22	Knowledge-aware Visual Question Generation for Remote Sensing Images	Siran Li et.al.	2602.19224	null
2026-02-22	Questions beyond Pixels: Integrating Commonsense Knowledge in Visual Question Generation for Remote Sensing	Siran Li et.al.	2602.19217	null
2026-02-19	VQPP: Video Query Performance Prediction Benchmark	Adrian Catalin Lutu et.al.	2602.17814	null
2026-02-19	Visual Model Checking: Graph-Based Inference of Visual Routines for Image Retrieval	Adrià Molina et.al.	2602.17386	null
2026-02-18	SCAR: Satellite Imagery-Based Calibration for Aerial Recordings	Henry Hölzemann et.al.	2602.16349	null
2026-02-17	Automated Assessment of Kidney Ureteroscopy Exploration for Training	Fangjie Li et.al.	2602.15988	null
2026-02-17	Privacy-Preserving and Secure Spectrum Sharing for Database-Driven Cognitive Radio Networks	Saleh Darzia et.al.	2602.15705	null
2026-02-17	GMAIL: Generative Modality Alignment for generated Image Learning	Shentong Mo et.al.	2602.15368	null
2026-02-16	AIC CTU@AVerImaTeC: dual-retriever RAG for image-text fact checking	Herbert Ullrich et.al.	2602.15190	null
2026-02-16	Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery	Chandrakanth Gudavalli et.al.	2602.14929	null
2026-02-15	Towards Spatial Transcriptomics-driven Pathology Foundation Models	Konstantin Hemker et.al.	2602.14177	null
2026-02-14	High-fidelity 3D reconstruction for planetary exploration	Alfonso Martínez-Petersen et.al.	2602.13909	null
2026-02-14	A Deep Convolutional Network to Extract Real-Time Landmarks for UAV Navigation	Osman Tokluoglu et.al.	2602.13814	null
2026-02-13	InfoCIR: Multimedia Analysis for Composed Image Retrieval	Ioannis Dravilas et.al.	2602.13402	null
2026-02-13	EPRBench: A High-Quality Benchmark Dataset for Event Stream Based Visual Place Recognition	Xiao Wang et.al.	2602.12919	null
2026-02-13	GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics	Modi Jin et.al.	2602.12617	null
2026-02-12	DiffPlace: Street View Generation via Place-Controllable Diffusion Model Enhancing Place Recognition	Ji Li et.al.	2602.11875	null
2026-02-12	Arbitrary Ratio Feature Compression via Next Token Prediction	Yufan Liu et.al.	2602.11494	null
2026-02-11	WHEREIS: IP Address Registration Geo-Consistency	Robert Beverly et.al.	2602.11102	null
2026-02-11	DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories	Chenlong Deng et.al.	2602.10809	null
2026-02-09	Large Language Models for Geolocation Extraction in Humanitarian Crisis Response	G. Cafferata et.al.	2602.08872	null
2026-02-09	OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval	Teng Wang et.al.	2602.08603	null
2026-02-16	NovaMoon: A Strategic Lunar Reference Station for Positioning, Timing, and Largely Enhanced Science in the Earth-Moon System	Serena Molli et.al.	2602.08432	null
2026-02-09	A Sketch+Text Composed Image Retrieval Dataset for Thangka	Jinyu Xu et.al.	2602.08411	null
2026-02-09	UrbanGraphEmbeddings: Learning and Evaluating Spatially Grounded Multimodal Embeddings for Urban Science	Jie Zhang et.al.	2602.08342	null
2026-02-10	WristMIR: Coarse-to-Fine Region-Aware Retrieval of Pediatric Wrist Radiographs with Radiology Report-Driven Learning	Mert Sonmezer et.al.	2602.07872	null
2026-02-04	Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?	Ruixin Yang et.al.	2602.05023	null
2026-02-04	SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation	David F. Ramirez et.al.	2602.04712	null
2026-02-05	SDR-CIR: Semantic Debias Retrieval Framework for Training-Free Zero-Shot Composed Image Retrieval	Yi Sun et.al.	2602.04451	null
2026-02-04	Quantile Transfer for Reliable Operating Point Selection in Visual Place Recognition	Dhyey Manish Rajani et.al.	2602.04401	null
2026-02-04	Beyond Static Cropping: Layer-Adaptive Visual Localization and Decoding Enhancement	Zipeng Zhu et.al.	2602.04304	null
2026-02-03	LaVPR: Benchmarking Language and Vision for Place Recognition	Ofer Idan et.al.	2602.03253	null
2026-02-03	ObjEmbed: Towards Universal Multimodal Object Embeddings	Shenghao Fu et.al.	2602.01753	link
2026-02-02	Real-Time Loop Closure Detection in Visual SLAM via NetVLAD and Faiss	Enguang Fan et.al.	2602.01673	null
2026-02-02	ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval	Tianyu Yang et.al.	2602.01639	null
2026-02-01	Interacted Planes Reveal 3D Line Mapping	Zeran Ke et.al.	2602.01296	null
2026-02-05	Invariance on Manifolds: Understanding Robust Visual Representations for Place Recognition	Jintao Cheng et.al.	2602.00841	null
2026-02-03	Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval	Tong Wang et.al.	2602.00813	null
2026-01-31	VVLoc: Prior-free 3-DoF Vehicle Visual Localization	Ze Huang et.al.	2602.00810	null
2026-01-31	Audio-to-Image Bird Species Retrieval without Audio-Image Pairs via Text Distillation	Ilyass Moummad et.al.	2602.00681	null
2026-01-30	HierLoc: Hyperbolic Entity Embeddings for Hierarchical Visual Geolocation	Hari Krishna Gadi et.al.	2601.23064	null
2026-01-30	Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval	Ilyass Moummad et.al.	2601.22783	null
2026-01-29	Variance & Greediness: A comparative study of metric-learning losses	Donghuo Zeng et.al.	2601.21450	null
2026-01-29	GeoRC: A Benchmark for Geolocation Reasoning Chains	Mohit Talreja et.al.	2601.21278	null
2026-01-28	When Vision Meets Texts in Listwise Reranking	Hongyi Cai et.al.	2601.20623	null
2026-01-28	Eliminating Hallucination in Diffusion-Augmented Interactive Text-to-Image Retrieval	Zhuocheng Zhang et.al.	2601.20391	null
2026-01-30	VGGT-SLAM 2.0: Real-time Dense Feed-forward Scene Reconstruction	Dominic Maggio et.al.	2601.19887	null
2026-01-27	LocationAgent: A Hierarchical Agent for Image Geolocation via Decoupling Strategy and Evidence from Parametric Knowledge	Qiujun Li et.al.	2601.19155	null
2026-01-27	Pixel-Grounded Retrieval for Knowledgeable Large Multimodal Models	Jeonghwan Kim et.al.	2601.19060	null
2026-01-25	A Multi-Modal Fusion Platform for Joint Environment Sensing and Channel Sounding in Highly Dynamic Scenarios	Xuejian Zhang et.al.	2601.17809	null
2026-01-23	X-Aligner: Composed Visual Retrieval without the Bells and Whistles	Yuqian Zheng et.al.	2601.16582	null
2026-01-22	Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing	Tingyu Song et.al.	2601.16125	null
2026-01-21	Unified Multimodal and Multilingual Retrieval via Multi-Task Learning with NLU Integration	Xinyuan Zhang et.al.	2601.14714	null
2026-01-21	LookBench: A Live and Holistic Open Benchmark for Fashion Image Retrieval	Chao Gao et.al.	2601.14706	null
2026-01-20	XR: Cross-Modal Agents for Composed Image Retrieval	Zhongyu Yang et.al.	2601.14245	null
2026-01-20	Fine-Grained Zero-Shot Composed Image Retrieval with Complementary Visual-Semantic Integration	Yongcong Ye et.al.	2601.14060	null
2026-01-20	Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning	Hongbo Bai et.al.	2601.13942	null
2026-01-19	DC-VLAQ: Query-Residual Aggregation for Robust Visual Place Recognition	Hanyu Zhu et.al.	2601.12729	null
2026-01-18	Abusing the Internet of Medical Things: Evaluating Threat Models and Forensic Readiness for Multi-Vector Attacks on Connected Healthcare Devices	Isabel Straw et.al.	2601.12593	null
2026-01-17	SupScene: Learning Overlap-Aware Global Descriptor for Unconstrained SfM	Xulei Shi et.al.	2601.11930	null
2026-01-22	Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning	Haomiao Tang et.al.	2601.11393	null
2026-01-16	Simple Models, Rich Representations: Visual Decoding from Primate Intracortical Neural Signals	Matteo Ciferri et.al.	2601.11108	null
2026-01-20	Multilingual-To-Multimodal (M2M): Unlocking New Languages with Monolingual Text	Piyush Singh Pasi et.al.	2601.10096	null
2026-01-20	UniHash: Unifying Pointwise and Pairwise Hashing Paradigms for Seen and Unseen Category Retrieval	Xiaoxu Ma et.al.	2601.09828	null
2026-01-14	Hybrid guided variational autoencoder for visual place recognition	Ni Wang et.al.	2601.09248	null
2026-01-13	Spatial Context Improves the Integration of Text with Remote Sensing for Mapping Environmental Variables	Valerie Zermatten et.al.	2601.08750	null
2026-01-13	Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation	Kang Fu et.al.	2601.08311	null
2026-01-13	Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization	Miao Pan et.al.	2601.06224	link
2026-01-09	Descriptor: Multi-Regional Cloud Honeypot Dataset (MURHCAD)	Enrique Feito-Casares et.al.	2601.05813	null
2026-01-08	Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization	Yuxiang Ji et.al.	2601.05432	null
2026-01-08	Multi-task Cross-modal Learning for Chest X-ray Image Retrieval	Zhaohui Liang et.al.	2601.05399	null
2026-01-07	ImLoc: Revisiting Visual Localization with Image-based Representation	Xudong Jiang et.al.	2601.04185	null
2026-01-07	CSMCIR: CoT-Enhanced Symmetric Alignment with Memory Bank for Composed Image Retrieval	Zhipeng Qian et.al.	2601.03728	null
2026-01-07	BREATH-VL: Vision-Language-Guided 6-DoF Bronchoscopy Localization via Semantic-Geometric Fusion	Qingyao Tian et.al.	2601.03713	null
2026-01-07	HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps	Xuchang Zhong et.al.	2601.02730	null
2026-01-06	Loop Closure using AnyLoc Visual Place Recognition in DPV-SLAM	Wenzheng Zhang et.al.	2601.02723	null
2026-01-07	Comparative Analysis of Binarization Methods For Medical Image Hashing On Odir Dataset	Nedim Muzoglu et.al.	2601.02564	null
2026-01-04	Breadcrumbs in the Digital Forest: Tracing Criminals through Torrent Metadata with OSINT	Annelies de Jong et.al.	2601.01492	null
2026-01-05	Vision-Language Reasoning for Geolocalization: A Reinforcement Learning Approach	Biao Wu et.al.	2601.00388	null
2025-12-31	OCP-LS: An Efficient Algorithm for Visual Localization	Jindi Zhong et.al.	2512.24552	null
2025-12-29	Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation	Guo Ye et.al.	2512.23864	null
2026-01-07	MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning	Jiawei Chen et.al.	2512.23412	null
2025-12-29	Anomaly Detection by Effectively Leveraging Synthetic Images	Sungho Kang et.al.	2512.23227	null
2025-12-26	Reloc-VGGT: Visual Re-localization with Geometry Grounded Transformer	Tianchen Deng et.al.	2512.21883	null
2025-12-24	Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval	Dao Sy Duy Minh et.al.	2512.21221	null
2025-12-28	UniPR-3D: Towards Universal Visual Place Recognition with Visual Geometry Grounded Transformer	Tianchen Deng et.al.	2512.21078	null
2025-12-23	Soft Filtering: Guiding Zero-shot Composed Image Retrieval with Prescriptive and Proscriptive Constraints	Youjin Jung et.al.	2512.20781	null
2025-12-23	Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark	Hao Guo et.al.	2512.20174	null
2025-12-23	Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach	Hao Li et.al.	2512.20056	null
2025-12-22	Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis	Argha Kamal Samanta et.al.	2512.19663	null
2025-12-22	Finer-Personalization Rank: Fine-Grained Retrieval Examines Identity Preservation for Personalized Generation	Connor Kilrain et.al.	2512.19026	null
2025-12-21	Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments	Saeideh Yousefzadeh et.al.	2512.18613	null
2025-12-20	Through the PRISm: Importance-Aware Scene Graphs for Image Retrieval	Dimitrios Georgoulopoulos et.al.	2512.18407	null
2025-12-20	GeoSense-AI: Fast Location Inference from Crisis Microblogs	Deepit Sapru et.al.	2512.18225	null
2025-12-19	MMLANDMARKS: a Cross-View Instance-Level Benchmark for Geo-Spatial Understanding	Oskar Kristoffersen et.al.	2512.17492	null
2025-12-19	Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors	Son Tung Nguyen et.al.	2512.17226	null
2025-12-18	The Effect of Negation on CLIP in Medical Imaging: Limitations of Contrastive Language-Image Pretraining	Jasmine Vu et.al.	2512.17121	null
2025-12-18	Plug to Place: Indoor Multimedia Geolocation from Electrical Sockets for Digital Investigation	Kanwal Aftab et.al.	2512.16620	null
2025-12-18	MACL: Multi-Label Adaptive Contrastive Learning Loss for Remote Sensing Image Retrieval	Amna Amir et.al.	2512.16294	null
2025-12-16	CLNet: Cross-View Correspondence Makes a Stronger Geo-Localizationer	Xianwei Cao et.al.	2512.14560	null
2025-12-16	Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries	Emanuele Mezzi et.al.	2512.14102	null
2025-12-15	Towards Test-time Efficient Visual Place Recognition via Asymmetric Query Processing	Jaeyoon Kim et.al.	2512.13055	null
2025-12-14	Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching	Wonseok Choi et.al.	2512.12610	null
2025-12-11	Beyond Pixels: A Training-Free, Text-to-Text Framework for Remote Sensing Image Retrieval	J. Xiao et.al.	2512.10596	null
2025-12-10	YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos	Ryan Meegan et.al.	2512.09903	null
2025-12-09	Adaptive Thresholding for Visual Place Recognition using Negative Gaussian Mixture Statistics	Nick Trinh et.al.	2512.09071	null
2025-12-08	Generalized Referring Expression Segmentation on Aerial Photos	Luís Marnoto et.al.	2512.07338	null
2025-12-07	Spatial Retrieval Augmented Autonomous Driving	Xiaosong Jia et.al.	2512.06865	null
2025-12-06	Language-driven Fine-grained Retrieval	Shijie Wang et.al.	2512.06255	null
2025-12-05	GuideNav: User-Informed Development of a Vision-Only Robotic Navigation Assistant For Blind Travelers	Hochul Hwang et.al.	2512.06147	null
2025-12-05	M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG	David Anugraha et.al.	2512.05959	null
2025-12-05	World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty	Zhiting Mei et.al.	2512.05927	link
2025-12-05	Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator	Md. Mahbub Hasan Akash et.al.	2512.05866	null
2025-12-05	Distilling Expert Surgical Knowledge: How to train local surgical VLMs for anatomy explanation in Complete Mesocolic Excision	Lennart Maack et.al.	2512.05740	null
2025-12-05	NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections	Juho Korkeala et.al.	2512.05610	null
2025-12-05	Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer	Rong Wang et.al.	2512.05593	null
2025-12-05	A Comprehensive Framework for Automated Quality Control in the Automotive Industry	Panagiota Moraiti et.al.	2512.05579	null
2025-12-05	MedDIFT: Multi-Scale Diffusion-Based Correspondence in 3D Medical Imaging	Xingyu Zhang et.al.	2512.05571	link
2025-12-05	2K-Characters-10K-Stories: A Quality-Gated Stylized Narrative Dataset with Disentangled Control and Sequence Consistency	Xingxi Yin et.al.	2512.05557	null
2025-12-05	Know-Show: Benchmarking Video-Language Models on Spatio-Temporal Grounded Reasoning	Chinthani Sugandhika et.al.	2512.05513	link
2025-12-05	Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation	Fan Zhang et.al.	2512.05494	null
2025-12-05	WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field	Qi Zhu et.al.	2512.05492	null
2025-12-05	YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications	Yida Lin et.al.	2512.05412	null
2025-12-05	LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models	Qingqiao Hu et.al.	2512.05391	null
2025-12-05	Hypothesis-Based Particle Detection for Accurate Nanoparticle Counting and Digital Diagnostics	Neil H. Kim et.al.	2512.05346	null
2025-12-05	CATNUS: Coordinate-Aware Thalamic Nuclei Segmentation Using T1-Weighted MRI	Anqi Feng et.al.	2512.05329	null
2025-12-04	Nerves of generalized multicategories	Soichiro Fujii et.al.	2512.05232	null
2025-12-04	Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models	Rowan Bradbury et.al.	2512.05198	null
2025-12-04	ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Shengyuan Ding et.al.	2512.05111	null
2025-12-04	Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark	Haobo Yuan et.al.	2512.05091	null
2025-12-04	Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding	Abhigyan Bhattacharya et.al.	2512.05039	null
2025-12-04	Revealing stimulus-dependent dynamics through statistical complexity	Edson V. de Paula et.al.	2512.05007	null
2025-12-04	Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis	Supriya Bordoloi et.al.	2512.04989	null
2025-12-04	Rethinking the Use of Vision Transformers for AI-Generated Image Detection	NaHyeon Park et.al.	2512.04969	link
2025-12-04	LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging	Zhijian Shu et.al.	2512.04939	null
2025-12-04	You Only Train Once (YOTO): A Retraining-Free Object Detection Framework	Priyanto Hidayatullah et.al.	2512.04888	null
2025-12-04	Are Your Agents Upward Deceivers?	Dadi Guo et.al.	2512.04864	null
2025-12-04	Terahertz Fourier Ptychographic Imaging	Pitambar Mukherjee et.al.	2512.04783	null
2025-12-04	TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards	Mauro Martini et.al.	2512.04772	null
2025-12-04	MemLoRA: Distilling Expert Adapters for On-Device Memory Systems	Massimo Bini et.al.	2512.04763	null
2025-12-04	Spectral micro-CT for quantitative analysis of calcification in fibrocartilage	Vittoria Mazzini et.al.	2512.04662	null
2025-12-04	Metric dimension of Cartesian product of stars	Akbar Davoodi et.al.	2512.04620	null
2025-12-04	Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence	Tianyu Yuan et.al.	2512.04619	null
2025-12-04	Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot	Sheng Hang et.al.	2512.04599	null
2025-12-04	Structure-Aware Adaptive Kernel MPPCA Denoising for Diffusion MRI	Ananya Singhal et.al.	2512.04586	null
2025-12-04	Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation	Houzhang Fang et.al.	2512.04581	null
2025-12-04	Prompt2Craft: Generating Functional Craft Assemblies with LLMs	Vitor Hideyo Isume et.al.	2512.04568	null
2025-12-04	Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex	Zhizhen Wu et.al.	2512.04556	null
2025-12-03	RELIC: Interactive Video World Model with Long-Horizon Memory	Yicong Hong et.al.	2512.04040	null
2025-12-03	Needle beams and structured space-time wavepackets	Ruediger Grunwald et.al.	2512.03993	null
2025-12-03	DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment	Sheng-Hao Liao et.al.	2512.03981	link
2025-12-03	Dual Cross-Attention Siamese Transformer for Rectal Tumor Regrowth Assessment in Watch-and-Wait Endoscopy	Jorge Tapias Gomez et.al.	2512.03883	null
2025-12-03	Traffic Image Restoration under Adverse Weather via Frequency-Aware Mamba	Liwen Pan et.al.	2512.03852	null
2025-12-03	Algorithms for Boolean Matrix Factorization using Integer Programming and Heuristics	Christos Kolomvakis et.al.	2512.03807	null
2025-12-04	CaFTRA: Frequency-Domain Correlation-Aware Feedback-Free MIMO Transmission and Resource Allocation for 6G and Beyond	Bo Qian et.al.	2512.03767	null
2025-12-03	Revealing Nanoscale Molecular Organization in Liquid Crystals via Cryogenic Atom Probe Tomograph	Kuan Meng et.al.	2512.03734	null
2025-12-03	DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction	Kaichen Zhang et.al.	2512.03715	null
2025-12-03	Structured Uncertainty Similarity Score (SUSS): Learning a Probabilistic, Interpretable, Perceptual Metric Between Images	Paula Seidler et.al.	2512.03701	null
2025-12-03	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2512.03684	null
2025-12-03	Multi-Scale Visual Prompting for Lightweight Small-Image Classification	Salim Khazem et.al.	2512.03663	null
2025-12-03	Evaluation of Foundational Machine Learned Interatomic Potentials for Migration Barrier Predictions	Achinthya Krishna Bheemaguli et.al.	2512.03642	null
2025-12-03	HBFormer: A Hybrid-Bridge Transformer for Microtumor and Miniature Organ Segmentation	Fuchen Zheng et.al.	2512.03597	null
2025-12-03	Global-Local Aware Scene Text Editing	Fuxiang Yang et.al.	2512.03574	null
2025-12-03	M3DR: Towards Universal Multilingual Multimodal Document Retrieval	Adithya S Kolavi et.al.	2512.03514	null
2025-12-03	Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles	Haicheng Liao et.al.	2512.03454	null
2025-12-03	Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation	Xieji Li et.al.	2512.03445	link
2025-12-03	Multimodal Reinforcement Learning with Agentic Verifier for AI Agents	Reuben Tan et.al.	2512.03438	null
2025-12-03	Building a Radio AGN Sample from Cosmic Morning – The Radio High-Redshift Quasar Catalog (RHzQCat): I. Catalog from SDSS Quasars and Radio Surveys at $z > 3$	Yingkang Zhang et.al.	2512.03415	null
2025-12-02	MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues	Zichen Liu et.al.	2512.03046	link
2025-12-02	Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation	Zeqi Xiao et.al.	2512.03040	null
2025-12-02	Stability of knot equivalence at low regularity, and symmetric critical knots for the Möbius energy	Simon Blatt et.al.	2512.02998	null
2025-12-02	MIRI spectrophotometry of GN-z11: Detection and nature of an optical red continuum component	A. Crespo Gómez et.al.	2512.02997	null
2025-12-02	GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection	Md Sohag Mia et.al.	2512.02991	null
2025-12-02	LoVoRA: Text-guided and Mask-free Video Object Removal and Addition with Learnable Object-aware Localization	Zhihan Xiao et.al.	2512.02933	null
2025-12-02	MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding	Fan Yang et.al.	2512.02906	null
2025-12-02	Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models	Pierpaolo Serio et.al.	2512.02897	null
2025-12-02	Terahertz Emission from Spintronic Stack Nanodecorated with Drop-Cast Core-Shell Plasmonic Nanoparticles	Vittorio Cecconi et.al.	2512.02889	null
2025-12-02	Leveraging generative adversarial networks with spatially adaptive denormalization for multivariate stochastic seismic data inversion	Roberto Miele et.al.	2512.02863	null
2025-12-02	BOOM: Beyond Only One Modality KIT’s Multimodal Multilingual Lecture Companion	Sai Koneru et.al.	2512.02817	null
2025-12-02	Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control	Yongrui Yu et.al.	2512.02814	null
2025-12-02	Direct observational evidence that higher-luminosity type 1 active galactic nuclei are most commonly triggered by galaxy mergers	Yongmin Yoon et.al.	2512.02805	null
2025-12-14	HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Retrieval	Zhiwei Chen et.al.	2512.02792	link
2025-12-02	Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone	Tristan Amadei et.al.	2512.02737	link
2025-12-02	DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions	Yifan Zhou et.al.	2512.02727	null
2025-12-02	Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs	Theodoros Aivalis et.al.	2512.02713	null
2025-12-02	GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization	Zixuan Song et.al.	2512.02697	link
2025-12-02	ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data	Yuxing Liu et.al.	2512.02686	null
2025-12-02	Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation	Agathoklis Georgiou et.al.	2512.02660	null
2025-12-01	Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback	Aiden Yiliu Li et.al.	2512.01979	null
2025-12-01	SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception	Gurmeher Khurana et.al.	2512.01908	null
2025-12-01	KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM	Zaid Nasser et.al.	2512.01889	null
2025-12-01	Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval	Xin Wang et.al.	2512.01636	null
2025-12-01	Depth Matching Method Based on ShapeDTW for Oil-Based Mud Imager	Fengfeng Li et.al.	2512.01611	null
2025-12-01	Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track	Mo Chen et.al.	2512.01608	null
2025-12-01	Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation	Thao Thi Phuong Dao et.al.	2512.01589	null
2025-12-01	Near-infrared polarimetric imaging with nonlinear flat-optics	Evgenii Menshikov et.al.	2512.01525	null
2025-12-01	QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions	Can Polat et.al.	2512.01519	null
2025-12-01	Winning Solutions for the Rayan AI Contest: Compositional Retrieval, Zero-Shot Anomaly Detection, and Backdoor Detection	Ali Nafisi et.al.	2512.01498	null
2025-12-01	ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers	Yiyang Ma et.al.	2512.01426	null
2025-12-01	Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries	Tushar Pranav et.al.	2512.01419	null
2025-12-01	Rethinking Intracranial Aneurysm Vessel Segmentation: A Perspective from Computational Fluid Dynamics Applications	Feiyang Xiao et.al.	2512.01319	null
2025-12-01	DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy	Jaewoo Song et.al.	2512.01302	null
2025-12-01	Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI	Kamal Basha S et.al.	2512.01291	null
2025-12-01	Egent: An Autonomous Agent for Equivalent Width Measurement	Yuan-Sen Ting et.al.	2512.01270	null
2025-12-01	Social Media Data Mining of Human Behaviour during Bushfire Evacuation	Junfeng Wu et.al.	2512.01262	null
2025-12-01	M4-BLIP: Advancing Multi-Modal Media Manipulation Detection through Face-Enhanced Local Analysis	Hang Wu et.al.	2512.01214	null
2025-11-30	A sudden fine-scale bright kernel captured by Hi-C Flare during an M1.6-class solar flare’s post-maximum phase	Sanjiv K. Tiwari et.al.	2512.01140	null
2025-11-30	OmniFD: A Unified Model for Versatile Face Forgery Detection	Haotian Liu et.al.	2512.01128	link
2025-11-28	DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline	Rui Zhang et.al.	2511.23377	link
2025-11-28	FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting	Tianhao Xie et.al.	2511.23292	null
2025-11-28	Robust 3DGS-based SLAM via Adaptive Kernel Smoothing	Shouhe Zhang et.al.	2511.23221	null
2025-11-28	PowerCLIP: Powerset Alignment for Contrastive Pre-Training	Masaki Kawamura et.al.	2511.23170	null
2025-11-28	DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation	Hongfei Zhang et.al.	2511.23127	link
2025-11-28	DNA-Prior: Unsupervised Denoise Anything via Dual-Domain Prior	Yanqi Cheng et.al.	2511.23124	null
2025-11-28	Geodiffussr: Generative Terrain Texturing with Elevation Fidelity	Tai Inui et.al.	2511.23029	null
2025-11-28	JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization	Yunlong Lin et.al.	2511.23002	null
2025-11-28	Imaging propagating terahertz collective modes in two-dimensional semiconductor double layers	Andrew T. Pierce et.al.	2511.22962	null
2025-11-28	HMR3D: Hierarchical Multimodal Representation for 3D Scene Understanding with Large Vision-Language Model	Chen Li et.al.	2511.22961	null
2025-11-28	A Trainable Centrality Framework for Modern Data	Minh Duc Vu et.al.	2511.22959	null
2025-11-28	Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records	Shiyu Shen et.al.	2511.22958	null
2025-11-28	See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection	YuEun Lee et.al.	2511.22906	null
2025-11-28	MARVO: Marine-Adaptive Radiance-aware Visual Odometry	Sacchin Sundar et.al.	2511.22860	null
2025-11-28	Breaking the Visual Shortcuts in Multimodal Knowledge-Based Visual Question Answering	Dosung Lee et.al.	2511.22843	null
2025-11-28	Captain Safari: A World Engine	Yu-Cheng Chou et.al.	2511.22815	null
2025-11-27	Alzheimer’s Disease Prediction Using EffNetViTLoRA and BiLSTM with Multimodal Longitudinal MRI Data	Mahdieh Behjat Khatooni et.al.	2511.22774	null
2025-11-27	ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering	Alberto Compagnoni et.al.	2511.22715	null
2025-11-27	Test-time scaling of diffusions with flow maps	Amirmojtaba Sabour et.al.	2511.22688	null
2025-11-27	VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models	Silin Cheng et.al.	2511.22664	null
2025-11-27	GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents	Xinyu Zhang et.al.	2511.22441	null
2025-11-27	UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries	Hoang-Bao Le et.al.	2511.22253	null
2025-11-26	Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models	Naifu Zhang et.al.	2511.21663	null
2025-11-26	Fast 3D Ultrasound Localization Microscopy via Projection-based Processing Framework	Jingke Zhang et.al.	2511.21647	null
2025-11-26	Qwen3-VL Technical Report	Shuai Bai et.al.	2511.21631	null
2025-11-26	Scale-Agnostic Kolmogorov-Arnold Geometry in Neural Networks	Mathew Vanherreweghe et.al.	2511.21626	null
2025-11-26	Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy	Teng Hu et.al.	2511.21579	null
2025-11-26	CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation	Shizhe Sun et.al.	2511.21503	null
2025-11-26	Semantic-Enhanced Feature Matching with Learnable Geometric Verification for Cross-Modal Neuron Registration	Wenwei Li et.al.	2511.21452	null
2025-11-26	Hierarchical Besov-Laplace priors for spatially inhomogeneous binary classification	Patric Dolmeta et.al.	2511.21441	null
2025-11-26	FITRep: Attention-Guided Item Representation via MLLMs	Guoxiao Zhang et.al.	2511.21389	null
2025-11-26	Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning	Xin Gu et.al.	2511.21375	null
2025-11-26	The Directed Prediction Change - Efficient and Trustworthy Fidelity Assessment for Local Feature Attribution Methods	Kevin Iselborn et.al.	2511.21363	null
2025-11-26	HTTM: Head-wise Temporal Token Merging for Faster VGGT	Weitian Wang et.al.	2511.21317	null
2025-11-26	Neural NMPC through Signed Distance Field Encoding for Collision Avoidance	Martin Jacquet et.al.	2511.21312	null
2025-11-26	Low-dose Chemically Specific Bioimaging via Deep-UV Lensless Holographic Microscopy on a Standard Camera	Piotr Arcab et.al.	2511.21311	null
2025-11-26	Adaptive Lighting Control in Visible Light Systems: An Integrated Sensing, Communication, and Illumination Framework	Xinyan Xie et.al.	2511.21271	null
2025-11-26	Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition	Baoli Sun et.al.	2511.21202	null
2025-11-26	CAHS-Attack: CLIP-Aware Heuristic Search Attack Method for Stable Diffusion	Shuhan Xia et.al.	2511.21180	null
2025-11-26	LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs	Shichu Sun et.al.	2511.21150	link
2025-11-26	Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval	Anup Roy et.al.	2511.21121	null
2025-11-26	Scaling Foundation Models for Radar Scene Understanding	Pushkal Mishra et.al.	2511.21105	null
2025-11-25	Efficient Greedy Algorithms for Feature Selection in Robot Visual Localization	Vivek Pandey et.al.	2511.20894	null
2025-11-25	The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment	Ziheng Ouyang et.al.	2511.20614	null
2025-11-25	Adaptive Hopfield Network: Rethinking Similarities in Associative Memory	Shurong Wang et.al.	2511.20609	null
2025-11-25	New York Smells: A Large Multimodal Dataset for Olfaction	Ege Ozguroglu et.al.	2511.20544	null
2025-11-25	Wide Area Surface Dosimetry with Conformal Scintillator Array for External Beam Radiotherapy	Roman Vasyltsiv et.al.	2511.20472	null
2025-11-25	Power-Efficient Autonomous Mobile Robots	Liangkai Liu et.al.	2511.20467	null
2025-11-25	STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow	Jiatao Gu et.al.	2511.20462	null
2025-11-25	Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search	Yunqi Zhou et.al.	2511.20460	link
2025-11-25	A meshless data-tailored approach to compute statistics from scattered data with adaptive radial basis functions	Damien Rigutto et.al.	2511.20449	null
2025-11-25	A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control	Jiawei Lin et.al.	2511.20401	null
2025-11-25	Real-Space Imaging of Moiré-Confined Excitons in Twisted Bilayer MoS $_2$	Laurens J. M. Westenberg et.al.	2511.20398	null
2025-11-25	Interactive Visualization of Proof-of-Work Consensus Protocol on Raspberry Pi	Anton Ivashkevich et.al.	2511.20391	null
2025-11-25	From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations	Zhiqing Guo et.al.	2511.20359	null
2025-11-25	3D Motion Perception of Binocular Vision Target with PID-CNN	Shi Jiazhao et.al.	2511.20332	null
2025-11-25	TaCo: Capturing Spatio-Temporal Semantic Consistency in Remote Sensing Change Detection	Han Guo et.al.	2511.20306	null
2025-11-25	Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations	Chao Wang et.al.	2511.20295	null
2025-11-25	Bootstrapping Physics-Grounded Video Generation through VLM-Guided Iterative Self-Refinement	Yang Liu et.al.	2511.20280	null
2025-11-25	ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis	Advik Sinha et.al.	2511.20274	null
2025-11-25	DRL-Guided Neural Batch Sampling for Semi-Supervised Pixel-Level Anomaly Detection	Amirhossein Khadivi Noghredeh et.al.	2511.20270	null
2025-11-25	XiCAD: Camera Activation Detection in the Da Vinci Xi User Interface	Alexander C. Jenke et.al.	2511.20254	null
2025-11-25	V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs	Sen Nie et.al.	2511.20223	link
2025-11-25	Intelligent Image Search Algorithms Fusing Visual Large Models	Kehan Wang et.al.	2511.19920	null
2025-11-24	Wigner and Gabor phase-space analysis of propagators for evolution equations	Elena Cordero et.al.	2511.19400	null
2025-11-24	Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments	Jorge Ortigoso-Narro et.al.	2511.19396	null
2025-11-24	Neural Architecture Search for Quantum Autoencoders	Hibah Agha et.al.	2511.19246	null
2025-11-24	In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings	Teresa Guallart-Naval et.al.	2511.19226	null
2025-11-24	Can Modern Vision Models Understand the Difference Between an Object and a Look-alike?	Itay Cohen et.al.	2511.19200	null
2025-11-24	From Pixels to Posts: Retrieval-Augmented Fashion Captioning and Hashtag Generation	Moazzam Umer Gondal et.al.	2511.19149	null
2025-11-24	When Semantics Regulate: Rethinking Patch Shuffle and Internal Bias for Generated Image Detection with CLIP	Beilin Chu et.al.	2511.19126	null
2025-11-24	DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection	Hai Ci et.al.	2511.19111	null
2025-11-24	Graph-based 3D Human Pose Estimation using WiFi Signals	Jichao Chen et.al.	2511.19105	null
2025-11-24	Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach	Fan Nie et.al.	2511.19080	null
2025-11-24	Granular Computing-driven SAM: From Coarse-to-Fine Guidance for Prompt-Free Segmentation	Qiyang Yu et.al.	2511.19062	null
2025-11-24	LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space	Hai Wu et.al.	2511.19057	null
2025-11-24	Multi-height probing of horizontal flows in the solar photosphere	Teodor Kostić et.al.	2511.19048	null
2025-11-24	Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors	Haihang Wu et.al.	2511.19031	null
2025-11-24	Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting	Qiyang Yu et.al.	2511.19021	null
2025-11-24	AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization	Christos Koutlis et.al.	2511.18993	null
2025-11-24	Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models	Santiago Moreno et.al.	2511.18978	null
2025-11-24	MagicWorld: Interactive Geometry-driven Video World Exploration	Guangyuan Li et.al.	2511.18886	null
2025-11-24	Personalized Federated Segmentation with Shared Feature Aggregation and Boundary-Focused Calibration	Ishmam Tashdeed et.al.	2511.18847	null
2025-11-24	SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation	Nimeshika Udayangani et.al.	2511.18816	null
2025-11-23	AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization	Shuai Zhang et.al.	2511.18293	null
2025-11-23	SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes	Jungho Lee et.al.	2511.18290	null
2025-11-22	Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models	Dachuan Zhao et.al.	2511.18123	null
2025-11-21	Effect of local environment on Ly $α$ line profile in DESI/ODIN LAEs	Ana Sofía M. Uzsoy et.al.	2511.17498	null
2025-11-21	GPR-OdomNet: Difference and Similarity-Driven Odometry Estimation Network for Ground Penetrating Radar-Based Localization	Huaichao Wang et.al.	2511.17457	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	null
2025-11-21	Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers	Christopher Boland et.al.	2511.17421	null
2025-11-21	IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation	Yifan Li et.al.	2511.17384	null
2025-11-21	SVRecon: Sparse Voxel Rasterization for Surface Reconstruction	Seunghun Oh et.al.	2511.17364	null
2025-11-21	NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior	Dongbo Shi et.al.	2511.17322	null
2025-11-21	MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning	Wenrui Zhang et.al.	2511.17300	null
2025-11-21	Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation	Chuancheng Shi et.al.	2511.17282	null
2025-11-21	A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback	Bulat Khaertdinov et.al.	2511.17255	null
2025-11-21	Mixed Reality Scenic Live Streaming for Cultural Heritage: Visual Interactions in a Historic Landscape	Zeyu Huang et.al.	2511.17246	null
2025-11-21	Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers	Cris Claessens et.al.	2511.17209	null
2025-11-21	SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors	Kunyi Li et.al.	2511.17207	null
2025-11-21	Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition	Aditya Mishra et.al.	2511.17183	null
2025-11-21	Reflection-Based Relative Localization for Cooperative UAV Teams Using Active Markers	Tim Lakemann et.al.	2511.17166	null
2025-11-21	A lightweight detector for real-time detection of remote sensing images	Qianyi Wang et.al.	2511.17147	null
2025-11-21	Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation	Shuo Wang et.al.	2511.17097	null
2025-11-21	Spanning Tree Autoregressive Visual Generation	Sangkyu Lee et.al.	2511.17089	null
2025-11-21	ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion	Junming Liu et.al.	2511.17068	null
2025-11-21	Stable Offline Hand-Eye Calibration for any Robot with Just One Mark	Sicheng Xie et.al.	2511.17001	null
2025-11-20	Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation	Ziyu Guo et.al.	2511.16671	null
2025-11-20	Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems	Elias Lumer et.al.	2511.16654	null
2025-11-20	Measurement incompatibility in Bayesian multiparameter quantum estimation	Francesco Albarelli et.al.	2511.16645	null
2025-11-20	SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction	Guolin Huang et.al.	2511.16635	null
2025-11-20	SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking	Haofeng Liu et.al.	2511.16618	link
2025-11-20	POMA-3D: The Point Map Way to 3D Scene Understanding	Ye Mao et.al.	2511.16567	link
2025-11-20	NutriScreener: Retrieval-Augmented Multi-Pose Graph Attention Network for Malnourishment Screening	Misaal Khan et.al.	2511.16566	null
2025-11-20	Investigating Optical Flow Computation: From Local Methods to a Multiresolution Horn-Schunck Implementation with Bilinear Interpolation	Haytham Ziani et.al.	2511.16535	null
2025-11-20	Contrastive vision-language learning with paraphrasing and negation	Kwun Ho Ngan et.al.	2511.16527	null
2025-11-20	BoxingVI: A Multi-Modal Benchmark for Boxing Action Recognition and Localization	Rahul Kumar et.al.	2511.16524	null
2025-11-20	YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras	Fan Yang et.al.	2511.16521	null
2025-11-20	TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models	Li Zhang et.al.	2511.16423	null
2025-11-20	DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration	Meng-Cheng Shih et.al.	2511.16364	null
2025-11-20	CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering	Joni Vanherck et.al.	2511.16349	null
2025-11-20	Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks	Yi Ting Tsai et.al.	2511.16341	null
2025-11-20	Non-squeezing and other global rigidity results in locally conformal symplectic geometry	Mélanie Bertelson et.al.	2511.16329	null
2025-11-20	Real-Time Inference for Distributed Multimodal Systems under Communication Delay Uncertainty	Victor Croisfelt et.al.	2511.16225	null
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-20	AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers	Boxun Xu et.al.	2511.16047	null
2025-11-20	InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer	Muyao Yuan et.al.	2511.15967	null
2025-11-19	GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization	Yikun Wang et.al.	2511.15705	link
2025-11-19	First Frame Is the Place to Go for Video Content Customization	Jingxi Chen et.al.	2511.15700	link
2025-11-19	Hyperspectral Image Classification using Spectral-Spatial Mixer Network	Mohammed Q. Alkhatib et.al.	2511.15692	link
2025-11-19	Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning	Tao Hu et.al.	2511.15633	null
2025-11-19	Catching the 2021 γ-ray flare in the blazar TXS 2013+370	Giorgos Michailidis et.al.	2511.15601	null
2025-11-19	Multi-Text Guided Few-Shot Semantic Segmentation	Qiang Jiao et.al.	2511.15515	null
2025-11-19	SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome	Dabin Jeong et.al.	2511.15464	null
2025-11-19	HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation	Linyin Luo et.al.	2511.15435	null
2025-11-19	The Empowerment of Science of Science by Large Language Models: New Tools and Methods	Guoqiang Liang et.al.	2511.15370	null
2025-11-19	On the phase aberration estimation using common mid-angle correlations	Naiara Korta Martiartu et.al.	2511.15336	null
2025-11-19	C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models	Nayoung Oh et.al.	2511.15333	null
2025-11-19	Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval	Qing Wang et.al.	2511.15201	null
2025-11-19	Probing Electro-Magnetic Field Enhancement in 3D Plasmonic Nanopores Using DNA-PAINT and Nanorulers	German Lanzavecchia et.al.	2511.15181	null
2025-11-19	Multimodal Wireless Foundation Models	Ahmed Aboulfotouh et.al.	2511.15162	null
2025-11-19	Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation	Jin Wang et.al.	2511.15118	link
2025-11-19	BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer	Wenhan Yu et.al.	2511.15090	null
2025-11-19	Hyperspectral Super-Resolution with Inter-Image Variability via Degradation-based Low-Rank and Residual Fusion Method	Yue Wen et.al.	2511.15052	null
2025-11-18	Reconstruction of three-dimensional shapes of normal and disease-related erythrocytes from partial observations using multi-fidelity neural networks	Haizhou Wen et.al.	2511.14962	null
2025-11-18	FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding	Zhenshi Li et.al.	2511.14901	null
2025-11-18	Quantum Transport Spectroscopy of Pseudomagnetic Field in Graphene	Divya Sahani et.al.	2511.14888	null
2025-11-18	FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation	Yunfeng Wu et.al.	2511.14712	link
2025-11-18	Cell Shape Emerges from Motion	Gautham Gopinath et.al.	2511.14707	link
2025-11-18	Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images	Farheen Ramzan et.al.	2511.14702	null
2025-11-18	Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities	Huiyan Zou et.al.	2511.14687	null
2025-11-18	A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases	Tao Yang et.al.	2511.14638	null
2025-11-18	Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains	Qingwei Ben et.al.	2511.14625	link
2025-11-18	Deep Learning-Based Regional White Matter Hyperintensity Mapping as a Robust Biomarker for Alzheimer’s Disease	Julia Machnio et.al.	2511.14588	null
2025-11-18	Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction	Jaume Ros et.al.	2511.14544	null
2025-11-18	D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images	Taifour Yousra Nabila et.al.	2511.14518	null
2025-11-18	Aerial Assistance System for Automated Firefighting during Turntable Ladder Operations	Jan Quenzel et.al.	2511.14504	null
2025-11-18	VENUS: A Strongly Lensed Clumpy Galaxy at $z\sim11-12$ behind the Galaxy Cluster MACS J0257.1-2325	Minami Nakane et.al.	2511.14483	null
2025-11-18	Multi-network Topology Underlying Individual Language Learning Success	Peilun Song et.al.	2511.14453	null
2025-11-18	DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval	Zongwei Zhen et.al.	2511.14449	null
2025-11-18	Agentic Video Intelligence: A Flexible Framework for Advanced Video Exploration and Understanding	Hong Gao et.al.	2511.14446	null
2025-11-18	Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving	Kangqiao Zhao et.al.	2511.14386	null
2025-11-18	O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model	Rishi Gupta et.al.	2511.14368	null
2025-11-18	Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors	Jeryes Danial et.al.	2511.14335	null
2025-11-18	SAM-Fed: SAM-Guided Federated Semi-Supervised Learning for Medical Image Segmentation	Sahar Nasirihaghighi et.al.	2511.14302	null
2025-11-18	NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration	Luohong Wu et.al.	2511.14286	null
2025-11-18	Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery	Yiming Zeng et.al.	2511.14270	null
2025-11-17	Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images	Yinuo Xu et.al.	2511.13586	null
2025-11-17	Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification	Linhan Zhou et.al.	2511.13575	link
2025-11-17	Language-Guided Invariance Probing of Vision-Language Models	Jae Joong Lee et.al.	2511.13494	null
2025-11-17	Unlocking the Forgery Detection Potential of Vanilla MLLMs: A Novel Training-Free Pipeline	Rui Zuo et.al.	2511.13442	null
2025-11-17	Attention Grounded Enhancement for Visual Document Retrieval	Wanqing Cui et.al.	2511.13415	null
2025-11-17	An Unusual Velocity Field in a Sunspot Penumbra	H. Balthasar et.al.	2511.13374	null
2025-11-17	Stray Light Correction for the Helioseismic and Magnetic Imager	A. A. Norton et.al.	2511.13348	null
2025-11-17	GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models	Yushuo Zheng et.al.	2511.13259	null
2025-11-17	Referring Camouflaged Object Detection With Multi-Context Overlapped Windows Cross-Attention	Yu Wen et.al.	2511.13249	null
2025-11-17	Uncovering and Mitigating Transient Blindness in Multimodal Model Editing	Xiaoqi Han et.al.	2511.13243	null
2025-11-17	GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry	Chiyun Noh et.al.	2511.13216	null
2025-11-17	Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework	Diego Ortego et.al.	2511.13189	null
2025-11-17	GenTract: Generative Global Tractography	Alec Sargood et.al.	2511.13183	null
2025-11-17	THIR: Topological Histopathological Image Retrieval	Zahra Tabatabaei et.al.	2511.13170	null
2025-11-17	SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration	Haodong Wang et.al.	2511.13168	null
2025-11-17	MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications	Gagan Raj Gupta et.al.	2511.13131	null
2025-11-17	Region-Point Joint Representation for Effective Trajectory Similarity Learning	Hao Long et.al.	2511.13125	null
2025-11-17	Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining	Zhaocheng Yu et.al.	2511.13113	null
2025-11-17	uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data	Dahyun Chung et.al.	2511.13036	null
2025-11-17	Towards 3D Object-Centric Feature Learning for Semantic Scene Completion	Weihua Wang et.al.	2511.13031	null
2025-11-14	DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding	Dawei Zhu et.al.	2511.11552	null
2025-11-14	STEM EBIC as a Quantitative Probe of Semiconductor Devices	Sebastian Schneider et.al.	2511.11528	null
2025-11-14	Bridging Hidden States in Vision-Language Models	Benjamin Fein-Ashley et.al.	2511.11526	null
2025-11-14	OpenUS: A Fully Open-Source Foundation Model for Ultrasound Image Analysis via Self-Adaptive Masked Contrastive Learning	Xiaoyu Zheng et.al.	2511.11510	link
2025-11-14	Planetary nebulae as tracers of stellar population properties: a pilot study with MUSE	Ana Inés Ennis et.al.	2511.11479	null
2025-11-14	Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs	Francisco Nogueira et.al.	2511.11427	null
2025-11-14	Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment	Lukun Wu et.al.	2511.11422	link
2025-11-14	Bidimensional measurements of photon statistics within a multimodal temporal framework	C. Hainaut et.al.	2511.11403	null
2025-11-14	GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes	Shumit A. Mitra et.al.	2511.11401	null
2025-11-14	Shadow-Induced Warps in Protoplanetary disks	Shangjia Zhang et.al.	2511.11358	null
2025-11-14	Gluing sheaves along Harder-Narasimhan strata of $\mathrm{Bun}_G$	Jon Miles et.al.	2511.11327	null
2025-11-14	StochEP: Stochastic Equilibrium Propagation for Spiking Convergent Recurrent Neural Networks	Jiaqi Lin et.al.	2511.11320	null
2025-11-14	DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding	Tanveer Hannan et.al.	2511.11313	null
2025-11-14	MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising	Chenghan Fu et.al.	2511.11305	null
2025-11-14	Coordinative Learning with Ordinal and Relational Priors for Volumetric Medical Image Segmentation	Haoyi Wang et.al.	2511.11276	link
2025-11-14	3D Stokes polarimetric imaging at nanoscales	Isael Herrera et.al.	2511.11222	null
2025-11-14	Positional Bias in Multimodal Embedding Models: Do They Favor the Beginning, the Middle, or the End?	Kebin Wu et.al.	2511.11216	null
2025-11-14	Inverse modeling of porous flow through deep neural networks: the case of coffee percolation	Antoniorenee Barletta et.al.	2511.11194	null
2025-11-14	CareCom: Generative Image Composition with Calibrated Reference Features	Jiaxuan Chen et.al.	2511.11060	null
2025-11-14	MPCGNet: A Multiscale Feature Extraction and Progressive Feature Aggregation Network Using Coupling Gates for Polyp Segmentation	Wei Wang et.al.	2511.11032	null
2025-11-13	Multitask GLocal OBIA-Mamba for Sentinel-2 Landcover Mapping	Zack Dewis et.al.	2511.10604	null
2025-11-13	Excitonic Landscapes in Monolayer Lateral Heterostructures Revealed by Unsupervised Machine Learning	Maninder Kaur et.al.	2511.10600	null
2025-11-13	Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering	Bavana Durgapraveen et.al.	2511.10591	null
2025-11-13	Tight Robustness Certification through the Convex Hull of $\ell_0$ Attacks	Yuval Shapira et.al.	2511.10576	null
2025-11-13	Two Americas of Well-Being: Divergent Rural-Urban Patterns of Life Satisfaction and Happiness from 2.6 B Social Media Posts	Stefano Maria Iacus et.al.	2511.10542	null
2025-11-13	SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation	Wei Li et.al.	2511.10518	null
2025-11-13	Measuring dissimilarity between convex cones by means of max-min angles	Welington de Oliveira et.al.	2511.10483	null
2025-11-13	Extending the Frontier of Spatially-Resolved Supermassive Black Hole Mass Measurements to at $1\lesssim z\lesssim2$ : Simulations with ELT/MICADO High-Resolution Mass Models and HARMONI Integral-Field Stellar Kinematics	Dieu D. Nguyen et.al.	2511.10427	null
2025-11-13	Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators	Maximiliane Gruber et.al.	2511.10424	null
2025-11-13	MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns	Jiarui Zhang et.al.	2511.10390	null
2025-11-13	Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery	Prince Mensah et.al.	2511.10387	null
2025-11-13	DermAI: Clinical dermatology acquisition through quality-driven image collection for AI classification in mobile	Thales Bezerra et.al.	2511.10367	null
2025-11-13	Rethinking Visual Information Processing in Multimodal LLMs	Dongwan Kim et.al.	2511.10301	null
2025-11-13	H3Former: Hypergraph-based Semantic-Aware Aggregation via Hyperbolic Hierarchical Contrastive Loss for Fine-Grained Visual Classification	Yongji Zhang et.al.	2511.10260	null
2025-11-13	TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding	Jinxuan Li et.al.	2511.10241	null
2025-11-13	Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization	Ashutosh Anshul et.al.	2511.10212	null
2025-11-13	Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA	Yiran Zhang et.al.	2511.10182	null
2025-11-13	GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval	Hao Zou et.al.	2511.10154	null
2025-11-13	Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction	Mingda Jia et.al.	2511.10134	null
2025-11-13	GridPrune: From “Where to Look” to “What to Select” in Visual Token Pruning for MLLMs	Yuxiang Duan et.al.	2511.10081	null
2025-11-10	TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research	Han Zhang et.al.	2511.07412	null
2025-11-10	LeCoT: revisiting network architecture for two-view correspondence pruning	Luanyuan Dai et.al.	2511.07078	null
2025-11-09	DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization	Tao Liu et.al.	2511.06422	link
2025-11-09	ALIGN: A Vision-Language Framework for High-Accuracy Accident Location Inference through Geo-Spatial Neural Reasoning	MD Thamed Bin Zaman Chowdhury et.al.	2511.06316	null
2025-11-08	Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era	Feng Lu et.al.	2511.06024	link
2025-11-08	Hilbert-Guided Block-Sparse Local Attention	Yunge Li et.al.	2511.05832	null
2025-11-07	Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments	Laura Alejandra Encinar Gonzalez et.al.	2511.05404	null
2025-11-07	DAFM: Dynamic Adaptive Fusion for Multi-Model Collaboration in Composed Image Retrieval	Yawei Cai et.al.	2511.05020	null
2025-11-06	Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA	Itbaan Safwan et.al.	2511.04384	null
2025-11-06	An Efficient Algorithm for Learning-Based Visual Localization	Jindi Zhong et.al.	2511.04232	null
2025-11-05	The Human Flourishing Geographic Index: A County-Level Dataset for the United States, 2013–2023	Stefano M. Iacus et.al.	2511.03915	null
2025-11-04	Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization	Tao Liu et.al.	2511.02489	null
2025-11-04	LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment	Rohan Wandre et.al.	2511.02371	null
2025-11-03	SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment	Xinyu Mao et.al.	2511.01390	null
2025-11-02	GeoToken: Hierarchical Geolocalization of Images via Next Token Prediction	Narges Ghasemi et.al.	2511.01082	null
2025-11-02	Dynamic Multi-level Weighted Alignment Network for Zero-shot Sketch-based Image Retrieval	Hanwen Su et.al.	2511.00925	null
2025-10-31	GEDICorrect: A Scalable Python Tool for Orbit-, Beam-, and Footprint-Level GEDI Geolocation Correction	Leonel Corado et.al.	2511.00319	null
2025-10-31	Approximate Diverse $k$ -nearest Neighbor Search in Vector Database	Jiachen Zhao et.al.	2510.27243	null
2025-11-03	Evaluating Perspectival Biases in Cross-Modal Retrieval	Teerapol Saengsukhiran et.al.	2510.26861	null
2025-10-30	Scaling Image Geo-Localization to Continent Level	Philipp Lindenberger et.al.	2510.26795	null
2025-10-29	Citizen science dataset on residents’ urban heat perception in outdoor public spaces of climate-vulnerable neighborhoods	Ferran Larroya et.al.	2510.25645	null
2025-10-29	Instance-Level Composed Image Retrieval	Bill Psomas et.al.	2510.25387	null
2025-10-28	DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts	Binbin Li et.al.	2510.24813	null
2025-10-27	Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment	Hongyi Wang et.al.	2510.23224	null
2025-10-26	Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models	Yang Zhang et.al.	2510.22868	null
2025-10-30	Cross-view Localization and Synthesis – Datasets, Challenges and Opportunities	Ningli Xu et.al.	2510.22736	null
2025-10-26	STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models	Mahiro Ukai et.al.	2510.22571	null
2025-10-25	Cross-Platform Short-Video Diplomacy: Topic and Sentiment Analysis of China-US Relations on Douyin and TikTok	Zheng Wei et.al.	2510.22415	null
2025-10-24	BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models	Ziheng Zhang et.al.	2510.20095	null
2025-10-18	Small Language Models Offer Significant Potential for Science Community	Jian Zhang et.al.	2510.18890	null
2025-10-21	Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection	Ji Du et.al.	2510.18437	link
2025-10-21	ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization	Yuanhe Guo et.al.	2510.18433	null
2025-10-21	DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing	Luxuan Li et.al.	2510.18218	null
2025-10-20	Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition	Timur Ismagilov et.al.	2510.17739	null
2025-10-18	iWatchRoadv2: Pothole Detection, Geospatial Mapping, and Intelligent Road Governance	Rishi Raj Sahoo et.al.	2510.16375	link
2025-10-16	Acquisition of interpretable domain information during brain MR image harmonization for content-based image retrieval	Keima Abe et.al.	2510.14535	null
2025-10-15	Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition	Emily Miller et.al.	2510.13464	null
2025-10-15	Mobile Coverage Analysis using Crowdsourced Data	Timothy Wong et.al.	2510.13459	null
2025-10-13	Embedding the Teacher: Distilling vLLM Preferences for Scalable Image Retrieval	Eric He et.al.	2510.12014	null
2025-10-13	Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across Scales	Zhaofang Qian et.al.	2510.10880	null
2025-10-08	Population synthesis with geographic coordinates	Jacopo Lenti et.al.	2510.09669	null
2025-10-10	Hierarchical Scheduling for Multi-Vector Image Retrieval	Maoliang Li et.al.	2510.08976	null
2025-10-09	DarkHash: A Data-Free Backdoor Attack Against Deep Hashing	Ziqi Zhou et.al.	2510.08094	null
2025-10-09	CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning	Weihuang Lin et.al.	2510.08003	null
2025-10-09	Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision	Xiaoxu Ma et.al.	2510.07703	null
2025-10-08	Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval	Didrik Bergström et.al.	2510.06868	null
2025-10-07	CalibCLIP: Contextual Calibration of Dominant Semantics for Text-Driven Image Retrieval	Bin Kang et.al.	2510.05586	null
2025-10-06	Personalizing Retrieval using Joint Embeddings or “the Return of Fluffy”	Bruno Korbar et.al.	2510.05411	null
2025-10-05	Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition	Yu Kiu et.al.	2510.04282	null
2025-10-04	The Overlooked Value of Test-time Reference Sets in Visual Place Recognition	Mubariz Zaffar et.al.	2510.03751	null
2025-10-03	Team Xiaomi EV-AD VLA: Caption-Guided Retrieval System for Cross-Modal Drone Navigation – Technical Report for IROS 2025 RoboSense Challenge Track 4	Lingfeng Zhang et.al.	2510.02728	null
2025-10-01	A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features	Axel Barroso-Laguna et.al.	2510.00978	null
2025-10-01	Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions	Thanh Nguyen Canh et.al.	2510.00783	null
2025-09-30	Video Object Segmentation-Aware Audio Generation	Ilpo Viertola et.al.	2509.26604	null
2025-09-30	SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval	Ren-Di Wu et.al.	2509.26330	null
2025-09-30	SETR: A Two-Stage Semantic-Enhanced Framework for Zero-Shot Composed Image Retrieval	Yuqi Xiao et.al.	2509.26012	null
2025-09-30	SAGE: Spatial-visual Adaptive Graph Exploration for Visual Place Recognition	Shunpeng Chen et.al.	2509.25723	null
2025-09-29	Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity	Tu-Hoa Pham et.al.	2509.25520	null
2025-09-29	Performance-Efficiency Trade-off for Fashion Image Retrieval	Julio Hurtado et.al.	2509.24477	null
2025-09-28	Prepare for Warp Speed: Sub-millisecond Visual Place Recognition Using Event Cameras	Vignesh Ramanathan et.al.	2509.24094	null
2025-09-27	Terrorism & Democracy in Burkina-Faso	P Carmel Marie Zagre et.al.	2509.23046	null
2025-09-26	Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation	Jinpeng Lu et.al.	2509.22307	null
2025-09-25	Enhancing Contrastive Learning for Geolocalization by Discovering Hard Negatives on Semivariograms	Boyi Chen et.al.	2509.21573	null
2025-09-23	SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment	Binod Singh et.al.	2509.20401	null
2025-09-24	A Versatile Foundation Model for AI-enabled Mammogram Interpretation	Fuxiang Huang et.al.	2509.20271	null
2025-09-23	Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions	Ioanna Ntinou et.al.	2509.19203	link
2025-09-30	OrthoLoC: UAV 6-DoF Localization and Calibration Using Orthographic Geodata	Oussema Dhaouadi et.al.	2509.18350	link
2025-09-21	Learning Attribute-Aware Hash Codes for Fine-Grained Image Retrieval via Query Optimization	Peng Wang et.al.	2509.17049	null
2025-09-20	PM25Vision: A Large-Scale Benchmark Dataset for Visual Estimation of Air Quality	Yang Han et.al.	2509.16519	null
2025-09-25	Efficient Multimodal Dataset Distillation via Generative Models	Zhenghao Zhao et.al.	2509.15472	link
2025-09-18	SERVAL: Surprisingly Effective Zero-Shot Visual Document Retrieval Powered by Large Vision and Language Models	Thong Nguyen et.al.	2509.15432	link
2025-09-18	Assessing metadata privacy in neuroimaging	Emilie Kibsgaard et.al.	2509.15278	null
2025-09-18	PRISM: Product Retrieval In Shopping Carts using Hybrid Matching	Arda Kabadayi et.al.	2509.14985	null
2025-09-18	Chain-of-Thought Re-ranking for Image Retrieval Tasks	Shangrong Wu et.al.	2509.14746	null
2025-09-18	DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising	Li Gao et.al.	2509.14565	null
2025-09-18	Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods	Adam D. Hines et.al.	2509.14516	link
2025-09-17	Hashing-Baseline: Rethinking Hashing in the Age of Pretrained Models	Ilyass Moummad et.al.	2509.14427	link
2025-09-17	CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts	Leonard Hackel et.al.	2509.14104	null
2025-09-16	Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization	Yujia Lin et.al.	2509.13474	null
2025-09-18	MapAnything: Universal Feed-Forward Metric 3D Reconstruction	Nikhil Keetha et.al.	2509.13414	link
2025-09-17	DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval	Zechao Liu et.al.	2509.12824	null
2025-09-16	Ketto and the Science of Giving: A Data-Driven Investigation of Crowdfunding for India	Karuna Chandra et.al.	2509.12616	null
2025-09-15	Bridging Vision Language Models and Symbolic Grounding for Video Question Answering	Haodi Ma et.al.	2509.11862	null
2025-09-14	UnLoc: Leveraging Depth Uncertainties for Floorplan Localization	Matthias Wüest et.al.	2509.11301	null
2025-09-12	A Stochastic Birth-and-Death Approach for Street Furniture Geolocation in Urban Environments	Evan Murphy et.al.	2509.10310	null
2025-09-11	Listening for “You”: Enhancing Speech Image Retrieval via Target Speaker Extraction	Wenhao Yang et.al.	2509.09306	null
2025-09-09	Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark	Yandi Yang et.al.	2509.07362	null
2025-09-08	Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval	Emil Demić et.al.	2509.06566	null
2025-09-06	Augmenting Human-Centered Racial Covenant Detection and Georeferencing with Plug-and-Play NLP Pipelines	Jiyoon Pyo et.al.	2509.05829	null
2025-09-05	Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)	Emanuela Boros et.al.	2509.04948	null
2025-09-05	FloodVision: Urban Flood Depth Estimation Using Foundation Vision-Language Models and Domain Knowledge Graph	Zhangding Liu et.al.	2509.04772	null
2025-09-05	Global-to-Local or Local-to-Global? Enhancing Image Retrieval with Efficient Local Search and Effective Global Re-ranking	Dror Aiger et.al.	2509.04351	null
2025-09-05	GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization	Pengyue Jia et.al.	2509.04334	link
2025-09-04	DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval	Ruohong Yang et.al.	2509.04193	null
2025-09-04	A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning	Qika Lin et.al.	2509.03906	null
2025-09-02	Scale, Don’t Fine-tune: Guiding Multimodal LLMs for Efficient Visual Place Recognition at Test-Time	Jintao Cheng et.al.	2509.02129	null
2025-09-02	Ensemble-Based Event Camera Place Recognition Under Varying Illumination	Therese Joseph et.al.	2509.01968	null
2025-09-01	ConamArray: A 32-Element Broadband MEMS Ultrasound Transducer Array	Dennis Laurijssen et.al.	2509.01372	null
2025-09-01	M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision	Che Liu et.al.	2509.01360	null
2025-09-01	Street-Level Geolocalization Using Multimodal Large Language Models and Retrieval-Augmented Generation	Yunus Serhat Bicakci et.al.	2509.01341	null
2025-09-01	ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization	Thinh-Phuc Nguyen et.al.	2509.01259	null
2025-09-03	Multimodal Iterative RAG for Knowledge Visual Question Answering	Changin Choi et.al.	2509.00798	null
2025-08-31	Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification	Y Hop Nguyen et.al.	2509.00752	null
2025-08-31	EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions	Dinh-Khoi Vo et.al.	2509.00751	null
2025-08-29	Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders	Faizan Farooq Khan et.al.	2509.00177	null
2025-08-29	HCCM: Hierarchical Cross-Granularity Contrastive and Matching Learning for Natural Language-Guided Drones	Hao Ruan et.al.	2508.21539	null
2025-08-27	Disentangling Latent Embeddings with Sparse Linear Concept Subspaces (SLiCS)	Zhi Li et.al.	2508.20322	null
2025-08-27	Low-exposure, high-quality multimodal speckle X-ray imaging via an intrinsic gradient-flow approach	Jayvan Liu et.al.	2508.20209	null
2025-08-27	Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study	Max Torop et.al.	2508.20188	null
2024-09-27	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049	null
2024-08-14	Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN	Hao Li et.al.	2408.06761	null
2024-07-25	MeshVPR: Citywide Visual Place Recognition Using 3D Meshes	Gabriele Berton et.al.	2406.02776	null
2024-04-02	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546	null
2023-09-11	Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers	Jongwon Lee et.al.	2309.04441	null
2023-08-16	Wide-Area Geolocalization with a Limited Field of View Camera in Challenging Urban Environments	Lena M. Downes et.al.	2308.07432	null
2023-04-18	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426	null
2023-05-19	Wide-Area Geolocalization with a Limited Field of View Camera	Lena M. Downes et.al.	2209.11854	null
2022-08-09	A Survey on Visual Map Localization Using LiDARs and Cameras	Elhousni Mahdi et.al.	2208.03376	null
2022-07-26	ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization	Ivan Cisneros et.al.	2207.12317	null
2022-06-01	Investigating the Role of Image Retrieval for Visual Localization – An exhaustive benchmark	Martin Humenberger et.al.	2205.15761	null
2022-05-25	VPAIR – Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments	Michael Schleiss et.al.	2205.11567	null
2021-05-10	Probabilistic Visual Place Recognition for Hierarchical Localization	Ming Xu et.al.	2105.03091	null
2021-02-26	Scene Retrieval for Contextual Visual Mapping	William H. B. Smith et.al.	2102.12728	null
2020-12-02	Benchmarking Image Retrieval for Visual Localization	Noé Pion et.al.	2011.11946	null
2023-05-02	City-Scale Visual Place Recognition with Deep Local Features Based on Multi-Scale Ordered VLAD Pooling	Duc Canh Le et.al.	2009.09255	null
2019-04-16	Localizing Discriminative Visual Landmarks for Place Recognition	Zhe Xin et.al.	1904.06635	null
2018-09-18	UAV Pose Estimation using Cross-view Geolocalization with Satellite Imagery	Akshay Shetty et.al.	1809.05979	null
2018-05-16	Visual Global Localization with a Hybrid WNN-CNN Approach	Avelino Forechi et.al.	1805.03183	link
2017-04-28	Real-Time Visual Place Recognition for Personal Localization on a Mobile Device	Michał Nowicki et.al.	1611.02061	null

(<a href=#updated-on-20260429>back to top</a>)

Point Cloud Place Recognition

Publish Date	Title	Authors	PDF	Code
2026-04-02	Riemannian and Symplectic Geometry for Hierarchical Text-Driven Place Recognition	Tianyi Shang et.al.	2604.01598	null
2026-03-16	Voronoi-based Second-order Descriptor with Whitened Metric in LiDAR Place Recognition	Jaein Kim et.al.	2603.14974	null
2026-03-09	RLPR: Radar-to-LiDAR Place Recognition via Two-Stage Asymmetric Cross-Modal Alignment for Autonomous Driving	Zhangshuo Qi et.al.	2603.07920	null
2026-03-06	PROBE: Probabilistic Occupancy BEV Encoding with Analytical Translation Robustness for 3D Place Recognition	Jinseop Lee et.al.	2603.05965	null
2026-01-29	Advanced techniques and applications of LiDAR Place Recognition in Agricultural Environments: A Comprehensive Survey	Judith Vilella-Cantos et.al.	2601.22198	null
2026-04-22	Low Cost, High Efficiency: LiDAR Place Recognition in Vineyards with Matryoshka Representation Learning	Judith Vilella-Cantos et.al.	2601.18714	null
2025-12-05	NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections	Juho Korkeala et.al.	2512.05610	null
2025-12-04	A dynamic memory assignment strategy for dilation-based ICP algorithm on embedded GPUs	Qiong Chang et.al.	2512.04996	null
2025-12-04	TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards	Mauro Martini et.al.	2512.04772	null
2025-12-03	DM3D: Deformable Mamba via Offset-Guided Gaussian Sequencing for Point Cloud Understanding	Bin Liu et.al.	2512.03424	null
2025-12-03	What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models	Tianchen Deng et.al.	2512.03422	null
2025-12-02	GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection	Md Sohag Mia et.al.	2512.02991	null
2025-12-02	Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models	Pierpaolo Serio et.al.	2512.02897	null
2025-12-01	Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching	Yue Pan et.al.	2512.01850	null
2025-12-01	RoboLoc: A Benchmark Dataset for Point Place Recognition and Localization in Indoor-Outdoor Integrated Environments	Jaejin Jeon et.al.	2512.01194	null
2025-11-30	LAHNet: Local Attentive Hashing Network for Point Cloud Registration	Wentao Qu et.al.	2512.00927	null
2025-11-27	BrepGPT: Autoregressive B-rep Generation with Voronoi Half-Patch	Pu Li et.al.	2511.22171	null
2025-11-27	Constant-Volume Deformation Manufacturing for Material-Efficient Shaping	Lei Li et.al.	2511.22042	null
2025-11-26	Diagonal Scaling: A Multi-Dimensional Resource Model and Optimization Framework for Distributed Databases	Shahir Abdullah et.al.	2511.21612	null
2025-11-26	PFF-Net: Patch Feature Fitting for Point Cloud Normal Estimation	Qing Li et.al.	2511.21365	null
2025-11-26	$δ$ -core subsampling, strong collapses and TDA	Elias Gabriel Minian et.al.	2511.20954	null
2025-11-25	Accelerating Sparse Convolutions in Voxel-Based Point Cloud Networks	Dionysios Adamopoulos et.al.	2511.20834	null
2025-11-25	DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion	Yinghui Li et.al.	2511.20278	link
2025-11-25	FLaTEC: Frequency-Disentangled Latent Triplanes for Efficient Compression of LiDAR Point Clouds	Xiaoge Zhang et.al.	2511.20065	null
2025-11-24	PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion	Yichen Yang et.al.	2511.18801	null
2025-11-23	Object-centric Task Representation and Transfer using Diffused Orientation Fields	Cem Bilaloglu et.al.	2511.18563	null
2025-11-22	Two-step Generalized RBF-Generated Finite Difference Method on Manifolds	Rongji Li et.al.	2511.18049	null
2025-11-21	RL-AD-Net: Reinforcement Learning Guided Adaptive Displacement in Latent Space for Refined Point Cloud Completion	Bhanu Pratap Paregi et.al.	2511.17054	null
2025-11-20	CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering	Joni Vanherck et.al.	2511.16349	null
2025-11-20	Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion	Lirui Zhang et.al.	2511.16161	null
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-19	Atlas Gaussian processes on restricted domains and point clouds	Mu Niu et.al.	2511.15822	null
2025-11-21	The MeerKAT Fornax Survey VI. The collapse of the galaxy HI Mass Function in Fornax	D. Kleiner et.al.	2511.15795	null
2025-11-19	Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2511.15597	null
2025-11-19	Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language	Yan Xia et.al.	2511.15308	null
2025-11-18	NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration	Luohong Wu et.al.	2511.14286	null
2025-11-17	Part-X-MLLM: Part-aware 3D Multimodal Large Language Model	Chunshi Wang et.al.	2511.13647	link
2025-11-18	ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes	Yixuan Yang et.al.	2511.12977	null
2025-11-12	Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement	Lian He et.al.	2511.11702	null
2025-11-18	LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning	Xinran Yang et.al.	2511.10040	null
2025-11-13	AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models	Xinyi Wang et.al.	2511.10017	link
2025-11-12	PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model	Yunqian Cheng et.al.	2511.09724	link
2025-11-12	IFG: Internet-Scale Guidance for Functional Grasping Generation	Ray Muxin Liu et.al.	2511.09558	null
2026-04-09	HOTFLoc++: End-to-End Hierarchical LiDAR Place Recognition, Re-Ranking, and 6-DoF Metric Localisation in Forests	Ethan Griffiths et.al.	2511.09170	null
2025-11-11	Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention Mechanisms	Jiaxun Guo et.al.	2511.08833	null
2025-11-11	Hierarchical Direction Perception via Atomic Dot-Product Operators for Rotation-Invariant Point Clouds Learning	Chenyu Hu et.al.	2511.08240	null
2025-11-11	Accurate and Efficient Surface Reconstruction from Point Clouds via Geometry-Aware Local Adaptation	Eito Ogawa et.al.	2511.08233	null
2025-11-10	Semi-distributed Cross-modal Air-Ground Relative Localization	Weining Lu et.al.	2511.06749	null
2025-11-10	PointCubeNet: 3D Part-level Reasoning with 3x3x3 Point Cloud Blocks	Da-Yeong Kim et.al.	2511.06744	null
2025-11-07	Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments	Laura Alejandra Encinar Gonzalez et.al.	2511.05404	null
2025-11-10	Rethinking Metrics and Diffusion Architecture for 3D Point Cloud Generation	Matteo Bastico et.al.	2511.05308	null
2025-11-07	Implicit reconstruction from point cloud: an adaptive level-set-based semi-Lagrangian method	Silvia Preda et.al.	2511.05145	null
2025-11-04	Curvature of high-dimensional data	Jiayi Chen et.al.	2511.02873	null
2025-11-02	GauDP: Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies	Ziye Wang et.al.	2511.00998	null
2025-11-02	Modeling Microenvironment Trajectories on Spatial Transcriptomics with NicheFlow	Kristiyan Sakalyan et.al.	2511.00977	link
2025-11-04	Towards classification-based representation learning for place recognition on LiDAR scans	Maksim Konoplia et.al.	2511.00738	null
2025-11-01	Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles	Hyungtae Lim et.al.	2511.00635	link
2025-10-31	MambaNetLK: Enhancing Colonoscopy Point Cloud Registration with Mamba	Linzhe Jiang et.al.	2511.00260	null
2025-10-29	Figuring Out Gas & Galaxies In Enzo (FOGGIE) XI: Circumgalactic O VI Emission Traces Clumpy Inflowing Recycled Gas	Cassandra Lochhaas et.al.	2510.25844	null
2025-10-02	LangGrasp: Leveraging Fine-Tuned LLMs for Language Interactive Robot Grasping with Ambiguous Instructions	Yunhan Lin et.al.	2510.02104	null
2025-08-09	LifelongPR: Lifelong point cloud place recognition based on sample replay and prompt learning	Xianghong Zou et.al.	2507.10034	null
2025-08-08	ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models	Minwoo Jung et.al.	2505.18364	null
2025-05-26	MinkUNeXt-SI: Improving point cloud-based place recognition including spherical coordinates and LiDAR intensity	Judith Vilella-Cantos et.al.	2505.17591	null
2025-05-12	Ranking-aware Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2505.07198	null
2025-08-27	OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion	Shuhao Kang et.al.	2504.19258	null
2025-06-19	An Iterative Task-Driven Framework for Resilient LiDAR Place Recognition in Adverse Weather	Xiongwei Zhao et.al.	2504.14806	null
2025-04-16	Diffusion Based Robust LiDAR Place Recognition	Benjamin Krummenacher et.al.	2504.12412	null
2025-10-03	Vehicle-Scene Interaction: A Text-Driven 3D Lidar Place Recognition Method for Autonomous Driving	Tianyi Shang et.al.	2503.18035	null
2025-10-29	L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery	Ziwei Shi et.al.	2503.11245	null
2025-03-21	HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views	Ethan Griffiths et.al.	2503.08140	null
2025-03-06	ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images	Yanqing Shen et.al.	2503.04475	null
2025-03-20	CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework	Yanlong Xu et.al.	2503.02593	null
2025-02-07	HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer	Minwoo Jung et.al.	2501.18943	null
2024-12-20	SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning	Yuhao Li et.al.	2412.15577	null
2025-04-04	PerLA: Perceptive 3D Language Assistant	Guofeng Mei et.al.	2411.19774	link
2024-10-10	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2025-05-19	A Deeper Look into Second-Order Feature Aggregation for LiDAR Place Recognition	Saimunur Rahman et.al.	2409.15919	null
2024-09-06	Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments	Therese Joseph et.al.	2409.03998	null
2024-10-02	Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition	Hogyun Kim et.al.	2408.07330	null
2024-07-31	SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition	Raktim Gautam Goswami et.al.	2407.08260	null
2024-06-21	Voxel-Based Point Cloud Localization for Smart Spaces Management	F. S. Mortazavi et.al.	2406.15110	null
2024-10-09	PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture	T. Barros et.al.	2405.19038	link
2024-05-14	OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition	Qiuchi Xiang et.al.	2405.07966	link
2025-03-14	VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition	Yun-Jin Li et.al.	2403.14594	link
2024-08-30	Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests	Haedam Oh et.al.	2403.14326	null
2024-02-27	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-03-19	HeLiPR: Heterogeneous LiDAR Dataset for inter-LiDAR Place Recognition under Spatiotemporal Variations	Minwoo Jung et.al.	2309.14590	null
2023-08-25	VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition	Gengxuan Tian et.al.	2308.12870	null
2024-09-25	SelFLoc: Selective Feature Fusion for Large-scale Point Cloud-based Place Recognition	Qibo Qiu et.al.	2306.01205	null
2025-06-26	BEVPlace: Learning LiDAR-based Place Recognition using Bird’s Eye View Images	Lun Luo et.al.	2302.14325	null
2023-11-14	Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map	Haodong Yuan et.al.	2206.03062	null
2022-11-30	InCloud: Incremental Learning for Point Cloud Place Recognition	Joshua Knights et.al.	2203.00807	link
2025-06-26	BVMatch: Lidar-based Place Recognition Using Bird’s-eye View Images	Lun Luo et.al.	2109.00317	null
2021-04-15	MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition	Jacek Komorowski et.al.	2104.05327	link
2021-04-23	Robust Place Recognition using an Imaging Lidar	Tixiao Shan et.al.	2103.02111	null
2021-06-21	Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning	Huan Yin et.al.	2102.04960	link
2021-08-05	A Registration-aided Domain Adaptation Network for 3D Point Cloud Based Place Recognition	Zhijian Qiao et.al.	2012.05018	null
2020-08-04	PIC-Net: Point Cloud and Image Collaboration Network for Large-Scale Place Recognition	Yuheng Lu et.al.	2008.00658	null
2020-07-06	LOL: Lidar-Only Odometry and Localization in 3D Point Cloud Maps	David Rozenberszki et.al.	2007.01595	link

(<a href=#updated-on-20260429>back to top</a>)

Cross-modality Localization

Publish Date	Title	Authors	PDF	Code
2026-03-08	TAPFormer: Robust Arbitrary Point Tracking via Transient Asynchronous Fusion of Frames and Events	Jiaxiong Liu et.al.	2603.04989	null
2026-01-07	SpatiaLoc: Leveraging Multi-Level Spatial Enhanced Descriptors for Cross-Modal Localization	Tianyi Shang et.al.	2601.03579	null
2025-12-05	Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures	Amirkia Rafiei Oskooei et.al.	2512.05908	null
2025-12-04	Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models	Manar Alnaasan et.al.	2512.04425	null
2025-12-02	GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection	Md Sohag Mia et.al.	2512.02991	null
2025-12-02	Reasoning-Aware Multimodal Fusion for Hateful Video Detection	Shuonan Yang et.al.	2512.02743	null
2025-12-02	GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization	Zixuan Song et.al.	2512.02697	null
2025-12-01	TBT-Former: Learning Temporal Boundary Distributions for Action Localization	Thisara Rathnayaka et.al.	2512.01298	null
2025-11-29	CourseTimeQA: A Lecture-Video Benchmark and a Latency-Constrained Cross-Modal Fusion Method for Timestamped QA	Vsevolod Kovalev et.al.	2512.00360	null
2025-11-27	MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning	Kyeongha Rho et.al.	2512.00115	null
2025-11-28	Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records	Shiyu Shen et.al.	2511.22958	null
2025-11-27	Enhanced Graph Convolutional Network with Chebyshev Spectral Graph and Graph Attention for Autism Spectrum Disorder Classification	Adnan Ferdous Ashrafi et.al.	2511.22178	link
2025-11-28	Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy	Teng Hu et.al.	2511.21579	null
2025-11-26	Semantic-Enhanced Feature Matching with Learnable Geometric Verification for Cross-Modal Neuron Registration	Wenwei Li et.al.	2511.21452	null
2025-11-25	Prompt-Aware Adaptive Elastic Weight Consolidation for Continual Learning in Medical Vision-Language Models	Ziyuan Gao et.al.	2511.20732	null
2025-11-25	ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis	Advik Sinha et.al.	2511.20274	null
2025-11-25	ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction	Yuanzhe Li et.al.	2511.20020	null
2025-11-24	Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach	Fan Nie et.al.	2511.19080	null
2025-11-24	AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization	Christos Koutlis et.al.	2511.18993	null
2025-11-24	A Theory-Inspired Framework for Few-Shot Cross-Modal Sketch Person Re-Identification	Yunpeng Gong et.al.	2511.18677	null
2025-11-22	CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking	Hao Li et.al.	2511.17967	null
2025-11-21	Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics	Wei Zhang et.al.	2511.17685	null
2025-11-21	Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers	Cris Claessens et.al.	2511.17209	null
2025-11-21	Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition	Aditya Mishra et.al.	2511.17183	null
2025-11-19	Multi-Text Guided Few-Shot Semantic Segmentation	Qiang Jiao et.al.	2511.15515	null
2025-11-19	Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language	Yan Xia et.al.	2511.15308	null
2025-11-18	NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration	Luohong Wu et.al.	2511.14286	null
2025-11-18	SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts	Fan Zhang et.al.	2511.14093	null
2025-11-17	Attention Grounded Enhancement for Visual Document Retrieval	Wanqing Cui et.al.	2511.13415	null
2025-11-17	Uncovering and Mitigating Transient Blindness in Multimodal Model Editing	Xiaoqi Han et.al.	2511.13243	null
2025-11-17	3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale	Yijia Fan et.al.	2511.13211	null
2025-11-17	SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration	Haodong Wang et.al.	2511.13168	null
2025-11-15	FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention	Peng Zhang et.al.	2511.12215	link
2025-11-15	Calibrated Multimodal Representation Learning with Missing Modalities	Xiaohao Liu et.al.	2511.12034	null
2025-11-14	DEFT-LLM: Disentangled Expert Feature Tuning for Micro-Expression Recognition	Ren Zhang et.al.	2511.10948	null
2025-11-13	Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification	Junjie Zhang et.al.	2511.10774	null
2025-11-13	URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding	Yongxin Shi et.al.	2511.10552	null
2025-11-13	Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization	Ashutosh Anshul et.al.	2511.10212	null
2025-11-13	HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction	Yueran Zhao et.al.	2511.10211	null
2025-11-13	Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction	Mingda Jia et.al.	2511.10134	null
2025-11-14	FreDFT: Frequency Domain Fusion Transformer for Visible-Infrared Object Detection	Wencong Wu et.al.	2511.10046	null
2025-11-12	BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation	Hongchao Shu et.al.	2511.09443	null
2025-11-12	xHAP: Cross-Modal Attention for Haptic Feedback Estimation in the Tactile Internet	Georgios Kokkinis et.al.	2511.09137	null
2025-11-11	Multi-modal Deepfake Detection and Localization with FPN-Transformer	Chende Zheng et.al.	2511.08031	null
2025-11-11	Cross Modal Fine-grained Alignment via Granularity-aware and Region-uncertain Modeling	Jiale Liu et.al.	2511.07710	link
2025-11-10	Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding	Yuzhen Li et.al.	2511.06908	null
2025-11-10	Semi-distributed Cross-modal Air-Ground Relative Localization	Weining Lu et.al.	2511.06749	null
2025-11-09	Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation	Tzu-Jung Lin et.al.	2511.06240	null
2025-11-04	C3-Diff: Super-resolving Spatial Transcriptomics via Cross-modal Cross-content Contrastive Diffusion Modelling	Xiaofei Wang et.al.	2511.05571	null
2025-11-06	DINOv2 Driven Gait Representation Learning for Video-Based Visible-Infrared Person Re-identification	Yujie Yang et.al.	2511.04281	null
2025-11-06	CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation	Yuwen Tao et.al.	2511.03992	null
2025-11-04	Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization	Tao Liu et.al.	2511.02489	null
2025-11-03	3EED: Ground Everything Everywhere in 3D	Rong Li et.al.	2511.01755	null
2025-11-03	SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment	Xinyu Mao et.al.	2511.01390	null
2025-11-02	Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya	Hassan Ugail et.al.	2511.01000	null
2025-11-02	VesSAM: Efficient Multi-Prompting for Segmenting Complex Vessel	Suzhong Fu et.al.	2511.00981	null
2025-10-24	A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization	LinFeng Li et.al.	2510.20291	null
2025-10-20	Closed-Loop Transfer for Weakly-supervised Affordance Grounding	Jiajin Tang et.al.	2510.17384	null
2025-09-27	AttAnchor: Guiding Cross-Modal Token Alignment in VLMs with Attention Anchors	Junyang Zhang et.al.	2509.23109	null
2025-09-30	InterKey: Cross-modal Intersection Keypoints for Global Localization on OpenStreetMap	Nguyen Hoang Khoi Tran et.al.	2509.13857	null
2025-12-28	Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrieval	Hao Yin et.al.	2509.13754	null
2025-09-16	Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization	Yujia Lin et.al.	2509.13474	null
2025-09-12	TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion	Xiaodong Guo et.al.	2509.10005	null
2025-09-09	Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark	Yandi Yang et.al.	2509.07362	null
2025-10-10	SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization	Hanjun Kim et.al.	2506.15175	null
2025-08-27	OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion	Shuhao Kang et.al.	2504.19258	null
2024-12-19	Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration	Ziheng Zhou et.al.	2412.12628	null
2024-12-02	Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures	Qiyuan Shen et.al.	2412.01299	null
2024-11-02	X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios	Yichen Xie et.al.	2411.01123	null
2025-05-12	Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization	Ling Xing et.al.	2409.07967	null
2025-02-24	MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms	Tianyi Shang et.al.	2408.15740	null
2024-06-26	Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2406.17679	null
2024-05-13	JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation	Xubo Luo et.al.	2405.07429	null
2024-04-27	Instance-free Text to Point Cloud Localization with Relative Position Awareness	Lichao Wang et.al.	2404.17845	null
2024-07-15	SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs	Yang Miao et.al.	2404.00469	null
2024-03-11	LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map	Xinrui Wu et.al.	2403.05002	null
2023-12-27	LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization	Sai Shubodh Puligilla et.al.	2312.16648	null
2023-09-20	Sound Source Localization is All about Cross-Modal Alignment	Arda Senocak et.al.	2309.10724	null
2023-10-17	Counterfactual Cross-modality Reasoning for Weakly Supervised Video Moment Localization	Zezhong Lv et.al.	2308.05648	link
2023-06-06	Energy-Based Models for Cross-Modal Localization using Convolutional Transformers	Alan Wu et.al.	2306.04021	null
2023-05-07	Poses as Queries: Image-to-LiDAR Map Localization with Transformers	Jinyu Miao et.al.	2305.04298	null
2023-03-23	Egocentric Audio-Visual Object Localization	Chao Huang et.al.	2303.13471	null
2023-02-20	Champion Solution for the WSDM2023 Toloka VQA Challenge	Shengyi Gao et.al.	2301.09045	null
2023-01-13	Text to Point Cloud Localization with Relation-Enhanced Transformer	Guangzhi Wang et.al.	2301.05372	link
2022-12-06	Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds	Zhipeng Zhao et.al.	2212.02757	null
2022-10-31	Visual Answer Localization with Cross-modal Mutual Knowledge Transfer	Yixuan Weng et.al.	2210.14823	null
2022-08-04	Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos	Juncheng Li et.al.	2208.01954	null
2023-01-18	CSDN: Cross-modal Shape-transfer Dual-refinement Network for Point Cloud Completion	Zhe Zhu et.al.	2208.00751	null
2022-04-06	Text2Pos: Text-to-Point-Cloud Cross-Modal Localization	Manuel Kolmet et.al.	2203.15125	null
2022-02-15	Visual Sound Localization in the Wild by Cross-Modal Interference Erasing	Xian Liu et.al.	2202.06406	null
2021-08-18	Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences	Hyunjong Park et.al.	2108.07422	null
2021-07-28	Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization	Fa-Ting Hong et.al.	2107.12589	null
2020-09-15	RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization	Niluthpol Chowdhury Mithun et.al.	2009.05695	null

(<a href=#updated-on-20260429>back to top</a>)

3D GS

Publish Date	Title	Authors	PDF	Code
2026-04-28	Generalizable Human Gaussian Splatting via Multi-view Semantic Consistency	Jingi Kim et.al.	2604.25466	null
2026-04-28	GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning	Yufei Jia et.al.	2604.25459	null
2026-04-28	Generalizable 3D Gaussian Splatting enabled Semantic Coding for Real-Time Immersive Video Communications	Dingxi Yang et.al.	2604.25330	null
2026-04-28	Thermodynamic Phase Transitions in Einstein-Maxwell-Scalar-Gauss-Bonnet Gravity	Cristián Erices et.al.	2604.25100	null
2026-04-27	Large-Scale Photogrammetric Documentation of St. John’s Co-Cathedral: A Workflow for Cultural Heritage Preservation	Matthew Kenely et.al.	2604.24316	null
2026-04-27	Light ‘em Up: Enabling Few-Shot Low-Light 3D Gaussian Splatting with Multi-Scale Explicit Retinex Illumination Decoupling	YuHao Yin et.al.	2604.24053	null
2026-04-26	Bringing a Personal Point of View: Evaluating Dynamic 3D Gaussian Splatting for Egocentric Scene Reconstruction	Jan Warchocki et.al.	2604.23803	null
2026-04-26	GS-DOT: Gaussian splatting-based image reconstruction for diffuse optical tomography	Jingjing Jiang et.al.	2604.23675	null
2026-04-26	Spatiotemporal Degradation-Aware 3D Gaussian Splatting for Realistic Underwater Scene Reconstruction	Shaohua Liu et.al.	2604.23551	null
2026-04-24	NRGS: Neural Regularization for Robust 3D Semantic Gaussian Splatting	Zaiyan Yang et.al.	2604.22439	null
2026-04-28	Flow4DGS-SLAM: Optical Flow-Guided 4D Gaussian Splatting SLAM	Yunsong Wang et.al.	2604.22339	null
2026-04-24	EvFlow-GS: Event Enhanced Motion Deblurring with Optical Flow for 3D Gaussian Splatting	Feiyu An et.al.	2604.22183	null
2026-04-24	PAGaS: Pixel-Aligned 1DoF Gaussian Splatting for Depth Refinement	David Recasens et.al.	2604.22129	null
2026-04-20	High-Fidelity 3D Gaussian Human Reconstruction via Region-Aware Initialization and Geometric Priors	Yang Liu et.al.	2604.21714	null
2026-04-23	DualSplat: Robust 3D Gaussian Splatting via Pseudo-Mask Bootstrapping from Reconstruction Failures	Xu Wang et.al.	2604.21631	null
2026-04-24	You Only Gaussian Once: Controllable 3D Gaussian Splatting for Ultra-Densely Sampled Scenes	Jinrang Jia et.al.	2604.21400	null
2026-04-23	WildSplatter: Feed-forward 3D Gaussian Splatting with Appearance Control from Unconstrained Images	Yuki Fujimura et.al.	2604.21182	null
2026-04-22	GSCompleter: A Distillation-Free Plugin for Metric-Aware 3D Gaussian Splatting Completion in Seconds	Ao Gao et.al.	2604.20155	null
2026-04-23	Gaussians on a Diet: High-Quality Memory-Bounded 3D Gaussian Splatting Training	Yangming Zhang et.al.	2604.20046	null
2026-04-21	FluSplat: Sparse-View 3D Editing without Test-Time Optimization	Haitao Huang et.al.	2604.20038	null
2026-04-21	Camera Control for Text-to-Image Generation via Learning Viewpoint Tokens	Xinxuan Lu et.al.	2604.19954	null
2026-04-21	TransSplat: Unbalanced Semantic Transport for Language-Driven 3DGS Editing	Yanhui Chen et.al.	2604.19571	null
2026-04-21	An Object-Centered Data Acquisition Method for 3D Gaussian Splatting using Mobile Phones	Yuezhe Zhang et.al.	2604.19216	null
2026-04-21	SketchFaceGS: Real-Time Sketch-Driven Face Editing and Generation with Gaussian Splatting	Bo Li et.al.	2604.19202	null
2026-04-21	BALTIC: A Benchmark and Cross-Domain Strategy for 3D Reconstruction Across Air and Underwater Domains Under Varying Illumination	Michele Grimaldi et.al.	2604.19133	null
2026-04-21	OT-UVGS: Revisiting UV Mapping for Gaussian Splatting as a Capacity Allocation Problem	Byunghyun Kim et.al.	2604.19127	null
2026-04-21	AdaGScale: Viewpoint-Adaptive Gaussian Scaling in 3D Gaussian Splatting to Reduce Gaussian-Tile Pairs	Joongho Jo et.al.	2604.18980	null
2026-04-20	A Comparative Evaluation of Geometric Accuracy in NeRF and Gaussian Splatting	Mikolaj Zielinski et.al.	2604.18205	null
2026-04-20	Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs	Samuel G. Balter et.al.	2604.18203	null
2026-04-20	GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting	Mingyu Shi et.al.	2604.18047	null
2026-04-23	E3VS-Bench: A Benchmark for Viewpoint-Dependent Active Perception in 3D Gaussian Splatting Scenes	Koya Sakamoto et.al.	2604.17969	null
2026-04-20	Voronoi-guided Bilateral 2D Gaussian Splatting for Arbitrary-Scale Hyperspectral Image Super-Resolution	Jie Zhang et.al.	2604.17727	null
2026-04-18	Instant Colorization of Gaussian Splats	Daniel Lieber et.al.	2604.17155	null
2026-04-18	mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval	Kyeong Seon Kim et.al.	2604.17054	null
2026-04-18	LAGS: Low-Altitude Gaussian Splatting with Groupwise Heterogeneous Graph Learning	Yikun Wang et.al.	2604.16910	null
2026-04-17	Incoherent Deformation, Not Capacity: Diagnosing and Mitigating Overfitting in Dynamic Gaussian Splatting	Ahmad Droby et.al.	2604.16747	null
2026-04-17	Neural Gabor Splatting: Enhanced Gaussian Splatting with Neural Gabor for High-frequency Surface Reconstruction	Haato Watanabe et.al.	2604.15941	null
2026-04-17	CLOTH-HUGS: Cloth Aware Human Gaussian Splatting	Sadia Mubashshira et.al.	2604.15875	null
2026-04-17	Splats in Splats++: Robust and Generalizable 3D Gaussian Splatting Steganography	Yijia Guo et.al.	2604.15862	null
2026-04-17	GaussianFlow SLAM: Monocular Gaussian Splatting SLAM Guided by GaussianFlow	Dong-Uk Seo et.al.	2604.15612	null
2026-04-17	GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens	Roni Itkin et.al.	2604.15284	null
2026-04-16	TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens	Jiawei Ren et.al.	2604.15239	null
2026-04-16	One-shot Compositional 3D Head Avatars with Deformable Hair	Yuan Sun et.al.	2604.14782	null
2026-04-16	NG-GS: NeRF-Guided 3D Gaussian Splatting Segmentation	Yi He et.al.	2604.14706	null
2026-04-15	HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds	Team HY-World et.al.	2604.14268	null
2026-04-15	ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction	Jie Liang et.al.	2604.13746	null
2026-04-18	Dehaze-then-Splat: Generative Dehazing with Physics-Informed 3D Gaussian Splatting for Smoke-Free Novel View Synthesis	Boss Chen et.al.	2604.13589	null
2026-04-15	RadarSplat-RIO: Indoor Radar-Inertial Odometry with Gaussian Splatting-Based Radar Bundle Adjustment	Pou-Chun Kung et.al.	2604.13492	null
2026-04-15	DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis	Cheng-You Lu et.al.	2604.13416	null
2026-04-14	MSGS: Multispectral 3D Gaussian Splatting	Iris Zheng et.al.	2604.13340	null
2026-04-14	SSD-GS: Scattering and Shadow Decomposition for Relightable 3D Gaussian Splatting	Iris Zheng et.al.	2604.13333	null
2026-04-14	PatchPoison: Poisoning Multi-View Datasets to Degrade 3D Reconstruction	Prajas Wadekar et.al.	2604.13153	null
2026-04-14	GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts	Amir Hossein Kargaran et.al.	2604.12978	null
2026-04-14	RMGS-SLAM: Real-time Multi-sensor Gaussian Splatting SLAM	Dongen Li et.al.	2604.12942	null
2026-04-14	GGD-SLAM: Monocular 3DGS SLAM Powered by Generalizable Motion Model for Dynamic Environments	Yi Liu et.al.	2604.12837	null
2026-04-14	Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting	Ziyuan Xia et.al.	2604.12626	null
2026-04-14	ELoG-GS: Dual-Branch Gaussian Splatting with Luminance-Guided Enhancement for Extreme Low-light 3D Reconstruction	Yuhao Liu et.al.	2604.12592	null
2026-04-16	PDF-GS: Progressive Distractor Filtering for Robust 3D Gaussian Splatting	Kangmin Seo et.al.	2604.12580	null
2026-04-14	ArtifactWorld: Scaling 3D Gaussian Splatting Artifact Restoration via Video Generation Models	Xinliang Wang et.al.	2604.12251	null
2026-04-14	VVGT: Visual Volume-Grounded Transformer	Yuxuan Wang et.al.	2604.12217	null
2026-04-13	ReefMapGS: Enabling Large-Scale Underwater Reconstruction by Closing the Loop Between Multimodal SLAM and Gaussian Splatting	Daniel Yang et.al.	2604.11992	null
2026-04-13	Unfolding 3D Gaussian Splatting via Iterative Gaussian Synopsis	Yuqin Lu et.al.	2604.11685	null
2026-04-13	GS4City: Hierarchical Semantic Gaussian Splatting via City-Model Priors	Qilin Zhang et.al.	2604.11401	null
2026-04-13	Naka-GS: A Bionics-inspired Dual-Branch Naka Correction and Progressive Point Pruning for Low-Light 3DGS	Runyu Zhu et.al.	2604.11142	null
2026-04-13	ViserDex: Visual Sim-to-Real for Robust Dexterous In-hand Reorientation	Arjun Bhardwaj et.al.	2604.11138	null
2026-04-13	Efficient Transceiver Design for Aerial Image Transmission and Large-scale Scene Reconstruction	Zeyi Ren et.al.	2604.11098	null
2026-04-13	LumiMotion: Improving Gaussian Relighting with Scene Dynamics	Joanna Kaleta et.al.	2604.10994	null
2026-04-13	Ψ-Map: Panoptic Surface Integrated Mapping Enables Real2Sim Transfer	Xuan Yu et.al.	2604.10982	null
2026-04-13	Fast-SegSim: Real-Time Open-Vocabulary Segmentation for Robotics in Simulation	Xuan Yu et.al.	2604.10951	null
2026-04-14	STGV: Spatio-Temporal Hash Encoding for Gaussian-based Video Representation	Jierun Lin et.al.	2604.10910	null
2026-04-12	WARPED: Wrist-Aligned Rendering for Robot Policy Learning from Egocentric Human Demonstrations	Harry Freeman et.al.	2604.10809	null
2026-04-12	MonoEM-GS: Monocular Expectation-Maximization Gaussian Splatting SLAM	Evgenii Kruzhkov et.al.	2604.10593	null
2026-04-12	Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models	Dehui Wang et.al.	2604.10578	null
2026-04-12	Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images	Bo Zhou et.al.	2604.10573	null
2026-04-12	FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation	Chenhan Jiang et.al.	2604.10512	null
2026-04-11	Real-Time Human Reconstruction and Animation using Feed-Forward Gaussian Splatting	Devdoot Chatterjee et.al.	2604.10259	null
2026-04-11	A 129FPS Full HD Real-Time Accelerator for 3D Gaussian Splatting	Fang-Chi Chang et.al.	2604.10223	null
2026-04-10	PointSplat: Efficient Geometry-Driven Pruning and Transformer Refinement for 3D Gaussian Splatting	Anh Thuan Tran et.al.	2604.09903	null
2026-04-10	F3G-Avatar : Face Focused Full-body Gaussian Avatar	Willem Menu et.al.	2604.09835	null
2026-04-10	Structure-Aware Fine-Grained Gaussian Splatting for Expressive Avatar Reconstruction	Yuze Su et.al.	2604.09324	null
2026-04-10	A GPU-enhanced workflow for non-Fourier SENSE reconstruction	Samuel Bianchi et.al.	2604.09233	null
2026-04-10	Scene-Agnostic Object-Centric Representation Learning for 3D Gaussian Splatting	Tsuheng Hsu et.al.	2604.09045	null
2026-04-10	AudioGS: Spectrogram-Based Audio Gaussian Splatting for Sound Field Reconstruction	Chunhao Bi et.al.	2604.08967	null
2026-04-09	SIC3D: Style Image Conditioned Text-to-3D Gaussian Splatting Generation	Ming He et.al.	2604.08760	null
2026-04-09	BLaDA: Bridging Language to Functional Dexterous Actions within 3DGS Fields	Fan Yang et.al.	2604.08410	null
2026-04-09	SurfelSplat: Learning Efficient and Generalizable Gaussian Surfel Representations for Sparse-View Surface Reconstruction	Chensheng Dai et.al.	2604.08370	null
2026-04-10	Generative 3D Gaussian Splatting for Arbitrary-ResolutionAtmospheric Downscaling and Forecasting	Tao Han et.al.	2604.07928	null
2026-04-09	ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video	Boyuan Wang et.al.	2604.07882	null
2026-04-09	GEAR: GEometry-motion Alternating Refinement for Articulated Object Modeling with Gaussian Splatting	Jialin Li et.al.	2604.07728	null
2026-04-08	From Blobs to Spokes: High-Fidelity Surface Reconstruction via Oriented Gaussians	Diego Gomez et.al.	2604.07337	null
2026-04-08	Splats under Pressure: Exploring Performance-Energy Trade-offs in Real-Time 3D Gaussian Splatting under Constrained GPU Budgets	Muhammad Fahim Tajwar et.al.	2604.07177	null
2026-04-08	Genie Sim PanoRecon: Fast Immersive Scene Generation from Single-View Panorama	Zhijun Li et.al.	2604.07105	null
2026-04-08	Radio-Frequency Inverse Rendering for Wireless Environment Modeling	Fuhai Wang et.al.	2604.07086	null
2026-04-09	AnchorSplat: Feed-Forward 3D Gaussian Splatting with 3D Geometric Priors	Xiaoxue Zhang et.al.	2604.07053	null
2026-04-08	DOC-GS: Dual-Domain Observation and Calibration for Reliable Sparse-View Gaussian Splatting	Hantang Li et.al.	2604.06739	null
2026-04-08	4D Vessel Reconstruction for Benchtop Thrombectomy Analysis	Ethan Nguyen et.al.	2604.06671	link
2026-04-07	GS-Surrogate: Deformable Gaussian Splatting for Parameter Space Exploration of Ensemble Simulations	Ziwei Li et.al.	2604.06358	null
2026-04-07	Appearance Decomposition Gaussian Splatting for Multi-Traversal Reconstruction	Yangyi Xiao et.al.	2604.05908	null
2026-04-07	GaussianGrow: Geometry-aware Gaussian Growing from 3D Point Clouds with Text Guidance	Weiqi Zhang et.al.	2604.05721	null
2026-04-07	In Depth We Trust: Reliable Monocular Depth Supervision for Gaussian Splatting	Wenhui Xiao et.al.	2604.05715	null
2026-04-07	3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models	Xinye Zheng et.al.	2604.05687	null
2026-04-07	PanopticQuery: Unified Query-Time Reasoning for 4D Scenes	Ruilin Tang et.al.	2604.05638	null
2026-04-07	LSGS-Loc: Towards Robust 3DGS-Based Visual Localization for Large-Scale UAV Scenarios	Xiang Zhang et.al.	2604.05402	null
2026-04-07	3DTurboQuant: Training-Free Near-Optimal Quantization for 3D Reconstruction Models	Jae Joong Lee et.al.	2604.05366	null
2026-04-07	Indoor Asset Detection in Large Scale 360° Drone-Captured Imagery via 3D Gaussian Splatting	Monica Tang et.al.	2604.05316	null
2026-04-07	SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration	Xueming Fu et.al.	2604.05301	null
2026-04-06	GaussFly: Contrastive Reinforcement Learning for Visuomotor Policies in 3D Gaussian Fields	Yuhang Zhang et.al.	2604.05062	null
2026-04-06	AvatarPointillist: AutoRegressive 4D Gaussian Avatarization	Hongyu Liu et.al.	2604.04787	link
2026-04-06	3D Gaussian Splatting for Annular Dark Field Scanning Transmission Electron Microscopy Tomography Reconstruction	Beiyuan Zhang et.al.	2604.04693	null
2026-04-07	PR-IQA: Partial-Reference Image Quality Assessment for Diffusion-Based Novel View Synthesis	Inseong Choi et.al.	2604.04576	link
2026-04-06	GA-GS: Generation-Assisted Gaussian Splatting for Static Scene Reconstruction	Yedong Shen et.al.	2604.04331	null
2026-04-05	4C4D: 4 Camera 4D Gaussian Splatting	Junsheng Zhou et.al.	2604.04063	link
2026-04-05	HOIGS: Human-Object Interaction Gaussian Splatting	Taewoo Kim et.al.	2604.04016	null
2026-04-04	M2StyleGS: Multi-Modality 3D Style Transfer with Gaussian Splatting	Xingyu Miao et.al.	2604.03773	null
2026-04-04	CGHair: Compact Gaussian Hair Reconstruction with Card Clustering	Haimin Luo et.al.	2604.03716	null
2026-04-03	SpectralSplat: Appearance-Disentangled Feed-Forward Gaussian Splatting for Driving Scenes	Quentin Herau et.al.	2604.03462	null
2026-04-03	Flash-Mono: Feed-Forward Accelerated Gaussian Splatting Monocular SLAM	Zicheng Zhang et.al.	2604.03092	link
2026-04-03	SparseSplat: Towards Applicable Feed-Forward 3D Gaussian Splatting with Pixel-Unaligned Prediction	Zicheng Zhang et.al.	2604.03069	link
2026-04-03	Rendering Multi-Human and Multi-Object with 3D Gaussian Splatting	Weiquan Wang et.al.	2604.02996	null
2026-04-03	GP-4DGS: Probabilistic 4D Gaussian Splatting from Monocular Video via Variational Gaussian Processes	Mijeong Kim et.al.	2604.02915	null
2026-04-03	Streaming Real-Time Rendered Scenes as 3D Gaussians	Matti Siekkinen et.al.	2604.02851	null
2026-04-03	NavCrafter: Exploring 3D Scenes from a Single Image	Hongbo Duan et.al.	2604.02828	null
2026-04-03	UNICA: A Unified Neural Framework for Controllable 3D Avatars	Jiahe Zhu et.al.	2604.02799	null
2026-04-03	DynFOA: Generating First-Order Ambisonics with Conditional Diffusion for Dynamic and Acoustically Complex 360-Degree Videos	Ziyu Luo et.al.	2604.02781	null
2026-04-03	Differentiable Stroke Planning with Dual Parameterization for Efficient and High-Fidelity Painting Creation	Jinfan Liu et.al.	2604.02752	null
2026-04-03	VBGS-SLAM: Variational Bayesian Gaussian Splatting Simultaneous Localization and Mapping	Yuhan Zhu et.al.	2604.02696	null
2026-04-02	TrackerSplat: Exploiting Point Tracking for Fast and Robust Dynamic 3D Gaussians Reconstruction	Daheng Yin et.al.	2604.02586	null
2026-04-02	GEMM-GS: Accelerating 3D Gaussian Splatting on Tensor Cores with GEMM-Compatible Blending	Haomin Li et.al.	2604.02120	null
2026-04-02	ProDiG: Progressive Diffusion-Guided Gaussian Splatting for Aerial to Ground Reconstruction	Sirshapan Mitra et.al.	2604.02003	null
2026-04-02	Resonance4D: Frequency-Domain Motion Supervision for Preset-Free Physical Parameter Learning in 4D Dynamic Physical Scene Simulation	Changshe Zhang et.al.	2604.01994	null
2026-04-02	GS^2: Graph-based Spatial Distribution Optimization for Compact 3D Gaussian Splatting	Xianben Yang et.al.	2604.01884	null
2026-04-02	FaCT-GS: Fast and Scalable CT Reconstruction with Gaussian Splatting	Pawel Tomasz Pieta et.al.	2604.01844	null
2026-04-02	Director: Instance-aware Gaussian Splatting for Dynamic Scene Modeling and Understanding	Yuheng Jiang et.al.	2604.01678	null
2026-04-02	F3DGS: Federated 3D Gaussian Splatting for Decentralized Multi-Agent World Modeling	Morui Zhu et.al.	2604.01605	null
2026-04-03	Satellite-Free Training for Drone-View Geo-Localization	Tao Liu et.al.	2604.01581	null
2026-04-02	ColorGradedGaussians: Palette-Based Color Grading for 3D Gaussian Splatting via View-Space Sparse Decomposition	Cheng-Kang Ted Chao et.al.	2604.01551	null
2026-04-01	Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars	Derek Austin et.al.	2604.01447	null
2026-04-01	LESV: Language Embedded Sparse Voxel Fusion for Open-Vocabulary 3D Scene Understanding	Fusang Wang et.al.	2604.01388	null
2026-04-01	Neural Harmonic Textures for High-Quality Primitive Based Neural Reconstruction	Jorge Condor et.al.	2604.01204	null
2026-04-01	Diff3R: Feed-forward 3D Gaussian Splatting with Uncertainty-aware Differentiable Optimization	Yueh-Cheng Liu et.al.	2604.01030	null
2026-04-01	Autoregressive Appearance Prediction for 3D Gaussian Avatars	Michael Steiner et.al.	2604.00928	null
2026-04-01	Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM	Monica M. Q. Li et.al.	2604.00804	null
2026-04-01	DirectFisheye-GS: Enabling Native Fisheye Input in Gaussian Splatting with Cross-View Joint Optimization	Zhengxian Yang et.al.	2604.00648	null
2026-04-01	TRiGS: Temporal Rigid-Body Motion for Scalable 4D Gaussian Splatting	Suwoong Yeom et.al.	2604.00538	null
2026-04-01	RT-GS: Gaussian Splatting with Reflection and Transmittance Primitives	Kunnong Zeng et.al.	2604.00509	null
2026-04-01	ARGS: Auto-Regressive Gaussian Splatting via Parallel Progressive Next-Scale Prediction	Quanyuan Ruan et.al.	2604.00494	null
2026-03-31	GRVS: a Generalizable and Recurrent Approach to Monocular Dynamic View Synthesis	Thomas Tanay et.al.	2603.29734	null
2026-03-31	Adversarial Prompt Injection Attack on Multimodal Large Language Models	Meiwen Ding et.al.	2603.29418	null
2026-03-31	AA-Splat: Anti-Aliased Feed-forward Gaussian Splatting	Taewoo Suh et.al.	2603.29394	null
2026-03-31	MotionScale: Reconstructing Appearance, Geometry, and Motion of Dynamic Scenes with Scalable 4D Gaussian Splatting	Haoran Zhou et.al.	2603.29296	null
2026-03-31	LightHarmony3D: Harmonizing Illumination and Shadows for Object Insertion in 3D Gaussian Splatting	Tianyu Huang et.al.	2603.29209	link
2026-03-31	Efficient Camera Pose Augmentation for View Generalization in Robotic Policy Learning	Sen Wang et.al.	2603.29192	null
2026-03-31	Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting	Huaqi Tao et.al.	2603.29185	null
2026-03-31	LG-HCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting	Xuan Deng et.al.	2603.28431	null
2026-03-30	ObjectMorpher: 3D-Aware Image Editing via Deformable 3DGS Models	Yuhuan Xie et.al.	2603.28152	null
2026-03-30	SVGS: Single-View to 3D Object Editing via Gaussian Splatting	Pengcheng Xue et.al.	2603.28126	null
2026-03-30	\textit{4DSurf}: High-Fidelity Dynamic Scene Surface Reconstruction	Renjie Wu et.al.	2603.28064	null
2026-03-30	Physically Inspired Gaussian Splatting for HDR Novel View Synthesis	Huimin Zeng et.al.	2603.28020	null
2026-03-29	GS3LAM: Gaussian Semantic Splatting SLAM	Linfei Li et.al.	2603.27781	null
2026-03-31	SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering	Jiahao Niu et.al.	2603.27516	null
2026-03-28	DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification	Kenji Tojo et.al.	2603.27151	null
2026-03-26	arg-VU: Affordance Reasoning with Physics-Aware 3D Geometry for Visual Understanding in Robotic Surgery	Nan Xiao et.al.	2603.26814	null
2026-03-27	Detailed Geometry and Appearance from Opportunistic Motion	Ryosuke Hirai et.al.	2603.26665	null
2026-03-27	Drive-Through 3D Vehicle Exterior Reconstruction via Dynamic-Scene SfM and Distortion-Aware Gaussian Splatting	Nitin Kulkarni et.al.	2603.26638	null
2026-03-27	Scene Grounding In the Wild	Tamir Cohen et.al.	2603.26584	null
2026-03-27	GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport	Youngju Na et.al.	2603.26181	null
2026-03-27	R-PGA: Robust Physical Adversarial Camouflage Generation via Relightable 3D Gaussian Splatting	Tianrui Lou et.al.	2603.26067	null
2026-03-26	Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting	Yixing Lao et.al.	2603.25745	null
2026-03-26	ViewSplat: View-Adaptive Dynamic Gaussian Splatting for Feed-Forward Synthesis	Moonyeon Jeong et.al.	2603.25265	null
2026-03-26	AirSplat: Alignment and Rating for Robust Feed-Forward 3D Gaussian Splatting	Minh-Quan Viet Bui et.al.	2603.25129	null
2026-03-26	Learning Explicit Continuous Motion Representation for Dynamic Gaussian Splatting from Monocular Videos	Xuankai Zhang et.al.	2603.25058	null
2026-03-27	GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator	Liyuan Zhu et.al.	2603.25053	null
2026-03-26	MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes	Wonjoon Lee et.al.	2603.25042	null
2026-03-26	$π$ , But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation	Johnathan Tucker et.al.	2603.25038	null
2026-03-26	Relaxed Rigidity with Ray-based Grouping for Dynamic Gaussian Splatting	Junoh Leea et.al.	2603.24994	null
2026-03-25	Confidence-Based Mesh Extraction from 3D Gaussians	Lukas Radl et.al.	2603.24725	null
2026-03-25	Accurate Point Measurement in 3DGS – A New Alternative to Traditional Stereoscopic-View Based Measurements	Deyan Deng et.al.	2603.24716	null
2026-03-25	ViHOI: Human-Object Interaction Synthesis with Visual Priors	Songjin Cai et.al.	2603.24383	null
2026-03-25	SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision	Avigail Cohen Rimon et.al.	2603.24036	null
2026-03-25	FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting	Yixian Wang et.al.	2603.23891	null
2026-03-24	AdvSplat: Adversarial Attacks on Feed-Forward Gaussian Splatting Models	Yiran Qiao et.al.	2603.23686	null
2026-03-26	Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting	Peiyu Xu et.al.	2603.23637	null
2026-03-26	Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors	Chuanqing Zhuang et.al.	2603.23324	null
2026-03-23	Drop-In Perceptual Optimization for 3D Gaussian Splatting	Ezgi Ozyilkan et.al.	2603.23297	null
2026-03-24	GTLR-GS: Geometry-Texture Aware LiDAR-Regularized 3D Gaussian Splatting for Realistic Scene Reconstruction	Yan Fang et.al.	2603.23192	null
2026-03-24	PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding	Lirong Che et.al.	2603.22796	null
2026-03-25	Instrument-Splatting++: Towards Controllable Surgical Instrument Digital Twin Using Gaussian Splatting	Shuojue Yang et.al.	2603.22792	null
2026-03-24	Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis	Chamuditha Jayanga Galappaththige et.al.	2603.22786	null
2026-03-23	FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario	Hang Dai et.al.	2603.22102	null
2026-03-23	Fast undersampled dynamic MRI reconstruction using explicit representation learning with Gaussian splatting	M. L. Terpstra et.al.	2603.21980	null
2026-03-23	Cross-Instance Gaussian Splatting Registration via Geometry-Aware Feature-Guided Alignment	Roy Amoyal et.al.	2603.21936	null
2026-03-23	Camera-Agnostic Pruning of 3D Gaussian Splats via Descriptor-Based Beta Evidence	Peter Fasogbon et.al.	2603.21933	null
2026-03-23	RefracGS: Novel View Synthesis Through Refractive Water Surfaces with 3D Gaussian Ray Tracing	Yiming Shao et.al.	2603.21695	null
2026-03-22	EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization	Haolan Xu et.al.	2603.21332	null
2026-03-25	F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting	Injae Kim et.al.	2603.21304	null
2026-03-22	CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs	Shanmukha Vellamcheti et.al.	2603.21114	null
2026-03-24	2Xplat: Two Experts Are Better Than One Generalist	Hwasik Jeong et.al.	2603.21064	null
2026-03-22	SGAD-SLAM: Splatting Gaussians at Adjusted Depth for Better Radiance Fields in RGBD SLAM	Pengchong Hu et.al.	2603.21055	null
2026-03-21	Fast and Robust Deformable 3D Gaussian Splatting	Han Jiao et.al.	2603.20857	null
2026-03-21	The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting	Ivan Desiatov et.al.	2603.20714	null
2026-03-21	GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction	Di Kong et.al.	2603.20611	null
2026-03-20	Nevis Digital Twin: Photogrammetry and Immersive Visualization of Historical Sites	Alex Apffel et.al.	2603.20560	null
2026-03-20	TRGS-SLAM: IMU-Aided Gaussian Splatting SLAM for Blurry, Rolling Shutter, and Noisy Thermal Images	Spencer Carmichael et.al.	2603.20443	null
2026-03-20	Fourier Splatting: Generalized Fourier encoded primitives for scalable radiance fields	Mihnea-Bogdan Jurca et.al.	2603.19834	null
2026-03-20	HUGE-Bench: A Benchmark for High-Level UAV Vision-Language-Action Tasks	Jingyu Guo et.al.	2603.19822	null
2026-03-20	3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction	Takeshi Noda et.al.	2603.19682	null
2026-03-20	StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention	Zhongrui Yu et.al.	2603.19552	null
2026-03-20	Matryoshka Gaussian Splatting	Zhilin Guo et.al.	2603.19234	null
2026-03-19	Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting	Yiren Lu et.al.	2603.19193	null
2026-03-19	GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning	Yiren Lu et.al.	2603.19137	null
2026-03-19	GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting	Ahmed Tawfik Aboukhadra et.al.	2603.18912	null
2026-03-19	From ex(p) to poly: Gaussian Splatting with Polynomial Kernels	Joerg H. Mueller et.al.	2603.18707	null
2026-03-19	OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting	Hongjia Zhai et.al.	2603.18510	null
2026-03-19	Inst4DGS: Instance-Decomposed 4D Gaussian Splatting with Multi-Video Label Permutation Learning	Yonghan Lee et.al.	2603.18402	null
2026-03-18	Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting	Guillem Casadesus Vila et.al.	2603.18218	null
2026-03-18	AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion Priors	Aymen Mir et.al.	2603.17975	null
2026-03-18	CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image	Yizheng Song et.al.	2603.17779	null
2026-03-18	ReLaGS: Relational Language Gaussian Splatting	Yaxu Xie et.al.	2603.17605	null
2026-03-18	UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images	Guibiao Liao et.al.	2603.17519	null
2026-03-18	A Tutorial on Learning-Based Radio Map Construction: Data, Paradigms, and Physics-Awarenes	Xiucheng Wang et.al.	2603.17499	null
2026-03-18	Adaptive Anchor Policies for Efficient 4D Gaussian Streaming	Ashim Dahal et.al.	2603.17227	null
2026-03-17	SMAL-pets: SMAL Based Avatars of Pets from Single Image	Piotr Borycki et.al.	2603.17131	null
2026-03-16	KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition	Yuhan Chen et.al.	2603.16943	link
2026-03-17	M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM	Kerui Ren et.al.	2603.16844	null
2026-03-17	Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty	Mangyu Kong et.al.	2603.16538	null
2026-03-17	Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation	Yiming Huang et.al.	2603.16211	null
2026-03-17	NanoGS: Training-Free Gaussian Splat Simplification	Butian Xiong et.al.	2603.16103	null
2026-03-16	Feed-forward Gaussian Registration for Head Avatar Creation and Editing	Malte Prinzler et.al.	2603.15811	null
2026-03-16	IRIS: Intersection-aware Ray-based Implicit Editable Scenes	Grzegorz Wilczyński et.al.	2603.15368	null
2026-03-16	NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation	Jiahang Liu et.al.	2603.15186	null
2026-03-16	GeoNVS: Geometry Grounded Video Diffusion for Novel View Synthesis	Minjun Kang et.al.	2603.14965	null
2026-03-16	LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision	Yiming Huang et.al.	2603.14763	null
2026-03-16	E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction	Yunsoo Kim et.al.	2603.14684	null
2026-03-15	Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting	Shuai Guo et.al.	2603.14316	null
2026-03-15	In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels	Tomislav Medic et.al.	2603.14309	null
2026-03-15	4D Synchronized Fields: Motion-Language Gaussian Splatting for Temporal Scene Understanding	Mohamed Rayan Barhdadi et.al.	2603.14301	null
2026-03-15	S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction	Renhe Zhang et.al.	2603.14232	null
2026-03-14	PhyGaP: Physically-Grounded Gaussians with Polarization Cues	Jiale Wu et.al.	2603.14001	null
2026-03-14	Scene Generation at Absolute Scale: Utilizing Semantic and Geometric Guidance From Text for Accurate and Interpretable 3D Indoor Scene Generation	Stefan Ainetter et.al.	2603.13910	null
2026-03-14	RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting	Xuezhen Wang et.al.	2603.13783	null
2026-03-13	NumColor: Precise Numeric Color Control in Text-to-Image Generation	Muhammad Atif Butt et.al.	2603.13547	null
2026-03-13	SldprtNet: A Large-Scale Multimodal Dataset for CAD Generation in Language-Driven 3D Design	Ruogu Li et.al.	2603.13098	null
2026-03-13	Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting	Yang Chen et.al.	2603.12796	null
2026-03-13	LR-SGS: Robust LiDAR-Reflectance-Guided Salient Gaussian Splatting for Self-Driving Scene Reconstruction	Ziyu Chen et.al.	2603.12647	null
2026-03-12	RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution	Ali Mosleh et.al.	2603.12493	null
2026-03-12	AstroSplat: Physics-Based Gaussian Splatting for Rendering and Reconstruction of Small Celestial Bodies	Jennifer Nolan et.al.	2603.11969	null
2026-03-12	Mango-GS: Enhancing Spatio-Temporal Consistency in Dynamic Scenes Reconstruction using Multi-Frame Node-Guided 4D Gaussian Splatting	Tingxuan Huang et.al.	2603.11543	null
2026-03-12	Mobile-GS: Real-time Gaussian Splatting for Mobile Devices	Xiaobiao Du et.al.	2603.11531	null
2026-03-11	InstantHDR: Single-forward Gaussian Splatting for High Dynamic Range 3D Reconstruction	Dingqiang Ye et.al.	2603.11298	null
2026-03-11	S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs	Yuzhou Ji et.al.	2603.10893	link
2026-03-11	PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction	Yufei Han et.al.	2603.10801	null
2026-03-11	Splat2Real: Novel-view Scaling for Physical AI with 3D Gaussian Splatting	Hansol Lim et.al.	2603.10638	null
2026-03-11	P-GSVC: Layered Progressive 2D Gaussian Splatting for Scalable Image and Video	Longan Wang et.al.	2603.10551	null
2026-03-12	SignSparK: Efficient Multilingual Sign Language Production via Sparse Keyframe Learning	Jianhe Low et.al.	2603.10446	null
2026-03-10	ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare	Freeman Cheng et.al.	2603.09968	null
2026-03-10	GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming System	Zhiye Tang et.al.	2603.09718	null
2026-03-10	ProGS: Towards Progressive Coding for 3D Gaussian Splatting	Zhiye Tang et.al.	2603.09703	null
2026-03-10	VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM	Anh Thuan Tran et.al.	2603.09673	null
2026-03-10	DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics	Yuanhang Lei et.al.	2603.09668	null
2026-03-12	X-GS: An Extensible Open Framework for Perceiving and Thinking via 3D Gaussian Splatting	Yueen Ma et.al.	2603.09632	null
2026-03-10	IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework	Feiyu Wang et.al.	2603.09312	null
2026-03-10	DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction	Fuzhen Jiang et.al.	2603.09291	null
2026-03-10	Learning Convex Decomposition via Feature Fields	Yuezhi Yang et.al.	2603.09285	null
2026-03-10	Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists	Jiaqi Liu et.al.	2603.09277	null
2026-03-09	SkipGS: Post-Densification Backward Skipping for Efficient 3DGS Training	Jingxing Li et.al.	2603.08997	null
2026-03-09	SurgCalib: Gaussian Splatting-Based Hand-Eye Calibration for Robot-Assisted Minimally Invasive Surgery	Zijian Wu et.al.	2603.08983	null
2026-03-09	Where, What, Why: Toward Explainable 3D-GS Watermarking	Mingshu Cai et.al.	2603.08809	null
2026-03-09	ImprovedGS+: A High-Performance C++/CUDA Re-Implementation Strategy for 3D Gaussian Splatting	Jordi Muñoz Vicente et.al.	2603.08661	null
2026-03-09	Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction	Zhe Yang et.al.	2603.08503	null
2026-03-09	Improving Continual Learning for Gaussian Splatting based Environments Reconstruction on Commercial Off-the-Shelf Edge Devices	Ivan Zaino et.al.	2603.08499	null
2026-03-09	HDR-NSFF: High Dynamic Range Neural Scene Flow Fields	Shin Dong-Yeon et.al.	2603.08313	null
2026-03-09	DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving	Zhuolin He et.al.	2603.08254	null
2026-03-08	SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation	Zixuan Pan et.al.	2603.07789	null
2026-03-08	Ref-DGS: Reflective Dual Gaussian Splatting	Ningjing Fan et.al.	2603.07664	null
2026-03-08	Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence	Yuanyuan Gao et.al.	2603.07660	null
2026-03-08	EmbedTalk: Triplane-Free Talking Head Synthesis using Embedding-Driven Gaussian Deformation	Arpita Saggar et.al.	2603.07604	null
2026-03-08	3DGS-HPC: Distractor-free 3D Gaussian Splatting with Hybrid Patch-wise Classification	Jiahao Chen et.al.	2603.07587	null
2026-03-08	ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction	Haibao Yu et.al.	2603.07552	null
2026-03-07	MipSLAM: Alias-Free Gaussian Splatting SLAM	Yingzhao Li et.al.	2603.06989	null
2026-03-06	ColonSplat: Reconstruction of Peristaltic Motion in Colonoscopy with Dynamic Gaussian Splatting	Weronika Smolak-Dyżewska et.al.	2603.06860	null
2026-03-06	Active View Selection with Perturbed Gaussian Ensemble for Tomographic Reconstruction	Yulun Wu et.al.	2603.06852	null
2026-03-06	EntON: Eigenentropy-Optimized Neighborhood Densification in 3D Gaussian Splatting	Miriam Jäger et.al.	2603.06216	null
2026-03-06	VG3S: Visual Geometry Grounded Gaussian Splatting for Semantic Occupancy Prediction	Xiaoyang Yan et.al.	2603.06210	null
2026-03-06	Transforming Omnidirectional RGB-LiDAR data into 3D Gaussian Splatting	Semin Bae et.al.	2603.06061	null
2026-03-06	FTSplat: Feed-forward Triangle Splatting Network	Xiong Jinlin et.al.	2603.05932	null
2026-03-06	CylinderSplat: 3D Gaussian Splatting with Cylindrical Triplanes for Panoramic Novel View Synthesis	Qiwei Wang et.al.	2603.05882	null
2026-03-05	Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups	Leif Van Holland et.al.	2603.05507	null
2026-03-05	SSR-GS: Separating Specular Reflection in Gaussian Splatting for Glossy Surface Reconstruction	Ningjing Fan et.al.	2603.05152	null
2026-03-05	GaussTwin: Unified Simulation and Correction with Gaussian Splatting for Robotic Digital Twins	Yichen Cai et.al.	2603.05108	null
2026-03-05	GloSplat: Joint Pose-Appearance Optimization for Faster and More Accurate 3D Reconstruction	Tianyu Xiong et.al.	2603.04847	null
2026-03-05	DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction	Shiyu Zhang et.al.	2603.04770	null
2026-03-03	VIRGi: View-dependent Instant Recoloring of 3D Gaussians Splats	Alessio Mazzucchelli et.al.	2603.02986	null
2026-03-03	Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting	Kaiqiang Xiong et.al.	2603.02893	null
2026-03-04	Generalized non-exponential Gaussian splatting	Sébastien Speierer et.al.	2603.02887	null
2026-03-03	Multimodal-Prior-Guided Importance Sampling for Hierarchical Gaussian Splatting in Sparse-View Novel View Synthesis	Kaiqiang Xiong et.al.	2603.02866	null
2026-03-03	R3GW: Relightable 3D Gaussians for Outdoor Scenes in the Wild	Margherita Lea Corona et.al.	2603.02801	null
2026-03-03	SemGS: Feed-Forward Semantic 3D Gaussian Splatting from Sparse Views for Generalizable Scene Understanding	Sheng Ye et.al.	2603.02548	null
2026-03-03	OnlineX: Unified Online 3D Reconstruction and Understanding with Active-to-Stable State Evolution	Chong Xia et.al.	2603.02134	null
2026-03-02	LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation	Hualiang Wei et.al.	2603.02129	null
2026-03-02	Sparse View Distractor-Free Gaussian Splatting	Yi Gu et.al.	2603.01603	null
2026-03-02	Radiometrically Consistent Gaussian Surfels for Inverse Rendering	Kyu Beom Han et.al.	2603.01491	null
2026-03-01	FLICKER: A Fine-Grained Contribution-Aware Accelerator for Real-Time 3D Gaussian Splatting	Wenhui Ou et.al.	2603.01158	null
2026-03-01	D-REX: Differentiable Real-to-Sim-to-Real Engine for Learning Dexterous Grasping	Haozhe Lou et.al.	2603.01151	null
2026-03-03	HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views	Jiashu Li et.al.	2603.01099	null
2026-03-01	Decoupling Motion and Geometry in 4D Gaussian Splatting	Yi Zhang et.al.	2603.00952	null
2026-02-28	TokenSplat: Token-aligned 3D Gaussian Splatting for Feed-forward Pose-free Reconstruction	Yihui Li et.al.	2603.00697	null
2026-02-28	Zero-Shot Robotic Manipulation via 3D Gaussian Splatting-Enhanced Multimodal Retrieval-Augmented Generation	Zilong Xie et.al.	2603.00500	null
2026-02-28	ArtiFixer: Enhancing and Extending 3D Reconstruction with Auto-Regressive Diffusion Models	Riccardo de Lutio et.al.	2603.00492	null
2026-02-28	Station2Radar: query conditioned gaussian splatting for precipitation field	Doyi Kim et.al.	2603.00418	null
2026-02-27	UFO-4D: Unposed Feedforward 4D Reconstruction from Two Images	Junhwa Hur et.al.	2602.24290	null
2026-02-27	Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives	Haoran Wang et.al.	2602.24136	null
2026-02-27	DiffusionHarmonizer: Bridging Neural Reconstruction and Photorealistic Simulation with Online Diffusion Enhancer	Yuxuan Zhang et.al.	2602.24096	null
2026-02-27	SR3R: Rethinking Super-Resolution 3D Reconstruction With Feed-Forward Gaussian Splatting	Xiang Feng et.al.	2602.24020	null
2026-02-27	Provable Subspace Identification of Nonlinear Multi-view CCA	Zhiwei Han et.al.	2602.23785	null
2026-02-27	No Calibration, No Depth, No Problem: Cross-Sensor View Synthesis with 3D Consistency	Cho-Ying Wu et.al.	2602.23559	null
2026-02-26	Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking	Maximilian Luz et.al.	2602.23172	null
2026-02-26	PackUV: Packed Gaussian UV Maps for 4D Volumetric Video	Aashish Rai et.al.	2602.23040	link
2026-02-26	GSTurb: Gaussian Splatting for Atmospheric Turbulence Mitigation	Hanliang Du et.al.	2602.22800	null
2026-02-26	Sapling-NeRF: Geo-Localised Sapling Reconstruction in Forests for Ecological Monitoring	Miguel Ángel Muñoz-Bañón et.al.	2602.22731	null
2026-02-26	ArtPro: Self-Supervised Articulated Object Reconstruction with Adaptive Integration of Mobility Proposals	Xuelu Li et.al.	2602.22666	null
2026-02-26	BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model	Yuci Han et.al.	2602.22596	null
2026-02-26	GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views	Tianyu Chen et.al.	2602.22571	null
2026-02-26	SwiftNDC: Fast Neural Depth Correction for High-Fidelity 3D Reconstruction	Kang Han et.al.	2602.22565	null
2026-02-25	AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction	Hanyang Liu et.al.	2602.22376	null
2026-02-25	Interactive Augmented Reality-enabled Outdoor Scene Visualization For Enhanced Real-time Disaster Response	Dimitrios Apostolakis et.al.	2602.21874	null
2026-02-25	Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping	Junmyeong Lee et.al.	2602.21668	null
2026-02-27	DAGS-SLAM: Dynamic-Aware 3DGS SLAM via Spatiotemporal Motion Probability and Uncertainty-Aware Scheduling	Li Zhang et.al.	2602.21644	null
2026-02-24	HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles	Yifan Wang et.al.	2602.21333	null
2026-02-24	BrepGaussian: CAD reconstruction from Multi-View Images with Gaussian Splatting	Jiaxing Yu et.al.	2602.21105	null
2026-02-24	Dropping Anchor and Spherical Harmonics for Sparse-view Gaussian Splatting	Shuangkang Fang et.al.	2602.20933	null
2026-02-24	RU4D-SLAM: Reweighting Uncertainty in Gaussian Splatting SLAM for 4D Scene Reconstruction	Yangfan Zhao et.al.	2602.20807	link
2026-02-24	Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization	Yangsen Chen et.al.	2602.20718	null
2026-02-24	Real-time Calibration-free Imaging Through Dynamic and Distinct Multimode Fibers via Spatial Harmonic Invariant Nonlinear Encoding (SHINE)	Zhiyuan Wang et.al.	2602.20562	null
2026-02-24	WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos	Hanhui Li et.al.	2602.20556	null
2026-02-23	Aesthetic Camera Viewpoint Suggestion with 3D Aesthetic Field	Sheyang Tang et.al.	2602.20363	null
2026-02-23	Large-scale Photorealistic Outdoor 3D Scene Reconstruction from UAV Imagery Using Gaussian Splatting Techniques	Christos Maikos et.al.	2602.20342	null
2026-02-23	tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction	Chen Wang et.al.	2602.20160	null
2026-02-23	Augmented Radiance Field: A General Framework for Enhanced Gaussian Splatting	Yixin Yang et.al.	2602.19916	null
2026-02-23	One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image	Pengfei Wang et.al.	2602.19766	null
2026-02-23	RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing	Kaifa Yang et.al.	2602.19753	null
2026-02-22	DefenseSplat: Enhancing the Robustness of 3D Gaussian Splatting via Frequency-Aware Filtering	Yiran Qiao et.al.	2602.19323	null
2026-02-21	Compact Hadamard Latent Codes for Efficient Spectral Rendering	Jiaqi Yu et.al.	2602.18741	null
2026-02-20	Unifying Color and Lightness Correction with View-Adaptive Curve Adjustment for Robust 3D Novel View Synthesis	Ziteng Cui et.al.	2602.18322	null
2026-02-20	Diff2DGS: Reliable Reconstruction of Occluded Surgical Scenes via 2D Gaussian Splatting	Tianyi Song et.al.	2602.18314	null
2026-02-19	4D Monocular Surgical Reconstruction under Arbitrary Camera Motions	Jiwei Shan et.al.	2602.17473	null
2026-02-19	NRGS-SLAM: Monocular Non-Rigid SLAM for Endoscopy via Deformation-Aware 3D Gaussian Splatting	Jiwei Shan et.al.	2602.17182	null
2026-02-19	B $^3$ -Seg: Camera-Free, Training-Free 3DGS Segmentation via Analytic EIG and Beta-Bernoulli Bayesian Updates	Hiromichi Kamata et.al.	2602.17134	null
2026-02-19	3D Scene Rendering with Multimodal Gaussian Splatting	Chi-Shiang Gau et.al.	2602.17124	null
2026-02-19	i-PhysGaussian: Implicit Physical Simulation for 3D Gaussian Splatting	Yicheng Cao et.al.	2602.17117	null
2026-02-17	Semantic-Guided 3D Gaussian Splatting for Transient Object Removal	Aditi Prabakaran et.al.	2602.15516	null
2026-02-17	DAV-GSWT: Diffusion-Active-View Sampling for Data-Efficient Gaussian Splatting Wang Tiles	Rong Fu et.al.	2602.15355	null
2026-02-16	Time-Archival Camera Virtualization for Sports and Visual Performances	Yunxiao Zhang et.al.	2602.15181	null
2026-02-16	Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery	Chandrakanth Gudavalli et.al.	2602.14929	null
2026-02-16	Gaussian Mesh Renderer for Lightweight Differentiable Rendering	Xinpeng Liu et.al.	2602.14493	link
2026-02-15	Learnable Multi-level Discrete Wavelet Transforms for 3D Gaussian Splatting Frequency Modulation	Hung Nguyen et.al.	2602.14199	null
2026-02-14	High-fidelity 3D reconstruction for planetary exploration	Alfonso Martínez-Petersen et.al.	2602.13909	null
2026-02-14	Human-Aligned Evaluation of a Pixel-wise DNN Color Constancy Model	Hamed Heidari-Gorji et.al.	2602.13887	null
2026-02-14	Joint Orientation and Weight Optimization for Robust Watertight Surface Reconstruction via Dirichlet-Regularized Winding Fields	Jiaze Li et.al.	2602.13801	null
2026-02-14	Nighttime Autonomous Driving Scene Reconstruction with Physically-Based Gaussian Splatting	Tae-Kyeong Kim et.al.	2602.13549	null
2026-02-13	FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation	Huajian Zeng et.al.	2602.13444	null
2026-02-13	GSM-GS: Geometry-Constrained Single and Multi-view Gaussian Splatting for Surface Reconstruction	Xiao Ren et.al.	2602.12796	null
2026-02-12	LatentAM: Real-Time, Large-Scale Latent Gaussian Attention Mapping via Online Dictionary Learning	Junwoon Lee et.al.	2602.12314	null
2026-02-12	3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting	Wancai Zheng et.al.	2602.12159	null
2026-02-12	GSO-SLAM: Bidirectionally Coupled Gaussian Splatting and Direct Visual Odometry	Jiung Yeon et.al.	2602.11714	null
2026-02-12	TG-Field: Geometry-Aware Radiative Gaussian Fields for Tomographic Reconstruction	Yuxiang Zhong et.al.	2602.11705	null
2026-02-13	Variation-aware Flexible 3D Gaussian Editing	Hao Qin et.al.	2602.11638	null
2026-02-12	LeafFit: Plant Assets Creation from 3D Gaussian Splatting	Chang Luo et.al.	2602.11577	null
2026-02-14	ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles	Seungyeon Yoo et.al.	2602.11575	null
2026-02-10	ERGO: Excess-Risk-Guided Optimization for High-Fidelity Monocular 3D Gaussian Splatting	Zehua Ma et.al.	2602.10278	null
2026-02-10	XSPLAIN: XAI-enabling Splat-based Prototype Learning for Attribute-aware INterpretability	Dominik Galus et.al.	2602.10239	null
2026-02-10	ArtisanGS: Interactive Tools for Gaussian Splat Selection with AI and Human in the Loop	Clement Fuji Tsang et.al.	2602.10173	null
2026-02-10	Faster-GS: Analyzing and Improving Gaussian Splatting Optimization	Florian Hahlbohm et.al.	2602.09999	link
2026-02-10	CompSplat: Compression-aware 3D Gaussian Splatting for Real-world Video	Hojun Song et.al.	2602.09816	link
2026-02-10	SciFlow-Bench: Evaluating Structure-Aware Scientific Diagram Generation via Inverse Parsing	Tong Zhang et.al.	2602.09809	null
2026-02-10	Toward Fine-Grained Facial Control in 3D Talking Head Generation	Shaoyang Xie et.al.	2602.09736	null
2026-02-10	Stability and Concentration in Nonlinear Inverse Problems with Block-Structured Parameters: Lipschitz Geometry, Identifiability, and an Application to Gaussian Splatting	Joe-Mei Feng et.al.	2602.09415	null
2026-02-10	Grow with the Flow: 4D Reconstruction of Growing Plants with Gaussian Flow Fields	Weihan Luo et.al.	2602.08958	link
2026-02-09	Analysis of Converged 3D Gaussian Splatting Solutions: Density Effects and Prediction Limit	Zhendong Wang et.al.	2602.08909	null
2026-02-09	GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion	Santiago Montiel-Marín et.al.	2602.08784	link
2026-02-09	Rotated Lights for Consistent and Efficient 2D Gaussians Inverse Rendering	Geng Lin et.al.	2602.08724	link
2026-02-09	Informative Object-centric Next Best View for Object-aware 3D Gaussian Splatting in Cluttered Scenes	Seunghoon Jeong et.al.	2602.08266	null
2026-02-08	Recovering 3D Shapes from Ultra-Fast Motion-Blurred Images	Fei Yu et.al.	2602.07860	null
2026-02-11	Thermal odometry and dense mapping using learned odometry and Gaussian splatting	Tianhao Zhou et.al.	2602.07493	null
2026-02-06	Zero-Shot UAV Navigation in Forests via Relightable 3D Gaussian Splatting	Zinan Lv et.al.	2602.07101	null
2026-02-06	DynFOA: Generating First-Order Ambisonics with Conditional Diffusion for Dynamic and Acoustically Complex 360-Degree Videos	Ziyu Luo et.al.	2602.06846	null
2026-02-06	GaussianPOP: Principled Simplification Framework for Compact 3D Gaussian Splatting via Error Quantification	Soonbin Lee et.al.	2602.06830	null
2026-02-06	Uncertainty-Aware 4D Gaussian Splatting for Monocular Occluded Human Rendering	Weiquan Wang et.al.	2602.06343	null
2026-02-05	From Blurry to Believable: Enhancing Low-quality Talking Heads with 3D Generative Priors	Ding-Jiun Huang et.al.	2602.06122	null
2026-02-05	NVS-HO: A Benchmark for Novel View Synthesis of Handheld Objects	Musawar Ali et.al.	2602.05822	null
2026-02-05	PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction	Ju Shen et.al.	2602.05190	null
2026-02-04	QuantumGS: Quantum Encoding Framework for Gaussian Splatting	Grzegorz Wilczyński et.al.	2602.05047	null
2026-02-04	Nix and Fix: Targeting 1000x Compression of 3D Gaussian Splatting with Diffusion Models	Cem Eteke et.al.	2602.04549	null
2026-02-04	VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image	Teng-Fang Hsiao et.al.	2602.04349	null
2026-02-04	Towards Next-Generation SLAM: A Survey on 3DGS-SLAM Focusing on Performance, Robustness, and Future Directions	Li Wang et.al.	2602.04251	null
2026-02-03	AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting	Joanna Kaleta et.al.	2602.04043	null
2026-02-02	Intellectual Property Protection for 3D Gaussian Splatting Assets: A Survey	Longjie Zhao et.al.	2602.03878	null
2026-02-01	Split&Splat: Zero-Shot Panoptic Segmentation via Explicit Instance Modeling and 3D Gaussian Splatting	Leonardo Monchieri et.al.	2602.03809	null
2026-02-03	Constrained Dynamic Gaussian Splatting	Zihan Zheng et.al.	2602.03538	null
2026-02-03	Pi-GS: Sparse-View Gaussian Splatting with Dense π^3 Initialization	Manuel Hofer et.al.	2602.03327	null
2026-02-03	WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU	Yudong Han et.al.	2602.03207	null
2026-02-05	SharpTimeGS: Sharp and Stable Dynamic Gaussian Splatting via Lifespan Modulation	Zhanfeng Liao et.al.	2602.02989	null
2026-02-01	Position: 3D Gaussian Splatting Watermarking Should Be Scenario-Driven and Threat-Model Explicit	Yangfan Deng et.al.	2602.02602	null
2026-02-02	SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation	Mu Huang et.al.	2602.02402	null
2026-02-02	UrbanGS: A Scalable and Efficient Architecture for Geometrically Accurate Large-Scene Reconstruction	Changbai Li et.al.	2602.02089	null
2026-02-03	SurfSplat: Conquering Feedforward 2D Gaussian Splatting with Surface Continuity Priors	Bing He et.al.	2602.02000	null
2026-02-02	CloDS: Visual-Only Unsupervised Cloth Dynamics Learning in Unknown Conditions	Yuliang Zhan et.al.	2602.01844	null
2026-02-02	CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding	Yuling Shi et.al.	2602.01785	null
2026-02-02	FastPhysGS: Accelerating Physics-based Dynamic 3DGS Simulation via Interior Completion and Adaptive Optimization	Yikun Ma et.al.	2602.01723	null
2026-02-02	VRGaussianAvatar: Integrating 3D Gaussian Avatars into VR	Hail Song et.al.	2602.01674	null
2026-02-02	MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation	Xiaoxi Kong et.al.	2602.01513	null
2026-02-01	Radioactive 3D Gaussian Ray Tracing for Tomographic Reconstruction	Ling Chen et.al.	2602.01057	null
2026-01-31	HPC: Hierarchical Point-based Latent Representation for Streaming Dynamic Gaussian Splatting Compression	Yangzhi Ma et.al.	2602.00671	null
2026-01-31	Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting	Yian Zhao et.al.	2602.00618	null
2026-01-31	PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting	Xin Zhang et.al.	2602.00463	null
2026-01-30	3DGS $^2$ -TR: Scalable Second-Order Trust-Region Method for 3D Gaussian Splatting	Roger Hsiao et.al.	2602.00395	null
2026-01-29	Learning Physics-Grounded 4D Dynamics with Neural Gaussian Force Fields	Shiqian Li et.al.	2602.00148	link
2026-01-30	PaperBanana: Automating Academic Illustration for AI Scientists	Dawei Zhu et.al.	2601.23265	null
2026-01-30	Learning Geometrically-Grounded 3D Visual Representations for View-Generalizable Robotic Manipulation	Di Zhang et.al.	2601.22988	null
2026-01-30	Diachronic Stereo Matching for Multi-Date Satellite Imagery	Elías Masquil et.al.	2601.22808	null
2026-01-30	PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction	Changjian Jiang et.al.	2601.22046	null
2026-01-29	Hybrid Foveated Path Tracing with Peripheral Gaussians for Immersive Anatomy	Constantin Kleinbeck et.al.	2601.22026	null
2026-01-28	FreeFix: Boosting 3D Gaussian Splatting via Fine-Tuning-Free Diffusion Models	Hongyu Zhou et.al.	2601.20857	link
2026-01-28	GRTX: Efficient Ray Tracing for 3D Gaussian-Based Rendering	Junseo Lee et.al.	2601.20429	null
2026-01-28	GVGS: Gaussian Visibility-Aware Multi-View Geometry for Accurate Surface Reconstruction	Mai Su et.al.	2601.20331	null
2026-01-27	Graphical X Splatting (GraphiXS): A Graphical Model for 4D Gaussian Splatting under Uncertainty	Doga Yilmaz et.al.	2601.19843	null
2026-01-27	WaterClear-GS: Optical-Aware Gaussian Splatting for Underwater Reconstruction and Restoration	Xinrui Zhang et.al.	2601.19753	null
2026-01-28	Fast Converging 3D Gaussian Splatting for 1-Minute Reconstruction	Ziyu Zhang et.al.	2601.19489	null
2026-01-27	ClipGS-VR: Immersive and Interactive Cinematic Visualization of Volumetric Medical Data in Mobile Virtual Reality	Yuqi Tong et.al.	2601.19310	null
2026-01-27	TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment	Jiarun Liu et.al.	2601.19247	null
2026-01-27	UniMGS: Unifying Mesh and 3D Gaussian Splatting with Single-Pass Rasterization and Proxy-Based Deformation	Zeyu Xiao et.al.	2601.19233	null
2026-01-27	Bridging Visual and Wireless Sensing: A Unified Radiation Field for 3D Radio Map Construction	Chaozheng Wen et.al.	2601.19216	null
2026-01-26	Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting	Tong Shi et.al.	2601.18633	null
2026-01-26	ExoGS: A 4D Real-to-Sim-to-Real Framework for Scalable Manipulation Data Collection	Yiming Wang et.al.	2601.18629	null
2026-01-26	LoD-Structured 3D Gaussian Splatting for Streaming Video Reconstruction	Xinhui Liu et.al.	2601.18475	null
2026-01-27	Geometry-Grounded Gaussian Splatting	Baowen Zhang et.al.	2601.17835	null
2026-01-25	Advancing Structured Priors for Sparse-Voxel Surface Reconstruction	Ting-Hsun Chi et.al.	2601.17720	null
2026-01-28	PocketGS: On-Device Training of 3D Gaussian Splatting for High Perceptual Modeling	Wenzhi Guo et.al.	2601.17354	null
2026-01-23	LGDWT-GS: Local and Global Discrete Wavelet-Regularized 3D Gaussian Splatting for Sparse-View Scene Reconstruction	Shima Salehi et.al.	2601.17185	null
2026-01-26	A Step to Decouple Optimization in 3DGS	Renjie Ding et.al.	2601.16736	null
2026-01-23	ReWeaver: Towards Simulation-Ready and Topology-Accurate Garment Reconstruction	Ming Li et.al.	2601.16672	null
2026-01-22	EVolSplat4D: Efficient Volume-based Gaussian Splatting for 4D Urban Scene Synthesis	Sheng Miao et.al.	2601.15951	null
2026-01-22	ThermoSplat: Cross-Modal 3D Gaussian Splatting with Feature Modulation and Geometry Decoupling	Zhaoqi Su et.al.	2601.15897	null
2026-01-22	LL-GaussianImage: Efficient Image Representation for Zero-shot Low-Light Enhancement with 2D Gaussian Splatting	Yuhan Chen et.al.	2601.15772	null
2026-01-27	LL-GaussianMap: Zero-shot Low-Light Image Enhancement via 2D Gaussian Splatting Guided Gain Maps	Yuhan Chen et.al.	2601.15766	null
2026-01-21	SplatBus: A Gaussian Splatting Viewer Framework via GPU Interprocess Communication	Yinghan Xu et.al.	2601.15431	null
2026-01-21	LuxRemix: Lighting Decomposition and Remixing for Indoor Scenes	Ruofan Liang et.al.	2601.15283	null
2026-01-21	ScenDi: 3D-to-2D Scene Diffusion Cascades for Urban Generation	Hanlei Guo et.al.	2601.15221	null
2026-01-21	POTR: Post-Training 3DGS Compression	Bert Ramlot et.al.	2601.14821	null
2026-01-22	Structured Image-based Coding for Efficient Gaussian Splatting Compression	Pedro Martin et.al.	2601.14510	null
2026-01-20	Rig-Aware 3D Reconstruction of Vehicle Undercarriages using Gaussian Splatting	Nitin Kulkarni et.al.	2601.14208	null
2026-01-20	One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step Diffusion	Yitong Dong et.al.	2601.14161	null
2026-01-20	ParkingTwin: Training-Free Streaming 3D Reconstruction for Parking-Lot Digital Twins	Xinhao Liu et.al.	2601.13706	null
2026-01-19	GaussExplorer: 3D Gaussian Splatting for Embodied Exploration and Reasoning	Kim Yu-Ji et.al.	2601.13132	null
2026-01-19	TreeDGS: Aerial Gaussian Splatting for Distant DBH Measurement	Belal Shaheen et.al.	2601.12823	null
2026-01-19	CSGaussian: Progressive Rate-Distortion Compression and Segmentation for 3D Gaussian Splatting	Yu-Jen Tseng et.al.	2601.12814	null
2026-01-19	KaoLRM: Repurposing Pre-trained Large Reconstruction Models for Parametric 3D Face Reconstruction	Qingtian Zhu et.al.	2601.12736	null
2026-01-17	Active Semantic Mapping of Horticultural Environments Using Gaussian Splatting	Jose Cuaran et.al.	2601.12122	null
2026-01-16	studentSplat: Your Student Model Learns Single-view 3D Gaussian Splatting	Yimu Pan et.al.	2601.11772	null
2026-01-15	RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation	Peng Chen et.al.	2601.10606	null
2026-01-15	Thinking Like Van Gogh: Structure-Aware Style Transfer via Flow-Guided 3D Gaussian Splatting	Zhendong Wang et.al.	2601.10075	null
2026-01-14	Variable Basis Mapping for Real-Time Volumetric Visualization	Qibiao Li et.al.	2601.09417	null
2026-01-19	TIDI-GS: Floater Suppression in 3D Gaussian Splatting for Enhanced Indoor Scene Fidelity	Sooyeun Yang et.al.	2601.09291	null
2026-01-14	GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials	Bei Huang et.al.	2601.09265	null
2026-01-14	A $^2$ TG: Adaptive Anisotropic Textured Gaussians for Efficient 3D Scene Representation	Sheng-Chi Hsu et.al.	2601.09243	null
2026-01-12	3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing	Jiahua Dong et.al.	2601.07963	null
2026-01-12	FMAC: a Fair Fiducial Marker Accuracy Comparison Software	Guillaume J. Laurent et.al.	2601.07723	null
2026-01-13	ViewMorpher3D: A 3D-aware Diffusion Framework for Multi-Camera Novel View Synthesis in Autonomous Driving	Farhad G. Zanjani et.al.	2601.07540	null
2026-01-12	Mon3tr: Monocular 3D Telepresence with Pre-built Gaussian Avatars as Amortization	Fangyu Lin et.al.	2601.07518	null
2026-01-12	R3-RECON: Radiance-Field-Free Active Reconstruction via Renderability	Xiaofeng Jin et.al.	2601.07484	null
2026-01-11	SARA: Scene-Aware Reconstruction Accelerator	Jee Won Lee et.al.	2601.06831	link
2026-01-10	SRFlow: A Dataset and Regularization Model for High-Resolution Facial Optical Flow via Splatting Rasterization	JiaLin Zhang et.al.	2601.06479	link
2026-01-09	NAS-GS: Noise-Aware Sonar Gaussian Splatting	Shida Xu et.al.	2601.06285	null
2026-01-08	Akasha 2: Hamiltonian State Space Duality and Visual-Language Joint Embedding Predictive Architectur	Yani Meziani et.al.	2601.06212	null
2026-01-09	LayerGS: Decomposition and Inpainting of Layered 3D Human Avatars via 2D Gaussian Splatting	Yinghan Xu et.al.	2601.05853	link
2026-01-09	FeatureSLAM: Feature-enriched 3D gaussian splatting SLAM in real time	Christopher Thirgood et.al.	2601.05738	null
2026-01-09	GS-DMSR: Dynamic Sensitive Multi-scale Manifold Enhancement for Accelerated High-Quality 3D Gaussian Splatting	Nengbo Lu et.al.	2601.05584	link
2026-01-09	GaussianSwap: Animatable Video Face Swapping with 3D Gaussian Splatting	Xuan Cheng et.al.	2601.05511	null
2026-01-08	MOSAIC-GS: Monocular Scene Reconstruction via Advanced Initialization for Complex Dynamic Environments	Svitlana Morkva et.al.	2601.05368	null
2026-01-08	OceanSplat: Object-aware Gaussian Splatting with Trinocular View Consistency for Underwater Scene Reconstruction	Minseong Kweon et.al.	2601.04984	null
2026-01-08	AgentOCR: Reimagining Agent History via Optical Self-Compression	Lang Feng et.al.	2601.04786	null
2026-01-08	ProFuse: Efficient Cross-View Context Fusion for Open-Vocabulary 3D Gaussian Splatting	Yen-Jen Chiou et.al.	2601.04754	link
2026-01-09	Differential Locally Injective Grid Deformation and Optimization	Julian Knodt et.al.	2601.04494	null
2026-01-07	SCAR-GS: Spatial Context Attention for Residuals in Progressive Gaussian Splatting	Diego Revilla et.al.	2601.04348	null
2026-01-07	IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting	Wei Long et.al.	2601.03824	null
2026-01-07	G2P: Gaussian-to-Point Attribute Alignment for Boundary-Aware 3D Semantic Segmentation	Hojun Song et.al.	2601.03510	null
2026-01-06	RelightAnyone: A Generalized Relightable 3D Gaussian Head Model	Yingyan Xu et.al.	2601.03357	null
2026-01-06	CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature	Eldad Matmon et.al.	2601.03319	null
2026-01-06	A High-Fidelity Digital Twin for Robotic Manipulation Based on 3D Gaussian Splatting	Ziyang Sun et.al.	2601.03200	null
2026-01-06	Stroke Patches: Customizable Artistic Image Styling Using Regression	Ian Jaffray et.al.	2601.03114	null
2026-01-06	SA-ResGS: Self-Augmented Residual 3D Gaussian Splatting for Next Best View Selection	Kim Jun-Seong et.al.	2601.03024	null
2026-01-06	CAMO: Category-Agnostic 3D Motion Transfer from Monocular 2D Videos	Taeyeon Kim et.al.	2601.02716	null
2026-01-05	HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures	Yating Wang et.al.	2601.02103	null
2026-01-05	360-GeoGS: Geometrically Consistent Feed-Forward 3D Gaussian Splatting Reconstruction for 360 Images	Jiaqi Yao et.al.	2601.02102	null
2026-01-05	InpaintHuman: Reconstructing Occluded Humans with Multi-Scale UV Mapping and Identity-Preserving Diffusion Inpainting	Jinlong Fan et.al.	2601.02098	null
2026-01-05	SketchRodGS: Sketch-based Extraction of Slender Geometries for Animating Gaussian Splatting Scenes	Haato Watanabe et.al.	2601.02072	null
2026-01-05	ESGaussianFace: Emotional and Stylized Audio-Driven Facial Animation via 3D Gaussian Splatting	Chuhang Ma et.al.	2601.01847	null
2026-01-04	Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows	Aymen Mir et.al.	2601.01660	null
2026-01-04	ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking	Xiaobao Wei et.al.	2601.01386	null
2026-01-04	ShadowGS: Shadow-Aware 3D Gaussian Splatting for Satellite Imagery	Feng Luo et.al.	2601.00939	null
2026-01-01	Clean-GS: Semantic Mask-Guided Pruning for 3D Gaussian Splatting	Subhankar Mishra et.al.	2601.00913	null
2025-12-28	RGS-SLAM: Robust Gaussian Splatting SLAM with One-Shot Dense Initialization	Wei-Tse Cheng et.al.	2601.00705	null
2026-01-01	SV-GS: Sparse View 4D Reconstruction with Skeleton-Driven Gaussian Splatting	Jun-Jee Chao et.al.	2601.00285	null
2025-12-31	PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes	Luca Collorone et.al.	2512.24986	null
2025-12-31	UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning	Ankit Dhiman et.al.	2512.24763	null
2025-12-31	Splatwizard: A Benchmark Toolkit for 3D Gaussian Splatting Compression	Xiang Liu et.al.	2512.24742	null
2025-12-30	Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression	Huanxiong Liang et.al.	2512.24018	null
2025-12-30	Improved 3D Gaussian Splatting of Unknown Spacecraft Structure Using Space Environment Illumination Knowledge	Tae Ha Park et.al.	2512.23998	null
2025-12-29	Contour Information Aware 2D Gaussian Splatting for Image Representation	Masaya Takabe et.al.	2512.23255	null
2025-12-29	GVSynergy-Det: Synergistic Gaussian-Voxel Representations for Multi-View 3D Object Detection	Yi Zhang et.al.	2512.23176	null
2025-12-30	Differentiable Physics-Driven Human Representation for Millimeter-Wave Based Pose Estimation	Shuntian Zheng et.al.	2512.23054	null
2025-12-28	Hash Grid Feature Pruning	Yangzhi Ma et.al.	2512.22882	null
2025-12-28	Next Best View Selections for Semantic and Dynamic 3D Gaussian Splatting	Yiqian Li et.al.	2512.22771	null
2025-12-27	SCPainter: A Unified Framework for Realistic 3D Asset Insertion and Novel View Synthesis	Paul Dobre et.al.	2512.22706	null
2025-12-30	Tracking by Predicting 3-D Gaussians Over Time	Tanish Baranwal et.al.	2512.22489	null
2025-12-24	AirGS: Real-Time 4D Gaussian Streaming for Free-Viewpoint Video Experiences	Zhe Wang et.al.	2512.20943	null
2025-12-24	Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting	Yoonwoo Jeong et.al.	2512.20927	null
2025-12-23	Nebula: Enable City-Scale 3D Gaussian Splatting in Virtual Reality via Collaborative Rendering and Accelerated Stereo Rasterization	He Zhu et.al.	2512.20495	null
2025-12-23	SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images	Linfei Li et.al.	2512.20377	link
2025-12-23	Enhancing annotations for 5D apple pose estimation through 3D Gaussian Splatting (3DGS)	Robert van de Ven et.al.	2512.20148	link
2025-12-25	Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs	Cyrus Vachha et.al.	2512.20129	null
2025-12-22	WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion	Hanyang Kong et.al.	2512.19678	null
2025-12-22	4D Gaussian Splatting as a Learned Dynamical System	Arnold Caleb Asiimwe et.al.	2512.19648	null
2025-12-22	GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting	Tiantian Li et.al.	2512.19108	null
2025-12-21	EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images	Jongmin Park et.al.	2512.18692	null
2025-12-21	Geometric-Photometric Event-based 3D Gaussian Ray Tracing	Kai Kohyama et.al.	2512.18640	null
2025-12-21	ChronoDreamer: Action-Conditioned World Model as an Online Simulator for Robotic Planning	Zhenhao Zhou et.al.	2512.18619	null
2025-12-20	MatSpray: Fusing 2D Material World Knowledge on 3D Geometry	Philipp Langsteiner et.al.	2512.18314	null
2025-12-22	Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding	Yue Li et.al.	2512.17817	null
2025-12-19	G3Splat: Geometrically Consistent Generalizable Gaussian Splatting	Mehdi Hosseinzadeh et.al.	2512.17547	link
2025-12-19	FLEG: Feed-Forward Language Embedded Gaussian Splatting from Any Views	Qijian Tian et.al.	2512.17541	null
2025-12-19	Voxel-GS: Quantized Scaffold Gaussian Splatting Compression with Run-Length Coding	Chunyang Fu et.al.	2512.17528	null
2025-12-19	Flying in Clutter on Monocular RGB by Learning in 3D Radiance Fields with Domain Adaptation	Xijie Huang et.al.	2512.17349	null
2025-12-18	Instant Expressive Gaussian Head Avatar via 3D-Aware Expression Distillation	Kaiwen Jiang et.al.	2512.16893	null
2025-12-18	SDFoam: Signed-Distance Foam for explicit surface reconstruction	Antonella Rech et.al.	2512.16706	null
2025-12-18	Using Gaussian Splats to Create High-Fidelity Facial Geometry and Texture	Haodi He et.al.	2512.16397	null
2025-12-17	Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering	Divam Gupta et.al.	2512.15711	null
2025-12-17	Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting	Arthur Moreau et.al.	2512.15508	null
2025-12-19	VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments	Yuze Wu et.al.	2512.15258	null
2025-12-17	MVGSR: Multi-View Consistent 3D Gaussian Super-Resolution via Epipolar Guidance	Kaizhe Zhang et.al.	2512.15048	null
2025-12-17	A Gaussian Parameterization for Direct Atomic Structure Identification in Electron Tomography	Nalini M. Singh et.al.	2512.15034	null
2025-12-16	BridgeNet: A Dataset of Graph-based Bridge Structural Models for Machine Learning Applications	Lazlo Bleker et.al.	2512.14496	null
2025-12-16	Broadening View Synthesis of Dynamic Scenes from Constrained Monocular Videos	Le Jiang et.al.	2512.14406	null
2025-12-16	HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis	Kaizhe Zhang et.al.	2512.14352	null
2025-12-16	Beyond a Single Light: A Large-Scale Aerial Dataset for Urban Scene Reconstruction Under Varying Illumination	Zhuoxiao Li et.al.	2512.14200	null
2025-12-16	Spherical Voronoi: Directional Appearance as a Differentiable Partition of the Sphere	Francesco Di Sario et.al.	2512.14180	null
2025-12-16	GaussianPlant: Structure-aligned Gaussian Splatting for 3D Reconstruction of Plants	Yang Yang et.al.	2512.14087	null
2025-12-16	ASAP-Textured Gaussians: Enhancing Textured Gaussians with Adaptive Sampling and Anisotropic Parameterization	Meng Wei et.al.	2512.14039	null
2025-12-15	Nexels: Neurally-Textured Surfels for Real-Time Novel View Synthesis with Sparse Geometries	Victor Rong et.al.	2512.13796	null
2025-12-15	Computer vision training dataset generation for robotic environments using Gaussian splatting	Patryk Niżeniec et.al.	2512.13411	null
2025-12-15	Light Field Based 6DoF Tracking of Previously Unobserved Objects	Nikolai Goncharov et.al.	2512.13007	null
2025-12-15	Qonvolution: Towards Learning High-Frequency Signals with Queried Convolution	Abhinav Kumar et.al.	2512.12898	null
2025-12-14	Fast 2DGS: Efficient Image Representation with Deep Gaussian Prior	Hao Wang et.al.	2512.12774	null
2025-12-13	Keep the Lights On, Keep the Lengths in Check: Plug-In Adversarial Detection for Time-Series LLMs in Energy Forecasting	Hua Ma et.al.	2512.12154	null
2025-12-12	Moment-Based 3D Gaussian Splatting: Resolving Volumetric Occlusion with Order-Independent Transmittance	Jan U. Müller et.al.	2512.11800	null
2025-12-12	3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation	Zhiguo Lu et.al.	2512.11557	null
2025-12-12	Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video	Meng-Li Shih et.al.	2512.11356	null
2025-12-12	Lightweight 3D Gaussian Splatting Compression via Video Codec	Qi Yang et.al.	2512.11186	null
2025-12-11	GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting	Madhav Agarwal et.al.	2512.10939	link
2025-12-11	MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos	Kehong Gong et.al.	2512.10881	null
2025-12-11	DeMapGS: Simultaneous Mesh Deformation and Surface Attribute Mapping via Gaussian Splatting	Shuyi Zhou et.al.	2512.10572	link
2025-12-11	Neural Hamiltonian Deformation Fields for Dynamic Scene Rendering	Hai-Long Qin et.al.	2512.10424	null
2025-12-11	Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views	Zhankuo Xu et.al.	2512.10369	null
2025-12-11	Physically Aware 360 $^\circ$ View Generation from a Single Image using Disentangled Scene Embeddings	Karthikeya KV et.al.	2512.10293	null
2025-12-11	Long-LRM++: Preserving Fine Details in Feed-Forward Wide-Coverage Reconstruction	Chen Ziwen et.al.	2512.10267	null
2025-12-10	TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing	Jiachen Tao et.al.	2512.10095	null
2025-12-10	GAINS: Gaussian-based Inverse Rendering from Sparse Multi-View Captures	Patrick Noras et.al.	2512.09925	null
2025-12-10	Splatent: Splatting Diffusion Latents for Novel View Synthesis	Or Hirschorn et.al.	2512.09923	null
2025-12-10	YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos	Ryan Meegan et.al.	2512.09903	null
2025-12-10	ReMoSPLAT: Reactive Mobile Manipulation Control on a Gaussian Splat	Nicolas Marticorena et.al.	2512.09656	null
2025-12-10	D $^2$ GSLAM: 4D Dynamic Gaussian Splatting SLAM	Siting Zhu et.al.	2512.09411	null
2025-12-11	Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video	Seonghwa Choi et.al.	2512.09335	null
2025-12-10	MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectional Blending with Hierarchical Densification	Sangwoon Kwak et.al.	2512.09270	null
2025-12-09	GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars	Kelian Baert et.al.	2512.09162	null
2025-12-09	OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics	Jisang Yoo et.al.	2512.08625	null
2025-12-09	On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs	Yijia Guo et.al.	2512.08498	null
2025-12-09	Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform	Yuning Gong et.al.	2512.08478	null
2025-12-09	HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting	Chang Liu et.al.	2512.08334	null
2025-12-09	Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation	Srijan Dokania et.al.	2512.08271	null
2025-12-08	Multi-view Pyramid Transformer: Look Coarser to See Broader	Gyeongjin Kang et.al.	2512.07806	null
2025-12-08	Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects	Shuohan Tao et.al.	2512.07381	null
2025-12-08	Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting	Shilong Jin et.al.	2512.07345	null
2025-12-08	AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing	Ziming Hong et.al.	2512.07247	null
2025-12-08	STRinGS: Selective Text Refinement in Gaussian Splatting	Abhinav Raundhal et.al.	2512.07230	null
2025-12-08	SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting	Seokhyun Youn et.al.	2512.07197	null
2025-12-08	MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation	Muyu Xu et.al.	2512.07165	null
2025-12-09	COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision	Jaeyoon Lee et.al.	2512.07107	null
2025-12-07	RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting	Hoang-Nhat Tran et.al.	2512.07052	null
2025-12-07	MeshSplatting: Differentiable Rendering with Opaque Meshes	Jan Held et.al.	2512.06818	null
2025-12-07	RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting	Longjie Zhao et.al.	2512.06774	null
2025-12-07	EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy	Yumeng He et.al.	2512.06684	null
2025-12-06	AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars	Ramazan Fazylov et.al.	2512.06438	null
2025-12-06	TriaGS: Differentiable Triangulation-Guided Geometric Consistency for 3D Gaussian Splatting	Quan Tran et.al.	2512.06269	null
2025-12-05	Tracking-Guided 4D Generation: Foundation-Tracker Motion Priors for 3D Model Animation	Su Sun et.al.	2512.06158	null
2025-12-05	Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition	Anne Sielemann et.al.	2512.05936	null
2025-12-05	Physically-Based Simulation of Automotive LiDAR	L. Dudzik et.al.	2512.05932	null
2025-12-05	Edit-aware RAW Reconstruction	Abhijith Punnappurath et.al.	2512.05859	null
2025-12-05	3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering	Blanca Inigo et.al.	2512.05803	null
2025-12-05	Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer	Rong Wang et.al.	2512.05593	null
2025-12-05	SCoNE: Spherical Consistent Neighborhoods Ensemble for Effective and Efficient Multi-View Anomaly Detection	Yang Xu et.al.	2512.05540	null
2025-12-05	TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression	Cheng-Yuan Ho et.al.	2512.05446	null
2025-12-05	Image Semantic Communication with Quadtree Partition-based Coding	Yinhuan Huang et.al.	2512.05395	null
2025-12-05	SplatPainter: Interactive Authoring of 3D Gaussians from 2D Edits via Test-Time Training	Yang Zheng et.al.	2512.05354	null
2025-12-04	DEAR: Dataset for Evaluating the Aesthetics of RenderingDEAR: Dataset for Evaluating the Aesthetics of Rendering	Vsevolod Plohotnuk et.al.	2512.05209	null
2025-12-04	Light-X: Generative 4D Video Rendering with Camera and Illumination Control	Tianqi Liu et.al.	2512.05115	link
2025-12-08	Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting	Hao-Jen Chien et.al.	2512.05113	null
2025-12-04	NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation	Yu Zeng et.al.	2512.05106	null
2025-12-04	4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer	Xianfeng Wu et.al.	2512.05060	null
2025-12-04	Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image	Yanran Zhang et.al.	2512.05044	link
2025-12-04	Reflection Removal through Efficient Adaptation of Diffusion Transformers	Daniyar Zakarin et.al.	2512.05000	link
2025-12-04	Federated Learning for Terahertz Wireless Communication	O. Tansel Baydas et.al.	2512.04984	null
2025-12-04	RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS	Chuanyu Fu et.al.	2512.04815	null
2025-12-04	Bridging Simulation and Reality: Cross-Domain Transfer with Semantic 2D Gaussian Splatting	Jian Tang et.al.	2512.04731	null
2025-12-04	Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex	Zhizhen Wu et.al.	2512.04556	null
2025-12-04	Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization	Hong Kuang et.al.	2512.04542	null
2025-12-04	Refaçade: Editing Object with Given Reference Texture	Youze Huang et.al.	2512.04534	null
2025-12-04	UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes	Changhe Liu et.al.	2512.04421	link
2025-12-03	SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting	Yonghan Lee et.al.	2512.04315	null
2025-12-03	Mind-to-Face: Neural-Driven Photorealistic Avatar Synthesis via EEG Decoding	Haolin Xiong et.al.	2512.04313	null
2025-12-03	Machine Learning Pipeline for Denoising Low Signal-To-Noise Ratio and Out-of-Distribution Transmission Electron Microscopy Datasets	Brian Lee et.al.	2512.04045	null
2025-12-03	RELIC: Interactive Video World Model with Long-Horizon Memory	Yicong Hong et.al.	2512.04040	null
2025-12-03	C3G: Learning Compact 3D Representations with 2K Gaussians	Honggyu An et.al.	2512.04021	null
2025-12-03	Collective dynamics of trail-interacting particles	Paul Pineau et.al.	2512.03950	null
2025-12-03	Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding	Haoran Zhou et.al.	2512.03601	null
2025-12-03	CloseUpAvatar: High-Fidelity Animatable Full-Body Avatars with Mixture of Multi-Scale Textures	David Svitov et.al.	2512.03593	null
2025-12-03	Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models	Shojiro Yamabe et.al.	2512.03463	null
2025-12-03	What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models	Tianchen Deng et.al.	2512.03422	null
2025-12-03	ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding	Lingjun Zhao et.al.	2512.03370	null
2025-12-02	Flux4D: Flow-based Unsupervised 4D Reconstruction	Jingkang Wang et.al.	2512.03210	null
2025-12-02	PPTArena: A Benchmark for Agentic PowerPoint Editing	Michael Ofengenden et.al.	2512.03042	null
2025-12-02	SurfFill: Completion of LiDAR Point Clouds via Gaussian Surfel Splatting	Svenja Strobel et.al.	2512.03010	link
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	link
2025-12-02	EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis	Yancheng Zhang et.al.	2512.02932	link
2025-12-02	Adaptive hydrogels with spatiotemporal stiffening using pH-modulating enzymes	Natascha Gray et.al.	2512.02698	null
2025-12-02	PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes	Derui Shan et.al.	2512.02664	null
2025-12-02	PoreTrack3D: A Benchmark for Dynamic 3D Gaussian Splatting in Pore-Scale Facial Trajectory Tracking	Dong Li et.al.	2512.02648	null
2025-12-02	Content-Aware Texturing for Gaussian Splatting	Panagiotis Papantonakis et.al.	2512.02621	link
2025-12-02	G-SHARP: Gaussian Surgical Hardware Accelerated Real-time Pipeline	Vishwesh Nath et.al.	2512.02482	null
2025-12-02	VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM	Zihan Zhu et.al.	2512.02293	null
2025-12-01	SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting	Pranav Asthana et.al.	2512.02172	null
2025-12-01	Flowchart2Mermaid: A Vision-Language Model Powered System for Converting Flowcharts into Editable Diagram Code	Pritam Deka et.al.	2512.02170	null
2025-12-01	ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation	Chenyang Gu et.al.	2512.02013	null
2025-12-01	Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights	Juanxi Tian et.al.	2512.01816	null
2025-12-01	IGen: Scalable Data Generation for Robot Learning from Open-World Images	Chenghao Gu et.al.	2512.01773	null
2025-12-02	SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge	Yumeng He et.al.	2512.01629	null
2025-12-01	Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry Network	Tianyu Luan et.al.	2512.01380	null
2025-12-01	TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking	Hanzhi Guo et.al.	2512.01329	null
2025-12-01	DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy	Jaewoo Song et.al.	2512.01302	null
2025-12-01	EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly	Xiaokun Pan et.al.	2512.01296	null
2025-12-01	Pay Attention Later: From Vector Space Diffusion to Linearithmic Spectral Phase-Locking	Alper Yıldırım et.al.	2512.01208	null
2025-11-30	LISA-3D: Lifting Language-Image Segmentation to 3D via Multi-View Consistency	Zhongbin Guo et.al.	2512.01008	null
2025-11-30	Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation	An Yang et.al.	2512.00944	null
2025-11-30	Feed-Forward 3D Gaussian Splatting Compression with Long-Context Modeling	Zhening Liu et.al.	2512.00877	null
2025-11-30	Smol-GS: Compact Representations for Abstract 3D Gaussian Splatting	Haishan Wang et.al.	2512.00850	null
2025-11-30	PolarGS: Polarimetric Cues for Ambiguity-Free Gaussian Splatting with Accurate Geometry Recovery	Bo Guo et.al.	2512.00794	null
2025-11-30	Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards	Qiang Lyu et.al.	2512.00743	null
2025-11-30	Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer	Dong In Lee et.al.	2512.00677	null
2025-11-29	Asset-Driven Sematic Reconstruction of Dynamic Scene with Multi-Human-Object Interactions	Sandika Biswas et.al.	2512.00547	null
2025-11-29	Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update	Zeyuan An et.al.	2512.00534	null
2025-11-29	SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level Style Control	Ji Gan et.al.	2512.00413	null
2025-11-29	Debate with Images: Detecting Deceptive Behaviors in Multimodal Large Language Models	Sitong Fang et.al.	2512.00349	null
2025-11-28	Object-Centric Data Synthesis for Category-level Object Detection	Vikhyat Agarwal et.al.	2511.23450	null
2025-11-28	FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting	Tianhao Xie et.al.	2511.23292	null
2025-11-28	Robust 3DGS-based SLAM via Adaptive Kernel Smoothing	Shouhe Zhang et.al.	2511.23221	null
2025-11-28	NumeriKontrol: Adding Numeric Control to Diffusion Transformers for Instruction-based Image Editing	Zhenyu Xu et.al.	2511.23105	null
2025-11-28	Geometry-Consistent 4D Gaussian Splatting for Sparse-Input Dynamic View Synthesis	Yiwei Li et.al.	2511.23044	null
2025-11-28	DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management	Casimir Feldmann et.al.	2511.23030	null
2025-11-28	MrGS: Multi-modal Radiance Fields with 3D Gaussian Splatting for RGB-Thermal Novel View Synthesis	Minseong Kweon et.al.	2511.22997	null
2025-11-28	MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation	Yuta Oshima et.al.	2511.22989	null
2025-11-28	Ovis-Image Technical Report	Guo-Hua Wang et.al.	2511.22982	null
2025-11-28	Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM	Shouhe Zhang et.al.	2511.22968	null
2025-11-28	DenoiseGS: Gaussian Reconstruction Model for Burst Denoising	Yongsen Cheng et.al.	2511.22939	null
2025-11-28	FedAU2: Attribute Unlearning for User-Level Federated Recommender Systems with Adaptive and Robust Adversarial Training	Yuyuan Li et.al.	2511.22872	null
2025-11-28	TokCom-UEP: Semantic Importance-Matched Unequal Error Protection for Resilient Image Transmission	Kaizheng Zhang et.al.	2511.22859	null
2025-11-27	GSpaRC: Gaussian Splatting for Real-time Reconstruction of RF Channels	Bhavya Sai Nukapotula et.al.	2511.22793	null
2025-11-27	Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction	Boyao Zhou et.al.	2511.22704	null
2025-11-27	Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer	Z-Image Team et.al.	2511.22699	null
2025-11-27	Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation	Shubhankar Borse et.al.	2511.22690	null
2025-11-27	Bringing Your Portrait to 3D Presence	Jiawei Zhang et.al.	2511.22553	null
2025-11-27	FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning	Yuan Yao et.al.	2511.22265	null
2025-11-27	Can Protective Watermarking Safeguard the Copyright of 3D Gaussian Splatting?	Wenkai Huang et.al.	2511.22262	null
2025-11-26	Resolution Where It Counts: Hash-based GPU-Accelerated 3D Reconstruction via Variance-Adaptive Voxel Grids	Lorenzo De Rebotti et.al.	2511.21459	link
2025-11-26	Endo-G $^{2}$ T: Geometry-Guided & Temporally Aware Time-Embedded 4DGS For Endoscopic Scenes	Yangle Liu et.al.	2511.21367	null
2025-11-26	Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting	Juncheng Chen et.al.	2511.21265	null
2025-11-26	The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval	Jaime Garcia-Martinez et.al.	2511.21247	null
2025-11-26	Transformer Driven Visual Servoing and Dual Arm Impedance Control for Fabric Texture Matching	Fuyuki Tokuda et.al.	2511.21203	null
2025-11-26	Dual Preintegration for Relative State Estimation	Ruican Xia et.al.	2511.21189	null
2025-11-25	MODEST: Multi-Optics Depth-of-Field Stereo Dataset	Nisarg K. Trivedi et.al.	2511.20853	null
2025-11-25	Private Data Imputation	Abdelkarim Kati et.al.	2511.20832	null
2025-11-25	Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI	Xinhao Liu et.al.	2511.20620	null
2025-11-25	PhysChoreo: Physics-Controllable Video Generation with Part-Aware Semantic Grounding	Haoze Zhang et.al.	2511.20562	null
2025-11-25	GS-Checker: Tampering Localization for 3D Gaussian Splatting	Haoliang Han et.al.	2511.20354	null
2025-11-25	Material-informed Gaussian Splatting for 3D World Reconstruction in a Digital Twin	João Malheiro Silva et.al.	2511.20348	null
2025-11-25	Active3D: Active High-Fidelity 3D Reconstruction via Hierarchical Uncertainty Quantification	Yan Li et.al.	2511.20050	null
2025-11-25	Clair Obscur: an Illumination-Aware Method for Real-World Image Vectorization	Xingyue Lin et.al.	2511.20034	null
2025-11-25	GigaWorld-0: World Models as Data Engine to Empower Embodied AI	GigaWorld Team et.al.	2511.19861	null
2025-11-25	Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks	Xiangkai Ma et.al.	2511.19856	null
2025-11-25	STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction	Jiankuo Zhao et.al.	2511.19854	link
2025-11-24	ModHiFi: Identifying High Fidelity predictive components for Model Modification	Dhruva Kashyap et.al.	2511.19566	null
2025-11-24	Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation	Jaeyeong Kim et.al.	2511.19542	link
2025-11-24	LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context	Jingzhi Bao et.al.	2511.19437	link
2025-11-24	Efficiency vs. Fidelity: A Comparative Analysis of Diffusion Probabilistic Models and Flow Matching on Low-Resource Hardware	Srishti Gupta et.al.	2511.19379	null
2025-11-24	DensifyBeforehand: LiDAR-assisted Content-aware Densification for Efficient and Quality 3D Gaussian Splatting	Phurtivilai Patt et.al.	2511.19294	null
2025-11-24	IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes	Carl Lindström et.al.	2511.19235	null
2025-11-24	NVGS: Neural Visibility for Occlusion Culling in 3D Gaussian Splatting	Brent Zoomers et.al.	2511.19202	null
2025-11-24	AvatarBrush: Monocular Reconstruction of Gaussian Avatars with Intuitive Local Editing	Mengtian Li et.al.	2511.19189	null
2025-11-24	MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes	Kehua Chen et.al.	2511.19172	null
2025-11-24	Neural Texture Splatting: Expressive 3D Gaussian Splatting for View Synthesis, Geometry, and Dynamic Reconstruction	Yiming Wang et.al.	2511.18873	null
2025-11-24	NI-Tex: Non-isometric Image-based Garment Texture Generation	Hui Shan et.al.	2511.18765	null
2025-11-24	Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing	Xiaotong Huang et.al.	2511.18755	null
2025-11-24	MAGMA-Edu: Multi-Agent Generative Multimodal Framework for Text-Diagram Educational Question Generation	Zhenyu Wu et.al.	2511.18714	null
2025-11-24	Inverse Rendering for High-Genus Surface Meshes from Multi-View Images	Xiang Gao et.al.	2511.18680	null
2025-11-23	NeAR: Coupled Neural Asset-Renderer Stack	Hong Li et.al.	2511.18600	null
2025-11-23	PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation	Samarth Chopra et.al.	2511.18570	null
2025-11-23	Splatblox: Traversability-Aware Gaussian Splatting for Outdoor Robot Navigation	Samarth Chopra et.al.	2511.18525	null
2025-11-23	ReCoGS: Real-time ReColoring for Gaussian Splatting scenes	Lorenzo Rutayisire et.al.	2511.18441	null
2025-11-23	CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images	Avishka Perera et.al.	2511.18424	null
2025-11-23	SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation	Peter Siegel et.al.	2511.18386	null
2025-11-23	Synthetic Curriculum Reinforces Compositional Text-to-Image Generation	Shijian Wang et.al.	2511.18378	null
2025-11-23	Alias-free 4D Gaussian Splatting	Zilong Chen et.al.	2511.18367	null
2025-11-21	Planning with Sketch-Guided Verification for Physics-Aware Video Generation	Yidong Huang et.al.	2511.17450	null
2025-11-21	Refracting Reality: Generating Images with Realistic Transparent Objects	Yue Yin et.al.	2511.17340	null
2025-11-21	QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy	Adam Lilja et.al.	2511.17221	null
2025-11-21	FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception	Shubham Sonarghare et.al.	2511.17210	null
2025-11-21	SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors	Kunyi Li et.al.	2511.17207	null
2025-11-21	PEGS: Physics-Event Enhanced Large Spatiotemporal Motion Reconstruction via 3D Gaussian Splatting	Yijun Xu et.al.	2511.17116	null
2025-11-21	Towards Generative Design Using Optimal Transport for Shape Exploration and Solution Field Interpolation	Sergio Torregrosa et.al.	2511.17111	null
2025-11-21	SPAGS: Sparse-View Articulated Object Reconstruction from Single State via Planar Gaussian Splatting	Di Wu et.al.	2511.17092	null
2025-11-21	REArtGS++: Generalizable Articulation Reconstruction with Temporal Geometry Constraint via Planar Gaussian Splatting	Di Wu et.al.	2511.17059	null
2025-11-21	RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation	Wenzhuo Sun et.al.	2511.17048	null
2025-11-21	Parameter-Free Neural Lens Blur Rendering for High-Fidelity Composites	Lingyan Ruan et.al.	2511.17014	null
2025-11-21	Stable Offline Hand-Eye Calibration for any Robot with Just One Mark	Sicheng Xie et.al.	2511.17001	null
2025-11-21	PhysMorph-GS: Differentiable Shape Morphing via Joint Optimization of Physics and Rendering Objectives	Chang-Yong Song et.al.	2511.16988	null
2025-11-21	Gradient-Driven Natural Selection for Compact 3D Gaussian Splatting	Xiaobin Deng et.al.	2511.16980	null
2025-11-21	One Walk is All You Need: Data-Efficient 3D RF Scene Reconstruction with Human Movements	Yiheng Bian et.al.	2511.16966	null
2025-11-21	MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis	Di Luo et.al.	2511.16957	null
2025-11-21	UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation	Chi Zhang et.al.	2511.16917	null
2025-11-20	Vorion: A RISC-V GPU with Hardware-Accelerated 3D Gaussian Rendering and Training	Yipeng Wang et.al.	2511.16831	null
2025-11-20	SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG	Mengnan Jiang et.al.	2511.16766	null
2025-11-20	EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering	Pierrick Bournez et.al.	2511.16542	null
2025-11-20	Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution	Jaime Álvarez Urueña et.al.	2511.16541	null
2025-11-20	Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation	Zongcai Tan et.al.	2511.16494	null
2025-11-20	Neural Positioning Without External Reference	Till-Yannic Müller et.al.	2511.16352	null
2025-11-20	CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering	Joni Vanherck et.al.	2511.16349	null
2025-11-20	Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling	Minseok Seo et.al.	2511.16301	link
2025-11-20	Optimizing 3D Gaussian Splattering for Mobile GPUs	Md Musfiqur Rahman Sanim et.al.	2511.16298	null
2025-11-20	How Robot Dogs See the Unseeable	Oliver Bimber et.al.	2511.16262	null
2025-11-20	Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers	Jian Ma et.al.	2511.16156	link
2025-11-20	LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM	Sibaek Lee et.al.	2511.16144	link
2025-11-20	Clustered Error Correction with Grouped 4D Gaussian Splatting	Taeho Kang et.al.	2511.16112	link
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-20	Panel-by-Panel Souls: A Performative Workflow for Expressive Faces in AI-Assisted Manga Creation	Qing Zhang et.al.	2511.16038	null
2025-11-20	CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis	Zijian Wu et.al.	2511.16030	null
2025-11-19	Think Visually, Reason Textually: Vision-Language Synergy in ARC	Beichen Zhang et.al.	2511.15703	link
2025-11-19	ChartEditor: A Reinforcement Learning Framework for Robust Chart Editing	Liangyu Chen et.al.	2511.15266	null
2025-11-19	VIRAL: Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation	Tairan He et.al.	2511.15200	null
2025-11-19	Gaussian Blending: Rethinking Alpha Blending in 3D Gaussian Splatting	Junseo Koo et.al.	2511.15102	link
2025-11-19	BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching	Yachuan Huang et.al.	2511.15066	null
2025-11-19	Evaluating Multimodal Large Language Models on Vertically Written Japanese Text	Keito Sasagawa et.al.	2511.15059	null
2025-11-18	X-WIN: Building Chest Radiograph World Model via Predictive Sensing	Zefan Yang et.al.	2511.14918	null
2025-11-18	Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video	Yarin Bekor et.al.	2511.14848	null
2025-11-18	SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction	Meiying Gu et.al.	2511.14633	link
2025-11-18	Interaction-Aware 4D Gaussian Splatting for Dynamic Hand-Object Interaction Reconstruction	Hao Tian et.al.	2511.14540	null
2025-11-18	2D Gaussians Spatial Transport for Point-supervised Density Regression	Miao Shang et.al.	2511.14477	link
2025-11-18	BEDLAM2.0: Synthetic Humans and Cameras in Motion	Joachim Tesch et.al.	2511.14394	null
2025-11-19	Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving	Kangqiao Zhao et.al.	2511.14386	null
2025-11-18	IBGS: Image-Based Gaussian Splatting	Hoang Chuong Nguyen et.al.	2511.14357	null
2025-11-18	Silhouette-to-Contour Registration: Aligning Intraoral Scan Models with Cephalometric Radiographs	Yiyi Miao et.al.	2511.14343	null
2025-11-18	Dental3R: Geometry-Aware Pairing for Intraoral 3D Reconstruction from Sparse-View Photographs	Yiyi Miao et.al.	2511.14315	null
2025-11-18	GEN3D: Generating Domain-Free 3D Scenes from a Single Image	Yuxin Zhang et.al.	2511.14291	null
2025-11-19	Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery	Yiming Zeng et.al.	2511.14270	null
2025-11-19	RoboTidy : A 3D Gaussian Splatting Household Tidying Benchmark for Embodied Navigation and Action	Xiaoquan Sun et.al.	2511.14161	null
2025-11-18	iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion	Hao Wang et.al.	2511.14149	null
2025-11-18	Splat Regression Models	Mara Daniels et.al.	2511.14042	null
2025-11-17	GRLoc: Geometric Representation Regression for Visual Localization	Changyang Li et.al.	2511.13864	null
2025-11-17	Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting	Jiangnan Ye et.al.	2511.13684	null
2025-11-17	Opt3DGS: Optimizing 3D Gaussian Splatting with Adaptive Exploration and Curvature-Aware Exploitation	Ziyang Huang et.al.	2511.13571	null
2025-11-17	Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling	Adam Hazimeh et.al.	2511.13478	null
2025-11-17	SkyReels-Text: Fine-grained Font-Controllable Text Editing for Poster Design	Yunjie Yu et.al.	2511.13285	null
2025-11-17	SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting	Zihan Li et.al.	2511.13278	null
2025-11-17	SymGS : Leveraging Local Symmetries for 3D Gaussian Splatting Compression	Keshav Gupta et.al.	2511.13264	null
2025-11-17	Birth of a Painting: Differentiable Brushstroke Reconstruction	Ying Jiang et.al.	2511.13191	null
2025-11-17	Beyond Darkness: Thermal-Supervised 3D Gaussian Splatting for Low-Light Novel View Synthesis	Qingsen Ma et.al.	2511.13011	null
2025-11-17	TR-Gaussians: High-fidelity Real-time Rendering of Planar Transmission and Reflection with 3D Gaussian Splatting	Yong Liu et.al.	2511.13009	null
2025-11-17	SplatSearch: Instance Image Goal Navigation for Mobile Robots using 3D Gaussian Splatting and Diffusion Models	Siddarth Narasimhan et.al.	2511.12972	null
2025-11-17	GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving	Chunyong Hu et.al.	2511.12941	null
2025-11-17	Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration	Changhun Oh et.al.	2511.12930	null
2025-11-17	Redshifting the Cosmological Constant in Unimodular Gravity via Nonlinear Quantum Mechanics	David E. Kaplan et.al.	2511.12897	null
2025-11-17	Reconstructing 3D Scenes in Native High Dynamic Range	Kaixuan Zhang et.al.	2511.12895	null
2025-11-16	Which Way from B to A: The role of embedding geometry in image interpolation for Stable Diffusion	Nicholas Karris et.al.	2511.12757	null
2025-11-15	Changes in Real Time: Online Scene Change Detection with Multi-View Fusion	Chamuditha Jayanga Galappaththige et.al.	2511.12370	link
2025-11-15	LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors	Qifeng Chen et.al.	2511.12304	link
2025-11-15	SRSplat: Feed-Forward Super-Resolution Gaussian Splatting from Sparse Multi-View Images	Xinyuan Hu et.al.	2511.12040	null
2025-11-14	SimTac: A Physics-Based Simulator for Vision-Based Tactile Sensing with Biomorphic Structures	Xuyang Zhang et.al.	2511.11456	null
2025-11-14	Robust inverse material design with physical guarantees using the Voigt-Reuss Net	Sanath Keshav et.al.	2511.11388	null
2025-11-14	Shadow-Induced Warps in Protoplanetary disks	Shangjia Zhang et.al.	2511.11358	null
2025-11-14	RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image	Hengfei Wang et.al.	2511.11289	null
2025-11-14	3D Gaussian and Diffusion-Based Gaze Redirection	Abiram Panchalingam et.al.	2511.11231	null
2025-11-14	RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting	Ruocheng Wu et.al.	2511.11213	null
2025-11-14	Dynamic Gaussian Scene Reconstruction from Unsynchronized Videos	Zhixin Xu et.al.	2511.11175	null
2025-11-14	PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRI	Sun Jo et.al.	2511.11048	null
2025-11-14	Draft and Refine with Visual Experts	Sungheon Jeong et.al.	2511.11005	null
2025-11-13	MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns	Jiarui Zhang et.al.	2511.10390	null
2025-11-13	Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision	Yu Deng et.al.	2511.10316	null
2025-11-13	HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction	Yueran Zhao et.al.	2511.10211	null
2025-11-13	Competing Localizations on Disordered Non-Hermitian Random Graph Lattice	S Rahul et.al.	2511.10156	null
2025-11-13	AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models	Xinyi Wang et.al.	2511.10017	link
2025-11-13	Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching	Uday Bhaskar et.al.	2511.09955	null
2025-11-13	TSPE-GS: Probabilistic Depth Extraction for Semi-Transparent Surface Reconstruction via 3D Gaussian Splatting	Zhiyuan Xu et.al.	2511.09944	null
2025-11-13	AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting	Aymen Mir et.al.	2511.09827	null
2025-11-12	Traversable wormhole with double trace deformations via gravitational shear and sound channels	Fitria Khairunnisa et.al.	2511.09815	null
2025-11-12	A Shared-Autonomy Construction Robotic System for Overhead Works	David Minkwan Kim et.al.	2511.09695	null
2025-11-12	BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation	Hongchao Shu et.al.	2511.09443	null
2025-11-12	OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS	Haiyi Li et.al.	2511.09397	null
2025-11-12	Computational Caustic Design for Surface Light Source	Sizhuo Zhou et.al.	2511.09361	null
2025-11-12	WDT-MD: Wavelet Diffusion Transformers for Microaneurysm Detection in Fundus Images	Yifei Sun et.al.	2511.08987	null
2025-11-11	RePose-NeRF: Robust Radiance Fields for Mesh Reconstruction under Noisy Camera Poses	Sriram Srinivasan et.al.	2511.08545	null
2025-11-11	3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation	Yunhong He et.al.	2511.08536	null
2025-11-11	SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering	Laura Bragagnolo et.al.	2511.08294	null
2025-11-11	Perceptual Quality Assessment of 3D Gaussian Splatting: A Subjective Dataset and Prediction Metric	Zhaolin Wan et.al.	2511.08032	null
2025-11-11	UltraGS: Gaussian Splatting for Ultrasound Novel View Synthesis	Yuezhe Yang et.al.	2511.07743	null
2025-11-10	Accelerated, Memory-Efficient Far-Field Scattering Computation with Monte Carlo SBR	Samuel Audia et.al.	2511.07586	null
2025-11-10	YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting	Botao Ye et.al.	2511.07321	null
2025-11-10	4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D Generation	Mengmeng Liu et.al.	2511.07241	null
2025-11-10	Sparse4DGS: 4D Gaussian Splatting for Sparse-Frame Dynamic Scene Reconstruction	Changyue Shi et.al.	2511.07122	null
2025-11-10	GFix: Perceptually Enhanced Gaussian Splatting Video Compression	Siyue Teng et.al.	2511.06953	null
2025-11-10	MUGSQA: Novel Multi-Uncertainty-Based Gaussian Splatting Quality Assessment Method, Dataset, and Benchmarks	Tianang Chen et.al.	2511.06830	link
2025-11-10	ConeGS: Error-Guided Densification Using Pixel Cones for Improved Reconstruction with Fewer Primitives	Bartłomiej Baranowski et.al.	2511.06810	link
2025-11-10	Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes	Meijun Guo et.al.	2511.06765	null
2025-11-10	Rethinking Rainy 3D Scene Reconstruction via Perspective Transforming and Brightness Tuning	Qianfeng Yang et.al.	2511.06734	link
2025-11-10	DIAL-GS: Dynamic Instance Aware Reconstruction for Label-free Street Scenes with 4D Gaussian Splatting	Chenpeng Su et.al.	2511.06632	null
2025-11-09	Inpaint360GS: Efficient Object-Aware 3D Inpainting via Gaussian Splatting for 360° Scenes	Shaoxiang Wang et.al.	2511.06457	null
2025-11-09	Physics-Informed Deformable Gaussian Splatting: Towards Unified Constitutive Laws for Time-Evolving Material Field	Haoqin Hong et.al.	2511.06299	null
2025-11-08	StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video	Zhihui Ke et.al.	2511.06046	null
2025-11-07	4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos	Mengqi Guo et.al.	2511.05229	null
2025-11-07	Splatography: Sparse multi-view dynamic Gaussian Splatting for filmmaking challenges	Adrian Azzarelli et.al.	2511.05152	null
2025-11-07	Efficient representation of 3D spatial data for defense-related applications	Benjamin Kahl et.al.	2511.05109	null
2025-11-07	CLM: Removing the GPU Memory Barrier for 3D Gaussian Splatting	Hexu Zhao et.al.	2511.04951	null
2025-11-07	Channel Knowledge Map Construction: Recent Advances and Open Challenges	Zixiang Ren et.al.	2511.04944	null
2025-11-06	3D Gaussian Point Encoders	Jim James et.al.	2511.04797	null
2025-11-10	Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions	Kaifeng Zhang et.al.	2511.04665	link
2025-11-06	FastGS: Training 3D Gaussian Splatting in 100 Seconds	Shiwei Ren et.al.	2511.04283	null
2025-11-06	CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation	Yuwen Tao et.al.	2511.03992	null
2025-11-05	DentalSplat: Dental Occlusion Novel View Synthesis from Sparse Intra-Oral Photographs	Yiyi Miao et.al.	2511.03099	null
2025-11-04	PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing	Antonio Oroz et.al.	2511.02777	null
2025-11-04	Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping	Jiajia Li et.al.	2511.02207	null
2025-11-01	4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting	Chun-Tin Wu et.al.	2511.00560	null
2025-10-31	SAGS: Self-Adaptive Alias-Free Gaussian Splatting for Dynamic Surgical Endoscopic Reconstruction	Wenfeng Huang et.al.	2510.27318	null
2025-10-31	WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond	Zhicong Sun et.al.	2510.27133	null
2025-10-30	DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting	Moonsoo Jeong et.al.	2510.26921	null
2025-10-30	HEIR: Learning Graph-Based Motion Hierarchies	Cheng Zheng et.al.	2510.26786	null
2025-10-30	The Impact and Outlook of 3D Gaussian Splatting	Bernhard Kerbl et.al.	2510.26694	null
2025-10-30	AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM	Mirko Usuelli et.al.	2510.26358	link
2025-10-30	6D Channel Knowledge Map Construction via Bidirectional Wireless Gaussian Splatting	Juncong Zhou et.al.	2510.26166	null
2025-10-30	JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting	Yuxuan Li et.al.	2510.26117	null
2025-11-02	D $^2$ GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction	Kejing Xia et.al.	2510.25173	null
2025-10-29	AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians	Xiyu Zhang et.al.	2510.25129	null
2025-10-28	NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation	Mingyu Jeong et.al.	2510.24335	null
2025-10-28	LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation	Haotian Zhou et.al.	2510.24118	null
2025-10-28	A Survey on Collaborative SLAM with 3D Gaussian Splatting	Phuc Nguyen Xuan et.al.	2510.23988	null
2025-10-27	PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors	Xirui Jin et.al.	2510.23930	link
2025-10-27	Explicit Memory through Online 3D Gaussian Splatting Improves Class-Agnostic Video Segmentation	Anthony Opipari et.al.	2510.23521	null
2025-10-27	VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting	Hoonhee Cho et.al.	2510.23205	null
2025-10-27	EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction	Taoyu Wu et.al.	2510.23087	null
2025-10-27	Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method	Bohan Li et.al.	2510.22973	null
2025-10-27	Gen-LangSplat: Generalized Language Gaussian Splatting with Pre-Trained Feature Compression	Pranav Saxena et.al.	2510.22930	null
2025-10-26	Region-Adaptive Learned Hierarchical Encoding for 3D Gaussian Splatting Data	Shashank N. Sridhara et.al.	2510.22812	null
2025-10-26	Edge Collaborative Gaussian Splatting with Integrated Rendering and Communication	Yujie Wan et.al.	2510.22718	null
2025-10-28	LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering	Wenkai Zhu et.al.	2510.22669	null
2025-10-26	RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience	Huilin Yin et.al.	2510.22600	null
2025-10-26	DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss	Jing Yang et.al.	2510.22473	null
2025-10-25	GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation	Phillip Mueller et.al.	2510.22337	null
2025-10-25	DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum	Yaokun Li et.al.	2510.22213	link
2025-10-24	Towards Physically Executable 3D Gaussian for Embodied Navigation	Bingchen Miao et.al.	2510.21307	null
2025-10-23	GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation	Guangqi Jiang et.al.	2510.20813	null
2025-10-23	Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking	Zixuan Wu et.al.	2510.20335	null
2025-10-23	COS3D: Collaborative Open-Vocabulary 3D Segmentation	Runsong Zhu et.al.	2510.20238	null
2025-10-22	Extreme Views: 3DGS Filter for Novel View Synthesis from Out-of-Distribution Camera Poses	Damian Bowness et.al.	2510.20027	null
2025-10-21	Re-Activating Frozen Primitives for 3D Gaussian Splatting	Yuxin Cheng et.al.	2510.19653	null
2025-10-22	VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction	Junhong Lin et.al.	2510.19578	null
2025-10-22	Advances in 4D Representation: Geometry, Motion, and Interaction	Mingrui Zhao et.al.	2510.19255	null
2025-10-22	MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting	In-Hwan Jin et.al.	2510.19210	null
2025-10-22	GRASPLAT: Enabling dexterous grasping through novel view synthesis	Matteo Bortolon et.al.	2510.19200	null
2025-10-21	Moving Light Adaptive Colonoscopy Reconstruction via Illumination-Attenuation-Aware 3D Gaussian Splatting	Hao Wang et.al.	2510.18739	null
2025-10-21	Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos	Jinfeng Liu et.al.	2510.18489	null
2025-10-21	OpenInsGaussian: Open-vocabulary Instance Gaussian Segmentation with Context-aware Cross-view Fusion	Tianyu Huang et.al.	2510.18253	null
2025-10-20	From Volume Rendering to 3D Gaussian Splatting: Theory and Applications	Vitor Pereira Matias et.al.	2510.18101	null
2025-10-20	HouseTour: A Virtual Real Estate A(I)gent	Ata Çelen et.al.	2510.18054	null
2025-10-20	Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats	Simeon Adebola et.al.	2510.17783	null
2025-10-20	Raindrop GS: A Benchmark for 3D Gaussian Splatting under Raindrop Conditions	Zhiqiang Teng et.al.	2510.17719	null
2025-10-20	Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS	Feng Zhou et.al.	2510.17479	link
2025-10-20	GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation	Ruitong Gan et.al.	2510.17095	null
2025-10-19	2DGS-R: Revisiting the Normal Consistency Regularization in 2D Gaussian Splatting	Haofan Ren et.al.	2510.16837	null
2025-10-19	GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation	Junbo Li et.al.	2510.16777	null
2025-10-18	HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars	Haocheng Tang et.al.	2510.16463	null
2025-10-18	REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting	Changyue Shi et.al.	2510.16410	null
2025-10-17	Proactive Scene Decomposition and Reconstruction	Baicheng Li et.al.	2510.16272	null
2025-10-17	PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction	Ting-Yu Yen et.al.	2510.15386	null
2025-10-17	GaussGym: An open-source real-to-sim framework for learning locomotion from pixels	Alejandro Escontrela et.al.	2510.15352	null
2025-10-16	SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images	Jiaxin Guo et.al.	2510.15072	null
2025-10-16	Leveraging Learned Image Prior for 3D Gaussian Compression	Seungjoo Shin et.al.	2510.14705	null
2025-10-16	BalanceGS: Algorithm-System Co-design for Efficient 3D Gaussian Splatting Training on GPU	Junyi Wu et.al.	2510.14564	null
2025-10-16	GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering	Alexander Valverde et.al.	2510.14270	null
2025-10-16	Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures	Yuancheng Xu et.al.	2510.14179	link
2025-10-17	Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images	Emanuel Garbin et.al.	2510.14081	null
2025-10-15	Instant Skinned Gaussian Avatars for Web, Mobile and VR Applications	Naruya Kondo et.al.	2510.13978	null
2025-10-15	VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator	Hyojun Go et.al.	2510.13454	null
2025-10-15	Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering	Siddharth Tourani et.al.	2510.13381	null
2025-10-15	STT-GS: Sample-Then-Transmit Edge Gaussian Splatting with Joint Client Selection and Power Control	Zhen Li et.al.	2510.13186	null
2025-10-14	Uncertainty Matters in Dynamic Gaussian Splatting for Monocular 4D Reconstruction	Fengzhi Guo et.al.	2510.12768	null
2025-10-17	BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring	An Zhao et.al.	2510.12493	null
2025-10-14	Hybrid Gaussian Splatting for Novel Urban View Synthesis	Mohamed Omran et.al.	2510.12308	null
2025-10-14	PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes	Ying A et.al.	2510.12282	null
2025-10-14	UniGS: Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering	Yusen Xie et.al.	2510.12174	null
2025-10-14	G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior	Junfeng Ni et.al.	2510.12099	null
2025-10-13	GS-Verse: Mesh-based Gaussian Splatting for Physics-aware Interaction in Virtual Reality	Anastasiya Pechko et.al.	2510.11878	null
2025-10-13	Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams	Takuya Nakabayashi et.al.	2510.11717	null
2025-10-13	Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation	Maggie Wang et.al.	2510.11689	null
2025-10-13	VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment	Qing Li et.al.	2510.11473	null
2025-10-13	MaterialRefGS: Reflective Gaussian Splatting with Multi-view Consistent Material Inference	Wenyuan Zhang et.al.	2510.11387	null
2025-10-12	Dynamic Gaussian Splatting from Defocused and Motion-blurred Monocular Videos	Xuankai Zhang et.al.	2510.10691	null
2025-10-12	High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting	Haoyu Zhao et.al.	2510.10637	null
2025-10-12	Towards Efficient 3D Gaussian Human Avatar Compression: A Prior-Guided Framework	Shanzhi Yin et.al.	2510.10492	null
2025-10-11	Opacity-Gradient Driven Density Control for Compact and Efficient Few-Shot 3D Gaussian Splatting	Abdelrhman Elrawy et.al.	2510.10257	null
2025-10-11	Color3D: Controllable and Consistent 3D Colorization with Personalized Colorizer	Yecong Wan et.al.	2510.10152	null
2025-10-11	Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting	Jiahui Lu et.al.	2510.10097	null
2025-10-11	P-4DGS: Predictive 4D Gaussian Splatting with 90 $\times$ Compression	Henan Wang et.al.	2510.10030	null
2025-10-11	CLoD-GS: Continuous Level-of-Detail via 3D Gaussian Splatting	Zhigang Cheng et.al.	2510.09997	null
2025-10-11	VG-Mapping: Variation-Aware 3D Gaussians for Online Semi-static Scene Mapping	Yicheng He et.al.	2510.09962	null
2025-10-10	LTGS: Long-Term Gaussian Scene Chronology From Sparse View Updates	Minkwan Kim et.al.	2510.09881	null
2025-10-10	Vision Language Models: A Survey of 26K Papers	Fengming Lin et.al.	2510.09586	null
2025-10-10	FLOWING: Implicit Neural Flows for Structure-Preserving Morphing	Arthur Bizzi et.al.	2510.09537	link
2025-10-10	Two-Stage Gaussian Splatting Optimization for Outdoor Scene Reconstruction	Deborah Pintani et.al.	2510.09489	null
2025-10-10	Visibility-Aware Densification for 3D Gaussian Splatting in Dynamic Urban Scenes	Yikang Zhang et.al.	2510.09364	null
2025-10-09	ReSplat: Learning Recurrent Gaussian Splats	Haofei Xu et.al.	2510.08575	null
2025-10-09	D $^2$ GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction	Meixi Song et.al.	2510.08566	null
2025-10-09	Splat the Net: Radiance Fields with Splattable Neural Primitives	Xilong Zhou et.al.	2510.08491	null
2025-10-09	Efficient Label Refinement for Face Parsing Under Extreme Poses Using 3D Gaussian Splatting	Ankit Gahlawat et.al.	2510.08096	null
2025-10-09	CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving	Tianrui Zhang et.al.	2510.07944	null
2025-10-09	PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting	Houqiang Zhong et.al.	2510.07830	null
2025-10-09	DEGS: Deformable Event-based 3D Gaussian Splatting from RGB and Event Stream	Junhao He et.al.	2510.07752	null
2025-10-09	ComGS: Efficient 3D Object-Scene Composition via Surface Octahedral Probes	Jian Gao et.al.	2510.07729	null
2025-10-08	Generating Surface for Text-to-3D using 2D Gaussian Splatting	Huanning Dong et.al.	2510.06967	null
2025-10-08	Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity	Islomjon Shukhratov et.al.	2510.06802	null
2025-10-08	SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis	Jipeng Lyu et.al.	2510.06694	null
2025-10-09	RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction	Leshu Li et.al.	2510.06644	null
2025-10-07	Active Next-Best-View Optimization for Risk-Averse Path Planning	Amirhossein Mollaei Khass et.al.	2510.06481	null
2025-10-07	ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars	Peizhi Yan et.al.	2510.05488	null
2025-10-06	Provable Affine Identifiability of Nonlinear CCA under Latent Distributional Priors	Zhiwei Han et.al.	2510.04758	null
2025-10-04	Enhancing Foveated Rendering with Weighted Reservoir Sampling	Ville Cantory et.al.	2510.03964	null
2025-10-04	Optimized Minimal 4D Gaussian Splatting	Minseo Lee et.al.	2510.03857	null
2025-10-03	SketchPlan: Diffusion Based Drone Planning From Human Sketches	Sixten Norelius et.al.	2510.03545	null
2025-09-30	Universal Beta Splatting	Rong Liu et.al.	2510.03312	null
2025-10-03	Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields	Zhiting Mei et.al.	2510.03104	link
2025-10-03	GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting	Xinran Zhang et.al.	2510.02884	null
2025-10-03	From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting	Jianing Chen et.al.	2510.02732	null
2025-10-03	FSFSplatter: Build Surface and Novel Views with Sparse-Views within 3min	Yibin Zhao et.al.	2510.02691	null
2025-10-02	SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting	Sung-Yeon Park et.al.	2510.02469	null
2025-10-02	StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions	Bo-Hsu Ke et.al.	2510.02314	null
2025-10-02	Performance-Guided Refinement for Visual Aerial Navigation using Editable Gaussian Splatting in FalconGym 2.0	Yan Miao et.al.	2510.02248	null
2025-10-02	Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects	Georgios Kouros et.al.	2510.02069	null
2025-10-02	GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing	Mengtian Li et.al.	2510.02034	null
2025-10-02	4DGS-Craft: Consistent and Interactive 4D Gaussian Splatting Editing	Lei Liu et.al.	2510.01991	null
2025-10-02	ROI-GS: Interest-based Local Quality 3D Gaussian Splatting	Quoc-Anh Bui et.al.	2510.01978	null
2025-10-02	GreenhouseSplat: A Dataset of Photorealistic Greenhouse Simulations for Mobile Robotics	Diram Tabaa et.al.	2510.01848	null
2025-10-02	LOBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction	Sheng-Hsiang Hung et.al.	2510.01767	null
2025-10-02	MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics	Changmin Lee et.al.	2510.01619	link
2025-10-01	Instant4D: 4D Gaussian Splatting in Minutes	Zhanpeng Luo et.al.	2510.01119	link
2025-09-30	HART: Human Aligned Reconstruction Transformer	Xiyi Chen et.al.	2509.26621	null
2025-09-30	Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting	Hanzhou Liu et.al.	2509.26455	link
2025-09-30	GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts	Zhenyu Shu et.al.	2509.26055	null
2025-09-30	PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion	Zhiwei Zhang et.al.	2509.26008	null
2025-09-30	LLM-Powered Code Analysis and Optimization for Gaussian Splatting Kernels	Yi Hu et.al.	2509.25626	null
2025-09-29	GaussianLens: Localized High-Resolution Reconstruction via On-Demand Gaussian Densification	Yijia Weng et.al.	2509.25603	null
2025-09-29	Triangle Splatting+: Differentiable Rendering with Opaque Triangles	Jan Held et.al.	2509.25122	null
2025-10-02	GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction	Huaizhi Qu et.al.	2509.25075	link
2025-09-29	LVT: Large-Scale Scene Reconstruction via Local View Transformers	Tooba Imtiaz et.al.	2509.25001	link
2025-09-29	DWGS: Enhancing Sparse-View Gaussian Splatting with Hybrid-Loss Depth Estimation and Bidirectional Warping	Yu Ma et.al.	2509.24893	null
2025-09-29	ExGS: Extreme 3D Gaussian Compression with Diffusion Priors	Jiaqi Chen et.al.	2509.24758	null
2025-10-01	Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh	Yuanyuan Gao et.al.	2509.24421	null
2025-09-29	OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction	Yuhang Cao et.al.	2509.24308	link
2025-09-28	CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting	Dragoş-Andrei Chileban et.al.	2509.23947	link
2025-09-28	From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations	Javed Ahmad et.al.	2509.23555	null
2025-09-27	Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos	Junyi Wu et.al.	2509.23492	null
2025-09-27	OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting	Atakan Topaloglu et.al.	2509.23258	link
2025-09-26	Learning Unified Representation of 3D Gaussian Splatting	Yuelin Xin et.al.	2509.22917	null
2025-09-26	Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting	Yasmine Omri et.al.	2509.22615	null
2025-09-26	GS-2M: Gaussian Splatting for Joint Mesh Reconstruction and Material Decomposition	Dinh Minh Nguyen et.al.	2509.22276	null
2025-09-26	Polysemous Language Gaussian Splatting via Matching-based Mask Lifting	Jiayu Ding et.al.	2509.22225	null
2025-09-26	Large Material Gaussian Model for Relightable 3D Generation	Jingrui Ye et.al.	2509.22112	null
2025-09-26	Drag4D: Align Your Motion with Text-Driven 3D Scene Generation	Minjun Kang et.al.	2509.21888	null
2025-09-30	Dynamic Novel View Synthesis in High Dynamic Range	Kaixuan Zhang et.al.	2509.21853	null
2025-09-25	PowerGS: Display-Rendering Power Co-Optimization for Neural Rendering in Power-Constrained XR Systems	Weikai Lin et.al.	2509.21702	null
2025-09-25	Gaussian splatting holography	Shuhe Zhang et.al.	2509.20774	null
2025-09-25	FreeInsert: Personalized Object Insertion with Geometric and Style Control	Yuhong Zhang et.al.	2509.20756	null
2025-09-23	SeHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing	Yiyu Li et.al.	2509.20400	null
2025-09-24	4D Driving Scene Generation With Stereo Forcing	Hao Lu et.al.	2509.20251	null
2025-09-24	GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes	Guo Chen et.al.	2509.19937	null
2025-09-24	Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering	Jiangxue Yu et.al.	2509.19898	null
2025-09-24	BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting	Yixun Zhang et.al.	2509.19793	null
2025-09-24	PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction	Yufei Han et.al.	2509.19726	null
2025-09-23	VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction	Weijie Wang et.al.	2509.19297	null
2025-09-23	Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation	Sherwin Bahmani et.al.	2509.19296	null
2025-09-23	WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction	Hung Nguyen et.al.	2509.19073	null
2025-09-23	Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting	Zijing Guo et.al.	2509.18956	null
2025-09-23	DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring	Pengteng Li et.al.	2509.18898	null
2025-09-23	FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation	Zhaorui Wang et.al.	2509.18759	null
2025-09-23	SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones	Maximilian Adang et.al.	2509.18610	null
2025-09-23	Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction	Xiaoting Yin et.al.	2509.18566	null
2025-09-23	BridgeSplat: Bidirectionally Coupled CT and Non-Rigid Gaussian Splatting for Deformable Intraoperative Surgical Navigation	Maximilian Fehrentz et.al.	2509.18501	null
2025-09-23	Differentiable Light Transport with Gaussian Surfels via Adapted Radiosity for Efficient Relighting and Geometry Reconstruction	Kaiwen Jiang et.al.	2509.18497	null
2025-09-22	GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction	Jiahe Li et.al.	2509.18090	null
2025-09-22	GaussianPSL: A novel framework based on Gaussian Splatting for exploring the Pareto frontier in multi-criteria optimization	Phuong Mai Dinh et.al.	2509.17889	null
2025-09-22	ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos	Shi Chen et.al.	2509.17864	null
2025-09-22	From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes	Guoxi Huang et.al.	2509.17789	null
2025-09-22	Neural-MMGS: Multi-modal Neural Gaussian Splats for Large-Scale Scene Reconstruction	Sitian Shen et.al.	2509.17762	null
2025-09-23	EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device	Gunjan Chhablani et.al.	2509.17430	link
2025-09-22	FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR	Junzhe Wu et.al.	2509.17390	null
2025-09-22	SmokeSeer: 3D Gaussian Splatting for Smoke Removal and Scene Reconstruction	Neham Jain et.al.	2509.17329	null
2025-09-21	SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views	Ranran Huang et.al.	2509.17246	null
2025-09-23	HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis	Zipeng Wang et.al.	2509.17083	null
2025-09-21	Efficient 3D Scene Reconstruction and Simulation from Sparse Endoscopic Views	Zhenya Yang et.al.	2509.17027	null
2025-09-21	PGSTalker: Real-Time Audio-Driven Talking Head Generation via 3D Gaussian Splatting with Pixel-Aware Density Control	Tianheng Zhu et.al.	2509.16922	null
2025-09-21	ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM	Amanuel T. Dufera et.al.	2509.16863	null
2025-09-20	MedGS: Gaussian Splatting for Multi-Modal 3D Medical Imaging	Kacper Marzol et.al.	2509.16806	null
2025-09-20	ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting	Xiaoyang Yan et.al.	2509.16552	null
2025-09-19	RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars	Weiyi Xiong et.al.	2509.16119	null
2025-09-19	Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval	Liwei Liao et.al.	2509.15871	null
2025-09-19	Camera Splatting for Continuous View Optimization	Gahye Lee et.al.	2509.15677	null
2025-09-19	FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting	Yuwei Jia et.al.	2509.15648	null
2025-09-19	GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading	Donghyun Lee et.al.	2509.15645	null
2025-09-19	MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild	Deming Li et.al.	2509.15548	null
2025-09-18	Causal Reasoning Elicits Controllable 3D Scene Generation	Shen Chen et.al.	2509.15249	null
2025-09-18	FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction	Jinlong Fan et.al.	2509.14739	null
2025-09-18	RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI	Cong Tai et.al.	2509.14687	null
2025-09-17	Perception-Integrated Safety Critical Control via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting	Dario Tscholl et.al.	2509.14421	null
2025-09-17	MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping	Zhihao Cao et.al.	2509.14191	null
2025-09-17	Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction	Yifan Mo et.al.	2509.13938	null
2025-09-17	LamiGauss: Pitching Radiative Gaussian for Sparse-View X-ray Laminography Reconstruction	Chu Chen et.al.	2509.13863	null
2025-09-16	MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM	Yinlong Bai et.al.	2509.13536	null
2025-09-16	Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization	Hao Xu et.al.	2509.13482	link
2025-09-16	Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image	Gaofeng Liu et.al.	2509.13013	null
2025-09-16	Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings	Abdalla Arafa et.al.	2509.12938	null
2025-09-16	Effective Gaussian Management for High-fidelity Object Reconstruction	Jiateng Liu et.al.	2509.12742	null
2025-09-15	Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization	Mengjiao Han et.al.	2509.12138	null
2025-09-15	Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting	Yi-Hsin Li et.al.	2509.11853	null
2025-09-15	A Controllable 3D Deepfake Generation Framework with Gaussian Splatting	Wending Liu et.al.	2509.11624	null
2025-09-14	On the Skinning of Gaussian Avatars	Nikolaos Zioulis et.al.	2509.11411	null
2025-09-14	ROSGS: Relightable Outdoor Scenes With Gaussian Splatting	Lianjun Liao et.al.	2509.11275	null
2025-09-14	SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting	Ashkan Taghipour et.al.	2509.11116	null
2025-09-13	AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting	Gurutva Patle et.al.	2509.11003	null
2025-09-13	Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation	Yi-Ruei Liu et.al.	2509.10759	null
2025-09-12	T2Bs: Text-to-Character Blendshapes via Video Generation	Jiahao Luo et.al.	2509.10678	null
2025-09-15	On the Geometric Accuracy of Implicit and Primitive-based Representations Derived from View Rendering Constraints	Elias De Smijter et.al.	2509.10241	null
2025-09-09	SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting	Mahtab Dahaghin et.al.	2509.07809	null
2025-09-09	HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting	Yimin Pan et.al.	2509.07774	null
2025-09-09	DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning	Wenzhi Guo et.al.	2509.07493	null
2025-09-09	DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation	Ze-Xin Yin et.al.	2509.07435	null
2025-09-07	MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning	Jiarui Chen et.al.	2509.07021	null
2025-09-10	VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes	Shengkai Zhang et.al.	2509.06685	null
2025-09-15	Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation	Ian Page et.al.	2509.06433	null
2025-09-08	3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom	Matthieu Gendrin et.al.	2509.06400	null
2025-09-05	Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting	Sen Wang et.al.	2509.05515	null
2025-09-05	Toward Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization	Mengjiao Han et.al.	2509.05216	null
2025-09-05	Symbolic Graphics Programming with Large Language Models	Yamei Chen et.al.	2509.05208	null
2025-09-05	GeoSplat: A Deep Dive into Geometry-Constrained Gaussian Splatting	Yangming Li et.al.	2509.05075	null
2025-09-05	CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus	Hannah Schieber et.al.	2509.04859	null
2025-09-04	SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer	Jimin Xu et.al.	2509.04379	null
2025-09-03	ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction	Sankeerth Durvasula et.al.	2509.03775	null
2025-09-02	Efficient Geometry Compression and Communication for 3D Gaussian Splatting Point Clouds	Liang Xie et.al.	2509.02232	null
2025-09-02	GRMM: Real-Time High-Fidelity Gaussian Morphable Head Model with Learned Residuals	Mohit Mendiratta et.al.	2509.02141	null
2025-09-02	2D Gaussian Splatting with Semantic Alignment for Image Inpainting	Hongyu Li et.al.	2509.01964	null
2025-09-01	GaussianGAN: Real-Time Photorealistic controllable Human Avatars	Mohamed Ilyes Lakhal et.al.	2509.01681	null
2025-09-01	FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field	Fan Zhu et.al.	2509.01547	null
2025-09-01	Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars	Vanessa Sklyarova et.al.	2509.01469	link
2025-08-31	Towards Integrating Multi-Spectral Imaging with Gaussian Splatting	Josef Grün et.al.	2509.00989	null
2025-09-03	GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency	Joongho Jo et.al.	2509.00911	null
2025-09-03	UPGS: Unified Pose-aware Gaussian Splatting for Dynamic Scene Deblurring	Zhijing Wu et.al.	2509.00831	null
2025-08-31	SWAGSplatting: Semantic-guided Water-scene Augmented Gaussian Splatting	Zhuodong Jiang et.al.	2509.00800	null
2025-08-31	MarkSplatter: Generalizable Watermarking for 3D Gaussian Splatting Model via Splatter Image Structure	Xiufeng Huang et.al.	2509.00757	null
2025-08-31	DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments	Yi Liu et.al.	2509.00741	null
2025-08-30	AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection	Houshu He et.al.	2509.00433	null
2025-08-29	Complete Gaussian Splats from a Single Image with Denoising Diffusion Models	Ziwei Liao et.al.	2508.21542	null
2025-08-12	EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events	Siyu Chen et.al.	2508.07003	null
2025-10-29	GS4: Generalizable Sparse Splatting Semantic SLAM	Mingqi Jiang et.al.	2506.06517	null
2025-06-04	LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM	Roman Titkov et.al.	2506.03073	null
2025-05-16	Large-Scale Gaussian Splatting SLAM	Zhe Xin et.al.	2505.09915	null
2025-11-13	SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM	Samuel Cerezo et.al.	2504.13713	null
2025-03-18	DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes	Runfa Blark Li et.al.	2503.11979	null
2025-02-24	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633	null
2025-01-15	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-07-15	SEGS-SLAM: Structure-enhanced 3D Gaussian Splatting SLAM with Appearance Embedding	Tianci Wen et.al.	2501.05242	null
2024-12-05	RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting	Zhenzhong Cao et.al.	2412.01217	null
2024-11-20	LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2411.12185	null
2025-08-11	MBA-SLAM: Motion Blur Aware Gaussian Splatting SLAM	Peng Wang et.al.	2411.08279	null
2025-03-11	CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM	Dapeng Feng et.al.	2410.00486	null
2024-10-01	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111	null
2025-03-11	Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2409.12518	null
2025-05-05	UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM	Mostafa Mansour et.al.	2409.00362	null
2024-04-02	MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements	Lisong C. Sun et.al.	2404.00923	null
2024-03-26	CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field	Jiarui Hu et.al.	2403.16095	null
2024-04-09	GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting	Chi Yan et.al.	2311.11700	null

(<a href=#updated-on-20260429>back to top</a>)

Autonomous Driving

Publish Date	Title	Authors	PDF	Code
2026-04-28	Control Your Queries: Heterogeneous Query Interaction for Camera-Radar Fusion	Jialong Wu et.al.	2604.25574	null
2026-04-28	Leveraging Previous-Traversal Point Cloud Map Priors for Camera-Based 3D Object Detection and Tracking	Markus Käppeler et.al.	2604.25405	null
2026-04-28	ProDrive: Proactive Planning for Autonomous Driving via Ego-Environment Co-Evolution	Chuyao Fu et.al.	2604.25329	null
2026-04-27	TEACar: An Open-Source Autonomous Driving Platform	Zhongzheng Zhang et.al.	2604.24934	null
2026-04-27	Towards Lawful Autonomous Driving: Deriving Scenario-Aware Driving Requirements from Traffic Laws and Regulations	Bowen Jian et.al.	2604.24562	null
2026-04-27	ARETE: Attention-based Rasterized Encoding for Topology Estimation using HSV-transformed Crowdsourced Vehicle Fleet Data	Daniel Fritz et.al.	2604.24353	null
2026-04-27	Projected Attainable Speed Space: A Driving Efficiency Metric Connecting Instantaneous Evaluation to Travel Time	Xiaohua Zhao et.al.	2604.24295	null
2026-04-27	TopoHR: Hierarchical Centerline Representation for Cyclic Topology Reasoning in Driving Scenes with Point-to-Instance Relations	Yifeng Bai et.al.	2604.24119	null
2026-04-27	CLLAP: Contrastive Learning-based LiDAR-Augmented Pretraining for Enhanced Radar-Camera Fusion	Bingyi Liu et.al.	2604.24044	null
2026-04-27	VLM-VPI: A Vision-Language Reasoning Framework for Improving Automated Vehicle-Pedestrian Interactions	Qingwen Pu et.al.	2604.23934	null
2026-04-26	ESIA: An Energy-Based Spatiotemporal Interaction-Aware Framework for Pedestrian Intention Prediction	Yanping Wu et.al.	2604.23728	null
2026-04-26	Learning to Identify Out-of-Distribution Objects for 3D LiDAR Anomaly Segmentation	Simone Mosco et.al.	2604.23604	null
2026-04-26	Grammar-Constrained Refinement of Safety Operational Rules Using Language in the Loop: What Could Go Wrong	Khouloud Gaaloul et.al.	2604.23523	null
2026-04-26	Large Language Model based Interactive Decision-Making for Autonomous Driving	Xinwei Dong et.al.	2604.23513	null
2026-04-25	UniAda: Universal Adaptive Multi-objective Adversarial Attack for End-to-End Autonomous Driving Systems	Jingyu Zhang et.al.	2604.23362	null
2026-04-25	Empirical Insights of Test Selection Metrics under Multiple Testing Objectives and Distribution Shifts	Jingyu Zhang et.al.	2604.23342	null
2026-04-25	An Interactive Graphical Tool to Check the Coarray Continuity of Two-Fold Redundant Sparse Arrays (TFRSAs) Under Single Sensor Failures	Namya Malik et.al.	2604.23262	null
2026-04-25	Transferable Physical-World Adversarial Patches Against Object Detection in Autonomous Driving	Zihui Zhu et.al.	2604.23105	null
2026-04-23	Vision-Based Lane Following and Traffic Sign Recognition for Resource-Constrained Autonomous Vehicles	Md Tanjemul Islam et.al.	2604.22872	null
2026-04-24	Cross-Stage Coherence in Hierarchical Driving VQA: Explicit Baselines and Learned Gated Context Projectors	Gautam Kumar Jain et.al.	2604.22560	null
2026-04-24	Transferable Physical-World Adversarial Patches Against Pedestrian Detection Models	Shihui Yan et.al.	2604.22552	null
2026-04-24	Towards Safe Mobility: A Unified Transportation Foundation Model enabled by Open-Ended Vision-Language Dataset	Wenhui Huang et.al.	2604.22260	null
2026-04-24	OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space	Zhuding Liang et.al.	2604.22240	null
2026-04-24	GICC: A High-Performance Runtime for GPU-Initiated Communication and Coordination in Modern HPC Systems	Baodi Shan et.al.	2604.22126	null
2026-04-23	MISTY: High-Throughput Motion Planning via Mixer-based Single-step Drifting	Yining Xing et.al.	2604.21489	null
2026-04-24	Frozen LLMs as Map-Aware Spatio-Temporal Reasoners for Vehicle Trajectory Prediction	Yanjiao Liu et.al.	2604.21479	null
2026-04-23	Reasoning About Traversability: Language-Guided Off-Road 3D Trajectory Planning	Byounggun Park et.al.	2604.21249	null
2026-04-22	Enabling Mixed criticality applications for the Versal AI-Engines	Vincent Sprave et.al.	2604.21124	null
2026-04-21	Towards a Systematic Risk Assessment of Deep Neural Network Limitations in Autonomous Driving Perception	Svetlana Pavlitska et.al.	2604.20895	null
2026-04-22	OVPD: A Virtual-Physical Fusion Testing Dataset of OnSite Auton-omous Driving Challenge	Yuhang Zhang et.al.	2604.20423	null
2026-04-22	X-Cache: Cross-Chunk Block Caching for Few-Step Autoregressive World Models Inference	Yixiao Zeng et.al.	2604.20289	null
2026-04-22	Lightweight Low-SNR-Robust Semantic Communication System for Autonomous Driving	Ruixing Ren et.al.	2604.20278	null
2026-04-22	Toward Cooperative Driving in Mixed Traffic: An Adaptive Potential Game-Based Approach with Field Test Verification	Shiyu Fang et.al.	2604.20231	null
2026-04-22	From Scene to Object: Text-Guided Dual-Gaze Prediction	Zehong Ke et.al.	2604.20191	null
2026-04-21	CityRAG: Stepping Into a City via Spatially-Grounded Video Generation	Gene Chou et.al.	2604.19741	null
2026-04-21	SpanVLA: Efficient Action Bridging and Learning from Negative-Recovery Samples for Vision-Language-Action Model	Zewei Zhou et.al.	2604.19710	null
2026-04-23	PC2Model: ISPRS benchmark on 3D point cloud to model registration	Mehdi Maboudi et.al.	2604.19596	null
2026-04-21	VCE: A zero-cost hallucination mitigation method of LVLMs via visual contrastive editing	Yanbin Huang et.al.	2604.19412	null
2026-04-21	PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving	Yining Pan et.al.	2604.19379	null
2026-04-21	Unposed-to-3D: Learning Simulation-Ready Vehicles from Real-World Images	Hongyuan Liu et.al.	2604.19257	null
2026-04-21	When Can We Trust Deep Neural Networks? Towards Reliable Industrial Deployment with an Interpretability Guide	Hang-Cheng Dong et.al.	2604.19206	null
2026-04-21	ST-Prune: Training-Free Spatio-Temporal Token Pruning for Vision-Language Models in Autonomous Driving	Lin Sha et.al.	2604.19145	null
2026-04-21	AutoAWG: Adverse Weather Generation with Adaptive Multi-Controls for Automotive Videos	Jiagao Hu et.al.	2604.18993	null
2026-04-21	Localization-Guided Foreground Augmentation in Autonomous Driving	Jiawei Yong et.al.	2604.18940	null
2026-04-20	From Particles to Perils: SVGD-Based Hazardous Scenario Generation for Autonomous Driving Systems Testing	Linfeng Liang et.al.	2604.18918	null
2026-04-20	Feasibility of Indoor Frame-Wise Lidar Semantic Segmentation via Distillation from Visual Foundation Model	Haiyang Wu et.al.	2604.18831	null
2026-04-20	OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation	Jinghui Lu et.al.	2604.18486	null
2026-04-20	SemLT3D: Semantic-Guided Expert Distillation for Camera-only Long-Tailed 3D Object Detection	Hao Vo et.al.	2604.18476	null
2026-04-20	Asset Harvester: Extracting 3D Assets from Autonomous Driving Logs for Simulation	Tianshi Cao et.al.	2604.18468	null
2026-04-20	OneDrive: Unified Multi-Paradigm Driving with Vision-Language-Action Models	Yiwei Zhang et.al.	2604.17915	null
2026-04-20	M100: An Orchestrated Dataflow Architecture Powering General AI Computing	Yan Xie et.al.	2604.17862	null
2026-04-20	Driving risk emerges from the required two-dimensional joint evasive acceleration	Hao Cheng et.al.	2604.17841	null
2026-04-19	Infrastructure-Centric World Models: Bridging Temporal Depth and Spatial Breadth for Roadside Perception	Siyuan Meng et.al.	2604.17651	null
2026-04-19	RISC-V Functional Safety for Autonomous Automotive Systems: An Analytical Framework and Research Roadmap for ML-Assisted Certification	Nick Andreasyan et.al.	2604.17391	null
2026-04-19	Safety-Aware AoI Scheduling for LEO Satellite-Assisted Autonomous Driving	Kangkang Sun et.al.	2604.17281	null
2026-04-18	OptiMVMap: Offline Vectorized Map Construction via Optimal Multi-vehicle Perspectives	Zedong Dan et.al.	2604.17135	null
2026-04-18	Harness as an Asset: Enforcing Determinism via the Convergent AI Agent Framework (CAAF)	Tianbao Zhang et.al.	2604.17025	null
2026-04-17	Camo-M3FD: A New Benchmark Dataset for Cross-Spectral Camouflaged Pedestrian Detection	Henry O. Velesaca et.al.	2604.16582	null
2026-04-17	Real-Time Solution-Seeking for Game-Theoretic Autonomous Driving via Time-Distributed Iterations	Shaoqing Liu et.al.	2604.16184	null
2026-04-17	Stylistic-STORM (ST-STORM) : Perceiving the Semantic Nature of Appearance	Hamed Ouattara et.al.	2604.16086	null
2026-04-17	Fed3D: Federated 3D Object Detection	Suyan Dai et.al.	2604.15795	null
2026-04-17	Towards Robust Endogenous Reasoning: Unifying Drift Adaptation in Non-Stationary Tuning	Xiaoyu Yang et.al.	2604.15705	null
2026-04-16	RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework	Hao Gao et.al.	2604.15308	null
2026-04-16	AD4AD: Benchmarking Visual Anomaly Detection Models for Safer Autonomous Driving	Fabrizio Genilotti et.al.	2604.15291	null
2026-04-15	Mosaic: An Extensible Framework for Composing Rule-Based and Learned Motion Planners	Nick Le Large et.al.	2604.13853	null
2026-04-15	Towards Autonomous Driving with Short-Packet Rate Splitting: Age of Information Analysis and Optimization	Zirui Zheng et.al.	2604.13691	null
2026-04-15	Hybrid Architecture Gets Fluid: A New Paradigm for Direction-of-arrival Estimation in 6G Networks	Ye Tian et.al.	2604.13587	null
2026-04-14	Radar-Camera BEV Multi-Task Learning with Cross-Task Attention Bridge for Joint 3D Detection and Segmentation	Ahmet İnanç et.al.	2604.12918	null
2026-04-14	FeaXDrive: Feasibility-aware Trajectory-Centric Diffusion Planning for End-to-End Autonomous Driving	Baoyun Wang et.al.	2604.12656	null
2026-04-14	RACF: A Resilient Autonomous Car Framework with Object Distance Correction	Chieh Tsai et.al.	2604.12418	null
2026-04-14	HyperLiDAR: Adaptive Post-Deployment LiDAR Segmentation via Hyperdimensional Computing	Ivannia Gomez Moreno et.al.	2604.12331	null
2026-04-14	Physics-Grounded Monocular Vehicle Distance Estimation Using Standardized License Plate Typography	Manognya Lokesh Reddy et.al.	2604.12239	null
2026-04-14	Unveiling the Surprising Efficacy of Navigation Understanding in End-to-End Autonomous Driving	Zhihua Hua et.al.	2604.12208	null
2026-04-13	MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving	Haesung Oh et.al.	2604.11854	null
2026-04-13	Representations Before Pixels: Semantics-Guided Hierarchical Video Prediction	Efstathios Karypidis et.al.	2604.11707	null
2026-04-13	OpenDT: Exploring Datacenter Performance and Sustainability with a Self-Calibrating Digital Twin	Radu Nicolae et.al.	2604.11445	null
2026-04-13	MapATM: Enhancing HD Map Construction through Actor Trajectory Modeling	Mingyang Li et.al.	2604.11081	null
2026-04-12	BridgeSim: Unveiling the OL-CL Gap in End-to-End Autonomous Driving	Seth Z. Zhao et.al.	2604.10856	null
2026-04-12	LIDARLearn: A Unified Deep Learning Library for 3D Point Cloud Classification, Segmentation, and Self-Supervised Representation Learning	Said Ohamouddou et.al.	2604.10780	null
2026-04-12	Energy-Efficient Federated Edge Learning For Small-Scale Datasets in Large IoT Networks	Haihui Xie et.al.	2604.10662	null
2026-04-12	SignReasoner: Compositional Reasoning for Complex Traffic Sign Understanding via Functional Structure Units	Ruibin Wang et.al.	2604.10436	null
2026-04-11	MAVEN-T: Multi-Agent enVironment-aware Enhanced Neural Trajectory predictor with Reinforcement Learning	Wenchang Duan et.al.	2604.10169	null
2026-04-13	VAGNet: Vision-based Accident Anticipation with Global Features	Vipooshan Vipulananthan et.al.	2604.09305	null
2026-04-10	Neural Distribution Prior for LiDAR Out-of-Distribution Detection	Zizhao Li et.al.	2604.09232	null
2026-04-10	Long-SCOPE: Fully Sparse Long-Range Cooperative 3D Perception	Jiahao Wang et.al.	2604.09206	null
2026-04-10	Learning Vision-Language-Action World Models for Autonomous Driving	Guoqing Wang et.al.	2604.09059	null
2026-04-09	Quantum Patches: Enhancing Robustness of Quantum Machine Learning Models	Ban Q. Tran et.al.	2604.08827	null
2026-04-09	LMGenDrive: Bridging Multimodal Understanding and Generative World Modeling for End-to-End Driving	Hao Shao et.al.	2604.08719	null
2026-04-09	Fail2Drive: Benchmarking Closed-Loop Driving Generalization	Simon Gerstenecker et.al.	2604.08535	null
2026-04-10	CrashSight: A Phase-Aware, Infrastructure-Centric Video Benchmark for Traffic Crash Scene Understanding and Reasoning	Rui Gan et.al.	2604.08457	null
2026-04-09	Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems	Tolga Dimlioglu et.al.	2604.08366	null
2026-04-09	Orion-Lite: Distilling LLM Reasoning into Efficient Vision-Only Driving Models	Jing Gu et.al.	2604.08266	null
2026-04-09	DinoRADE: Full Spectral Radar-Camera Fusion with Vision Foundation Model Features for Multi-class Object Detection in Adverse Weather	Christof Leitgeb et.al.	2604.08074	null
2026-04-09	Open-Ended Instruction Realization with LLM-Enabled Multi-Planner Scheduling in Autonomous Vehicles	Jiawei Liu et.al.	2604.08031	null
2026-04-09	SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving	Felix Embacher et.al.	2604.08008	null
2026-04-09	MotionScape: A Large-Scale Real-World Highly Dynamic UAV Video Dataset for World Models	Zile Guo et.al.	2604.07991	null
2026-04-09	Object-Centric Stereo Ranging for Autonomous Driving: From Dense Disparity to Census-Based Template Matching	Qihao Huang et.al.	2604.07980	null
2026-04-09	On-Policy Distillation of Language Models for Autonomous Vehicle Motion Planning	Amirhossein Afsharrad et.al.	2604.07944	null
2026-04-09	ParkSense: Where Should a Delivery Driver Park? Leveraging Idle AV Compute and Vision-Language Models	Die Hu et.al.	2604.07912	null
2026-04-08	Geo-EVS: Geometry-Conditioned Extrapolative View Synthesis for Autonomous Driving	Yatong Lan et.al.	2604.07250	null
2026-04-08	Self-Discovered Intention-aware Transformer for Multi-modal Vehicle Trajectory Prediction	Diyi Liu et.al.	2604.07126	null
2026-04-10	Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM	Chengyue Wu et.al.	2604.06832	null
2026-04-08	How Well Do Vision-Language Models Understand Sequential Driving Scenes? A Sensitivity Study	Roberto Brusnicki et.al.	2604.06750	null
2026-04-08	VDPP: Video Depth Post-Processing for Speed and Scalability	Daewon Yoon et.al.	2604.06665	null
2026-04-07	Evolution of Video Generative Foundations	Teng Hu et.al.	2604.06339	null
2026-04-07	Telescope: Learnable Hyperbolic Foveation for Ultra-Long-Range Object Detection	Parker Ewen et.al.	2604.06332	null
2026-04-07	Appearance Decomposition Gaussian Splatting for Multi-Traversal Reconstruction	Yangyi Xiao et.al.	2604.05908	null
2026-04-07	Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion	Yu Xue et.al.	2604.05780	null
2026-04-07	Not All Agents Matter: From Global Attention Dilution to Risk-Prioritized Game Planning	Kang Ding et.al.	2604.05449	null
2026-04-07	ICR-Drive: Instruction Counterfactual Robustness for End-to-End Language-Driven Autonomous Driving	Kaiser Hamid et.al.	2604.05378	null
2026-04-06	Probabilistic Tree Inference Enabled by FDSOI Ferroelectric FETs	Pengyu Ren et.al.	2604.05115	null
2026-04-06	Part-Level 3D Gaussian Vehicle Generation with Joint and Hinge Axis Estimation	Shiyao Qian et.al.	2604.05070	null
2026-04-06	HorizonWeaver: Generalizable Multi-Level Semantic Editing for Driving Scenes	Mauricio Soroco et.al.	2604.04887	null
2026-04-06	The Blind Spot of Adaptation: Quantifying and Mitigating Forgetting in Fine-tuned Driving Models	Runhao Mao et.al.	2604.04857	null
2026-04-06	Multi-Modal Sensor Fusion using Hybrid Attention for Autonomous Driving	Mayank Mayank et.al.	2604.04797	null
2026-04-06	Multimodal Backdoor Attack on VLMs for Autonomous Driving via Graffiti and Cross-Lingual Triggers	Jiancheng Wang et.al.	2604.04630	null
2026-04-06	Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or Group-Distributional non-robustness and how to fix them	Ole Delzer et.al.	2604.04518	null
2026-04-06	Adversarial Robustness Analysis of Cloud-Assisted Autonomous Driving Systems	Maher Al Islam et.al.	2604.04349	null
2026-04-06	GA-GS: Generation-Assisted Gaussian Splatting for Static Scene Reconstruction	Yedong Shen et.al.	2604.04331	null
2026-04-05	DriveVA: Video Action Models are Zero-Shot Drivers	Mengmeng Liu et.al.	2604.04198	null
2026-04-04	InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset	Felix Stillger et.al.	2604.03814	null
2026-04-04	HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving	Wenhao Yao et.al.	2604.03581	null
2026-04-03	Sim2Real-AD: A Modular Sim-to-Real Framework for Deploying VLM-Guided Reinforcement Learning in Real-World Autonomous Driving	Zilin Huang et.al.	2604.03497	null
2026-04-03	SpectralSplat: Appearance-Disentangled Feed-Forward Gaussian Splatting for Driving Scenes	Quentin Herau et.al.	2604.03462	null
2026-04-03	YOLOv11 Demystified: A Practical Guide to High-Performance Object Detection	Nikhileswara Rao Sulake et.al.	2604.03349	null
2026-04-03	BEVPredFormer: Spatio-temporal Attention for BEV Instance Prediction in Autonomous Driving	Miguel Antunes-García et.al.	2604.02930	null
2026-04-03	ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving	Zihao Sheng et.al.	2604.02714	null
2026-04-03	V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views	Junwei You et.al.	2604.02710	null
2026-04-03	Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles	Weimin Liu et.al.	2604.02639	null
2026-04-03	Rascene: High-Fidelity 3D Scene Imaging with mmWave Communication Signals	Kunzhe Song et.al.	2604.02603	null
2026-04-02	Dynamic Risk Generation for Autonomous Driving: Naturalistic Reconstruction of Vehicle-E-Scooter Interactions	Abin Mathew et.al.	2604.02573	null
2026-04-02	Adaptive Learned State Estimation based on KalmanNet	Arian Mehrfard et.al.	2604.02441	null
2026-04-02	Deep Neural Network Based Roadwork Detection for Autonomous Driving	Sebastian Wullrich et.al.	2604.02282	null
2026-04-02	LEO: Graph Attention Network based Hybrid Multi Sensor Extended Object Fusion and Tracking for Autonomous Driving Applications	Mayank Mayank et.al.	2604.02206	null
2026-04-02	UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving	Yongkang Li et.al.	2604.02190	null
2026-04-02	Causal Scene Narration with Runtime Safety Supervision for Vision-Language-Action Driving	Yun Li et.al.	2604.01723	null
2026-04-02	Hi-LOAM: Hierarchical Implicit Neural Fields for LiDAR Odometry and Mapping	Zhiliu Yang et.al.	2604.01720	null
2026-04-02	Riemannian and Symplectic Geometry for Hierarchical Text-Driven Place Recognition	Tianyi Shang et.al.	2604.01598	null
2026-04-01	SECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous Driving	Wenjing Wang et.al.	2604.01337	null
2026-04-01	Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models	Xiaosong Jia et.al.	2604.01259	null
2026-04-01	Simulating Realistic LiDAR Data Under Adverse Weather for Autonomous Vehicles: A Physics-Informed Learning Approach	Vivek Anand et.al.	2604.01254	null
2026-04-01	VRUD: A Drone Dataset for Complex Vehicle-VRU Interactions within Mixed Traffic	Ziyu Wang et.al.	2604.01134	null
2026-04-01	ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction	Yuheng Zhang et.al.	2604.01081	null
2026-04-01	DLWM: Dual Latent World Models enable Holistic Gaussian-centric Pre-training in Autonomous Driving	Yiyao Zhu et.al.	2604.00969	null
2026-04-01	DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale	Sicheng Zuo et.al.	2604.00813	null
2026-04-01	Towards Viewpoint-Robust End-to-End Autonomous Driving with 3D Foundation Model Priors	Hiroki Hashimoto et.al.	2604.00597	null
2026-04-01	SAR/ISAR Imaging in 6G Network	Yanmo Hu et.al.	2604.00583	null
2026-04-01	COTTA: Context-Aware Transfer Adaptation for Trajectory Prediction in Autonomous Driving	Seohyoung Park et.al.	2604.00402	null
2026-04-01	Neural Reconstruction of LiDAR Point Clouds under Jamming Attacks via Full-Waveform Representation and Simultaneous Laser Sensing	Ryo Yoshida et.al.	2604.00371	null
2026-03-31	Better than Average: Spatially-Aware Aggregation of Segmentation Uncertainty Improves Downstream Performance	Vanessa Emanuela Guarino et.al.	2603.29941	null
2026-03-31	C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving	Zhihong Cui et.al.	2603.29908	null
2026-03-31	SparseDriveV2: Scoring is All You Need for End-to-End Autonomous Driving	Wenchao Sun et.al.	2603.29163	null
2026-03-30	AutoWorld: Scaling Multi-Agent Traffic Simulation with Self-Supervised World Models	Mozhgan Pourkeshavatz et.al.	2603.28963	null
2026-03-30	A Semantic Observer Layer for Autonomous Vehicles: Pre-Deployment Feasibility Study of VLMs for Low-Latency Anomaly Detection	Kunal Runwal et.al.	2603.28888	null
2026-03-30	OccSim: Multi-kilometer Simulation with Long-horizon Occupancy World Models	Tianran Liu et.al.	2603.28887	null
2026-03-30	FL-PBM: Pre-Training Backdoor Mitigation for Federated Learning	Osama Wehbi et.al.	2603.28673	null
2026-03-31	RAD-LAD: Rule and Language Grounded Autonomous Driving in Real-Time	Anurag Ghosh et.al.	2603.28522	null
2026-03-30	Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms	Muyang He et.al.	2603.28489	null
2026-03-30	TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation	Minh-Khoi Do et.al.	2603.28233	null
2026-03-30	Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal	Kazuma Ikeda et.al.	2603.28224	null
2026-03-30	$AutoDrive\text{-}P^3$ : Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning	Yuqi Ye et.al.	2603.28116	null
2026-03-30	To View Transform or Not to View Transform: NeRF-based Pre-training Perspective	Hyeonjun Jeong et.al.	2603.28090	null
2026-03-30	Effort-Based Criticality Metrics for Evaluating 3D Perception Errors in Autonomous Driving	Sharang Kaul et.al.	2603.28029	null
2026-03-30	Energy-Aware Imitation Learning for Steering Prediction Using Events and Frames	Hu Cao et.al.	2603.28008	null
2026-03-30	UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection	Hongjing Wu et.al.	2603.27995	null
2026-03-29	Benchmarking Multi-View BEV Object Detection with Mixed Pinhole and Fisheye Cameras	Xiangzhong Liu et.al.	2603.27818	null
2026-03-29	TianJi:An autonomous AI meteorologist for discovering physical mechanisms in atmospheric science	Kaikai Zhang et.al.	2603.27738	null
2026-03-29	Annotation-Free Detection of Drivable Areas and Curbs Leveraging LiDAR Point Cloud Maps	Fulong Ma et.al.	2603.27553	null
2026-03-28	HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors	Ke Li et.al.	2603.27371	null
2026-03-28	Guided Lensless Polarization Imaging	Noa Kraicer et.al.	2603.27357	null
2026-03-28	Class-Distribution Guided Active Learning for 3D Occupancy Prediction in Autonomous Driving	Wonjune Kim et.al.	2603.27294	null
2026-03-28	Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving	Qiqi Liu et.al.	2603.27287	null
2026-03-28	Robust Global-Local Behavior Arbitration via Continuous Command Fusion Under LiDAR Errors	Mohamed Elgouhary et.al.	2603.27273	null
2026-03-28	An Instance-Centric Panoptic Occupancy Prediction Benchmark for Autonomous Driving	Yi Feng et.al.	2603.27238	link
2026-03-28	RailVQA: A Benchmark and Framework for Efficient Interpretable Visual Cognition in Automatic Train Operation	Sen Zhang et.al.	2603.27112	null
2026-03-26	Vega: Learning to Drive with Natural Language Instructions	Sicheng Zuo et.al.	2603.25741	null
2026-03-26	Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving	Zehao Wang et.al.	2603.25740	null
2026-03-26	Can Users Specify Driving Speed? Bench2Drive-Speed: Benchmark and Baselines for Desired-Speed Conditioned Autonomous Driving	Yuqian Shao et.al.	2603.25672	null
2026-03-26	Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case	Koldo Basterretxea et.al.	2603.25510	null
2026-03-26	RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models	Yufeng Yang et.al.	2603.25502	null
2026-03-26	Temporally Decoupled Diffusion Planning for Autonomous Driving	Xiang Li et.al.	2603.25462	null
2026-03-26	Denoise and Align: Towards Source-Free UDA for Robust Panoramic Semantic Segmentation	Yaowen Chang et.al.	2603.25131	null
2026-03-26	Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model	Ziyan Wang et.al.	2603.24989	null
2026-03-26	TIGFlow-GRPO: Trajectory Forecasting via Interaction-Aware Flow Matching and Reward-Driven Optimization	Xuepeng Jing et.al.	2603.24936	null
2026-03-25	DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving	Pengxuan Yang et.al.	2603.24587	null
2026-03-25	Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving	Linbo Wang et.al.	2603.24581	null
2026-03-25	Toward Physically Consistent Driving Video World Models under Challenging Trajectories	Jiawei Zhou et.al.	2603.24506	null
2026-03-25	Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification	Han Sun et.al.	2603.24058	null
2026-03-25	Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration	Guopeng Li et.al.	2603.23889	null
2026-03-24	Rectify, Don’t Regret: Avoiding Pitfalls of Differentiable Simulation in Trajectory Prediction	Harsh Yadav et.al.	2603.23393	null
2026-03-24	Modeling Edge-to-Cloud Offloading Workloads for Autonomous Vehicles	Longkun Li et.al.	2603.23310	null
2026-03-25	PoseDriver: A Unified Approach to Multi-Category Skeleton Detection for Autonomous Driving	Yasamin Borhani et.al.	2603.23215	null
2026-03-24	Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment	Guoyang Zhao et.al.	2603.23034	null
2026-03-24	Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction	Chengxin Lv et.al.	2603.22852	null
2026-03-24	Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems	Manognya Lokesh Reddy et.al.	2603.22781	null
2026-03-23	LRC-WeatherNet: LiDAR, RADAR, and Camera Fusion Network for Real-time Weather-type Classification in Autonomous Driving	Nour Alhuda Albashir et.al.	2603.21987	null
2026-03-23	The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation	Guannan Lai et.al.	2603.21928	null
2026-03-23	Disengagement Analysis and Field Tests of a Prototypical Open-Source Level 4 Autonomous Driving System	Marvin Seegert et.al.	2603.21926	null
2026-03-23	Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis	Kangbo Zhao et.al.	2603.21661	null
2026-03-23	HACMatch Semi-Supervised Rotation Regression with Hardness-Aware Curriculum Pseudo Labeling	Mei Li et.al.	2603.21583	null
2026-03-23	Ultrafast microwave sensing and automatic recognition of dynamic objects in open world using programmable surface plasmonic neural networks	Qian Ma et.al.	2603.21521	null
2026-03-22	Dynasto: Validity-Aware Dynamic-Static Parameter Optimization for Autonomous Driving Testing	Dmytro Humeniuk et.al.	2603.21427	null
2026-03-22	Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving	Haixi Zhang et.al.	2603.21061	null
2026-03-22	KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph	Ye Tian et.al.	2603.21029	null
2026-03-21	OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation	Aarush Aggarwal et.al.	2603.20777	null
2026-03-24	GHOST: Ground-projected Hypotheses from Observed Structure-from-Motion Trajectories	Tomasz Frelek et.al.	2603.20583	null
2026-03-20	Understanding Behavior Cloning with Action Quantization	Haoqun Cao et.al.	2603.20538	null
2026-03-20	Wildfire Spread Scenarios: Increasing Sample Diversity of Segmentation Diffusion Models with Training-Free Methods	Sebastian Gerard et.al.	2603.20188	null
2026-03-20	Uncertainty Matters: Structured Probabilistic Online Mapping for Motion Prediction in Autonomous Driving	Pritom Gogoi et.al.	2603.20076	null
2026-03-20	X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving	Chaoda Zheng et.al.	2603.19979	null
2026-03-23	2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction	Tianbao Zhang et.al.	2603.19964	null
2026-03-20	LIORNet: Self-Supervised LiDAR Snow Removal Framework for Autonomous Driving under Adverse Weather Conditions	Ji-il Park et.al.	2603.19936	null
2026-03-20	Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them	Michael Hubbertz et.al.	2603.19852	null
2026-03-20	DynFlowDrive: Flow-Based Dynamic World Modeling for Autonomous Driving	Xiaolu Liu et.al.	2603.19675	null
2026-03-20	StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention	Zhongrui Yu et.al.	2603.19552	null
2026-03-19	DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding	Dong Zhuo et.al.	2603.19219	null
2026-03-19	Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting	Yiren Lu et.al.	2603.19193	null
2026-03-19	Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving	Huiwen Yan et.al.	2603.19188	null
2026-03-19	Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs	Gaoxiang Cao et.al.	2603.18871	null
2026-03-19	Student views in AI Ethics and Social Impact	Tudor-Dan Mihoc et.al.	2603.18827	null
2026-03-19	Benchmarking Visual Feature Representations for LiDAR-Inertial-Visual Odometry Under Challenging Conditions	Eunseon Choi et.al.	2603.18589	null
2026-03-19	CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention	Jiacheng Tang et.al.	2603.18561	null
2026-03-18	DriveVLM-RL: Neuroscience-Inspired Reinforcement Learning with Vision-Language Models for Safe and Deployable Autonomous Driving	Zilin Huang et.al.	2603.18315	null
2026-03-18	VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events	Mohammad Qazim Bhat et.al.	2603.18178	null
2026-03-18	DarkDriving: A Real-World Day and Night Aligned Dataset for Autonomous Driving in the Dark Environment	Wuqi Wang et.al.	2603.18067	null
2026-03-18	AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception	Jinho Park et.al.	2603.17979	null
2026-03-18	An HMDP-MPC Decision-making Framework with Adaptive Safety Margins and Hysteresis for Autonomous Driving	Siyuan Li et.al.	2603.17802	null
2026-03-18	From Virtual Environments to Real-World Trials: Emerging Trends in Autonomous Driving	A. Humnabadkar et.al.	2603.17714	null
2026-03-18	VectorWorld: Efficient Streaming World Model via Diffusion Flow on Vector Graphs	Chaokang Jiang et.al.	2603.17652	null
2026-03-18	Hierarchical Decision-Making under Uncertainty: A Hybrid MDP and Chance-Constrained MPC Approach	Siyuan Li et.al.	2603.17634	null
2026-03-18	Physics-informed Deep Mixture-of-Koopmans Vehicle Dynamics Model with Dual-branch Encoder for Distributed Electric-drive Trucks	Jinyu Miao et.al.	2603.17416	null
2026-03-18	VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm	Hongbo Lu et.al.	2603.17382	null
2026-03-17	Topology-Preserving Deep Joint Source-Channel Coding for Semantic Communication	Omar Erak et.al.	2603.17126	null
2026-03-16	Joint Optimization of Storage and Loading for High-Performance 3D Point Cloud Data Processing	Ke Wang et.al.	2603.16945	null
2026-03-17	CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection	Junseok Lee et.al.	2603.16439	null
2026-03-17	Poisoning the Pixels: Revisiting Backdoor Attacks on Semantic Segmentation	Guangsheng Zhang et.al.	2603.16405	null
2026-03-17	Learning Human-Object Interaction for 3D Human Pose Estimation from LiDAR Point Clouds	Daniel Sungho Jung et.al.	2603.16343	null
2026-03-17	DriveFix: Spatio-Temporally Coherent Driving Scene Restoration	Heyu Si et.al.	2603.16306	null
2026-03-17	Toward Deep Representation Learning for Event-Enhanced Visual Autonomous Perception: the eAP Dataset	Jinghang Li et.al.	2603.16303	null
2026-03-17	AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection	Hongwei Lin et.al.	2603.16261	null
2026-03-17	PanguMotion: Continuous Driving Motion Forecasting with Pangu Transformers	Quanhao Ren et.al.	2603.16196	null
2026-03-17	HIPO: Instruction Hierarchy via Constrained Reinforcement Learning	Keru Chen et.al.	2603.16152	null
2026-03-17	The Era of End-to-End Autonomy: Transitioning from Rule-Based Driving to Large Driving Models	Eduardo Nebot et.al.	2603.16050	null
2026-03-18	Safety Case Patterns for VLA-based driving systems: Insights from SimLingo	Gerhard Yu et.al.	2603.16013	null
2026-03-16	CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving	Yihong Guo et.al.	2603.15771	null
2026-03-16	CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving	Erick Silva et.al.	2603.15364	null
2026-03-16	ADV-0: Closed-Loop Min-Max Adversarial Training for Long-Tail Robustness in Autonomous Driving	Tong Nie et.al.	2603.15221	null
2026-03-16	What Matters for Scalable and Robust Learning in End-to-End Driving Planners?	David Holtz et.al.	2603.15185	null
2026-03-16	Learning from Mistakes: Post-Training for Driving VLA with Takeover Data	Yinfeng Gao et.al.	2603.14972	null
2026-03-16	Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation	Xingtai Gui et.al.	2603.14948	null
2026-03-16	FAR-Drive: Frame-AutoRegressive Video Generation in Closed-Loop Autonomous Driving	Yaoru Li et.al.	2603.14938	null
2026-03-16	PerlAD: Towards Enhanced Closed-loop End-to-end Autonomous Driving with Pseudo-simulation-based Reinforcement Learning	Yinfeng Gao et.al.	2603.14908	null
2026-03-16	AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture-of-Transformers for End-to-End Autonomous Driving	Wenhui Huang et.al.	2603.14851	null
2026-03-16	RadarXFormer: Robust Object Detection via Cross-Dimension Fusion of 4D Radar Spectra and Images for Autonomous Driving	Yue Sun et.al.	2603.14822	null
2026-03-16	LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision	Yiming Huang et.al.	2603.14763	null
2026-03-16	TrajMamba: An Ego-Motion-Guided Mamba Model for Pedestrian Trajectory Prediction from an Egocentric Perspective	Yusheng Peng et.al.	2603.14739	null
2026-03-15	Learning to Order: Task Sequencing as In-Context Optimization	Jan Kobiolka et.al.	2603.14550	null
2026-03-15	WorldVLM: Combining World Model Forecasting and Vision-Language Reasoning	Stefan Englmeier et.al.	2603.14497	null
2026-03-15	DRCC-LPVMPC: Robust Data-Driven Control for Autonomous Driving and Obstacle Avoidance	Shiming Fang et.al.	2603.14408	null
2026-03-15	Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces	Jiayuan Du et.al.	2603.14354	null
2026-03-14	Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics	Dennis Haitz et.al.	2603.13917	null
2026-03-14	Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving	Zhexi Lian et.al.	2603.13842	null
2026-03-11	FlowAD: Ego-Scene Interactive Modeling for Autonomous Driving	Mingzhe Guo et.al.	2603.13399	null
2026-03-13	Panoramic Multimodal Semantic Occupancy Prediction for Quadruped Robots	Guoqiang Zhao et.al.	2603.13108	null
2026-03-13	VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation	Juhye Park et.al.	2603.12918	null
2026-03-16	Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection	Kadir-Kaan Özer et.al.	2603.12916	null
2026-03-13	Composing Driving Worlds through Disentangled Control for Adversarial Scenario Generation	Yifan Zhan et.al.	2603.12864	null
2026-03-13	Improving critical buildings energy resilience via shared autonomous electric vehicles – A sequential optimization framework	Jinming Liu et.al.	2603.12771	null
2026-03-13	IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud Registration	Dongxu Zhang et.al.	2603.12719	null
2026-03-13	CarPLAN: Context-Adaptive and Robust Planning with Dynamic Scene Awareness for Autonomous Driving	Junyong Yun et.al.	2603.12607	null
2026-03-12	A Neuro-Symbolic Framework Combining Inductive and Deductive Reasoning for Autonomous Driving Planning	Hongyan Wei et.al.	2603.12421	null
2026-03-12	QUARE: Multi-Agent Negotiation for Balancing Quality Attributes in Requirements Engineering	Haowei Cheng et.al.	2603.11890	null
2026-03-12	R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection	Zhongyu Xia et.al.	2603.11566	null
2026-03-12	Risk-Controllable Multi-View Diffusion for Driving Scenario Generation	Hongyi Lin et.al.	2603.11534	null
2026-03-12	Zero-Shot Cross-City Generalization in End-to-End Autonomous Driving: Self-Supervised versus Supervised Representations	Fatemeh Naeinian et.al.	2603.11417	null
2026-03-11	DriveXQA: Cross-modal Visual Question Answering for Adverse Driving Scene Understanding	Mingzhe Tao et.al.	2603.11380	null
2026-03-11	Radiometric fingerprinting of object surfaces using mobile laser scanning and semantic 3D road space models	Benedikt Schwab et.al.	2603.11252	null
2026-03-11	A Survey of Reasoning in Autonomous Driving Systems: Open Challenges and Emerging Paradigms	Kejin Yu et.al.	2603.11093	null
2026-03-13	DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving	Shuyao Shang et.al.	2603.11041	null
2026-03-11	STADA: Specification-based Testing for Autonomous Driving Agents	Joy Saha et.al.	2603.10940	null
2026-03-11	Evaluating randomized smoothing as a defense against adversarial attacks in trajectory prediction	Julian F. Schumann et.al.	2603.10821	null
2026-03-11	Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction	Hao Zhou et.al.	2603.10597	null
2026-03-11	KnowDiffuser: A Knowledge-Guided Diffusion Planner with LM Reasoning and Prior-Informed Trajectory Initialization	Fan Ding et.al.	2603.10441	null
2026-03-11	Motion Forcing: A Decoupled Framework for Robust Video Generation in Motion Dynamics	Tianshuo Xu et.al.	2603.10408	null
2026-03-11	PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner	Eugene Ku et.al.	2603.10330	null
2026-03-10	HG-Lane: High-Fidelity Generation of Lane Scenes under Adverse Weather and Lighting Conditions without Re-annotation	Daichao Zhao et.al.	2603.10128	null
2026-03-10	$M^2$ -Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs	Kaixin Lin et.al.	2603.09737	null
2026-03-10	RESBev: Making BEV Perception More Robust	Lifeng Zhuo et.al.	2603.09529	null
2026-03-10	Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning	Chun-Peng Chang et.al.	2603.09512	null
2026-03-10	StyleVLA: Driving Style-Aware Vision Language Action Model for Autonomous Driving	Yuan Gao et.al.	2603.09482	null
2026-03-10	EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation	Jiajun Cao et.al.	2603.09465	null
2026-03-10	Declarative Scenario-based Testing with RoadLogic	Ezio Bartocci et.al.	2603.09455	null
2026-03-10	Open-World Motion Forecasting	Nicolas Schischka et.al.	2603.09420	null
2026-03-10	Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning	Kanishkha Jaisankar et.al.	2603.09255	null
2026-03-09	Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving Architectures	David Fernandez et.al.	2603.08897	null
2026-03-09	OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras	Yongzhi Lin et.al.	2603.08521	null
2026-03-09	Graph Based Semantic Encoder Decoder Framework for Task Oriented Communications in Connected Autonomous Vehicles	Soheyb Ribouh et.al.	2603.08438	null
2026-03-09	DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving	Zhuolin He et.al.	2603.08254	null
2026-03-09	ALOOD: Exploiting Language Representations for LiDAR-based Out-of-Distribution Object Detection	Michael Kösel et.al.	2603.08180	null
2026-03-09	SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving	Zihan You et.al.	2603.08113	null
2026-03-09	RLPR: Radar-to-LiDAR Place Recognition via Two-Stage Asymmetric Cross-Modal Alignment for Autonomous Driving	Zhangshuo Qi et.al.	2603.07920	null
2026-03-09	NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving	Ximeng Tao et.al.	2603.07901	null
2026-03-09	Toward Unified Multimodal Representation Learning for Autonomous Driving	Ximeng Tao et.al.	2603.07874	null
2026-03-08	4DRC-OCC: Robust Semantic Occupancy Prediction Through Fusion of 4D Radar and Camera	David Ninfa et.al.	2603.07794	null
2026-03-08	Fast Attention-Based Simplification of LiDAR Point Clouds for Object Detection and Classification	Z. Rozsa et.al.	2603.07593	null
2026-03-08	ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction	Haibao Yu et.al.	2603.07552	null
2026-03-08	RayD3D: Distilling Depth Knowledge Along the Ray for Robust Multi-View 3D Object Detection	Rui Ding et.al.	2603.07493	null
2026-03-07	Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface	Balint K. Hodossy et.al.	2603.07364	null
2026-03-07	Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving	Jiazhuo Li et.al.	2603.07264	null
2026-03-07	Perception-Aware Multimodal Spatial Reasoning from Monocular Images	Yanchun Cheng et.al.	2603.06985	null
2026-03-06	Feasibility Restoration under Conflicting STL Specifications with Pareto-Optimal Refinement	Tianhao Wu et.al.	2603.06947	null
2026-03-06	VertiAdaptor: Online Kinodynamics Adaptation for Vertically Challenging Terrain	Tong Xu et.al.	2603.06887	null
2026-03-06	Improved Constrained Generation by Bridging Pretrained Generative Models	Xiaoxuan Liang et.al.	2603.06742	null
2026-03-06	BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations	Thomas Monninger et.al.	2603.06576	null
2026-03-06	Modeling and Measuring Redundancy in Multisource Multimodal Data for Autonomous Driving	Yuhan Zhou et.al.	2603.06544	null
2026-03-06	NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving	Kai Luo et.al.	2603.06254	null
2026-03-06	TaPD: Temporal-adaptive Progressive Distillation for Observation-Adaptive Trajectory Forecasting in Autonomous Driving	Mingyu Fan et.al.	2603.06231	null
2026-03-06	VG3S: Visual Geometry Grounded Gaussian Splatting for Semantic Occupancy Prediction	Xiaoyang Yan et.al.	2603.06210	null
2026-03-06	Transforming Omnidirectional RGB-LiDAR data into 3D Gaussian Splatting	Semin Bae et.al.	2603.06061	null
2026-03-06	TADPO: Reinforcement Learning Goes Off-road	Zhouchonghao Wu et.al.	2603.05995	null
2026-03-06	OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous Driving	Kota Shimomura et.al.	2603.05936	null
2026-03-06	Expert Knowledge-driven Reinforcement Learning for Autonomous Racing via Trajectory Guidance and Dynamics Constraints	Bo Leng et.al.	2603.05842	null
2026-03-05	Post Fusion Bird’s Eye View Feature Stabilization for Robust Multimodal 3D Detection	Trung Tien Dong et.al.	2603.05623	null
2026-03-05	Fusion4CA: Boosting 3D Object Detection via Comprehensive Image Exploitation	Kang Luo et.al.	2603.05305	null
2026-03-05	From Code to Road: A Vehicle-in-the-Loop and Digital Twin-Based Framework for Central Car Server Testing in Autonomous Driving	Chengdong Wu et.al.	2603.05279	null
2026-03-05	K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation	Mingxuan Mu et.al.	2603.04868	null
2026-03-05	On the Strengths and Weaknesses of Data for Open-set Embodied Assistance	Pradyumna Tambwekar et.al.	2603.04819	null
2026-03-04	Risk-Aware Rulebooks for Multi-Objective Trajectory Evaluation under Uncertainty	Tichakorn Wongpiromsarn et.al.	2603.04603	null
2026-03-04	PRAM-R: A Perception-Reasoning-Action-Memory Framework with LLM-Guided Modality Routing for Adaptive Autonomous Driving	Yi Zhang et.al.	2603.04222	null
2026-03-04	GSeg3D: A High-Precision Grid-Based Algorithm for Safety-Critical Ground Segmentation in LiDAR Point Clouds	Muhammad Haider Khan Lodhi et.al.	2603.04208	null
2026-03-04	SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling	Jinlong Cui et.al.	2603.04071	null
2026-03-04	Map-Agnostic And Interactive Safety-Critical Scenario Generation via Multi-Objective Tree Search	Wenyun Li et.al.	2603.03978	null
2026-03-04	Spatial Causal Prediction in Video	Yanguang Zhao et.al.	2603.03944	null
2026-03-04	LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving	Qihao Sun et.al.	2603.03765	null
2026-03-03	Analyzing the Impact of Adversarial Attacks on C-V2X-Enabled Road Safety: An Age of Information Perspective	Mahmudul Hassan Ashik et.al.	2603.03462	null
2026-03-03	Radar-based Pose Optimization for HD Map Generation from Noisy Multi-Drive Vehicle Fleet Data	Alexander Blumberg et.al.	2603.03453	null
2026-03-03	Utonia: Toward One Encoder for All Point Clouds	Yujia Zhang et.al.	2603.03283	null
2026-03-03	ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments	Ziyang Gong et.al.	2603.03198	null
2026-03-03	Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving	Tianze Zhu et.al.	2603.02613	null
2026-03-03	VLMFusionOcc3D: VLM Assisted Multi-Modal 3D Semantic Occupancy Prediction	A. Enes Doruk et.al.	2603.02609	null
2026-03-03	CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration	Huichun Liu et.al.	2603.02560	link
2026-03-03	AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation	Zhulin Jiang et.al.	2603.02542	null
2026-03-03	EIMC: Efficient Instance-aware Multi-modal Collaborative Perception	Kang Yang et.al.	2603.02532	link
2026-03-03	LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model	Xiangyu Li et.al.	2603.02528	null
2026-03-03	ModalPatch: A Plug-and-Play Module for Robust Multi-Modal 3D Object Detection under Modality Drop	Shuangzhi Li et.al.	2603.02481	null
2026-03-02	TruckDrive: Long-Range Autonomous Highway Driving Dataset	Filippo Ghilotti et.al.	2603.02413	null
2026-03-02	LAD-Drive: Bridging Language and Trajectory with Action-Aware Diffusion Transformers	Fabian Schmidt et.al.	2603.02035	null
2026-03-02	LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving	Yuechen Luo et.al.	2603.01928	null
2026-03-02	Streaming Real-Time Trajectory Prediction Using Endpoint-Aware Modeling	Alexander Prutsch et.al.	2603.01864	null
2026-03-02	GroupEnsemble: Efficient Uncertainty Estimation for DETR-based Object Detection	Yutong Yang et.al.	2603.01847	null
2026-03-02	WhisperNet: A Scalable Solution for Bandwidth-Efficient Collaboration	Gong Chen et.al.	2603.01708	null
2026-03-02	DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving	Enhui Ma et.al.	2603.01637	null
2026-03-02	Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing	Zijin Yin et.al.	2603.01535	null
2026-03-02	VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models	Duoxun Tang et.al.	2603.01454	null
2026-03-02	Unifying Language-Action Understanding and Generation for Autonomous Driving	Xinyang Wang et.al.	2603.01441	null
2026-03-02	Perspective-Equivariant Fine-tuning for Multispectral Demosaicing without Ground Truth	Andrew Wang et.al.	2603.01332	null
2026-03-01	FoSS: Modeling Long Range Dependencies and Multimodal Uncertainty in Trajectory Prediction via Fourier State Space Integration	Yizhou Huang et.al.	2603.01284	null
2026-03-01	Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures	Yuechen Luo et.al.	2603.01063	null
2026-03-01	An Open-Source Modular Benchmark for Diffusion-Based Motion Planning in Closed-Loop Autonomous Driving	Yun Li et.al.	2603.01023	null
2026-03-01	Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving	Xubo Zhu et.al.	2603.01007	null
2026-03-01	DriveCode: Domain Specific Numerical Encoding for LLM-Based Autonomous Driving	Zhiye Wang et.al.	2603.00919	null
2026-02-28	DRIV-EX: Counterfactual Explanations for Driving LLMs	Amaia Cardiel et.al.	2603.00696	null
2026-02-28	Wild-Drive: Off-Road Scene Captioning and Path Planning via Robust Multi-modal Routing and Efficient Large Language Model	Zihang Wang et.al.	2603.00694	null
2026-02-28	ReMoT: Reinforcement Learning with Motion Contrast Triplets	Cong Wan et.al.	2603.00461	null
2026-02-28	PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models	Yuanhao Su et.al.	2603.00412	null
2026-02-27	TSC: Topology-Conditioned Stackelberg Coordination for Multi-Agent Reinforcement Learning in Interactive Driving	Xiaotong Zhang et.al.	2602.23896	null
2026-02-27	SelfOccFlow: Towards end-to-end self-supervised 3D Occupancy Flow prediction	Xavier Timoneda et.al.	2602.23894	null
2026-02-27	Bandwidth-adaptive Cloud-Assisted 360-Degree 3D Perception for Autonomous Vehicles	Faisal Hawladera et.al.	2602.23871	null
2026-02-27	FPPS: An FPGA-Based Point Cloud Processing System	Xiaofeng Zhou et.al.	2602.23787	null
2026-02-27	CycleBEV: Regularizing View Transformation Networks via View Cycle Consistency for Bird’s-Eye-View Semantic Segmentation	Jeongbin Hong et.al.	2602.23575	null
2026-02-26	TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving	Tugrul Gorgulu et.al.	2602.23499	null
2026-02-26	Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving	Jiangxin Sun et.al.	2602.23259	null
2026-02-27	Towards Intelligible Human-Robot Interaction: An Active Inference Approach to Occluded Pedestrian Scenarios	Kai Chen et.al.	2602.23109	null
2026-02-26	Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving	Yinan Zheng et.al.	2602.22801	null
2026-02-26	Transformer Actor-Critic for Efficient Freshness-Aware Resource Allocation	Maryam Ansarifard et.al.	2602.22774	null
2026-02-27	The Swarm Intelligence Freeway-Urban Trajectories (SWIFTraj) Dataset – Part I: Dataset Description and Applications	Yu Han et.al.	2602.22563	null
2026-02-26	DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation	Zhechao Wang et.al.	2602.22549	null
2026-02-25	WeatherCity: Urban Scene Reconstruction with Controllable Multi-Weather Transformation	Wenhua Wu et.al.	2602.22096	null
2026-02-25	Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos	Matthew Strong et.al.	2602.22091	null
2026-02-25	PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning	Zekai Lin et.al.	2602.21992	null
2026-02-25	MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving	Lingjun Zhang et.al.	2602.21952	null
2026-02-25	GFPL: Generative Federated Prototype Learning for Resource-Constrained and Data-Imbalanced Vision Task	Shiwei Lu et.al.	2602.21873	null
2026-02-25	SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction	Haoxiang Fu et.al.	2602.21589	null
2026-02-25	Unified Unsupervised and Sparsely-Supervised 3D Object Detection by Semantic Pseudo-Labeling and Prototype Learning	Yushen He et.al.	2602.21484	null
2026-02-24	HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles	Yifan Wang et.al.	2602.21333	null
2026-02-24	Uncertainty-Aware Diffusion Model for Multimodal Highway Trajectory Prediction via DDIM Sampling	Marion Neumeier et.al.	2602.21319	null
2026-02-25	NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning	Ishaan Rawal et.al.	2602.21172	null
2026-02-24	UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling	Kaiyuan Tan et.al.	2602.20943	null
2026-02-24	VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving	Jie Wang et.al.	2602.20794	null
2026-02-24	GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generatio	Hao Zhang et.al.	2602.20673	null
2026-02-24	An LLM-driven Scenario Generation Pipeline Using an Extended Scenic DSL for Autonomous Driving Safety Validation	Fida Khandaker Safa et.al.	2602.20644	null
2026-02-24	Boosting Instance Awareness via Cross-View Correlation with 4D Radar and Camera for 3D Object Detection	Xiaokai Bai et.al.	2602.20632	null
2026-02-24	Efficient and Explainable End-to-End Autonomous Driving via Masked Vision-Language-Action Diffusion	Jiaru Zhang et.al.	2602.20577	null
2026-02-24	An interactive enhanced driving dataset for autonomous driving	Haojie Feng et.al.	2602.20575	null
2026-02-23	MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving	Junli Wang et.al.	2602.20060	null
2026-02-23	Probabilistic Photonic Computing	Frank Brückerhoff-Plückelmann et.al.	2602.19968	null
2026-02-23	VGGT-MPR: VGGT-Enhanced Multimodal Place Recognition in Autonomous Driving Environments	Jingyi Xu et.al.	2602.19735	null
2026-02-22	Safe and Interpretable Multimodal Path Planning for Multi-Agent Cooperation	Haojun Shi et.al.	2602.19304	null
2026-02-22	OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness	Phuc D. A. Nguyen et.al.	2602.19035	null
2026-02-21	Open-Vocabulary Domain Generalization in Urban-Scene Segmentation	Dong Zhao et.al.	2602.18853	null
2026-02-21	Driving with A Thousand Faces: A Benchmark for Closed-Loop Personalized End-to-End Autonomous Driving	Xiaoru Dong et.al.	2602.18757	null
2026-02-20	OODBench: Out-of-Distribution Benchmark for Large Vision-Language Models	Ling Lin et.al.	2602.18094	null
2026-02-20	Dynamic Deception: When Pedestrians Team Up to Fool Autonomous Cars	Masoud Jamshidiyan Tehrani et.al.	2602.18079	null
2026-02-20	Faster Training, Fewer Labels: Self-Supervised Pretraining for Fine-Grained BEV Segmentation	Daniel Busch et.al.	2602.18066	null
2026-02-19	Conditional Flow Matching for Continuous Anomaly Detection in Autonomous Driving on a Manifold-Aware Spectral Space	Antonio Guillen-Perez et.al.	2602.17586	null
2026-02-19	Hybrid System Planning using a Mixed-Integer ADMM Heuristic and Hybrid Zonotopes	Joshua A. Robbins et.al.	2602.17574	link
2026-02-19	HiMAP: History-aware Map-occupancy Prediction with Fallback	Yiming Xu et.al.	2602.17231	null
2026-02-19	Multi-session Localization and Mapping Exploiting Topological Information	Lorenzo Montano-Olivan et.al.	2602.17226	null
2026-02-19	3D Scene Rendering with Multimodal Gaussian Splatting	Chi-Shiang Gau et.al.	2602.17124	null
2026-02-18	Boreas Road Trip: A Multi-Sensor Autonomous Driving Dataset on Challenging Roads	Daniil Lisus et.al.	2602.16870	null
2026-02-18	PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction	Bo Lang et.al.	2602.16669	null
2026-02-18	A Contrastive Learning Framework Empowered by Attention-based Feature Adaptation for Street-View Image Classification	Qi You et.al.	2602.16590	null
2026-02-17	ScenicRules: An Autonomous Driving Benchmark with Multi-Objective Specifications and Abstract Scenarios	Kevin Kai-Chun Chang et.al.	2602.16073	null
2026-02-15	A Comprehensive Survey on Deep Learning-Based LiDAR Super-Resolution for Autonomous Driving	June Moh Goo et.al.	2602.15904	null
2026-02-17	RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution	Youngwan Jin et.al.	2602.15490	link
2026-02-16	Near-Optimal Sample Complexity for Online Constrained MDPs	Chang Liu et.al.	2602.15076	null
2026-02-16	ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery	Ayush Shrivastava et.al.	2602.14989	link
2026-02-16	DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI	En Yu et.al.	2602.14974	null
2026-02-16	DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving	Chenxu Dang et.al.	2602.14577	null
2026-02-16	Multimodal Covariance Steering in Belief Space with Active Probing and Influence for Autonomous Driving	Devodita Chakravarty et.al.	2602.14540	null
2026-02-15	A Generalizable Physics-guided Causal Model for Trajectory Prediction in Autonomous Driving	Zhenyu Zong et.al.	2602.13936	null
2026-02-14	Privacy-Concealing Cooperative Perception for BEV Scene Segmentation	Song Wang et.al.	2602.13555	null
2026-02-14	Nighttime Autonomous Driving Scene Reconstruction with Physically-Based Gaussian Splatting	Tae-Kyeong Kim et.al.	2602.13549	null
2026-02-13	TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios	Wentao Xu et.al.	2602.13040	null
2026-02-13	MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting	Mohammed Amine Bencheikh Lehocine et.al.	2602.13003	null
2026-02-13	RoadscapesQA: A Multitask, Multimodal Dataset for Visual Question Answering on Indian Roads	Vijayasri Iyer et.al.	2602.12877	null
2026-02-13	DPUConfig: Optimizing ML Inference in FPGAs Using Reinforcement Learning	Alexandros Patras et.al.	2602.12847	null
2026-02-13	The Constant Eye: Benchmarking and Bridging Appearance Robustness in Autonomous Driving	Jiabao Wang et.al.	2602.12563	null
2026-02-13	Self-Supervised JEPA-based World Models for LiDAR Occupancy Completion and Forecasting	Haoran Zhu et.al.	2602.12540	null
2026-02-12	DiffPlace: Street View Generation via Place-Controllable Diffusion Model Enhancing Place Recognition	Ji Li et.al.	2602.11875	null
2026-02-12	Talk2DM: Enabling Natural Language Querying and Commonsense Reasoning for Vehicle-Road-Cloud Integrated Dynamic Maps with Large Language Models	Lu Tao et.al.	2602.11860	null
2026-02-12	SToRM: Supervised Token Reduction for Multi-modal LLMs toward efficient end-to-end autonomous driving	Seo Hyun Kim et.al.	2602.11656	null
2026-02-11	DD-MDN: Human Trajectory Forecasting with Diffusion-Based Dual Mixture Density Networks and Uncertainty Self-Calibration	Manuel Hetzel et.al.	2602.11214	null
2026-02-11	Interpretable Vision Transformers in Monocular Depth Estimation via SVDA	Vasileios Arampatzakis et.al.	2602.11005	null
2026-02-11	ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving	Jinqing Zhang et.al.	2602.10884	null
2026-02-11	Viewpoint Recommendation for Point Cloud Labeling through Interaction Cost Modeling	Yu Zhang et.al.	2602.10871	null
2026-02-11	From Steering to Pedalling: Do Autonomous Driving VLMs Generalize to Cyclist-Assistive Spatial Perception and Planning?	Krishna Kanth Nakka et.al.	2602.10771	null
2026-02-11	AurigaNet: A Real-Time Multi-Task Network for Enhanced Urban Driving Perception	Kiarash Ghasemzadeh et.al.	2602.10660	null
2026-02-11	Found-RL: foundation model-enhanced reinforcement learning for autonomous driving	Yansong Qu et.al.	2602.10458	null
2026-02-10	Adaptive Time Step Flow Matching for Autonomous Driving Motion Planning	Ananya Trivedi et.al.	2602.10285	null
2026-02-10	AD $^2$ : Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems	Ishan Sahu et.al.	2602.10160	null
2026-02-11	Robust Vision Systems for Connected and Autonomous Vehicles: Security Challenges and Attack Vectors	Sandeep Gupta et.al.	2602.09740	null
2026-02-09	Robustness Is a Function, Not a Number: A Factorized Comprehensive Study of OOD Robustness in Vision-Based Driving	Amir Mallak et.al.	2602.09018	null
2026-02-09	Modeling 3D Pedestrian-Vehicle Interactions for Vehicle-Conditioned Pose Forecasting	Guangxun Zhu et.al.	2602.08962	null
2026-02-09	Multi-Staged Framework for Safety Analysis of Offloaded Services in Distributed Intelligent Transportation Systems	Robin Dehler et.al.	2602.08821	null
2026-02-09	A Generic Service-Oriented Function Offloading Framework for Connected Automated Vehicles	Robin Dehler et.al.	2602.08799	null
2026-02-09	Overview and Comparison of AVS Point Cloud Compression Standard	Wei Gao et.al.	2602.08613	null
2026-02-09	Head-to-Head autonomous racing at the limits of handling in the A2RL challenge	Simon Hoffmann et.al.	2602.08571	null
2026-02-09	SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios	Tian Gao et.al.	2602.08440	null
2026-02-09	Vec-QMDP: Vectorized POMDP Planning on CPUs for Real-Time Autonomous Driving	Xuanjin Jin et.al.	2602.08334	null
2026-02-09	Personalized Autonomous Driving via Optimal Control with Clearance Constraints from Questionnaires	Yongjae Lim et.al.	2602.08326	null
2026-02-09	Generating Adversarial Events: A Motion-Aware Point Cloud Framework	Hongwei Ren et.al.	2602.08230	null
2026-02-09	Self-Supervised Bootstrapping of Action-Predictive Embodied Reasoning	Milan Ganai et.al.	2602.08167	null
2026-02-08	MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection	Venkatraman Narayanan et.al.	2602.08126	null
2026-02-08	ForecastOcc: Vision-based Semantic Occupancy Forecasting	Riya Mohan et.al.	2602.08006	null
2026-02-08	Analyzing the Impact of Simulation Fidelity on the Evaluation of Autonomous Driving Motion Control	Simon Sagmeister et.al.	2602.07984	null
2026-02-07	All-Optical Segmentation via Diffractive Neural Networks for Autonomous Driving	Yingjie Li et.al.	2602.07717	null
2026-02-07	Vision and language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning	Ross Greer et.al.	2602.07680	null
2026-02-07	Seeing Roads Through Words: A Language-Guided Framework for RGB-T Driving Scene Segmentation	Ruturaj Reddy et.al.	2602.07343	null
2026-02-07	RAPiD: Real-time Deterministic Trajectory Planning via Diffusion Behavior Priors for Safe and Efficient Autonomous Driving	Ruturaj Reddy et.al.	2602.07339	null
2026-02-09	Temperature Scaling Attack Disrupting Model Confidence in Federated Learning	Kichang Lee et.al.	2602.06638	null
2026-02-06	DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving	Feiyang jia et.al.	2602.06521	null
2026-02-06	Rebenchmarking Unsupervised Monocular 3D Occupancy Prediction	Zizhan Guo et.al.	2602.06488	null
2026-02-05	Addressing the Waypoint-Action Gap in End-to-End Autonomous Driving via Vehicle Motion Models	Jorge Daniel Rodríguez-Vidal et.al.	2602.06214	null
2026-02-05	Driving with DINO: Vision Foundation Features as a Unified Bridge for Sim-to-Real Generation in Autonomous Driving	Xuyang Chen et.al.	2602.06159	null
2026-02-05	Thinking with Geometry: Active Geometry Integration for Spatial Reasoning	Haoyuan Li et.al.	2602.06037	null
2026-02-05	LSA: Localized Semantic Alignment for Enhancing Temporal Consistency in Traffic Video Generation	Mirlan Karimov et.al.	2602.05966	null
2026-02-05	ROMAN: Reward-Orchestrated Multi-Head Attention Network for Autonomous Driving System Testing	Jianlei Chi et.al.	2602.05629	null
2026-02-05	Unified Sensor Simulation for Autonomous Driving	Nikolay Patakin et.al.	2602.05617	null
2026-02-05	Visual Implicit Geometry Transformer for Autonomous Driving	Arsenii Shirokov et.al.	2602.05573	null
2026-02-06	A Comparative Study of 3D Person Detection: Sensor Modalities and Robustness in Diverse Indoor and Outdoor Environments	Malaz Tamim et.al.	2602.05538	null
2026-02-05	Imagine a City: CityGenAgent for Procedural 3D City Generation	Zishan Liu et.al.	2602.05362	null
2026-02-04	Reinforcement Learning Enhancement Using Vector Semantic Representation and Symbolic Reasoning for Human-Centered Autonomous Emergency Braking	Vinal Asodia et.al.	2602.05079	null
2026-02-04	Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty	Rui Liu et.al.	2602.04763	null
2026-02-06	DRMOT: A Dataset and Framework for RGBD Referring Multi-Object Tracking	Sijia Chen et.al.	2602.04692	null
2026-02-04	Safe and Stylized Trajectory Planning for Autonomous Driving via Diffusion Model	Shuo Pei et.al.	2602.04329	null
2026-02-04	AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models	Yuxuan Han et.al.	2602.04256	null
2026-02-04	Natural Language Instructions for Scene-Responsive Human-in-the-Loop Motion Planning in Autonomous Driving using Vision-Language-Action Models	Angel Martinez-Sanchez et.al.	2602.04184	null
2026-02-04	The Dynamics of Attention across Automated and Manual Driving Modes: A Driving Simulation Study	Yuan Cai et.al.	2602.04164	null
2026-02-03	Multi-Player, Multi-Strategy Quantum Game Model for Interaction-Aware Decision-Making in Autonomous Driving	Karim Essalmi et.al.	2602.03571	null
2026-02-03	HetroD: A High-Fidelity Drone Dataset and Benchmark for Autonomous Driving in Heterogeneous Traffic	Yu-Hsiang Chen et.al.	2602.03447	null
2026-02-03	PlanTRansformer: Unified Prediction and Planning with Goal-conditioned Transformer	Constantin Selzer et.al.	2602.03376	null
2026-02-03	Multi-Resolution Alignment for Voxel Sparsity in Camera-Based 3D Semantic Scene Completion	Zhiwen Yang et.al.	2602.03371	null
2026-02-03	InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation	Zhuoran Yang et.al.	2602.03242	null
2026-02-03	ConsisDrive: Identity-Preserving Driving World Models for Video Generation by Instance Mask	Zhuoran Yang et.al.	2602.03213	null
2026-02-04	A Unified Candidate Set with Scene-Adaptive Refinement via Diffusion for End-to-End Autonomous Driving	Zhengfei Wu et.al.	2602.03112	null
2026-02-03	JRDB-Pose3D: A Multi-person 3D Human Pose and Shape Estimation Dataset for Robotics	Sandika Biswas et.al.	2602.03064	null
2026-02-02	Accelerating Structured Chain-of-Thought in Autonomous Vehicles	Yi Gu et.al.	2602.02864	null
2026-02-02	AROLA: A Modular Layered Architecture for Scaled Autonomous Racing	Fam Shihata et.al.	2602.02730	null
2026-02-03	Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL	Julian Lemmel et.al.	2602.02236	null
2026-02-02	LiFlow: Flow Matching for 3D LiDAR Scene Completion	Andrea Matteazzi et.al.	2602.02232	null
2026-02-02	UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving	Guosheng Zhao et.al.	2602.02002	null
2026-02-02	ForSim: Stepwise Forward Simulation for Traffic Policy Fine-Tuning	Keyu Chen et.al.	2602.01916	null
2026-02-02	UniDWM: Towards a Unified Driving World Model via Multifaceted Representation Learning	Shuai Liu et.al.	2602.01536	null
2026-02-01	TF-Lane: Traffic Flow Module for Robust Lane Perception	Yihan Xie et.al.	2602.01277	null
2026-02-01	OASIS-DC: Generalizable Depth Completion via Output-level Alignment of Sparse-Integrated Monocular Pseudo Depth	Jaehyeon Cho et.al.	2602.01268	null
2026-02-01	LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions	Jingjing Wang et.al.	2602.01118	link
2026-02-01	HERMES: A Holistic End-to-End Risk-Aware Multimodal Embodied System with Vision-Language Models for Long-Tail Autonomous Driving	Weizhe Tang et.al.	2602.00993	null
2026-01-31	A Graph-based Framework for Coverage Analysis in Autonomous Driving	Thomas Muehlenstädt et.al.	2602.00903	null
2026-01-31	VVLoc: Prior-free 3-DoF Vehicle Visual Localization	Ze Huang et.al.	2602.00810	null
2026-01-31	Physics-informed Diffusion Mamba Transformer for Real-world Driving	Hang Zhou et.al.	2602.00808	null
2026-01-31	UniMotion: A Unified Motion Framework for Simulation, Prediction and Planning	Nan Song et.al.	2602.00566	null
2026-01-30	Deep Learning-Based Object Detection for Autonomous Vehicles: A Comparative Study of One-Stage and Two-Stage Detectors on Basic Traffic Objects	Bsher Karbouj et.al.	2602.00385	null
2026-01-30	IRL-DAL: Safe and Adaptive Trajectory Planning for Autonomous Driving via Energy-Guided Diffusion Models	Seyed Ahmad Hosseini Miangoleh et.al.	2601.23266	null
2026-01-30	FlowCalib: LiDAR-to-Vehicle Miscalibration Detection using Scene Flows	Ilir Tahiraj et.al.	2601.23107	null
2026-01-30	MTDrive: Multi-turn Interactive Reinforcement Learning for Autonomous Driving	Xidong Li et.al.	2601.22930	null
2026-01-30	Toward Fully Autonomous Driving: AI, Challenges, Opportunities, and Needs	Lars Ullrich et.al.	2601.22927	null
2026-01-30	A Serverless Edge-Native Data Processing Architecture for Autonomous Driving Training	Fabian Bally et.al.	2601.22919	null
2026-01-30	AutoMerge: Search-Based Model Merging Framework for Effective Model Reuse	You Lu et.al.	2601.22748	null
2026-01-30	GaussianOcc3D: A Gaussian-Based Adaptive Multi-modal 3D Occupancy Prediction	A. Enes Doruk et.al.	2601.22729	null
2026-01-29	FlexMap: Generalized HD Map Construction from Flexible Camera Configurations	Run Wang et.al.	2601.22376	null
2026-01-29	PoSafeNet: Safe Learning with Poset-Structured Neural Nets	Kiwan Wong et.al.	2601.22356	null
2026-01-29	Drive-JEPA: Video JEPA Meets Multimodal Trajectory Distillation for End-to-End Driving	Linhan Wang et.al.	2601.22032	null
2026-01-29	LLM-Driven Scenario-Aware Planning for Autonomous Driving	He Li et.al.	2601.21876	null
2026-01-29	4D-CAAL: 4D Radar-Camera Calibration and Auto-Labeling for Autonomous Driving	Shanliang Yao et.al.	2601.21454	null
2026-01-29	Drive-KD: Multi-Teacher Distillation for VLMs in Autonomous Driving	Weitong Lian et.al.	2601.21288	null
2026-01-28	Li-ViP3D++: Query-Gated Deformable Camera-LiDAR Fusion for End-to-End Perception and Trajectory Prediction	Matej Halinkovic et.al.	2601.20720	null
2026-01-28	Learning Contextual Runtime Monitors for Safe AI-Based Autonomy	Alejandro Luque-Cerpa et.al.	2601.20666	null
2026-01-28	Unsupervised Anomaly Detection in Multi-Agent Trajectory Prediction via Transformer-Based Models	Qing Lyu et.al.	2601.20367	null
2026-01-27	Game-Theoretic Autonomous Driving: A Graphs of Convex Sets Approach	Nikolaj Käfer et.al.	2601.20054	null
2026-01-27	ScenePilot-Bench: A Large-Scale Dataset and Benchmark for Evaluation of Vision-Language Models in Autonomous Driving	Yujin Wang et.al.	2601.19582	null
2026-01-27	Instance-Guided Radar Depth Estimation for 3D Object Detection	Chen-Chou Lo et.al.	2601.19314	null
2026-01-26	Learning the Pareto Space of Multi-Objective Autonomous Driving: A Modular, Data-Driven Approach	Mohammad Elayan et.al.	2601.18913	null
2026-01-26	Towards Safety-Compliant Transformer Architectures for Automotive Systems	Sven Kirchner et.al.	2601.18850	null
2026-01-25	Masked Depth Modeling for Spatial Perception	Bin Tan et.al.	2601.17895	null
2026-01-23	PocketDVDNet: Realtime Video Denoising for Real Camera Noise	Crispian Morris et.al.	2601.16780	null
2026-01-22	DMAVA: Distributed Multi-Autonomous Vehicle Architecture Using Autoware	Zubair Islam et.al.	2601.16336	null
2026-01-22	EVolSplat4D: Efficient Volume-based Gaussian Splatting for 4D Urban Scene Synthesis	Sheng Miao et.al.	2601.15951	null
2026-01-22	DualShield: Safe Model Predictive Diffusion via Reachability Analysis for Interactive Autonomous Driving	Rui Yang et.al.	2601.15729	null
2026-01-22	SuperOcc: Toward Cohesive Temporal Modeling for Superquadric-based Occupancy Prediction	Zichen Yu et.al.	2601.15644	null
2026-01-21	SplatBus: A Gaussian Splatting Viewer Framework via GPU Interprocess Communication	Yinghan Xu et.al.	2601.15431	null
2026-01-29	DrivIng: A Large-Scale Multimodal Driving Dataset with Full Digital Twin Integration	Dominik Rößle et.al.	2601.15260	null
2026-01-21	AutoDriDM: An Explainable Benchmark for Decision-Making of Vision-Language Models in Autonomous Driving	Zecong Tang et.al.	2601.14702	null
2026-01-20	Vision-Based Natural Language Scene Understanding for Autonomous Driving: An Extended Dataset and a New Model for Traffic Scene Description Generation	Danial Sadrian Zadeh et.al.	2601.14438	null
2026-01-20	Correcting and Quantifying Systematic Errors in 3D Box Annotations for Autonomous Driving	Alexandre Justo Miro et.al.	2601.14038	null
2026-01-20	PAtt: A Pattern Attention Network for ETA Prediction Using Historical Speed Profiles	ByeoungDo Kim et.al.	2601.13793	null
2026-01-19	NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness	Ali Shafiee Sarvestani et.al.	2601.13162	null
2026-01-19	AsyncBEV: Cross-modal Flow Alignment in Asynchronous 3D Object Detection	Shiming Wang et.al.	2601.12994	null
2026-01-19	PlannerRFT: Reinforcing Diffusion Planners through Closed-Loop and Sample-Efficient Fine-Tuning	Hongchen Li et.al.	2601.12901	null
2026-01-19	Efficient Local-to-Global Collaborative Perception via Joint Communication and Computation Optimization	Hui Zhang et.al.	2601.12749	null
2026-01-19	VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy Robustness	Qimao Chen et.al.	2601.12672	null
2026-01-18	SGCP: A Self-Organized Game-Theoretic Framework For Collaborative Perception	Zechuan Gong et.al.	2601.12524	null
2026-01-18	HOT-POT: Optimal Transport for Sparse Stereo Matching	Antonin Clerc et.al.	2601.12423	null
2026-01-17	Neural Process-Based Reactive Controller for Autonomous Racing	Devin Hunter et.al.	2601.12143	null
2026-01-17	Listen, Look, Drive: Coupling Audio Instructions for User-aware VLA-based Autonomous Driving	Ziang Guo et.al.	2601.12142	null
2026-01-17	Kernel-Based Learning of Safety Barriers	Oliver Schön et.al.	2601.12002	null
2026-01-17	Beyond Target-Level: ISAC-Enabled Event-Level Sensing for Behavioral Intention Prediction	Haotian Liu et.al.	2601.11894	null
2026-01-16	Toward Human-Centered Human-AI Interaction: Advances in Theoretical Frameworks and Practice	Zaifeng Gao et.al.	2601.11812	null
2026-01-16	Cross-Domain Object Detection Using Unsupervised Image Translation	Vinicius F. Arruda et.al.	2601.11779	null
2026-01-16	Generative Scenario Rollouts for End-to-End Autonomous Driving	Rajeev Yasarla et.al.	2601.11475	null
2026-01-21	SUG-Occ: An Explicit Semantics and Uncertainty Guided Sparse Learning Framework for Real-Time 3D Occupancy Prediction	Hanlin Wu et.al.	2601.11396	null
2026-01-15	A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems	Yizhou Wang et.al.	2601.10819	null
2026-01-15	See Less, Drive Better: Generalizable End-to-End Autonomous Driving via Foundation Models Stochastic Patch Selection	Amir Mallak et.al.	2601.10707	null
2026-01-15	DeepUrban: Interaction-Aware Trajectory Prediction and Planning for Automated Driving by Aerial Imagery	Constantin Selzer et.al.	2601.10554	null
2026-01-15	BikeActions: An Open Platform and Benchmark for Cyclist-Centric VRU Action Recognition	Max A. Buettner et.al.	2601.10521	null
2026-01-15	SatMap: Revisiting Satellite Maps as Prior for Online HD Map Construction	Kanak Mazumder et.al.	2601.10512	null
2026-01-15	OT-Drive: Out-of-Distribution Off-Road Traversable Area Segmentation via Optimal Transport	Zhihua Zhao et.al.	2601.09952	null
2026-01-14	LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving	Carlo Sgaravatti et.al.	2601.09812	null
2026-01-14	MAD: Motion Appearance Decoupling for efficient Driving World Models	Ahmad Rahimi et.al.	2601.09452	null
2026-01-14	Data Scaling for Navigation in Unknown Environments	Lauri Suomela et.al.	2601.09444	null
2026-01-14	ReflexDiffusion: Reflection-Enhanced Trajectory Planning for High-lateral-acceleration Scenarios in Autonomous Driving	Xuemei Yao et.al.	2601.09377	null
2026-01-14	Monte-Carlo Tree Search with Neural Network Guidance for Lane-Free Autonomous Driving	Ioannis Peridis et.al.	2601.09353	null
2026-01-13	SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning	Leo Fillioux et.al.	2601.08617	null
2026-01-13	Coverage-Guided Road Selection and Prioritization for Efficient Testing in Autonomous Driving Systems	Qurban Ali et.al.	2601.08609	null
2026-01-14	Large Multimodal Models for Embodied Intelligent Driving: The Next Frontier in Self-Driving?	Long Zhang et.al.	2601.08434	null
2026-01-15	Semantic Misalignment in Vision-Language Models under Perceptual Degradation	Guo Cheng et.al.	2601.08355	null
2026-01-09	An Empirical Study on Knowledge Transfer under Domain and Label Shifts in 3D LiDAR Point Clouds	Subeen Lee et.al.	2601.07855	null
2026-01-12	Leveraging 3D Representation Alignment and RGB Pretrained Priors for LiDAR Scene Generation	Nicolas Sereyjol-Garros et.al.	2601.07692	link
2026-01-13	ViewMorpher3D: A 3D-aware Diffusion Framework for Multi-Camera Novel View Synthesis in Autonomous Driving	Farhad G. Zanjani et.al.	2601.07540	null
2026-01-12	Task Prototype-Based Knowledge Retrieval for Multi-Task Learning from Partially Annotated Data	Youngmin Oh et.al.	2601.07474	null
2026-01-12	Software-Hardware Co-optimization for Modular E2E AV Paradigm: A Unified Framework of Optimization Approaches, Simulation Environment and Evaluation Metrics	Chengzhi Ji et.al.	2601.07393	null
2026-01-12	SC-MII: Infrastructure LiDAR-based 3D Object Detection on Edge Devices for Split Computing with Multiple Intermediate Outputs Integration	Taisuke Noguchi et.al.	2601.07119	null
2026-01-11	Efficient Visual Question Answering Pipeline for Autonomous Driving via Scene Region Compression	Yuliang Cai et.al.	2601.07092	null
2026-01-11	Conditional Normalizing Flows for Forward and Backward Joint State and Parameter Estimation	Luke S. Lagunowich et.al.	2601.07013	null
2026-01-10	SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning	Chenxu Dang et.al.	2601.06474	null
2026-01-10	WHU-PCPR: A cross-platform heterogeneous point cloud dataset for place recognition in complex urban scenes	Xianghong Zou et.al.	2601.06442	null
2026-01-09	Toward Safe and Responsible AI Agents: A Three-Pillar Model for Transparency, Accountability, and Trustworthiness	Edward C. Cheng et.al.	2601.06223	null
2026-01-09	GeoSurDepth: Spatial Geometry-Consistent Self-Supervised Depth Estimation for Surround-View Cameras	Weimin Liu et.al.	2601.05839	null
2026-01-09	Modular Autonomy with Conversational Interaction: An LLM-driven Framework for Decision Making in Autonomous Driving	Marvin Seegert et.al.	2601.05806	null
2026-01-09	Drivora: A Unified and Extensible Infrastructure for Search-based Autonomous Driving Testing	Mingfei Cheng et.al.	2601.05685	null
2026-01-12	SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving	Jingyu Li et.al.	2601.05640	null
2026-01-09	LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction	Chengen Xie et.al.	2601.05611	null
2026-01-08	UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition	Filippo Ghilotti et.al.	2601.05105	null
2026-01-08	Driving on Registers	Ellington Kirby et.al.	2601.05083	link
2026-01-08	SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection	Maximilian Pittner et.al.	2601.04968	null
2026-01-08	ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving	Chang Zhao et.al.	2601.04714	null
2026-01-08	The UnScripted Trip: Fostering Policy Discussion on Future Human-Vehicle Collaboration in Autonomous Driving Through Design-Oriented Methods	Xinyan Yu et.al.	2601.04601	null
2026-01-08	Timeliness-Oriented Scheduling and Resource Allocation in Multi-Region Collaborative Perception	Mengmeng Zhu et.al.	2601.04542	null
2026-01-07	UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving	Zhexiao Xiong et.al.	2601.04453	null
2026-01-07	3D-Agent:Tri-Modal Multi-Agent Collaboration for Scalable 3D Object Annotation	Jusheng Zhang et.al.	2601.04404	null
2026-01-07	A Systematic Mapping Study on the Debugging of Autonomous Driving Systems	Nathan Shaw et.al.	2601.04293	null
2026-01-07	Correcting Autonomous Driving Object Detection Misclassifications with Automated Commonsense Reasoning	Keegan Kimbrell et.al.	2601.04271	null
2026-01-07	Towards Safe Autonomous Driving: A Real-Time Motion Planning Algorithm on Embedded Hardware	Korbinian Moller et.al.	2601.03904	null
2026-01-07	On the Robustness of Fairness Practices: A Causal Framework for Systematic Evaluation	Verya Monjezi et.al.	2601.03621	null
2026-01-07	A Vision-Language-Action Model with Visual Prompt for OFF-Road Autonomous Driving	Liangdong Zhang et.al.	2601.03519	null
2026-01-06	FROST-Drive: Scalable and Efficient End-to-End Driving with a Frozen Vision Encoder	Zeyu Dong et.al.	2601.03460	null
2026-01-06	Enhancing Safety in Automated Ports: A Virtual Reality Study of Pedestrian-Autonomous Vehicle Interactions under Time Pressure, Visual Constraints, and Varying Vehicle Size	Yuan Che et.al.	2601.03218	null
2026-01-06	Towards Efficient 3D Object Detection for Vehicle-Infrastructure Collaboration via Risk-Intent Selection	Li Wang et.al.	2601.03001	null
2026-01-07	HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps	Xuchang Zhong et.al.	2601.02730	null
2026-01-05	VIT-Ped: Visionary Intention Transformer for Pedestrian Behavior Analysis	Aly R. Elkammar et.al.	2601.01989	null
2026-01-05	Sparse Threats, Focused Defense: Criticality-Aware Robust Reinforcement Learning for Safe Autonomous Driving	Qi Wei et.al.	2601.01800	null
2026-01-05	AlignDrive: Aligned Lateral-Longitudinal Planning for End-to-End Autonomous Driving	Yanhao Wu et.al.	2601.01762	null
2026-01-04	LabelAny3D: Label Any Object 3D in the Wild	Jin Yao et.al.	2601.01676	null
2026-01-04	Optically Transparent Meta-Grating Embedded in Rear Windshields for Automotive Radar Detection	Sergey Geyman et.al.	2601.01551	null
2026-01-04	DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving	Yang Zhou et.al.	2601.01528	link
2026-01-04	ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking	Xiaobao Wei et.al.	2601.01386	null
2025-12-31	Dichotomous Diffusion Policy Optimization	Ruiming Liang et.al.	2601.00898	link
2026-01-01	PatchBlock: A Lightweight Defense Against Adversarial Patches for Embedded EdgeAI Devices	Nandish Chattopadhyay et.al.	2601.00367	null
2026-01-01	Rectifying Adversarial Examples Using Their Vulnerabilities	Fumiya Morimoto et.al.	2601.00270	null
2025-12-31	Semi-Supervised Diversity-Aware Domain Adaptation for 3D Object detection	Bartłomiej Olber et.al.	2512.24922	null
2026-01-04	LSRE: Latent Semantic Rule Encoding for Real-Time Semantic Risk Detection in Autonomous Driving	Qian Cheng et.al.	2512.24712	null
2025-12-31	Decentralized No-Regret Frequency-Time Scheduling for FMCW Radar Interference Avoidance	Yunian Pan et.al.	2512.24619	null
2025-12-30	Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning	Zhenghao “Mark” Peng et.al.	2512.24426	null
2025-12-30	Spatial-aware Vision Language Model for Autonomous Driving	Weijie Wei et.al.	2512.24331	null
2025-12-30	MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation	Fuqiang Gu et.al.	2512.24243	null
2025-12-30	Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes	Shuyun Wang et.al.	2512.24227	null
2025-12-30	Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks	Yongtao Chen et.al.	2512.24111	null
2025-12-30	Multi-Scenario Highway Lane-Change Intention Prediction: A Temporal Physics-Informed Multi-Modal Framework	Jiazhao Shi et.al.	2512.24075	null
2025-12-30	DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation	Yuang Jia et.al.	2512.23983	null
2025-12-29	Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception	Xiaoyu Li et.al.	2512.23635	null
2025-12-29	Parallelized Code Generation from Simulink Models for Event-driven and Timer-driven ROS 2 Nodes	Kenshin Obi et.al.	2512.23605	null
2025-12-29	A Kalman Filter-Based Disturbance Observer for Steer-by-Wire Systems	Nikolai Beving et.al.	2512.23593	null
2025-12-29	Unsupervised Learning for Detection of Rare Driving Scenarios	Dat Le et.al.	2512.23585	null
2025-12-29	Model-based Development for Autonomous Driving Software Considering Parallelization	Kenshin Obi et.al.	2512.23575	null
2025-12-29	Assessing behaviour coverage in a multi-agent system simulation for autonomous vehicle testing	Manuel Franco-Vivo et.al.	2512.23445	null
2025-12-31	DriveLaW:Unifying Planning and Video Generation in a Latent Driving World	Tianze Xia et.al.	2512.23421	null
2025-12-29	A Human-Oriented Cooperative Driving Approach: Integrating Driving Intention, State, and Conflict	Qin Wang et.al.	2512.23220	link
2025-12-29	Exploring Syn-to-Real Domain Adaptation for Military Target Detection	Jongoh Jeong et.al.	2512.23208	null
2025-12-29	A Weak Signal Learning Dataset and Its Baseline Method	Xianqi Liu et.al.	2512.23160	null
2025-12-28	Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection	Runwei Guan et.al.	2512.22972	null
2025-12-28	ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving	Qihang Peng et.al.	2512.22939	link
2025-12-27	SCPainter: A Unified Framework for Realistic 3D Asset Insertion and Novel View Synthesis	Paul Dobre et.al.	2512.22706	null
2025-12-27	CoDS: Collaborative Perception via Digital Semantic Communication	Jipeng Gan et.al.	2512.22513	null
2025-12-27	SCAFusion: A Multimodal 3D Detection Framework for Small Object Detection in Lunar Surface Exploration	Xin Chen et.al.	2512.22503	null
2025-12-26	Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models	Zongmin Zhang et.al.	2512.22046	null
2025-12-26	RT-Focuser: A Real-Time Lightweight Model for Edge-side Image Deblurring	Zhuoyu Wu et.al.	2512.21975	null
2025-12-26	TimeBill: Time-Budgeted Inference for Large Language Models	Qi Fan et.al.	2512.21859	null
2025-12-26	End-to-End 3D Spatiotemporal Perception with Multimodal Fusion and V2X Collaboration	Zhenwei Yang et.al.	2512.21831	null
2025-12-25	SymDrive: Realistic and Controllable Driving Simulator via Symmetric Auto-regressive Online Restoration	Zhiyuan Liu et.al.	2512.21618	null
2025-12-24	SparScene: Efficient Traffic Scene Representation via Sparse Graph Learning for Large-Scale Trajectory Generation	Xiaoyu Mo et.al.	2512.21133	null
2025-12-25	Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation	Reeshad Khan et.al.	2512.20815	null
2025-12-23	OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective	Markus Gross et.al.	2512.20770	null
2025-12-23	KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System	Zhongyu Xia et.al.	2512.20299	null
2025-12-23	UrbanV2X: A Multisensory Vehicle-Infrastructure Dataset for Cooperative Navigation in Urban Areas	Qijun Qin et.al.	2512.20224	null
2025-12-23	RESPOND: Risk-Enhanced Structured Pattern for LLM-driven Online Node-level Decision-making	Dan Chen et.al.	2512.20179	null
2025-12-23	LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs	Haiyun Wei et.al.	2512.20105	null
2025-12-22	Vehicle-centric Perception via Multimodal Structured Pre-training	Wentao Wu et.al.	2512.19934	null
2025-12-22	A Gauss-Newton-Induced Structure-Exploiting Algorithm for Differentiable Optimal Control	Yuankun Chen et.al.	2512.19447	link
2025-12-22	Are All Data Necessary? Efficient Data Pruning for Large-scale Autonomous Driving Dataset via Trajectory Entropy Maximization	Zhaoyang Liu et.al.	2512.19270	null
2025-12-22	AMap: Distilling Future Priors for Ahead-Aware Online HD Map Construction	Ruikai Li et.al.	2512.19150	null
2025-12-22	WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving	Pengxuan Yang et.al.	2512.19133	null
2025-12-22	VOIC: Visible-Occluded Decoupling for Monocular 3D Semantic Scene Completion	Zaidao Han et.al.	2512.18954	null
2025-12-21	CrashChat: A Multimodal Large Language Model for Multitask Traffic Crash Video Analysis	Kaidi Liang et.al.	2512.18878	null
2025-12-21	InDRiVE: Reward-Free World-Model Pretraining for Autonomous Driving via Latent Disagreement	Feeza Khan Khanzada et.al.	2512.18850	null
2025-12-21	Misbehavior Forecasting for Focused Autonomous Driving Systems Testing	M M Abid Naziri et.al.	2512.18823	null
2025-12-21	CauTraj: A Causal-Knowledge-Guided Framework for Lane-Changing Trajectory Planning of Autonomous Vehicles	Cailin Lei et.al.	2512.18703	null
2025-12-21	Offline Reinforcement Learning for End-to-End Autonomous Driving	Chihiro Noguchi et.al.	2512.18662	null
2025-12-20	Systematic Benchmarking of SUMO Against Data-Driven Traffic Simulators	Erdao Liang et.al.	2512.18537	null
2025-12-20	Prioritized Constraints in Optimization-Based Control	Daniel Arnström et.al.	2512.18458	null
2025-12-20	LLaViDA: A Large Language Vision Driving Assistant for Explicit Reasoning and Enhanced Trajectory Planning	Yudong Liu et.al.	2512.18211	null
2025-12-19	Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation	Shreshth Rajan et.al.	2512.18082	null
2025-12-19	StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection	Di Wu et.al.	2512.17620	null
2025-12-19	Learning Safe Autonomous Driving Policies Using Predictive Safety Representations	Mahesh Keswani et.al.	2512.17586	null
2025-12-22	TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data	Deqing Liu et.al.	2512.17370	null
2025-12-18	DVGT: Driving Visual Geometry Transformer	Sicheng Zuo et.al.	2512.16919	null
2025-12-18	Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future	Tianshuai Hu et.al.	2512.16760	link
2025-12-18	The Bi-objective Electric Autonomous Dial-a-Ride Problem	Yue Su et.al.	2512.16605	null
2025-12-18	Autoencoder-based Denoising Defense against Adversarial Attacks on Object Detection	Min Geun Song et.al.	2512.16123	null
2025-12-18	Driving in Corner Case: A Real-World Adversarial Closed-Loop Evaluation Platform for End-to-End Autonomous Driving	Jiaheng Geng et.al.	2512.16055	null
2025-12-17	From Words to Wavelengths: VLMs for Few-Shot Multispectral Object Detection	Manuel Nkegoum et.al.	2512.15971	null
2025-12-17	Human-like Working Memory from Artificial Intrinsic Plasticity Neurons	Jingli Liu et.al.	2512.15829	null
2025-12-17	OccSTeP: Benchmarking 4D Occupancy Spatio-Temporal Persistence	Yu Zheng et.al.	2512.15621	null
2025-12-17	Gaussian Process Dual MPC using Active Inference: An Autonomous Vehicle Usecase	Mohammad Mahmoudi Filabadi et.al.	2512.15381	null
2025-12-17	KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird’s-Eye-View Segmentation	Wenke E et.al.	2512.15311	null
2025-12-17	EPSM: A Novel Metric to Evaluate the Safety of Environmental Perception in Autonomous Driving	Jörg Gamerdinger et.al.	2512.15195	null
2025-12-17	Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network	Zhuoran Li et.al.	2512.15109	null
2025-12-18	LADY: Linear Attention for Autonomous Driving Efficiency without Transformers	Jihao Huang et.al.	2512.15038	null
2025-12-16	DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance	Shreedhar Govil et.al.	2512.14266	link
2025-12-16	OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving	Tao Tang et.al.	2512.14225	null
2025-12-16	CIS-BA: Continuous Interaction Space Based Backdoor Attack for Object Detection in the Real-World	Shuxin Zhao et.al.	2512.14158	null
2025-12-16	OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving	Zhenguo Zhang et.al.	2512.14044	null
2025-12-16	FocalComm: Hard Instance-Aware Multi-Agent Perception	Dereje Shenkut et.al.	2512.13982	null
2025-12-15	A Convex Obstacle Avoidance Formulation	Ricardo Tapia et.al.	2512.13836	null
2025-12-16	MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning	Haoyu Fu et.al.	2512.13636	null
2025-12-15	Post-Training and Test-Time Scaling of Generative Agent Behavior Models for Interactive Autonomous Driving	Hyunki Seong et.al.	2512.13262	null
2025-12-16	MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion	Minghui Hou et.al.	2512.13177	null
2025-12-15	Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather	Zhijian He et.al.	2512.13107	null
2025-12-15	Sequence of Expert: Boosting Imitation Planners for Autonomous Driving through Temporal Alternation	Xiang Li et.al.	2512.13094	null
2025-12-15	Machine Learning Architectures for the Estimation of Predicted Occupancy Grids in Road Traffic	Parthasarathy Nadarajan et.al.	2512.12907	null
2025-12-14	GradID: Adversarial Detection via Intrinsic Dimensionality of Gradients	Mohammad Mahdi Razmjoo et.al.	2512.12827	null
2025-12-14	DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning	Zhe Liu et.al.	2512.12799	link
2025-12-14	High Order Control Lyapunov Function - Control Barrier Function - Quadratic Programming Based Autonomous Driving Controller for Bicyclist Safety	Haochong Chen et.al.	2512.12776	null
2025-12-13	From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving	Huan Zheng et.al.	2512.12302	null
2025-12-13	Measuring What Matters: Scenario-Driven Evaluation for Trajectory Predictors in Autonomous Driving	Longchao Da et.al.	2512.12211	null
2025-12-12	A Review of Learning-Based Motion Planning: Toward a Data-Driven Optimal Control Approach	Jia Hu et.al.	2512.11944	null
2025-12-12	TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder	Qinghao Meng et.al.	2512.11926	null
2025-12-12	LUCID: Learning-Enabled Uncertainty-Aware Certification of Stochastic Dynamical Systems	Ernesto Casablanca et.al.	2512.11750	null
2025-12-12	Evaluating Foundation Models’ 3D Understanding Through Multi-View Correspondence Analysis	Valentina Lilova et.al.	2512.11574	null
2025-12-12	CarlaNCAP: A Framework for Quantifying the Safety of Vulnerable Road Users in Infrastructure-Assisted Collective Perception Using EuroNCAP Scenarios	Jörg Gamerdinger et.al.	2512.11551	null
2025-12-12	SATMapTR: Satellite Image Enhanced Online HD Map Construction	Bingyuan Huang et.al.	2512.11319	null
2025-12-12	Elevation Aware 2D/3D Co-simulation Framework for Large-scale Traffic Flow and High-fidelity Vehicle Dynamics	Chandra Raskoti et.al.	2512.11249	null
2025-12-12	FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model	Hongbin Lin et.al.	2512.11226	null
2025-12-12	Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving	Jiawei Yang et.al.	2512.10947	null
2025-12-11	SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving	Peizheng Li et.al.	2512.10719	null
2025-12-11	NaviHydra: Controllable Navigation-guided End-to-end Autonomous Driving with Hydra-distillation	Hanfeng Wu et.al.	2512.10660	null
2025-12-11	UACER: An Uncertainty-Aware Critic Ensemble Framework for Robust Adversarial Reinforcement Learning	Jiaxi Wu et.al.	2512.10492	null
2025-12-11	T-SKM-Net: Trainable Neural Network Framework for Linear Constraint Satisfaction via Sampling Kaczmarz-Motzkin Method	Haoyu Zhu et.al.	2512.10461	null
2025-12-11	Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method	Ge Zhang et.al.	2512.10386	null
2025-12-11	InfoCom: Kilobyte-Scale Communication-Efficient Collaborative Perception with Information Bottleneck	Quanmin Wei et.al.	2512.10305	link
2025-12-11	Latent Chain-of-Thought World Modeling for End-to-End Driving	Shuhan Tan et.al.	2512.10226	null
2025-12-10	UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving	Hao Lu et.al.	2512.09864	null
2025-12-10	COVLM-RL: Critical Object-Oriented Reasoning for Autonomous Driving Using VLM-Guided Reinforcement Learning	Lin Li et.al.	2512.09349	null
2025-12-10	Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving	Songhan Wu et.al.	2512.09296	null
2025-12-09	Understanding Mental States in Active and Autonomous Driving with EEG	Prithila Angkan et.al.	2512.09190	null
2025-12-09	Astra: General Interactive World Model with Autoregressive Denoising	Yixuan Zhu et.al.	2512.08931	null
2025-12-09	A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems	Po-An Shih et.al.	2512.08476	null
2025-12-09	Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection	Haowen Zheng et.al.	2512.08247	null
2025-12-09	Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators	Yuki Kubota et.al.	2512.08163	null
2025-12-08	DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving	Jialv Zou et.al.	2512.07745	null
2025-12-08	VP-AutoTest: A Virtual-Physical Fusion Autonomous Driving Testing Platform	Yiming Cui et.al.	2512.07507	null
2025-12-08	Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood	Gilhyun Nam et.al.	2512.07390	null
2025-12-08	Unified Camera Positional Encoding for Controlled Video Generation	Cheng Zhang et.al.	2512.07237	link
2025-12-09	TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning	Zebin Xing et.al.	2512.07135	null
2025-12-08	Mimir: Hierarchical Goal-Driven Diffusion with Uncertainty Propagation for End-to-End Autonomous Driving	Zebin Xing et.al.	2512.07130	link
2025-12-07	Spatial Retrieval Augmented Autonomous Driving	Xiaosong Jia et.al.	2512.06865	link
2025-12-07	SparseCoop: Cooperative Perception with Kinematic-Grounded Queries	Jiahao Wang et.al.	2512.06838	null
2025-12-07	FedDSR: Federated Deep Supervision and Regularization Towards Autonomous Driving	Wei-Bin Kou et.al.	2512.06676	null
2025-12-07	Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving	Wei-Bin Kou et.al.	2512.06664	null
2025-12-06	UncertaintyZoo: A Unified Toolkit for Quantifying Predictive Uncertainty in Deep Learning Systems	Xianzong Wu et.al.	2512.06406	null
2025-12-06	Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework	Xinhao Xiang et.al.	2512.06376	null
2025-12-06	NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks	Fangzhou Lin et.al.	2512.06251	null
2025-12-05	Situation-Aware Interactive MPC Switching for Autonomous Driving	Shuhao Qi et.al.	2512.06182	null
2025-12-05	WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving	Yifang Xu et.al.	2512.06112	null
2025-12-05	BeLLA: End-to-End Birds Eye View Large Language Assistant for Autonomous Driving	Karthik Mohan et.al.	2512.06096	null
2025-12-05	Representation Learning for Point Cloud Understanding	Siming Yan et.al.	2512.06058	null
2025-12-05	OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning	Xusheng Guo et.al.	2512.05698	null
2025-12-05	LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving	Yiming Shu et.al.	2512.05686	null
2025-12-05	Scenario-aware Uncertainty Quantification for Trajectory Prediction with Statistical Guarantees	Yiming Shu et.al.	2512.05682	null
2025-12-05	Concept-based Explainable Data Mining with VLM for 3D Detection	Mai Tsujimoto et.al.	2512.05482	null
2025-12-05	MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare	Zag ElSayed et.al.	2512.05365	null
2025-12-05	State-Conditional Adversarial Learning: An Off-Policy Visual Domain Transfer Method for End-to-End Imitation Learning	Yuxiang Liu et.al.	2512.05335	null
2025-12-04	WhatsCode: Large-Scale GenAI Deployment for Developer Efficiency at WhatsApp	Ke Mao et.al.	2512.05314	null
2025-12-04	From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model	Kevin Cannons et.al.	2512.05277	null
2025-12-04	Are Your Agents Upward Deceivers?	Dadi Guo et.al.	2512.04864	null
2025-12-04	FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis	Shijie Chen et.al.	2512.04830	null
2025-12-04	MT-Depth: Multi-task Instance feature analysis for the Depth Completion	Abdul Haseeb Nizamani et.al.	2512.04734	null
2025-12-04	E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving	Yihong Tang et.al.	2512.04733	null
2025-12-04	Efficient Safety Verification of Autonomous Vehicles with Neural Network Operator	Lingxiang Fan et.al.	2512.04557	null
2025-12-04	dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning	Yingzi Ma et.al.	2512.04459	null
2025-12-04	MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving	Bin Sun et.al.	2512.04441	null
2025-12-03	DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle	Fangyu Lei et.al.	2512.04324	null
2025-12-03	Driving Beyond Privilege: Distilling Dense-Reward Knowledge into Sparse-Reward Policies	Feeza Khan Khanzada et.al.	2512.04279	null
2025-12-03	PINN vs LSTM: A Comparative Study for Steam Temperature Control in Heat Recovery Steam Generators	Mojtaba Fanoodi et.al.	2512.04183	null
2025-12-03	Fast & Efficient Normalizing Flows and Applications of Image Generative Models	Sandeep Nagar et.al.	2512.04039	null
2025-12-03	DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation	Zexin Lin et.al.	2512.03992	null
2025-12-03	Classification of User Satisfaction in HRI with Social Signals in the Wild	Michael Schiffmann et.al.	2512.03945	null
2025-12-03	Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response	Aron Distelzweig et.al.	2512.03936	null
2025-12-03	Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties	Vineel Tummala et.al.	2512.03931	null
2025-12-03	A Modular Architecture Design for Autonomous Driving Racing in Controlled Environments	Brais Fontan-Costas et.al.	2512.03886	null
2025-12-03	Multi-Agent Deep Reinforcement Learning for UAV-Assisted 5G Network Slicing: A Comparative Study of MAPPO, MADDPG, and MADQN	Ghoshana Bista et.al.	2512.03835	null
2025-12-03	MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving	Jia Hu et.al.	2512.03795	null
2025-12-03	Safety Reinforced Model Predictive Control (SRMPC): Improving MPC with Reinforcement Learning for Motion Planning in Autonomous Driving	Johannes Fischer et.al.	2512.03774	null
2025-12-03	Context-Triggered Contingency Games for Strategic Multi-Agent Interaction	Kilian Schweppe et.al.	2512.03639	null
2025-12-03	Market share maximizing strategies of CAV fleet operators may cause chaos in our cities	Grzegorz Jamróz et.al.	2512.03524	null
2025-12-03	Left shifting analysis of Human-Autonomous Team interactions to analyse risks of autonomy in high-stakes AI systems	Ben Larwood et.al.	2512.03519	null
2025-12-03	CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving	Zhijian Qiao et.al.	2512.03510	null
2025-12-03	Double-Edge-Assisted Computation Offloading and Resource Allocation for Space-Air-Marine Integrated Networks	Zhen Wang et.al.	2512.03487	null
2025-12-03	Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles	Haicheng Liao et.al.	2512.03454	null
2025-12-03	Generalization Evaluation of Deep Stereo Matching Methods for UAV-Based Forestry Applications	Yida Lin et.al.	2512.03427	null
2025-12-03	NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction	Thomas Monninger et.al.	2512.03317	null
2025-12-02	SpatialReasoner: Active Perception for Large-Scale 3D Scene Understanding	Hongpei Zheng et.al.	2512.03284	null
2025-12-02	Flux4D: Flow-based Unsupervised 4D Reconstruction	Jingkang Wang et.al.	2512.03210	null
2025-12-02	AGENTSAFE: A Unified Framework for Ethical Assurance and Governance in Agentic AI	Rafflesia Khan et.al.	2512.03180	null
2025-12-02	The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models	Saeid Jamshidi et.al.	2512.03026	null
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	link
2025-12-02	U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences	Xiang Xu et.al.	2512.02982	link
2025-12-02	Lumos: Let there be Language Model System Certification	Isha Chaudhary et.al.	2512.02966	null
2025-12-02	EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis	Yancheng Zhang et.al.	2512.02932	link
2025-12-02	VLM as Strategist: Adaptive Generation of Safety-critical Testing Scenarios via Guided Diffusion	Xinzheng Wu et.al.	2512.02844	null
2025-12-02	CogDrive: Cognition-Driven Multimodal Prediction-Planning Fusion for Safe Autonomy	Heye Huang et.al.	2512.02777	null
2025-12-02	Adaptive hydrogels with spatiotemporal stiffening using pH-modulating enzymes	Natascha Gray et.al.	2512.02698	null
2025-12-02	ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data	Yuxing Liu et.al.	2512.02686	null
2025-12-02	Wi-Fi Rate Adaptation for Moving Equipment in Industrial Environments	Pietro Chiavassa et.al.	2512.02455	null
2025-12-02	nuScenes Revisited: Progress and Challenges in Autonomous Driving	Whye Kit Fong et.al.	2512.02448	null
2025-12-02	Vehicle Dynamics Embedded World Models for Autonomous Driving	Huiqian Li et.al.	2512.02417	null
2025-12-02	Synthetic Error Injection Fails to Elicit Self-Correction In Language Models	David X. Wu et.al.	2512.02389	null
2025-12-02	Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention	Wenyi Xiong et.al.	2512.02368	null
2025-12-02	Near-Memory Architecture for Threshold-Ordinal Surface-Based Corner Detection of Event Cameras	Hongyang Shang et.al.	2512.02346	null
2025-12-01	RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies	Guillermo Garcia-Cobo et.al.	2512.01993	null
2025-12-01	Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory	Chenyi Wang et.al.	2512.01934	null
2025-12-01	NeuroHJR: Hamilton-Jacobi Reachability-based Obstacle Avoidance in Complex Environments with Physics-Informed Neural Networks	Granthik Halder et.al.	2512.01897	null
2025-12-02	OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic	Songyan Zhang et.al.	2512.01830	link
2025-12-01	AgriLiRa4D: A Multi-Sensor UAV Dataset for Robust SLAM in Challenging Agricultural Fields	Zhihao Zhan et.al.	2512.01753	link
2025-12-01	In-context Inverse Optimality for Fair Digital Twins: A Preference-based approach	Daniele Masti et.al.	2512.01650	null
2025-12-01	Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track	Mo Chen et.al.	2512.01608	null
2025-12-01	Language-Guided Open-World Anomaly Segmentation	Klara Reichard et.al.	2512.01427	null
2025-12-01	Accelerating Probabilistic Response-Time Analysis: Revised Critical Instant and Optimized Convolution	Hiroto Takahashi et.al.	2512.01381	null
2025-12-01	SocialDriveGen: Generating Diverse Traffic Scenarios with Controllable Social Interactions	Jiaguo Tian et.al.	2512.01363	null
2025-12-01	OpenBox: Annotate Any Bounding Boxes in 3D	In-Jae Lee et.al.	2512.01352	null
2025-12-01	CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL	Shinji Mai et.al.	2512.01311	null
2025-12-01	RoboDriveVLM: A Novel Benchmark and Baseline towards Robust Vision-Language Models for Autonomous Driving	Dacheng Liao et.al.	2512.01300	null
2025-12-01	COMET: A Dual Swashplate Autonomous Coaxial Bi-copter AAV with High-Maneuverability and Long-Endurance	Shuai Wang et.al.	2512.01246	null
2025-12-01	RoboLoc: A Benchmark Dataset for Point Place Recognition and Localization in Indoor-Outdoor Integrated Environments	Jaejin Jeon et.al.	2512.01194	null
2025-12-01	DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks	Hyunjun Kim et.al.	2512.01174	null
2025-11-30	Semantic Communications for Vehicle-Based Mission-Critical Services: Challenges and Solutions	Hui Zhou et.al.	2512.01102	null
2025-11-30	SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds	Jiawei Ren et.al.	2512.01078	null
2025-11-30	Autonomous Grasping On Quadruped Robot With Task Level Interaction	Muhtadin et.al.	2512.01052	null
2025-11-30	Approximating Analytically-Intractable Likelihood Densities with Deterministic Arithmetic for Optimal Particle Filtering	Orestis Kaparounakis et.al.	2512.01023	null
2025-11-28	Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation	Bernhard Klein et.al.	2511.23440	null
2025-11-28	SimScale: Learning to Drive via Real-World Simulation at Scale	Haochen Tian et.al.	2511.23369	null
2025-11-28	Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach	Haruki Sakajo et.al.	2511.23311	null
2025-11-28	Seeing before Observable: Potential Risk Reasoning in Autonomous Driving via Vision Language Models	Jiaxin Liu et.al.	2511.22928	null
2025-11-28	DM $^3$ T: Harmonizing Modalities via Diffusion for Multi-Object Tracking	Weiran Li et.al.	2511.22896	null
2025-11-28	SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving	Wonjeong Ryu et.al.	2511.22865	null
2025-11-28	Safe Autonomous Lane Changing: Planning with Dynamic Risk Fields and Time-Varying Convex Space Generation	Zhen Tian et.al.	2511.22829	null
2025-11-27	Active flow-driven DNA remodeling generates millimeter-scale mechanical oscillations	Maya Levanon et.al.	2511.22589	null
2025-11-27	BUDD-e: an autonomous robotic guide for visually impaired users	Jinyang Li et.al.	2511.22541	null
2025-11-27	CoT4AD: A Vision-Language-Action Model with Explicit Chain-of-Thought Reasoning for Autonomous Driving	Zhaohui Wang et.al.	2511.22532	null
2025-11-27	Motion-to-Motion Latency Measurement Framework for Connected and Autonomous Vehicle Teleoperation	François Provost et.al.	2511.22467	null
2025-11-27	RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding	Xiyan Liu et.al.	2511.22466	null
2025-11-27	Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning	Bokang Zhang et.al.	2511.22415	null
2025-11-27	LLM-Based Generalizable Hierarchical Task Planning and Execution for Heterogeneous Robot Teams with Event-Driven Replanning	Suraj Borate et.al.	2511.22354	link
2025-11-27	DriveVGGT: Visual Geometry Transformer for Autonomous Driving	Xiaosong Jia et.al.	2511.22264	null
2025-11-27	Co-Evolving Agents: Learning from Failures as Hard Negatives	Yeonsung Jung et.al.	2511.22254	null
2025-11-27	HybridWorldSim: A Scalable and Controllable High-fidelity Simulator for Autonomous Driving	Qiang Li et.al.	2511.22187	null
2025-11-27	MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction	Maitrayee Keskar et.al.	2511.22181	null
2025-11-27	SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions	Aiyinsi Zuo et.al.	2511.22142	null
2025-11-27	Aligning with Human Values to Enhance Interaction: An eHMI-Mediated Lane-Changing Negotiation Strategy Using Bayesian Inference	Boyao Peng et.al.	2511.22061	null
2025-11-26	Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving	Haohong Lin et.al.	2511.21584	null
2025-11-26	Improvement of Collision Avoidance in Cut-In Maneuvers Using Time-to-Collision Metrics	Jamal Raiyn et.al.	2511.21280	null
2025-11-26	LaGen: Towards Autoregressive LiDAR Scene Generation	Sizhuo Zhou et.al.	2511.21256	null
2025-11-25	Hierarchical Evaluation of Software Design Capabilities of Large Language Models of Code	Mootez Saad et.al.	2511.20933	null
2025-11-25	Accelerating Sparse Convolutions in Voxel-Based Point Cloud Networks	Dionysios Adamopoulos et.al.	2511.20834	null
2025-11-25	Learning from Risk: LLM-Guided Generation of Safety-Critical Scenarios with Prior Knowledge	Yuhang Wang et.al.	2511.20726	null
2025-11-25	DeeAD: Dynamic Early Exit of Vision-Language Action for Efficient Autonomous Driving	Haibo HU et.al.	2511.20720	null
2025-11-25	Efficient Parallel Implementation of the Pilot Assignment Problem in Massive MIMO Systems	Eman Alqudah et.al.	2511.20511	null
2025-11-25	AD-R1: Closed-Loop Reinforcement Learning for End-to-End Autonomous Driving with Impartial World Models	Tianyi Yan et.al.	2511.20325	null
2025-11-25	LLM-Driven Transient Stability Assessment: From Automated Simulation to Neural Architecture Design	Lianzhe Hu et.al.	2511.20276	null
2025-11-25	Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving	Bin Hu et.al.	2511.20156	link
2025-11-25	“Are We Done Yet?”: A Vision-Based Judge for Autonomous Task Completion of Computer Use Agents	Marta Sumyk et.al.	2511.20067	null
2025-11-25	DeLightMono: Enhancing Self-Supervised Monocular Depth Estimation in Endoscopy by Decoupling Uneven Illumination	Mingyang Ou et.al.	2511.20058	null
2025-11-25	Energy Efficient Nonlinear Microscopic Dynamical Model for Autonomous and Electric Vehicles	Yuneil Yeo et.al.	2511.20054	null
2025-11-25	WaymoQA: A Multi-View Visual Question Answering Dataset for Safety-Critical Reasoning in Autonomous Driving	Seungjun Yu et.al.	2511.20022	null
2025-11-25	Cross-Modal Semantic Communication for Heterogeneous Collaborative Perception	Mingyi Lu et.al.	2511.20000	null
2025-11-25	On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices	Lianming Huang et.al.	2511.19986	null
2025-11-25	Hierarchical Spatio-Temporal Attention Network with Adaptive Risk-Aware Decision for Forward Collision Warning in Complex Scenarios	Haoran Hu et.al.	2511.19952	null
2025-11-25	CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model	Dapeng Zhang et.al.	2511.19914	null
2025-11-25	Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving	Dapeng Zhang et.al.	2511.19912	null
2025-11-25	4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models	Yiting Lu et.al.	2511.19836	null
2025-11-24	Normative active inference: A numerical proof of principle for a computational and economic legal analytic approach to AI governance	Axel Constant et.al.	2511.19334	null
2025-11-24	IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes	Carl Lindström et.al.	2511.19235	null
2025-11-24	Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving	Jianhua Han et.al.	2511.19221	null
2025-11-25	VIL2C: Value-of-Information Aware Low-Latency Communication for Multi-Agent Reinforcement Learning	Qian Zhang et.al.	2511.19146	null
2025-11-24	Autonomous Docking of Multi-Rotor UAVs on Blimps under the Influence of Wind Gusts	Pascal Goldschmid et.al.	2511.19135	null
2025-11-24	MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images	Qirui Wang et.al.	2511.19119	link
2025-11-24	Agent Discovery in Internet of Agents: Challenges and Solutions	Shaolong Guo et.al.	2511.19113	null
2025-11-24	HABIT: Human Action Benchmark for Interactive Traffic in CARLA	Mohan Ramesh et.al.	2511.19109	null
2025-11-24	End-to-end Autonomous Vehicle Following System using Monocular Fisheye Camera	Jiale Zhang et.al.	2511.19011	null
2025-11-24	SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation	Nimeshika Udayangani et.al.	2511.18816	link
2025-11-24	From Features to Reference Points: Lightweight and Adaptive Fusion for Cooperative Autonomous Driving	Yongqi Zhu et.al.	2511.18757	null
2025-11-24	Thinking Ahead: Foresight Intelligence in MLLMs and World Models	Zhantao Gong et.al.	2511.18735	null
2025-11-24	GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving	Lin Liu et.al.	2511.18729	null
2025-11-24	DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving	Hongbin Lin et.al.	2511.18713	link
2025-11-24	Online Learning-Enhanced Lie Algebraic MPC for Robust Trajectory Tracking of Autonomous Surface Vehicles	Yinan Dong et.al.	2511.18683	null
2025-11-24	Data Augmentation Strategies for Robust Lane Marking Detection	Flora Lian et.al.	2511.18668	null
2025-11-23	The Evaluation for Usability Methods of Unmanned Surface Vehicles: Are Current Usability Methods Viable for Unmanned Surface Vehicles? Insights from a Multiple Case Study Approach to Human-Robot Interaction	Zitian Peng et.al.	2511.18561	null
2025-11-23	From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence	Jian Yang et.al.	2511.18538	null
2025-11-23	Splatblox: Traversability-Aware Gaussian Splatting for Outdoor Robot Navigation	Samarth Chopra et.al.	2511.18525	null
2025-11-23	Energy-Efficient Task Computation at the Edge for Vehicular Services	Paniz Parastar et.al.	2511.18449	null
2025-11-21	MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments	Zhiyu Huang et.al.	2511.17496	null
2025-11-21	Feasibility of Embodied Dynamics Based Bayesian Learning for Continuous Pursuit Motion Control of Assistive Mobile Robots in the Built Environment	Xiaoshan Zhou et.al.	2511.17401	null
2025-11-21	Vector Cost Behavioral Planning for Autonomous Robotic Systems with Contemporary Validation Strategies	Benjamin R. Toaz et.al.	2511.17375	null
2025-11-21	FORWARD: Dataset of a forwarder operating in rough terrain	Mikael Lundbäck et.al.	2511.17318	null
2025-11-21	Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing	Suchetan G. Uppur et.al.	2511.17269	null
2025-11-21	QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy	Adam Lilja et.al.	2511.17221	null
2025-11-21	Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition	Aditya Mishra et.al.	2511.17183	null
2025-11-21	DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving	Liuhan Yin et.al.	2511.17150	null
2025-11-21	Sparse Reasoning is Enough: Biological-Inspired Framework for Video Anomaly Detection with Large Pre-trained Models	He Huang et.al.	2511.17094	null
2025-11-21	VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions	Qianyi Shao et.al.	2511.16998	null
2025-11-21	MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots	Junseo Kim et.al.	2511.16949	null
2025-11-20	AutoBackdoor: Automating Backdoor Attacks via LLM Agents	Yige Li et.al.	2511.16709	null
2025-11-20	MiMo-Embodied: X-Embodied Foundation Model Technical Report	Xiaoshuai Hao et.al.	2511.16518	null
2025-11-20	Tube-Based Model Predictive Control with Random Fourier Features for Nonlinear Systems	Ákos M. Bokor et.al.	2511.16425	null
2025-11-20	Flow-Aided Flight Through Dynamic Clutters From Point To Motion	Bowen Xu et.al.	2511.16372	null
2025-11-20	DynaMimicGen: A Data Generation Framework for Robot Learning of Dynamic Tasks	Vincenzo Pomponi et.al.	2511.16223	null
2025-11-20	AskDB: An LLM Agent for Natural Language Interaction with Relational Databases	Xuan-Quang Phan et.al.	2511.16131	null
2025-11-20	LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving	Pei Liu et.al.	2511.16049	null
2025-11-19	RE for AI in Practice: Managing Data Annotation Requirements for AI Autonomous Driving Systems	Hina Saeeda et.al.	2511.15859	null
2025-11-19	Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges	Kim N. Nolle et.al.	2511.15652	link
2025-11-19	Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2511.15597	null
2025-11-20	CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking	Sifan Zhou et.al.	2511.15580	null
2025-11-19	Computer-Use Agents as Judges for Generative User Interface	Kevin Qinghong Lin et.al.	2511.15567	null
2025-11-19	Scriboora: Rethinking Human Pose Forecasting	Daniel Bermuth et.al.	2511.15565	null
2025-11-20	UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy	Ruoqu Chen et.al.	2511.15550	null
2025-11-19	Uncoordinated Cooperative OFDM Multi-Hop UAV Relay Networks Using Virtual Channels Based on All-Pass Filters	Noura Sellami et.al.	2511.15545	null
2025-11-19	Driving in Spikes: An Entropy-Guided Object Detector for Spike Cameras	Ziyan Liu et.al.	2511.15459	null
2025-11-19	WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes	Marc-Emmanuel Coupvent des Graviers et.al.	2511.15429	null
2025-11-19	Unveiling Inference Scaling for Difference-Aware User Modeling in LLM Personalization	Suyu Chen et.al.	2511.15389	link
2025-11-19	Symmetry-Breaking in Multi-Agent Navigation: Winding Number-Aware MPC with a Learned Topological Strategy	Tomoki Nakao et.al.	2511.15239	null
2025-11-19	Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth Estimation	Jing Cao et.al.	2511.15167	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	link
2025-11-19	Data-driven control of network systems: Accounting for communication adaptivity and security	Gang Wang et.al.	2511.15044	null
2025-11-18	Z-Merge: Multi-Agent Reinforcement Learning for On-Ramp Merging with Zone-Specific V2X Traffic Information	Yassine Ibork et.al.	2511.14910	null
2025-11-18	Attacking Autonomous Driving Agents with Adversarial Machine Learning: A Holistic Evaluation with the CARLA Leaderboard	Henry Wong et.al.	2511.14876	null
2025-11-18	Uncertainty-Aware Measurement of Scenario Suite Representativeness for Autonomous Systems	Robab Aghazadeh Chakherlou et.al.	2511.14853	null
2025-11-19	Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks	Xianhui Meng et.al.	2511.14592	null
2025-11-18	Enhancing End-to-End Autonomous Driving with Risk Semantic Distillaion from VLM	Jack Qin et.al.	2511.14499	null
2025-11-18	CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring	Mingchen Zhong et.al.	2511.14469	null
2025-11-18	Context-aware, Ante-hoc Explanations of Driving Behaviour	Dominik Grundt et.al.	2511.14428	null
2025-11-18	Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition	Fabian Schmidt et.al.	2511.14391	null
2025-11-18	Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving	Kangqiao Zhao et.al.	2511.14386	null
2025-11-18	Emergent Cooperative Driving Strategies for Stop-and-Go Wave Mitigation via Multi-Agent Reinforcement Learning	Raphael Korbmacher et.al.	2511.14378	null
2025-11-18	PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation	Xiangyu Li et.al.	2511.14185	null
2025-11-18	RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment	Zeyu Cheng et.al.	2511.14107	null
2025-11-18	Cosmological dynamics of interacting dark matter-dark energy in generalized Rastall gravity	Manuel Gonzalez-Espinoza et.al.	2511.14089	null
2025-11-17	LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering	Jielin Qiu et.al.	2511.13998	null
2025-11-17	VLMs Guided Interpretable Decision Making for Autonomous Driving	Xin Hu et.al.	2511.13881	null
2025-11-17	In-memory phononic learning toward cognitive mechanical intelligence	Yuning Zhang et.al.	2511.13543	null
2025-11-17	An Automated Framework for Analyzing Structural Evolution in On-the-fly Non-adiabatic Molecular Dynamics Using Autoencoder and Multiple Molecular Descriptors	Hangxu Liu et.al.	2511.13364	null
2025-11-17	DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving	Kaiwen Cai et.al.	2511.13309	null
2025-11-17	DAP: A Discrete-token Autoregressive Planner for Autonomous Driving	Bowen Ye et.al.	2511.13306	null
2025-11-17	CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving	Enhui Ma et.al.	2511.13297	null
2025-11-17	GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models	Yushuo Zheng et.al.	2511.13259	null
2025-11-17	Event-Triggered Regulation of Mixed-Autonomy Traffic Under Varying Traffic Conditions	Yihuai Zhang et.al.	2511.13206	null
2025-11-17	Difficulty-Aware Label-Guided Denoising for Monocular 3D Object Detection	Soyul Lee et.al.	2511.13195	null
2025-11-17	Autonomous Sensing UAV for Accurate Multi-User Identification and Localization in Cellular Networks	Niccolò Paglierani et.al.	2511.13171	null
2025-11-17	WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection	Longhui Zheng et.al.	2511.13138	null
2025-11-17	Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining	Zhaocheng Yu et.al.	2511.13113	null
2025-11-17	ResAlignNet: A Data-Driven Approach for INS/DVL Alignment	Guy Damari et.al.	2511.13096	null
2025-11-17	Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving	Jiacheng Tang et.al.	2511.13079	link
2025-11-17	Towards 3D Object-Centric Feature Learning for Semantic Scene Completion	Weihua Wang et.al.	2511.13031	null
2025-11-17	T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving	Chen Ma et.al.	2511.12956	null
2025-11-17	GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving	Chunyong Hu et.al.	2511.12941	null
2025-11-17	Yanyun-3: Enabling Cross-Platform Strategy Game Operation with Vision-Language Models	Guoyan Wang et.al.	2511.12937	null
2025-11-17	Text2Traffic: A Text-to-Image Generation and Editing Method for Traffic Scenes	Feng Lv et.al.	2511.12932	null
2025-11-17	Distributed Self-allocated Time Slot Reuse: Multi-hop Communication in Rigid UAV Formations	Amelia Samandari et.al.	2511.12888	null
2025-11-16	Multi-Agent Reinforcement Learning for Heterogeneous Satellite Cluster Resources Optimization	Mohamad A. Hady et.al.	2511.12792	null
2025-11-14	Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy	Asraful Haque et.al.	2511.11558	null
2025-11-14	A Comparative Evaluation of Prominent Methods in Autonomous Vehicle Certification	Mustafa Erdem Kırmızıgül et.al.	2511.11484	null
2025-11-14	Robust and Efficient Communication in Multi-Agent Reinforcement Learning	Zejiao Liu et.al.	2511.11393	null
2025-11-14	Simulating an Autonomous System in CARLA using ROS 2	Joseph Abdo et.al.	2511.11310	null
2025-11-14	GraphPilot: Grounded Scene Graph Conditioning for Language-Based Autonomous Driving	Fabian Schmidt et.al.	2511.11266	null
2025-11-14	UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios	Mohamed Amine Ferrag et.al.	2511.11252	null
2025-11-14	One-to-N Backdoor Attack in 3D Point Cloud via Spherical Trigger	Dongmei Shan et.al.	2511.11210	null
2025-11-14	CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios	Hangyu Li et.al.	2511.11168	null
2025-11-14	Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids	Ke Ma et.al.	2511.11077	null
2025-11-14	Autonomous Vehicle Path Planning by Searching With Differentiable Simulation	Asen Nachkov et.al.	2511.11043	null
2025-11-14	Miniature Testbed for Validating Multi-Agent Cooperative Autonomous Driving	Hyunchul Bae et.al.	2511.11022	null
2025-11-14	Requirements for Aligned, Dynamic Resolution of Conflicts in Operational Constraints	Steven J. Jones et.al.	2511.10952	null
2025-11-13	Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction	Omid Mirzaeedodangeh et.al.	2511.10586	null
2025-11-13	LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction	Benjamin Stoler et.al.	2511.10411	null
2025-11-13	nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation	Mingxing Peng et.al.	2511.10403	null
2025-11-13	AgentEvolver: Towards Efficient Self-Evolving Agent System	Yunpeng Zhai et.al.	2511.10395	null
2025-11-13	Operator Models for Continuous-Time Offline Reinforcement Learning	Nicolas Hoischen et.al.	2511.10383	null
2025-11-13	Physically Interpretable Multi-Degradation Image Restoration via Deep Unfolding and Explainable Convolution	Hu Gao et.al.	2511.10166	null
2025-11-13	Trapped by Their Own Light: Deployable and Stealth Retroreflective Patch Attacks on Traffic Sign Recognition Systems	Go Tsuruoka et.al.	2511.10050	null
2025-11-13	DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection	Feiyang Jia et.al.	2511.10035	null
2025-11-13	Efficient Verification and Falsification of ReLU Neural Barrier Certificates	Dejin Ren et.al.	2511.10015	null
2025-11-13	Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching	Uday Bhaskar et.al.	2511.09955	null
2025-11-12	Coherent Optical Quantum Computing-Aided Resource Optimization for Transportation Digital Twin Construction	Huixiang Zhang et.al.	2511.09760	null
2025-11-12	Baby Sophia: A Developmental Approach to Self-Exploration through Self-Touch and Hand Regard	Stelios Zarifis et.al.	2511.09727	null
2025-11-12	FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection	Jiangyong Yu et.al.	2511.09347	null
2025-11-12	SimPath: Mitigating Motion Sickness in In-vehicle Infotainment Systems via Driving Condition Adaptation	Jinghao Huang et.al.	2511.09240	null
2025-11-12	D-AWSIM: Distributed Autonomous Driving Simulator for Dynamic Map Generation Framework	Shunsuke Ito et.al.	2511.09080	null
2025-11-12	Advancing Autonomous Emergency Response Systems: A Generative AI Perspective	Yousef Emami et.al.	2511.09044	null
2025-11-12	Argus: Resilience-Oriented Safety Assurance Framework for End-to-End ADSs	Dingji Wang et.al.	2511.09032	null
2025-11-12	FLAD: Federated Learning for LLM-based Autonomous Driving in Vehicle-Edge-Cloud Networks	Tianao Xiang et.al.	2511.09025	null
2025-11-12	UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving	Ziyi Song et.al.	2511.09013	null
2025-11-11	Information-Driven Fault Detection and Identification for Multi-Agent Spacecraft Systems: Collaborative On-Orbit Inspection Mission	Akshita Gupta et.al.	2511.08752	null
2025-11-10	PlanT 2.0: Exposing Biases and Structural Flaws in Closed-Loop Driving	Simon Gerstenecker et.al.	2511.07292	null
2025-11-10	MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs	Tianhao Peng et.al.	2511.07250	null
2025-11-10	Leveraging Text-Driven Semantic Variation for Robust OOD Segmentation	Seungheon Song et.al.	2511.07238	null
2025-11-10	Dynamics-Decoupled Trajectory Alignment for Sim-to-Real Transfer in Reinforcement Learning for Autonomous Driving	Thomas Steinecker et.al.	2511.07155	null
2025-11-10	HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving	Zhongyu Xia et.al.	2511.07106	null
2025-11-10	Certified L2-Norm Robustness of 3D Point Cloud Recognition in the Frequency Domain	Liang Zhou et.al.	2511.07029	null
2025-11-10	Relative Energy Learning for LiDAR Out-of-Distribution Detection	Zizhao Li et.al.	2511.06720	null
2025-11-10	Differentiable Semantic Meta-Learning Framework for Long-Tail Motion Forecasting in Autonomous Driving	Bin Rao et.al.	2511.06649	null
2025-11-10	DIAL-GS: Dynamic Instance Aware Reconstruction for Label-free Street Scenes with 4D Gaussian Splatting	Chenpeng Su et.al.	2511.06632	null
2025-11-09	A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving	Keke Long et.al.	2511.06496	null
2025-11-09	VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes	Zhengyu Zou et.al.	2511.06408	null
2025-11-09	LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation	Zijie Wang et.al.	2511.06272	null
2025-11-09	VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving	Ruifei Zhang et.al.	2511.06256	link
2025-11-09	AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving	Ruifei Zhang et.al.	2511.06253	link
2025-11-09	ROAR: Robust Accident Recognition and Anticipation for Autonomous Driving	Xingcheng Liu et.al.	2511.06226	null
2025-11-08	Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration	Umar Rashid et.al.	2511.06087	link
2025-11-08	Runtime Safety Monitoring of Deep Neural Networks for Perception: A Survey	Albert Schotschneider et.al.	2511.05982	null
2025-11-08	Polymap: generating high definition map based on rasterized polygons	Shiyu Gao et.al.	2511.05944	null
2025-11-07	SnowyLane: Robust Lane Detection on Snow-covered Rural Roads Using Infrastructural Elements	Jörg Gamerdinger et.al.	2511.05108	null
2025-11-07	J-SGFT: Joint Spatial and Graph Fourier Domain Learning for Point Cloud Attribute Deblocking	Muhammad Talha et.al.	2511.05047	null
2025-11-07	4D Imaging in ISAC Systems: A Framework Based on 5G NR Downlink Signals	Haoyang Weng et.al.	2511.04913	null
2025-11-06	ReGen: Generative Robot Simulation via Inverse Design	Phat Nguyen et.al.	2511.04769	null
2025-11-06	SAFe-Copilot: Unified Shared Autonomy Framework	Phat Nguyen et.al.	2511.04664	null
2025-11-06	UniSplat: Unified Spatio-Temporal Fusion via 3D Latent Scaffolds for Dynamic Driving Scene Reconstruction	Chen Shi et.al.	2511.04595	null
2025-11-06	A Tool for Benchmarking Large Language Models’ Robustness in Assessing the Realism of Driving Scenarios	Jiahui Wu et.al.	2511.04267	null
2025-11-06	ScaleDL: Towards Scalable and Efficient Runtime Prediction for Distributed Deep Learning Workloads	Xiaokai Wang et.al.	2511.04162	null
2025-11-04	Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data	Syed Mostaquim Ali et.al.	2511.02994	null
2025-11-04	Keeping it Local, Tiny and Real: Automated Report Generation on Edge Computing Devices for Mechatronic-Based Cognitive Systems	Nicolas Schuler et.al.	2511.02507	null
2025-11-04	3D Point Cloud Object Detection on Edge Devices for Split Computing	Taisuke Noguchi et.al.	2511.02293	null
2025-11-04	LLMs as Judges: Toward The Automatic Review of GSN-compliant Assurance Cases	Gerhard Yu et.al.	2511.02203	link
2025-11-03	UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs	Zhe Liu et.al.	2511.01768	null
2025-11-03	LLM-Assisted Tool for Joint Generation of Formulas and Functions in Rule-Based Verification of Map Transformations	Ruidi He et.al.	2511.01423	null
2025-11-03	VeriODD: From YAML to SMT-LIB – Automating Verification of Operational Design Domains	Bassel Rafie et.al.	2511.01417	null
2025-11-03	Risk Aware Safe Control with Cooperative Sensing for Dynamic Obstacle Avoidance	Pei Yu Chang et.al.	2511.01403	null
2025-11-03	Embodied Cognition Augmented End2End Autonomous Driving	Ling Niu et.al.	2511.01334	null
2025-11-02	Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion	Jaehyun Park et.al.	2511.00859	null
2025-11-04	Towards classification-based representation learning for place recognition on LiDAR scans	Maksim Konoplia et.al.	2511.00738	null
2025-11-01	Unveiling Uniform Shifted Power Law in Stochastic Human and Autonomous Driving Behavior	Wang Chen et.al.	2511.00659	link
2025-11-01	RNN-based linear parameter varying adaptive model predictive control for autonomous driving	Yassine Kebbati et.al.	2511.00610	null
2025-10-31	Dynamic Model Selection for Trajectory Prediction via Pairwise Ranking and Meta-Features	Lu Bowen et.al.	2511.00126	link
2025-10-30	Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail	NVIDIA et.al.	2511.00088	null
2025-10-31	Trends and Challenges in Next-Generation GNSS Interference Management	Leatile Marata et.al.	2510.27576	null
2025-10-31	Modified-Emergency Index (MEI): A Criticality Metric for Autonomous Driving in Lateral Conflict	Hao Cheng et.al.	2510.27333	link
2025-10-30	AD-SAM: Fine-Tuning the Segment Anything Vision Foundation Model for Autonomous Driving Perception	Mario Camarena et.al.	2510.27047	null
2025-10-29	VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes	Simon Yu et.al.	2510.26833	null
2025-10-30	Beyond Imitation: Constraint-Aware Trajectory Generation with Flow Matching For End-to-End Autonomous Driving	Lin Liu et.al.	2510.26292	null
2025-10-30	Self-localization on a 3D map by fusing global and local features from a monocular camera	Satoshi Kikuch et.al.	2510.26170	null
2025-11-05	WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios	Runsheng Xu et.al.	2510.26125	null
2025-10-29	Integrating Legal and Logical Specifications in Perception, Prediction, and Planning for Automated Driving: A Survey of Methods	Kumar Manas et.al.	2510.25386	null
2025-10-31	MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding	Runxi Huang et.al.	2510.25327	null
2025-10-29	Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision	Yuyang Xia et.al.	2510.25205	null
2025-11-02	D $^2$ GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction	Kejing Xia et.al.	2510.25173	null
2025-10-28	SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving	Anil Yildiz et.al.	2510.24949	null
2025-10-28	Delay Tolerant Control for Autonomous Driving Using CDOB	Xincheng Cao et.al.	2510.24898	null
2025-10-28	Learning to Drive Safely with Hybrid Options	Bram De Cooman et.al.	2510.24674	null
2025-10-28	Enhancing Vision-Language Models for Autonomous Driving through Task-Specific Prompting and Spatial Reasoning	Aodi Wu et.al.	2510.24152	null
2025-10-28	ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring	Zhenxin Li et.al.	2510.24108	null
2025-10-28	SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration	Jongsuk Kim et.al.	2510.24052	null
2025-10-27	Modeling and Scheduling of Fusion Patterns in Autonomous Driving Systems (Extended Version)	Hoora Sobhani et.al.	2510.23895	null
2025-10-27	VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting	Hoonhee Cho et.al.	2510.23205	null
2025-10-27	Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method	Bohan Li et.al.	2510.22973	null
2025-10-26	Uncertainty-Aware Autonomous Vehicles: Predicting the Road Ahead	Shireen Kudukkil Manchingal et.al.	2510.22680	null
2025-10-26	DAMap: Distance-aware MapNet for High Quality HD Map Construction	Jinpeng Dong et.al.	2510.22675	null
2025-10-25	3D Roadway Scene Object Detection with LIDARs in Snowfall Conditions	Ghazal Farhani et.al.	2510.22436	null
2025-10-25	Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework	Amir Mohammad Khadem Hosseini et.al.	2510.22243	null
2025-10-25	HARMONY: Hidden Activation Representations and Model Output-Aware Uncertainty Estimation for Vision-Language Models	Erum Mushtaq et.al.	2510.22171	null
2025-10-23	Addressing Corner Cases in Autonomous Driving: A World Model-based Approach with Mixture of Experts and LLMs	Haicheng Liao et.al.	2510.21867	null
2025-10-24	Learning Neural Control Barrier Functions from Expert Demonstrations using Inverse Constraint Learning	Yuxuan Yang et.al.	2510.21560	null
2025-10-24	Scalpel: Automotive Deep Learning Framework Testing via Assembling Model Components	Yinglong Zou et.al.	2510.21451	null
2025-10-24	Track-to-Track Association for Collective Perception based on Stochastic Optimization	Laura M. Wolf et.al.	2510.21278	null
2025-10-24	Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study	Guanlin Wu et.al.	2510.21160	null
2025-10-24	Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility	Hezam Albagami et.al.	2510.21112	null
2025-10-23	Adversary-Aware Private Inference over Wireless Channels	Mohamed Seif et.al.	2510.20518	null
2025-10-23	Behavior-Aware Online Prediction of Obstacle Occupancy using Zonotopes	Alvaro Carrizosa-Rendon et.al.	2510.20437	null
2025-10-23	Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses	Wu Yichao et.al.	2510.20314	null
2025-10-23	Privacy Protection of Automotive Location Data Based on Format-Preserving Encryption of Geographical Coordinates	Haojie Ji et.al.	2510.20300	null
2025-10-23	Seeing the Unseen: Mask-Driven Positional Encoding and Strip-Convolution Context Modeling for Cross-View Object Geo-Localization	Shuhan Hu et.al.	2510.20247	null
2025-10-23	Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists	Eduardo R. Corral-Soto et.al.	2510.20158	null
2025-10-22	VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction	Junhong Lin et.al.	2510.19578	null
2025-10-22	AutoMT: A Multi-Agent LLM Framework for Automated Metamorphic Testing of Autonomous Driving Systems	Linfeng Liang et.al.	2510.19438	null
2025-10-22	SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion	Xiaozhi Li et.al.	2510.19215	null
2025-10-24	Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks	Kai Zeng et.al.	2510.19195	link
2025-10-21	Robust Driving QA through Metadata-Grounded Context and Task-Specific Prompts	Seungjun Yu et.al.	2510.19001	null
2025-10-23	Occluded nuScenes: A Multi-Sensor Dataset for Evaluating Perception Robustness in Automated Driving	Sanjay Kumar et.al.	2510.18552	null
2025-10-21	MMRHP: A Miniature Mixed-Reality HIL Platform for Auditable Closed-Loop Evaluation	Mingxin Li et.al.	2510.18371	null
2025-10-21	ViSE: A Systematic Approach to Vision-Only Street-View Extrapolation	Kaiyuan Tan et.al.	2510.18341	link
2025-10-24	OmniNWM: Omniscient Driving Navigation World Models	Bohan Li et.al.	2510.18313	link
2025-10-21	OpenInsGaussian: Open-vocabulary Instance Gaussian Segmentation with Context-aware Cross-view Fusion	Tianyu Huang et.al.	2510.18253	link
2025-10-21	BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal Pretraining	Ajinkya Khoche et.al.	2510.18244	null
2025-10-20	SPACeR: Self-Play Anchoring with Centralized Reference Models	Wei-Jer Chang et.al.	2510.18060	null
2025-10-20	SAVANT: Semantic Analysis with Vision-Augmented Anomaly deTection	Roberto Brusnicki et.al.	2510.18034	null
2025-10-20	4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads	Ling Liu et.al.	2510.17664	null
2025-10-20	Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models	Katie Luo et.al.	2510.17274	null
2025-10-20	Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations	Shahin Atakishiyev et.al.	2510.17256	null
2025-10-20	SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving	Peiru Zheng et.al.	2510.17191	null
2025-10-21	DiffVLA++: Bridging Cognitive Reasoning and End-to-End Driving through Metric-Guided Alignment	Yu Gao et.al.	2510.17148	null
2025-10-20	ProDAT: Progressive Density-Aware Tail-Drop for Point Cloud Coding	Zhe Luo et.al.	2510.17068	null
2025-10-19	UNDREAM: Bridging Differentiable Rendering and Photorealistic Simulation for End-to-end Adversarial Attacks	Mansi Phute et.al.	2510.16923	null
2025-10-19	Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry	Sara Hatami Rostami et.al.	2510.16790	null
2025-10-19	A Comprehensive Survey on World Models for Embodied AI	Xinqing Li et.al.	2510.16732	link
2025-10-19	Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models	Jianbiao Mei et.al.	2510.16729	null
2025-10-18	Advancing Off-Road Autonomous Driving: The Large-Scale ORAD-3D Dataset and Comprehensive Benchmarks	Chen Min et.al.	2510.16500	null
2025-10-18	Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance	Chien Thai et.al.	2510.16445	null
2025-10-17	ObjectTransforms for Uncertainty Quantification and Reduction in Vision-Based Perception for Autonomous Vehicles	Nishad Sahu et.al.	2510.16118	null
2025-10-17	LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal	Shr-Ruei Tsai et.al.	2510.15868	null
2025-10-17	Perfect Prediction or Plenty of Proposals? What Matters Most in Planning for Autonomous Driving	Aron Distelzweig et.al.	2510.15505	null
2025-10-17	VDRive: Leveraging Reinforced VLA and Diffusion Policy for End-to-end Autonomous Driving	Ziang Guo et.al.	2510.15446	null
2025-10-17	FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers	Haisheng Su et.al.	2510.15385	null
2025-10-15	David vs. Goliath: A comparative study of different-sized LLMs for code generation in the domain of automotive scenario generation	Philipp Bauerfeind et.al.	2510.14115	null
2025-10-15	Provably Invincible Adversarial Attacks on Reinforcement Learning Systems: A Rate-Distortion Information-Theoretic Approach	Ziqing Lu et.al.	2510.13792	null
2025-10-15	XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation	Huawei Sun et.al.	2510.13565	null
2025-10-17	CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation	Yushan Han et.al.	2510.13432	null
2025-10-15	Partitioned Scheduling for DAG Tasks Considering Probabilistic Execution Time	Fuma Omori et.al.	2510.13279	null
2025-10-15	SAJA: A State-Action Joint Attack Framework on Multi-Agent Deep Reinforcement Learning	Weiqi Guo et.al.	2510.13262	null
2025-10-16	CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation	Li Liang et.al.	2510.13245	null
2025-10-15	An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities	Jalal Khan et.al.	2510.13230	null
2025-10-15	Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion	Rongtao Xu et.al.	2510.13198	null
2025-10-15	Safe Driving in Occluded Environments	Zhuoyuan Wang et.al.	2510.13114	null
2025-10-15	DriveCritic: Towards Context-Aware, Human-Aligned Evaluation for Autonomous Driving with Vision-Language Models	Jingyu Song et.al.	2510.13108	null
2025-10-15	ADPerf: Investigating and Testing Performance in Autonomous Driving Systems	Tri Minh-Triet Pham et.al.	2510.13078	null
2025-10-16	SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms	Haithem Turki et.al.	2510.12901	null
2025-10-14	DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving	Yingyan Li et.al.	2510.12796	null
2025-10-14	CAMNet: Leveraging Cooperative Awareness Messages for Vehicle Trajectory Prediction	Mattia Grasselli et.al.	2510.12703	null
2025-10-14	CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving	Xiaoji Zheng et.al.	2510.12560	null
2025-10-14	Biased-Attention Guided Risk Prediction for Safe Decision-Making at Unsignalized Intersections	Chengyang Dong et.al.	2510.12428	null
2025-10-14	CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion	Jinzhou Lin et.al.	2510.12362	null
2025-10-14	PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes	Ying A et.al.	2510.12282	null
2025-10-14	AngularFuse: A Closer Look at Angle-based Perception for Spatial-Sensitive Multi-Modality Image Fusion	Xiaopeng Liu et.al.	2510.12260	null
2025-10-14	Hierarchical Reasoning with Vision-Language Models for Incident Reports from Dashcam Videos	Shingo Yokoi et.al.	2510.12190	null
2025-10-13	Context-Aware Model-Based Reinforcement Learning for Autonomous Racing	Emran Yasser Moustafa et.al.	2510.11501	null
2025-10-13	A Faster and More Reliable Middleware for Autonomous Driving Systems	Yuankai He et.al.	2510.11448	null
2025-10-13	Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution	Bozhou Zhang et.al.	2510.11092	null
2025-10-13	Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling	Tianyi Tan et.al.	2510.11083	link
2025-10-13	Game-Theoretic Risk-Shaped Reinforcement Learning for Safe Autonomous Driving	Dong Hu et.al.	2510.10960	link
2025-10-13	Neutral Agent-based Adversarial Policy Learning against Deep Reinforcement Learning in Multi-party Open Systems	Qizhou Peng et.al.	2510.10937	null
2025-10-13	rareboost3d: a synthetic lidar dataset with enhanced rare classes	Shutong Lin et.al.	2510.10876	null
2025-10-12	Stability Under Scrutiny: Benchmarking Representation Paradigms for Online HD Mapping	Hao Shan et.al.	2510.10660	null
2025-10-12	A Machine Learning Perspective on Automated Driving Corner Cases	Sebastian Schmidt et.al.	2510.10653	null
2025-10-12	Reinforcement Learning-based Dynamic Adaptation for Sampling-Based Motion Planning in Agile Autonomous Driving	Alexander Langmann et.al.	2510.10567	null
2025-10-12	Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving	Kanishkha Jaisankar et.al.	2510.10503	null
2025-10-12	Risk-Budgeted Control Framework for Balanced Performance and Safety in Autonomous Vehicles	Pei Yu Chang et.al.	2510.10442	null
2025-10-11	Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking	Markus Käppeler et.al.	2510.10287	null
2025-10-11	A Style-Based Metric for Quantifying the Synthetic-to-Real Gap in Autonomous Driving Image Datasets	Dingyi Yao et.al.	2510.10203	null
2025-10-11	Beyond ADE and FDE: A Comprehensive Evaluation Framework for Safety-Critical Prediction in Multi-Agent Autonomous Driving Scenarios	Feifei Liu et.al.	2510.10086	null
2025-10-11	Probabilistic Hyper-Graphs using Multiple Randomly Masked Autoencoders for Semi-supervised Multi-modal Multi-task Learning	Pîrvu Mihai-Cristian et.al.	2510.10068	null
2025-10-11	Explainable Human-in-the-Loop Segmentation via Critic Feedback Signals	Pouya Shaeri et.al.	2510.09945	null
2025-10-10	SpaceVista: All-Scale Visual Spatial Reasoning from mm to km	Peiwen Sun et.al.	2510.09606	link
2025-10-10	Autonomous Soft Robotic Guidewire Navigation via Imitation Learning	Noah Barnes et.al.	2510.09497	null
2025-10-10	Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation	Vijay M. Galshetwar et.al.	2510.09228	null
2025-10-10	Towards Safer and Understandable Driver Intention Prediction	Mukilan Karuppasamy et.al.	2510.09200	null
2025-10-10	TARO: Toward Semantically Rich Open-World Object Detection	Yuchen Zhang et.al.	2510.09173	null
2025-10-10	Robust Driving Control for Autonomous Vehicles: An Intelligent General-sum Constrained Adversarial Reinforcement Learning Approach	Junchao Fan et.al.	2510.09041	null
2025-10-10	Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels	Weitong Kong et.al.	2510.09035	null
2025-10-09	Scalable Offline Metrics for Autonomous Driving	Animikh Aich et.al.	2510.08571	null
2025-10-09	ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving	Zhiyu Zheng et.al.	2510.08562	null
2025-10-09	Approximate Domain Unlearning for Vision-Language Models	Kodai Kawamura et.al.	2510.08132	null
2025-10-09	LinguaSim: Interactive Multi-Vehicle Testing Scenario Generation via Natural Language Instruction Based on Large Language Models	Qingyuan Shi et.al.	2510.08046	null
2025-10-09	RayFusion: Ray Fusion Enhanced Collaborative Visual Perception	Shaohong Wang et.al.	2510.08017	link
2025-10-09	CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving	Tianrui Zhang et.al.	2510.07944	null
2025-10-09	MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding	Peiran Wu et.al.	2510.07915	null
2025-10-10	GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models	Qinghongbing Xie et.al.	2510.07791	null
2025-10-08	VeMo: A Lightweight Data-Driven Approach to Model Vehicle Dynamics	Girolamo Oddo et.al.	2510.07447	null
2025-10-08	HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving	Donald Pfaffmann et.al.	2510.07210	null
2025-10-08	A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model	Tony Zhang et.al.	2510.07133	null
2025-10-08	Learning Global Representation from Queries for Vectorized HD Map Construction	Shoumeng Qiu et.al.	2510.06969	null
2025-10-08	OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects	Bing Li et.al.	2510.06952	null
2025-10-08	DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning	Ke Guo et.al.	2510.06913	null
2025-10-08	Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion	Jie Luo et.al.	2510.06687	null
2025-10-08	AIM 2025 Challenge on Real-World RAW Image Denoising	Feiran Li et.al.	2510.06601	null
2025-10-07	Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models	Jiahao Wang et.al.	2510.06209	null
2025-10-07	From Learning to Mastery: Achieving Safe and Efficient Real-World Autonomous Driving with Human-In-The-Loop Reinforcement Learning	Li Zeqiao et.al.	2510.06038	null
2025-10-07	The Safety Challenge of World Models for Embodied AI Agents: A Review	Lorenzo Baraldi et.al.	2510.05865	null
2025-10-07	ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving	Yongxuan Lyu et.al.	2510.05752	null
2025-10-07	Precise and Efficient Collision Prediction under Uncertainty in Autonomous Driving	Marc Kaufeld et.al.	2510.05729	null
2025-10-06	Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context	Ngeyen Yinkfu et.al.	2510.04912	null
2025-10-08	Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction	Chi Yan et.al.	2510.04759	link
2025-10-05	Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction	Yuhao Luo et.al.	2510.04365	null
2025-10-04	From Filters to VLMs: Benchmarking Defogging Methods through Object Detection and Segmentation Performance	Ardalan Aryashad et.al.	2510.03906	null
2025-10-04	Referring Expression Comprehension for Small Objects	Kanoko Goto et.al.	2510.03701	null
2025-10-04	Safety-Oriented Dynamic Path Planning for Automated Vehicles	Mostafa Emam et.al.	2510.03640	null
2025-10-03	Training-Free Out-Of-Distribution Segmentation With Foundation Models	Laith Nayal et.al.	2510.02909	null
2025-10-03	GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting	Xinran Zhang et.al.	2510.02884	null
2025-10-03	Action Deviation-Aware Inference for Low-Latency Wireless Robots	Jeyoung Park et.al.	2510.02851	null
2025-10-03	Work Zones challenge VLM Trajectory Planning: Toward Mitigation and Robust Autonomous Driving	Yifan Liao et.al.	2510.02803	null
2025-10-03	High Pixel Resolution Visible to Extended Shortwave Infrared Single Pixel Imaging with a black Phosphorus-Molybdenum disulfide (bP-MoS2) photodiode	Seyed Saleh Mousavi Khaleghi et.al.	2510.02673	null
2025-10-03	A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios	Ruining Yang et.al.	2510.02627	null
2025-10-02	Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving	Cornelius Schröder et.al.	2510.01829	null
2025-10-02	Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving	Haibo Hu et.al.	2510.01795	null
2025-10-02	Predictive Preference Learning from Human Interventions	Haoyuan Cai et.al.	2510.01545	null
2025-10-01	Strategic Fusion of Vision Language Models: Shapley-Credited Context-Aware Dawid-Skene for Multi-Label Tasks in Autonomous Driving	Yuxiang Feng et.al.	2510.01126	null
2025-10-01	Datasets for Valence and Arousal Inference: A Survey	Helen Schneider et.al.	2510.00738	null
2025-09-30	CHAI: Command Hijacking against embodied AI	Luis Burbano et.al.	2510.00181	null
2025-09-30	Adaptive and Resource-efficient Agentic AI Systems for Mobile and Embedded Devices: A Survey	Sicong Liu et.al.	2510.00078	null
2025-10-03	Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving	Sheng Yang et.al.	2510.00060	null
2025-09-30	PRISM: Progressive Rain removal with Integrated State-space Modeling	Pengze Xue et.al.	2509.26413	null
2025-09-30	Beyond Overall Accuracy: Pose- and Occlusion-driven Fairness Analysis in Pedestrian Detection for Autonomous Driving	Mohammad Khoshkdahan et.al.	2509.26166	null
2025-09-30	NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving	Yuan Gao et.al.	2509.25944	null
2025-09-30	Preemptive Spatiotemporal Trajectory Adjustment for Heterogeneous Vehicles in Highway Merging Zones	Yuan Li et.al.	2509.25929	null
2025-09-30	MuSLR: Multimodal Symbolic Logical Reasoning	Jundong Xu et.al.	2509.25851	null
2025-09-30	Cooperative Autonomous Driving in Diverse Behavioral Traffic: A Heterogeneous Graph Reinforcement Learning Approach	Qi Liu et.al.	2509.25751	null
2025-09-29	Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments	Zihan Zhang et.al.	2509.25542	null
2025-09-29	StreamForest: Efficient Online Video Understanding with Persistent Event Memory	Xiangyu Zeng et.al.	2509.24871	null
2025-09-29	TACO-Net: Topological Signatures Triumph in 3D Object Classification	Anirban Ghosh et.al.	2509.24802	null
2025-09-29	FuncPoison: Poisoning Function Library to Hijack Multi-agent Autonomous Driving Systems	Yuzhen Long et.al.	2509.24408	null
2025-09-29	Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning	Korbinian Moller et.al.	2509.24313	null
2025-09-29	Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds	Yongqiang Wang et.al.	2509.24273	null
2025-09-28	Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning	Muleilan Pei et.al.	2509.23993	null
2025-09-28	AutoPrune: Each Complexity Deserves a Pruning Policy	Hanshi Wang et.al.	2509.23931	null
2025-09-30	DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation	Haibao Yu et.al.	2509.23922	null
2025-09-28	Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios	Jinghan Xu Yuyang Zhang Qixuan Cai Jiancheng Chen Keqiu Li et.al.	2509.23895	null
2025-09-28	From Static to Dynamic: a Survey of Topology-Aware Perception in Autonomous Driving	Yixiao Chen et.al.	2509.23641	null
2025-09-28	Foundation Model-Based Adaptive Semantic Image Transmission for Dynamic Wireless Environments	Fangyu Liu et.al.	2509.23590	null
2025-09-28	BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving	Shu Liu et.al.	2509.23589	null
2025-09-27	WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving	Ziyue Zhu et.al.	2509.23402	link
2025-09-27	Preventing Robotic Jailbreaking via Multimodal Domain Adaptation	Francesco Marchiori et.al.	2509.23281	null
2025-09-26	Persistent Autoregressive Mapping with Traffic Rules for Autonomous Driving	Shiyi Liang et.al.	2509.22756	null
2025-09-26	Self-driving cars: Are we there yet?	Merve Atasever et.al.	2509.22754	null
2025-09-26	An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment	Xiaoyun Qiu et.al.	2509.22550	null
2025-09-26	EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model	Andrii Litvynchuk et.al.	2509.22527	null
2025-09-29	A Multi-Modality Evaluation of the Reality Gap in Autonomous Driving Systems	Stefano Carlo Lambertenghi et.al.	2509.22379	null
2025-09-26	UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data	Yujian Yuan et.al.	2509.22262	link
2025-09-26	An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose	Qifeng Wang et.al.	2509.22058	null
2025-09-25	PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines	Zhixin Zhang et.al.	2509.21563	null
2025-09-25	Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement	Jianbo Zhao et.al.	2509.20938	null
2025-09-25	MTRDrive: Memory-Tool Synergistic Reasoning for Robust Autonomous Driving in Corner Cases	Ziang Luo et.al.	2509.20843	null
2025-09-25	DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation	Ved Umrajkar et.al.	2509.20792	null
2025-09-25	MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM	Yuxuan Zhou et.al.	2509.20757	null
2025-09-25	Cyber Racing Coach: A Haptic Shared Control Framework for Teaching Advanced Driving Skills	Congkai Shen et.al.	2509.20653	null
2025-09-26	AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving	Jinhao Chai et.al.	2509.20253	null
2025-09-24	Universal Camouflage Attack on Vision-Language Models for Autonomous Driving	Dehong Kong et.al.	2509.20196	null
2025-09-24	Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving	Pengxiang Li et.al.	2509.20109	null
2025-09-25	Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models	Juana Valeria Hurtado et.al.	2509.20107	null
2025-09-24	Steerable Adversarial Scenario Generation through Test-Time Preference Alignment	Tong Nie et.al.	2509.20102	null
2025-09-25	OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving	Pei Liu et.al.	2509.19973	link
2025-09-24	BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting	Yixun Zhang et.al.	2509.19793	null
2025-09-24	RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving	Carlo Bosio et.al.	2509.19789	null
2025-09-24	EfficienT-HDR: An Efficient Transformer-Based Framework via Multi-Exposure Fusion for HDR Reconstruction	Yu-Shen Huang et.al.	2509.19779	null
2025-09-23	The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar	William L. Muckelroy III et.al.	2509.19644	null
2025-09-23	Coordinated PSO-PID based longitudinal control with LPV-MPC based lateral control for autonomous vehicles	Yassine Kebbati et.al.	2509.19529	link
2025-09-23	Autonomous driving using an optimized neural network based adaptive LPV-MPC controller	Yassine Kebbati et.al.	2509.19523	link
2025-09-23	Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation	Sherwin Bahmani et.al.	2509.19296	null
2025-09-24	An on-chip Pixel Processing Approach with 2.4μs latency for Asynchronous Read-out of SPAD-based dToF Flash LiDARs	Yiyang Liu et.al.	2509.19192	null
2025-09-23	TriFusion-AE: Language-Guided Depth and LiDAR Fusion for Robust Point Cloud Processing	Susmit Neogi et.al.	2509.18743	null
2025-09-23	The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving	Jay Patrikar et.al.	2509.18626	null
2025-09-23	MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving	Yuzhi Wu et.al.	2509.18613	null
2025-09-23	PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving	Chengran Yuan et.al.	2509.18609	null
2025-09-23	Spatial Envelope MPC: High Performance Driving without a Reference	Siyuan Yu et.al.	2509.18506	null
2025-09-22	AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback	Yunhao Yang et.al.	2509.18384	null
2025-09-23	V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts	Hsu-kuang Chiu et.al.	2509.18053	link
2025-09-22	DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving	Shuyao Shang et.al.	2509.17940	null
2025-09-22	SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model	Xiao Zhou et.al.	2509.17850	null
2025-09-22	RSU-Assisted Resource Allocation for Collaborative Perception	Guowei Liu et.al.	2509.17691	null
2025-09-22	Predicting Depth Maps from Single RGB Images and Addressing Missing Information in Depth Estimation	Mohamad Mofeed Chaar et.al.	2509.17686	null
2025-09-22	Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method	Gregory Schroeder et.al.	2509.17620	null
2025-09-22	Vision-Based Driver Drowsiness Monitoring: Comparative Analysis of YOLOv5-v11 Models	Dilshara Herath et.al.	2509.17498	null
2025-09-22	FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR	Junzhe Wu et.al.	2509.17390	null
2025-09-22	Multi-Scenario Highway Lane-Change Intention Prediction: A Physics-Informed AI Framework for Three-Class Classification	Jiazhao Shi et.al.	2509.17354	null
2025-09-21	Optimized adaptive MPC for lateral control of autonomous vehicles	Yassine Kebbati et.al.	2509.17215	null
2025-09-21	CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving	Ruiguo Zhong et.al.	2509.17080	null
2025-09-21	Orchestrate, Generate, Reflect: A VLM-Based Multi-Agent Collaboration Framework for Automated Driving Policy Learning	Zengqi Peng et.al.	2509.17042	null
2025-09-21	Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving	Xuan Chen et.al.	2509.16950	null
2025-09-21	End2Race: Efficient End-to-End Imitation Learning for Real-Time F1Tenth Racing	Zhijie Qiao et.al.	2509.16894	null
2025-09-20	Improve bounding box in Carla Simulator	Mohamad Mofeed Chaar et.al.	2509.16773	null
2025-09-20	Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?	Xin Chen et.al.	2509.16654	null
2025-09-20	ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents	Yichen Wang et.al.	2509.16645	null
2025-09-20	SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving	Haiming Zhang et.al.	2509.16588	null
2025-09-20	ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting	Xiaoyang Yan et.al.	2509.16552	null
2025-09-20	RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation	Tianyi Yan et.al.	2509.16500	null
2025-09-19	RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars	Weiyi Xiong et.al.	2509.16119	null
2025-09-19	CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios	Kangyu Wu et.al.	2509.15984	null
2025-09-19	CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine	Shiyu Fang et.al.	2509.15968	link
2025-09-19	RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation	Paul Julius Kühn et.al.	2509.15886	null
2025-09-19	ThermalGuardian: Temperature-Aware Testing of Automotive Deep Learning Frameworks	Yinglong Zou et.al.	2509.15815	null
2025-09-19	CBPNet: A Continual Backpropagation Prompt Network for Alleviating Plasticity Loss on Edge Devices	Runjie Shao et.al.	2509.15785	null
2025-09-19	Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution	Chang Soo Lim et.al.	2509.15781	null
2025-09-18	Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing	Christopher Oeltjen et.al.	2509.15423	null
2025-09-18	Out-of-Sight Trajectories: Tracking, Fusion, and Prediction	Haichao Zhang et.al.	2509.15219	null
2025-09-18	Digital Twin-based Cooperative Autonomous Driving in Smart Intersections: A Multi-Agent Reinforcement Learning Approach	Taoyuan Yu et.al.	2509.15099	null
2025-09-18	Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression	Xuan Deng et.al.	2509.14591	null
2025-09-18	DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising	Li Gao et.al.	2509.14565	null
2025-09-17	FlowDrive: Energy Flow Field for End-to-End Autonomous Driving	Hao Jiang et.al.	2509.14303	link
2025-09-17	MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping	Zhihao Cao et.al.	2509.14191	null
2025-09-17	BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection	Rongyu Zhang et.al.	2509.14151	null
2025-09-17	SEG-Parking: Towards Safe, Efficient, and Generalizable Autonomous Parking via End-to-End Offline Reinforcement Learning	Zewei Yang et.al.	2509.13956	null
2025-09-17	MAP: End-to-End Autonomous Driving with Map-Assisted Planning	Huilin Yin et.al.	2509.13926	null
2025-09-17	Ensemble of Pre-Trained Models for Long-Tailed Trajectory Prediction	Divya Thuremella et.al.	2509.13914	null
2025-09-17	Data-Efficient Spectral Classification of Hyperspectral Data Using MiniROCKET and HDC-MiniROCKET	Nick Theisen et.al.	2509.13809	null
2025-09-17	AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving	Yuechen Luo et.al.	2509.13769	null
2025-09-17	UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry	Tae-Wook Um et.al.	2509.13713	null
2025-09-17	FishBEV: Distortion-Resilient Bird’s Eye View Segmentation with Surround-View Fisheye Cameras	Hang Li et.al.	2509.13681	null
2025-09-16	TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning	Momchil S. Tomov et.al.	2509.13579	null
2025-09-16	Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving	Artem Savkin et.al.	2509.13507	null
2025-09-16	Road Obstacle Video Segmentation	Shyam Nandan Rai et.al.	2509.13181	null
2025-09-17	TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving	Jiawei Wang et.al.	2509.13164	null
2025-09-16	An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios	Zhihao Zhang et.al.	2509.13132	null
2025-09-16	Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving	Ruibo Li et.al.	2509.13116	null
2025-09-16	4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar	Xiao Tang et.al.	2509.12931	null
2025-09-16	StereoCarla: A High-Fidelity Driving Dataset for Generalizable Stereo	Xianda Guo et.al.	2509.12683	null
2025-09-16	Maps for Autonomous Driving: Full-process Survey and Frontiers	Pengxin Chen et.al.	2509.12632	null
2025-09-16	DisorientLiDAR: Physical Attacks on LiDAR-based Localization	Yizhen Lao et.al.	2509.12595	null
2025-09-15	Approaches to Analysis and Design of AI-Based Autonomous Vehicles	Tao Yan et.al.	2509.12169	null
2025-09-16	Embodied Navigation Foundation Model	Jiazhao Zhang et.al.	2509.12129	null
2025-09-15	Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network	Navid Hashemi et.al.	2509.11838	null
2025-09-15	HeLoFusion: An Efficient and Scalable Encoder for Modeling Heterogeneous and Multi-Scale Interactions in Trajectory Prediction	Bingqing Wei et.al.	2509.11719	null
2025-09-14	SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion	Zhiwen Yang et.al.	2509.11171	null
2025-09-13	Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios	Simone Mosco et.al.	2509.10841	null
2025-09-11	Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey	Wei Dai et.al.	2509.10570	null
2025-09-17	DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training	Jianxin Shi et.al.	2509.10426	null
2025-09-12	Multimodal SAM-adapter for Semantic Segmentation	Iacopo Curti et.al.	2509.10408	null
2025-09-12	CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion	Santiago Montiel-Marín et.al.	2509.10139	null
2025-09-12	BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals	Minsang Kong et.al.	2509.10080	null
2025-09-11	MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network	Ge Sun et.al.	2509.09200	null
2025-09-10	LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations	Payal Varshney et.al.	2509.08422	null
2025-09-10	Sparse BEV Fusion with Self-View Consistency for Multi-View Detection and Tracking	Keisuke Toida et.al.	2509.08421	null
2025-09-10	InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection	Zhongyu Xia et.al.	2509.08374	null
2025-09-10	Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities	Rajendramayavan Sathyam et.al.	2509.08302	null
2025-09-10	A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator	Elahe Delavari et.al.	2509.08221	null
2025-09-09	Mean Field Game-Based Interactive Trajectory Planning Using Physics-Inspired Unified Potential Fields	Zhen Tian et.al.	2509.08147	null
2025-09-09	TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models	Zongzheng Zhang et.al.	2509.07962	null
2025-09-09	Bias-Aware Machine Unlearning: Towards Fairer Vision Models via Controllable Forgetting	Sai Siddhartha Chary Aylapuram et.al.	2509.07456	null
2025-09-09	Attention and Risk-Aware Decision Framework for Safe Autonomous Driving	Zhen Tian et.al.	2509.07412	null
2025-09-09	TEGRA: A Flexible & Scalable NextGen Mobile Core	Bilal Saleem et.al.	2509.07410	null
2025-09-08	SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis	Zhengqing Chen et.al.	2509.06798	null
2025-09-08	Adaptive Evolution Factor Risk Ellipse Framework for Reliable and Safe Autonomous Driving	Fujiang Yuan et.al.	2509.06375	null
2025-09-07	Asymmetry Vulnerability and Physical Attacks on Online Map Construction for Autonomous Driving	Yang Lou et.al.	2509.06071	null
2025-09-06	Scenario-based Decision-making Using Game Theory for Interactive Autonomous Driving: A Survey	Zhihao Lin et.al.	2509.05777	null
2025-09-06	Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh	Ha Meem Hossain et.al.	2509.05652	null
2025-09-06	OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision	Ruixun Liu et.al.	2509.05578	null
2025-09-08	LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation	Yinglin Duan et.al.	2509.05263	null
2025-09-05	Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet	Mohammad Saeid et.al.	2509.05198	link
2025-09-05	A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing	Chengkai Xu et.al.	2509.04853	null
2025-09-05	Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization	Dharsan Ravindran et.al.	2509.04735	null
2025-09-04	Bootstrapping Reinforcement Learning with Sub-optimal Policies for Autonomous Driving	Zhihao Zhang et.al.	2509.04712	null
2025-09-04	Domain Adaptation for Different Sensor Configurations in 3D Object Detection	Satoshi Tanaka et.al.	2509.04711	null
2025-09-04	In-Context Policy Adaptation via Cross-Domain Skill Diffusion	Minjong Yoo et.al.	2509.04535	null
2025-09-09	One Flight Over the Gap: A Survey from Perspective to Panoramic Vision	Xin Lin et.al.	2509.04444	null
2025-09-04	TriLiteNet: Lightweight Model for Multi-Task Visual Perception	Quang-Huy Che et.al.	2509.04092	null
2025-09-04	SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation	Han Huang et.al.	2509.03999	null
2025-09-03	sam-llm: interpretable lane change trajectoryprediction via parametric finetuning	Zhuo Cao et.al.	2509.03462	null
2025-09-03	Rashomon in the Streets: Explanation Ambiguity in Scene Understanding	Helge Spieker et.al.	2509.03169	null
2025-09-03	Automatically Generating High-Precision Simulated Road Networking in Traffic Scenario	Liang Xie et.al.	2509.02990	null
2025-09-03	KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models	Yujin Wang et.al.	2509.02966	null
2025-09-02	Do LLM Modules Generalize? A Study on Motion Generation for Autonomous Driving	Mingyi Wang et.al.	2509.02754	null
2025-09-02	2nd Place Solution for CVPR2024 E2E Challenge: End-to-End Autonomous Driving Using Vision Language Model	Zilong Guo et.al.	2509.02659	null
2025-09-02	Omnidirectional Spatial Modeling from Correlated Panoramas	Xinshen Zhang et.al.	2509.02164	null
2025-09-02	Txt2Sce: Scenario Generation for Autonomous Driving System Testing Based on Textual Reports	Pin Ji et.al.	2509.02150	null
2025-09-02	Curiosity-Driven Testing for Sequential Decision-Making Process	Junda He et.al.	2509.02025	null
2025-09-02	Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions	Beibei Zhou et.al.	2509.02011	null
2025-09-01	2COOOL: 2nd Workshop on the Challenge Of Out-Of-Label Hazards in Autonomous Driving	Ali K. AlShami et.al.	2508.21080	null
2025-10-22	Interpretable Decision-Making for End-to-End Autonomous Driving	Mona Mirzaie et.al.	2508.18898	null
2025-02-18	OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Shuo Xing et.al.	2412.15208	null
2024-12-05	DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving	Dingrui Wang et.al.	2409.18053	null
2024-04-16	Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap	Carl Lindström et.al.	2403.16092	null
2023-05-30	Selective Communication for Cooperative Perception in End-to-End Autonomous Driving	Hsu-kuang Chiu et.al.	2305.17181	null
2023-09-14	Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges	Yushan Han et.al.	2301.06262	link
2022-06-28	A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning	Balint Gyevnar et.al.	2206.08783	null
2023-07-31	Pushing the Limits of Learning-based Traversability Analysis for Autonomous Driving on CPU	Daniel Fusaro et.al.	2206.03083	null
2021-11-16	A Scenario-Based Platform for Testing Autonomous Vehicle Behavior Prediction Models in Simulation	Francis Indaheng et.al.	2110.14870	null
2021-11-03	Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement	Tianyu Shi et.al.	2110.07067	null
2021-11-22	ViSTA: a Framework for Virtual Scenario-based Testing of Autonomous Vehicles	Andrea Piazzoni et.al.	2109.02529	null
2021-08-10	Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge	Songyang Zhang et.al.	2108.04230	null
2021-04-23	Multi-task Learning with Attention for End-to-end Autonomous Driving	Keishi Ishihara et.al.	2104.10753	null
2021-03-31	Multi-modal Trajectory Prediction for Autonomous Driving with Semantic Map and Dynamic Graph Attention Network	Bo Dong et.al.	2103.16273	null
2023-05-16	Control Strategies for Autonomous Vehicles	Chinmay Vilas Samak et.al.	2011.08729	null
2019-12-03	Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles	Pin Wang et.al.	1912.00074	null
2019-09-18	*A3D Dataset: Towards Autonomous Driving in Challenging Environments**	Quang-Hieu Pham et.al.	1909.07541	null
2017-04-11	Deep Reinforcement Learning framework for Autonomous Driving	Ahmad El Sallab et.al.	1704.02532	null

(<a href=#updated-on-20260429>back to top</a>)

Map

Publish Date	Title	Authors	PDF	Code
2009-10-21	Quantum Error Correction Beyond Completely Positive Maps	A. Shabani et.al.	quant-ph/0610028	null
2026-04-28	Leveraging Previous-Traversal Point Cloud Map Priors for Camera-Based 3D Object Detection and Tracking	Markus Käppeler et.al.	2604.25405	null
2026-04-23	SLAM as a Stochastic Control Problem with Partial Information: Optimal Solutions and Rigorous Approximations	Ilir Gusija et.al.	2604.21693	null
2026-04-17	Environment-Adaptive Solid-State LiDAR-Inertial Odometry	Zhi Zhang et.al.	2604.15864	null
2026-04-17	GaussianFlow SLAM: Monocular Gaussian Splatting SLAM Guided by GaussianFlow	Dong-Uk Seo et.al.	2604.15612	null
2026-04-15	BIEVR-LIO: Robust LiDAR-Inertial Odometry through Bump-Image-Enhanced Voxel Maps	Patrick Pfreundschuh et.al.	2604.14421	null
2026-04-13	Bridging the RGB-IR Gap: Consensus and Discrepancy Modeling for Text-Guided Multispectral Detection	Jiaqi Wu et.al.	2604.11234	null
2026-04-13	Ψ-Map: Panoptic Surface Integrated Mapping Enables Real2Sim Transfer	Xuan Yu et.al.	2604.10982	null
2026-04-10	HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation	Chengjie Fan et.al.	2604.08883	null
2026-04-08	RichMap: A Reachability Map Balancing Precision, Efficiency, and Flexibility for Rich Robot Manipulation Tasks	Yupu Lu et.al.	2604.06778	null
2026-04-26	TOL: Textual Localization with OpenStreetMap	Youqi Liao et.al.	2604.01644	null
2026-04-01	Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM	Monica M. Q. Li et.al.	2604.00804	null
2026-03-30	Pandora: Articulated 3D Scene Graphs from Egocentric Vision	Alan Yu et.al.	2603.28732	null
2026-03-24	Active Robotic Perception for Disease Detection and Mapping in Apple Trees	Hayden Feddock et.al.	2603.23112	null
2026-03-18	Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting	Guillem Casadesus Vila et.al.	2603.18218	null
2026-03-17	ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial Imagery	Weiqin Jiao et.al.	2603.16616	null
2026-03-06	Word-Anchored Temporal Forgery Localization	Tianyi Wang et.al.	2603.06220	null
2026-03-03	Probabilistic Occupancy Grid for Radio-Based SLAM	Xuhong Li et.al.	2603.03559	null
2026-03-02	Randomized Neural Networks for Partial Differential Equation on Static and Evolving Surfaces	Jingbo Sun et.al.	2603.01689	null
2026-03-02	B $^2$ F-Map: Crowd-sourced Mapping with Bayesian B-spline Fusion	Yiping Xie et.al.	2603.01673	null
2026-02-25	Tacmap: Bridging the Tactile Sim-to-Real Gap via Geometry-Consistent Penetration Depth Map	Lei Su et.al.	2602.21625	null
2026-02-22	Beyond Behavioural Trade-Offs: Mechanistic Tracing of Pain-Pleasure Decisions in an LLM	Francesca Bianco et.al.	2602.19159	null
2026-03-15	H.265/HEVC Video Steganalysis Based on CU Block Structure Gradients and IPM Mapping	Xiang Zhang et.al.	2602.11547	null
2026-02-08	SPOT: Spatio-Temporal Obstacle-free Trajectory Planning for UAVs in an Unknown Dynamic Environment	Astik Srivastava et.al.	2602.01189	null
2026-02-03	MapDream: Task-Driven Map Learning for Vision-Language Navigation	Guoxin Lian et.al.	2602.00222	null
2026-01-18	OpenNavMap: Structure-Free Topometric Mapping via Large-Scale Collaborative Localization	Jianhao Jiao et.al.	2601.12291	null
2025-12-16	ACE-SLAM: Scene Coordinate Regression for Neural Implicit Real-Time SLAM	Ignacio Alzugaray et.al.	2512.14032	null
2025-12-05	Categorifying isomonodromic deformations via Lie groupoids I: Logarithmic singularities	Waleed Qaisar et.al.	2512.05966	null
2025-12-05	AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement	Munsif Ali et.al.	2512.05960	null
2025-12-05	World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty	Zhiting Mei et.al.	2512.05927	null
2025-12-05	Invariant polynomials, gaps, and sparseness	John P. D’Angelo et.al.	2512.05892	null
2025-12-05	Machine-learning-enabled interpretation of tribological deformation patterns in large-scale MD data	Hendrik J. Ehrich et.al.	2512.05818	null
2025-12-05	Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation	Fabian Konstantinidis et.al.	2512.05812	null
2025-12-05	Emergence of Language in the Developing Brain	Linnea Evanson et.al.	2512.05718	null
2025-12-05	Physics-Informed Graph Neural Network with Frequency-Aware Learning for Optical Aberration Correction	Yong En Kok et.al.	2512.05683	null
2025-12-05	Scenario-aware Uncertainty Quantification for Trajectory Prediction with Statistical Guarantees	Yiming Shu et.al.	2512.05682	null
2025-12-05	The Power of Network Pluralism: Multi-Perspective Modeling of Heterogeneous Legal Document Networks	Titus Pünder et.al.	2512.05679	null
2025-12-05	Over-the-Air Semantic Alignment with Stacked Intelligent Metasurfaces	Mario Edoardo Pandolfo et.al.	2512.05657	null
2025-12-05	Modular Jets for Supervised Pipelines: Diagnosing Mirage vs Identifiability	Suman Sanyal et.al.	2512.05638	null
2025-12-05	Experts-Guided Unbalanced Optimal Transport for ISP Learning from Unpaired and/or Paired Data	Georgy Perevozchikov et.al.	2512.05635	null
2025-12-05	Sticky eigenstates in systems with sharply-divided phase space	Hua Yan et.al.	2512.05627	null
2025-12-05	Refined HLA Linkage Disequilibrium Architectures of World Populations by a Novel Allelic Correlation Measure	Fei Zhang et.al.	2512.05573	null
2025-12-05	Knowing Your Uncertainty – On the application of LLM in social sciences	Bolun Zhang et.al.	2512.05461	null
2025-12-05	On-Orbit Calibration of Danuri/PolCam. I. Geometric Calibration	Kilho Baek et.al.	2512.05330	null
2025-12-04	Restriction of the metaplectic representation over a $p$ -adic field to an anisotropic torus	Khemais Maktouf et.al.	2512.05317	null
2026-01-16	Systematically Evaluating Equivalent Purpose for Digital Maps	Brandon Biggs et.al.	2512.05310	null
2025-12-04	Seabed-to-Sky Mapping of Maritime Environments with a Dual Orthogonal SONAR and LiDAR Sensor Suite	Christian Westerdahl et.al.	2512.05303	null
2025-12-04	Stable Single-Pixel Contrastive Learning for Semantic and Geometric Tasks	Leonid Pogorelyuk et.al.	2512.04970	null
2025-12-04	Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty	Kailiang Liu et.al.	2512.04918	null
2025-12-04	VNS Tokamak OpenMC-Serpent Validation for Medical Isotope Studies	Christopher Ehrich et.al.	2512.04873	null
2025-12-04	LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation	Huynh Trinh Ngoc et.al.	2512.04821	null
2025-12-04	TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards	Mauro Martini et.al.	2512.04772	null
2025-12-04	Spectral micro-CT for quantitative analysis of calcification in fibrocartilage	Vittoria Mazzini et.al.	2512.04662	null
2025-12-04	Standard audiogram classification from loudness scaling data using unsupervised, supervised, and explainable machine learning techniques	Chen Xu et.al.	2512.04616	null
2025-12-04	Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot	Sheng Hang et.al.	2512.04599	link
2025-12-04	Prompt2Craft: Generating Functional Craft Assemblies with LLMs	Vitor Hideyo Isume et.al.	2512.04568	null
2025-12-04	Convergence Dynamics and Scaling Laws in the Dissipative Relativistic Kicked Rotator	Daniel Borin et.al.	2512.04471	null
2025-12-04	MAFNet:Multi-frequency Adaptive Fusion Network for Real-time Stereo Matching	Ao Xu et.al.	2512.04358	null
2025-12-03	UniLight: A Unified Representation for Lighting	Zitian Zhang et.al.	2512.04267	null
2025-12-03	Warped & Hooked: Mapping the Magellanic Clouds in 3D using Red Clump stars	Slater J. Oden et.al.	2512.04200	null
2025-12-03	SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows	Qinyu Zhao et.al.	2512.04084	null
2025-12-03	The Loss Landscape of Powder X-Ray Diffraction-Based Structure Optimization Is Too Rough for Gradient Descent	Nofit Segal et.al.	2512.04036	null
2025-12-03	Learning Group Actions In Disentangled Latent Image Representations	Farhana Hossain Swarnali et.al.	2512.04015	null
2025-12-03	MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction	Guole Shen et.al.	2512.03939	null
2025-12-03	Rethinking Collapse: Coupling Quantum States to Classical Bits with quasi-probabilities	Dagomir Kaszlikowski et.al.	2512.03929	null
2025-12-03	Feature-aware Modulation for Learning from Temporal Tabular Data	Hao-Run Cai et.al.	2512.03678	null
2025-12-03	Multi-Scale Visual Prompting for Lightweight Small-Image Classification	Salim Khazem et.al.	2512.03663	null
2025-12-03	MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms	Jiahao Zhang et.al.	2512.03640	null
2025-12-03	From fractional Chern insulators to topological electronic crystals in moiré MoTe2: quantum geometry tuning via remote layer	Feng Liu et.al.	2512.03622	null
2025-12-03	Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding	Haoran Zhou et.al.	2512.03601	null
2025-12-03	Quantum Hash Function Based on Spectral Properties of Graphs and Discrete Walker Dynamics	Mohana Priya Thinesh Kumar et.al.	2512.03581	null
2025-12-03	GeoVideo: Introducing Geometric Regularization into Video Generation Model	Yunpeng Bai et.al.	2512.03453	null
2025-12-03	What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models	Tianchen Deng et.al.	2512.03422	null
2025-12-03	Surfel-LIO: Fast LiDAR-Inertial Odometry with Pre-computed Surfels and Hierarchical Z-order Voxel Hashing	Seungwon Choi et.al.	2512.03397	null
2025-12-03	New linear invariants of hypergraphs	Peter A. Brooksbank et.al.	2512.03342	null
2025-12-03	Epistemic Substitution: How Grokipedia’s AI-Generated Encyclopedia Restructures Authority	Aliakbar Mehdizadeh et.al.	2512.03337	null
2025-12-03	When does Gaussian equivalence fail and how to fix it: Non-universal behavior of random features with quadratic scaling	Garrett G. Wen et.al.	2512.03325	null
2025-12-03	NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction	Thomas Monninger et.al.	2512.03317	null
2025-12-02	Retrofitting Earth System Models with Cadence-Limited Neural Operator Updates	Aniruddha Bora et.al.	2512.03309	null
2025-12-02	Learning Network Sheaves for AI-native Semantic Communication	Enrico Grimaldi et.al.	2512.03248	null
2025-12-02	CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models	Minkyung Kwon et.al.	2512.03045	link
2025-12-02	U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences	Xiang Xu et.al.	2512.02982	link
2025-12-02	Unipotent quantum coordinate ring and minuscule prefundamental representations: twisted case	Il-Seung Jang et.al.	2512.02946	null
2025-12-02	MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding	Fan Yang et.al.	2512.02906	null
2025-12-02	FAIRY2I: Universal Extremely-Low Bit QAT framework via Widely-Linear Representation and Phase-Aware Quantization	Feiyu Wang et.al.	2512.02901	link
2025-12-02	Assessing the performance of correlation-based multi-fidelity neural emulators	Cristian J. Villatoro et.al.	2512.02868	null
2025-12-02	Revisiting Theory of Contrastive Learning for Domain Generalization	Ali Alvandi et.al.	2512.02831	null
2025-12-02	Implementation and Analysis of Quantum Majority Rules under Noisy Conditions	Gal Amit et.al.	2512.02813	null
2025-12-02	Exploring Definitions of Quality and Diversity in Sonic Measurement Spaces	Björn Þór Jónsson et.al.	2512.02783	null
2025-12-02	Digit-Indexed q-ary SEC-DED Codes with Near-Hamming Overhead	Jiaxu Hu et.al.	2512.02747	null
2025-12-02	Efficient Simulation of the 2D Hubbard Model via Hilbert Space-Filling Curve Mapping	Ashkan Abedi et.al.	2512.02666	null
2025-12-02	PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes	Derui Shan et.al.	2512.02664	null
2025-12-02	Content-Aware Texturing for Gaussian Splatting	Panagiotis Papantonakis et.al.	2512.02621	null
2025-12-02	Quantum LLMs Using Quantum Computing to Analyze and Process Semantic Information	Timo Aukusti Laine et.al.	2512.02619	null
2025-12-02	Interface Correlators in Symmetric Product Orbifolds	Sebastian Harris et.al.	2512.02616	null
2025-12-02	Updates on dipolar anisotropy in local measurements of the Hubble constant from Cosmicflows-4	Vincenzo Salzano et.al.	2512.02526	null
2025-12-02	Individual-specific precision neuroimaging of learning-related plasticity	Simon Leipold et.al.	2512.02503	null
2025-12-02	Quantum-Based Self-Attention Mechanism for Hardware-Aware Differentiable Quantum Architecture Search	Yuxiang Liu et.al.	2512.02476	null
2025-12-02	nuScenes Revisited: Progress and Challenges in Autonomous Driving	Whye Kit Fong et.al.	2512.02448	null
2025-12-02	Vehicle Dynamics Embedded World Models for Autonomous Driving	Huiqian Li et.al.	2512.02417	null
2025-12-01	ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation	Chenyang Gu et.al.	2512.02013	null
2025-12-01	JWST & the Waz Arc I: Spatially Resolving the Physical Conditions within a Post-Starburst Galaxy at Redshift 5 with NIRSpec IFS	Taylor A. Hutchison et.al.	2512.02000	null
2025-12-01	The Lebesgue constant for uniform approximation of differential forms	Ludovico Bruni Bruno et.al.	2512.01944	null
2025-12-01	SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception	Gurmeher Khurana et.al.	2512.01908	null
2025-12-01	Decision Tree Embedding by Leaf-Means	Cencheng Shen et.al.	2512.01819	link
2025-12-01	Secure Over-the-Air Computation Against Multiple Eavesdroppers using Correlated Artificial Noise	David Nordlund et.al.	2512.01778	null
2025-12-01	DiG-Flow: Discrepancy-Guided Flow Matching for Robust VLA Models	Wanpeng Zhang et.al.	2512.01715	link
2025-12-01	A unified framework for geometry-independent operator learning in cardiac electrophysiology simulations	Bei Zhou et.al.	2512.01702	null
2025-12-01	Integrating Artificial Intelligence and Mixed Integer Linear Programming: Explainable Graph-Based Instance Space Analysis in Air Transportation	Artur Guerra Rosa et.al.	2512.01698	null
2026-03-12	The Spin-MInt Algorithm: an Accurate and Symplectic Propagator for the Spin-Mapping Representation of Nonadiabatic Dynamics	Lauren E. Cook et.al.	2512.01579	null
2025-12-01	QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions	Can Polat et.al.	2512.01519	null
2025-12-01	RadioPiT: Radio Map Generation with Pixel Transformer Driven by Ultra-Sparse Real-World Data	Zeyao Sun et.al.	2512.01451	null
2025-12-01	Consistency Flow Model Achieves One-step Denoising Error Correction Codes	Haoyu Lei et.al.	2512.01389	null
2025-12-01	Reversible Inversion for Training-Free Exemplar-guided Image Editing	Yuke Li et.al.	2512.01382	null
2025-12-01	EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly	Xiaokun Pan et.al.	2512.01296	link
2025-12-01	Diffusion Model in Latent Space for Medical Image Segmentation Task	Huynh Trinh Ngoc et.al.	2512.01292	null
2025-12-01	Knowledge Graph Augmented Large Language Models for Next-Visit Disease Prediction	Ruiyu Wang et.al.	2512.01210	null
2025-12-01	Pay Attention Later: From Vector Space Diffusion to Linearithmic Spectral Phase-Locking	Alper Yıldırım et.al.	2512.01208	null
2025-11-30	Generalized Medical Phrase Grounding	Wenjun Zhang et.al.	2512.01085	null
2025-11-30	Stability analysis of action potential generation using Markov models of voltage-gated sodium channel isoforms	Youssof Abdullah et.al.	2512.01058	null
2025-11-28	Detection of the Pairwise Kinematic Sunyaev-Zel’dovich Effect and Pairwise Velocity with DESI DR1 Galaxies and ACT DR6 and Planck CMB Data	Yulin Gong et.al.	2511.23417	null
2025-11-28	Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon	Isabel Whiteley Tscherniak et.al.	2511.23384	null
2025-11-28	DAONet-YOLOv8: An Occlusion-Aware Dual-Attention Network for Tea Leaf Pest and Disease Detection	Yefeng Wu et.al.	2511.23222	null
2025-11-28	Robust 3DGS-based SLAM via Adaptive Kernel Smoothing	Shouhe Zhang et.al.	2511.23221	null
2025-11-28	Quantum graphs in infinite-dimensions: Hilbert–Schmidts and Hilbert modules	Matthew Daws et.al.	2511.23121	null
2025-11-28	Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM	Shouhe Zhang et.al.	2511.22968	null
2025-11-28	Scalable Diffusion Transformer for Conditional 4D fMRI Synthesis	Jungwoo Seo et.al.	2511.22870	null
2025-11-28	CoordSpeaker: Exploiting Gesture Captioning for Coordinated Caption-Empowered Co-Speech Gesture Generation	Fengyi Fang et.al.	2511.22863	null
2025-11-28	Plumbings of lens spaces and crepant resolutions of compound $A_n$ singularities	Bilun Xie et.al.	2511.22837	null
2025-11-27	A Functional Field Theorem: An Explicit Proof of Axioms and Equations for Applying iSAFT in Polymer Field Theory	Maximo T. Estrada et.al.	2511.22760	null
2025-11-27	Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction	Boyao Zhou et.al.	2511.22704	null
2025-11-27	Emergent Extreme-View Geometry in 3D Foundation Models	Yiwen Zhang et.al.	2511.22686	null
2025-11-27	Spatially Aware Dictionary-Free Eigenfunction Identification for Modeling and Control of Nonlinear Dynamical Systems	David Grasev et.al.	2511.22648	null
2025-11-27	Non-Gaussianity in SMICA	M. Citran et.al.	2511.22641	null
2025-11-27	Bringing Your Portrait to 3D Presence	Jiawei Zhang et.al.	2511.22553	link
2025-11-27	DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA	Ahmad Mohammadshirazi et.al.	2511.22521	null
2025-11-27	Design of Cycles by Impulsive Feedback: Application to Discrete Dosing	Alexander Medvedev et.al.	2511.22417	null
2025-11-27	FADiff: Fusion-Aware Differentiable Optimization for DNN Scheduling on Tensor Accelerators	Shuao Jia et.al.	2511.22348	null
2025-11-27	NOMA Assisted Downlink Power Allocation in Pinching Antenna Systems Using Convolutional Neural Network	Saeed Mohammadzadeh et.al.	2511.22328	null
2025-11-27	UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries	Hoang-Bao Le et.al.	2511.22253	null
2025-11-26	Machine Learning Approaches to Clinical Risk Prediction: Multi-Scale Temporal Alignment in Electronic Health Records	Wei-Chen Chang et.al.	2511.21561	null
2025-11-26	CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation	Shizhe Sun et.al.	2511.21503	link
2025-11-26	Scaling limits of critical FK-decorated random planar maps with $q=4$	William Da Silva et.al.	2511.21480	null
2025-11-26	$\texttt{CRLS}$ : Convolutional Regularized Least Squares Framework for Reduced Order Modeling of Transonic Flows	Muhammad Bilal et.al.	2511.21425	null
2025-11-26	Bombyx: OpenCilk Compilation for FPGA Hardware Acceleration	Mohamed Shahawy et.al.	2511.21346	null
2025-11-26	Discovery and recovery of crystalline materials with property-conditioned transformers	Cyprien Bone et.al.	2511.21299	link
2025-11-26	Rigidity of bounded-type Siegel polynomials	Kostiantyn Drach et.al.	2511.21246	null
2025-11-26	When barchan dunes move over craters	Paulo Vitor Ribeiro Plácido et.al.	2511.21177	null
2025-11-26	Referring Video Object Segmentation with Cross-Modality Proxy Queries	Baoli Sun et.al.	2511.21139	link
2025-11-26	MNM : Multi-level Neuroimaging Meta-analysis with Hyperbolic Brain-Text Representations	Seunghun Baek et.al.	2511.21092	null
2025-11-26	Witness wedges in fidelity-deviation plane: separating teleportation advantage and Bell-inequality violation	Kyoungho Cho et.al.	2511.21079	null
2025-11-25	Exploring Time-Step Size in Reinforcement Learning for Sepsis Treatment	Yingchuan Sun et.al.	2511.20913	null
2025-11-25	Restoring a Missing Meta-Symmetry of Quantum Mechanics	Sheng Ran et.al.	2511.20907	null
2025-11-25	Primal: A Unified Deterministic Framework for Quasi-Orthogonal Hashing and Manifold Learning	Vladimer Khasia et.al.	2511.20839	null
2025-11-25	Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model	Ziyue Wang et.al.	2511.20636	null
2025-11-25	Quantum Key Distribution: Bridging Theoretical Security Proofs, Practical Attacks, and Error Correction for Quantum-Augmented Networks	Nitin Jha et.al.	2511.20602	null
2025-11-25	Time-Domain Linear Model-based Framework for Passive Acoustic Mapping of Cavitation Activity	Tatiana Gelvez-Barrera et.al.	2511.20551	null
2025-11-25	Wide Area Surface Dosimetry with Conformal Scintillator Array for External Beam Radiotherapy	Roman Vasyltsiv et.al.	2511.20472	null
2025-11-25	MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts	Zilong Huang et.al.	2511.20415	null
2025-11-26	VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild	Xin Ming et.al.	2511.20366	null
2025-11-25	Quality-guided UAV Surface Exploration for 3D Reconstruction	Benjamin Sportich et.al.	2511.20353	null
2025-11-25	Plumbing Analog of Molecular Computation	Roger D. Jones et.al.	2511.20339	null
2025-11-25	Data Augmentation Techniques to Reverse-Engineer Neural Network Weights from Input-Output Queries	Alexander Beiser et.al.	2511.20312	null
2025-11-25	In-Context Compositional Learning via Sparse Coding Transformer	Wei Chen et.al.	2511.20194	null
2025-11-25	Alzheimers Disease Progression Prediction Based on Manifold Mapping of Irregularly Sampled Longitudinal Data	Xin Hong et.al.	2511.20154	null
2025-11-25	Designs on the Tautological bundle	Ikeda Yuya et.al.	2511.20114	null
2025-11-25	ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction	Yuanzhe Li et.al.	2511.20020	null
2025-11-25	iRadioDiff: Physics-Informed Diffusion Model for Indoor Radio Map Construction and Localization	Xiucheng Wang et.al.	2511.20015	null
2025-11-25	HybriDLA: Hybrid Generation for Document Layout Analysis	Yufan Chen et.al.	2511.19919	null
2025-11-25	MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization	Chengyue Huang et.al.	2511.19878	null
2025-11-25	DOGE: Differentiable Bezier Graph Optimization for Road Network Extraction	Jiahui Sun et.al.	2511.19850	null
2025-11-25	Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation	Xuewen Liu et.al.	2511.19835	link
2025-11-24	Rigidity of $\mathbf{SU(2)}$ and $\mathbf{SO(3)}$ quantum representations of mapping class groups at prime levels	Pierre Godfard et.al.	2511.19795	null
2025-11-24	Flow Map Distillation Without Data	Shangyuan Tong et.al.	2511.19428	null
2025-11-24	Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection	Zixuan Wang et.al.	2511.19306	null
2025-11-24	IOMMU Support for Virtual-Address Remote DMA in an ARMv8 environment	Antonis Psistakis et.al.	2511.19258	null
2025-11-24	SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control	Yuxuan Wang et.al.	2511.19236	null
2025-11-27	Learning Plug-and-play Memory for Guiding Video Diffusion Models	Selena Song et.al.	2511.19229	null
2025-11-24	In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings	Teresa Guallart-Naval et.al.	2511.19226	null
2025-11-24	MambaRefine-YOLO: A Dual-Modality Small Object Detector for UAV Imagery	Shuyu Cao et.al.	2511.19134	null
2025-11-24	Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation	Siqi Ding et.al.	2511.19114	null
2025-11-24	The TAG array of a multiple sequence alignment	Jannik Olbrich et.al.	2511.19068	null
2025-11-26	Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors	Yuchen Zhou et.al.	2511.19031	null
2025-11-24	3D Dynamic Radio Map Prediction Using Vision Transformers for Low-Altitude Wireless Networks	Nguyen Duc Minh Quang et.al.	2511.19019	null
2025-11-24	Web of Non-invertible Dualities for (2+1) Dimensional Models with Subsystem Symmetries	Avijit Maity et.al.	2511.18969	null
2025-11-24	GContextFormer: A global context-aware hybrid multi-head attention approach with scaled additive aggregation for multimodal trajectory prediction	Yuzhi Chen et.al.	2511.18874	null
2025-11-24	Mitigating Long-Tail Bias in HOI Detection via Adaptive Diversity Cache	Yuqiu Jiang et.al.	2511.18811	null
2025-11-24	Sentiment Analysis of Financial Text Using Quantum Language Processing QDisCoCirc	Takayuki Sakuma et.al.	2511.18804	null
2025-11-24	ChronoGS: Disentangling Invariants and Changes in Multi-Period Scenes	Zhongtao Wang et.al.	2511.18794	null
2025-11-24	SAOT: An Enhanced Locality-Aware Spectral Transformer for Solving PDEs	Chenhong Zhou et.al.	2511.18777	null
2025-11-24	From Features to Reference Points: Lightweight and Adaptive Fusion for Cooperative Autonomous Driving	Yongqi Zhu et.al.	2511.18757	null
2025-11-24	Robust Multimodal Sentiment Analysis with Distribution-Based Feature Recovery and Fusion	Daiqing Wu et.al.	2511.18751	null
2025-11-24	Seeing What Matters: Visual Preference Policy Optimization for Visual Generation	Ziqi Ni et.al.	2511.18719	null
2025-11-21	DSeq-JEPA: Discriminative Sequential Joint-Embedding Predictive Architecture	Xiangteng He et.al.	2511.17354	link
2025-11-21	Learning Latent Transmission and Glare Maps for Lens Veiling Glare Removal	Xiaolong Qian et.al.	2511.17353	null
2025-11-21	Phase-adjusted realification of a $\mathbb{C}^3$ Kochen-Specker configuration into $\mathbb{R}^6$	Andrei Khrennikov et.al.	2511.17223	null
2025-11-21	FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception	Shubham Sonarghare et.al.	2511.17210	null
2025-11-21	SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors	Kunyi Li et.al.	2511.17207	null
2025-11-21	A lightweight detector for real-time detection of remote sensing images	Qianyi Wang et.al.	2511.17147	null
2025-11-21	Generative MIMO Beam Map Construction for Location Recovery and Beam Tracking	Wangqian Chen et.al.	2511.17007	null
2025-11-21	MirrorMind: Empowering OmniScientist with the Expert Perspectives and Collective Knowledge of Human Scientists	Qingbin Zeng et.al.	2511.16997	null
2025-11-21	MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis	Di Luo et.al.	2511.16957	null
2025-11-21	UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation	Chi Zhang et.al.	2511.16917	null
2025-11-21	A deep ALMA Band 3 survey of HDFS/MUSE3D: Survey description and initial results	Hugo Messias et.al.	2511.16909	null
2025-11-20	Evolution mapping III: A new recipe for the halo mass function	Andrea Fiorilli et.al.	2511.16730	null
2025-11-20	Quasiparticle Variational Quantum Eigensolver	Saavanth Velury et.al.	2511.16721	null
2025-11-20	SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation	Zhenyuan Qin et.al.	2511.16666	null
2025-11-20	Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems	Elias Lumer et.al.	2511.16654	null
2025-11-20	Toward Artificial Palpation: Representation Learning of Touch on Soft Bodies	Zohar Rimon et.al.	2511.16596	null
2025-11-21	POMA-3D: The Point Map Way to 3D Scene Understanding	Ye Mao et.al.	2511.16567	null
2025-11-20	TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval	Özay Ezerceli et.al.	2511.16528	null
2025-11-20	Flow-Aided Flight Through Dynamic Clutters From Point To Motion	Bowen Xu et.al.	2511.16372	null
2025-11-20	Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling	Minseok Seo et.al.	2511.16301	null
2025-11-20	Optimizing 3D Gaussian Splattering for Mobile GPUs	Md Musfiqur Rahman Sanim et.al.	2511.16298	null
2025-11-20	Exponential map in DT theory	Sarunas Kaubrys et.al.	2511.16261	null
2025-11-20	Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning	Yibin Huang et.al.	2511.16160	null
2025-11-20	VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation	Chenyang Wu et.al.	2511.16124	null
2025-11-20	Clustered Error Correction with Grouped 4D Gaussian Splatting	Taeho Kang et.al.	2511.16112	null
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-20	JWST observations of cosmic-ray-excited H $_2$ in Barnard 68: spatial variations and constraints on cosmic-ray attenuation	David A. Neufeld et.al.	2511.16003	null
2025-11-20	Self-supervised and Multi-fidelity Learning for Extended Predictive Soil Spectroscopy	Luning Sun et.al.	2511.15965	null
2025-11-20	A Simple and Robust Multi-Fidelity Data Fusion Method for Effective Modeling of Citizen-Science Air Pollution Data	Camilla Andreozzi et.al.	2511.15942	null
2025-11-19	A scattering perspective on gravitational lensing	Mariana Carrillo Gonzalez et.al.	2511.15797	null
2025-11-19	Joint Semantic-Channel Coding and Modulation for Token Communications	Jingkai Ying et.al.	2511.15699	null
2025-11-19	The JWST weather report from the nearest brown dwarfs III: Heterogeneous clouds and Thermochemical instabilities as possible drivers of WISE 1049AB’s spectroscopic variability	Natalia Oliveros-Gomez et.al.	2511.15667	null
2025-11-19	Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography	Shourov Joarder et.al.	2511.15640	null
2025-11-19	Cartan meets Cramér-Rao	Sunder Ram Krishnan et.al.	2511.15612	null
2025-11-19	From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers	Huiyuan Tian et.al.	2511.15572	null
2025-11-19	RS-CA-HSICT: A Residual and Spatial Channel Augmented CNN Transformer Framework for Monkeypox Detection	Rashid Iqbal et.al.	2511.15476	null
2025-11-19	Fidelity-Preserving Quantum Encoding for Quantum Neural Networks	Yuhu Lu et.al.	2511.15363	null
2025-11-19	Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection	Spyridon Loukovitis et.al.	2511.15343	null
2025-11-19	Physics-Based Benchmarking Metrics for Multimodal Synthetic Images	Kishor Datta Gupta et.al.	2511.15204	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	null
2025-11-19	CASPER: Cross-modal Alignment of Spatial and single-cell Profiles for Expression Recovery	Amit Kumar et.al.	2511.15139	null
2025-11-19	Proper derivation of subspace mapping from whole space mapping in boson expansion theory	Kimikazu Taniguchi et.al.	2511.15129	null
2025-11-19	WiCo-MG: Wireless Channel Foundation Model for Multipath Generation via Synesthesia of Machines	Zengrui Han et.al.	2511.15026	null
2025-11-18	EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects	Gbenga Omotara et.al.	2511.14970	null
2025-11-18	From minimal-length quantum theory to modified gravity	Rocco D’Agostino et.al.	2511.14869	null
2025-11-18	Geometry of Generalized Density Functional Theories	Chih-Chun Wang et.al.	2511.14822	null
2025-11-18	High-resolution weak lensing mass mapping from DES-Y3 data using diffusion-based prior	Supranta S. Boruah et.al.	2511.14667	null
2025-11-18	Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains	Qingwei Ben et.al.	2511.14625	null
2025-11-18	A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder	Dengyun Huang et.al.	2511.14600	null
2025-11-18	A Bayesian INLA-SPDE Approach to Spatio-Temporal Point-Grid Fusion with Change-of-Support and Misaligned Covariates	Weiyue Zheng et.al.	2511.14535	null
2025-11-18	A Generative Data Framework with Authentic Supervision for Underwater Image Restoration and Enhancement	Yufeng Tian et.al.	2511.14521	null
2025-11-18	Adversarial Learning-Based Radio Map Reconstruction for Fingerprinting Localization	Jiaming Zhang et.al.	2511.14495	null
2025-11-18	Covariance-based Imaging and Multi-View Fusion for Networked Sensing	Junyuan Gao et.al.	2511.14490	null
2025-11-18	Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation	Aditi Agarwal et.al.	2511.14481	null
2025-11-18	Going Places: Place Recognition in Artificial and Natural Systems	Michael Milford et.al.	2511.14341	null
2025-11-18	MA-SLAM: Active SLAM in Large-Scale Unknown Environment using Map Aware Deep Reinforcement Learning	Yizhen Yin et.al.	2511.14330	null
2025-11-18	LSP-YOLO: A Lightweight Single-Stage Network for Sitting Posture Recognition on Embedded Devices	Nanjun Li et.al.	2511.14322	null
2025-11-18	Secure parameter identification of ARX systems with CKKS cryptosystem	Jialong Chen et.al.	2511.14267	null
2025-11-18	Harnessing Deep LLM Participation for Robust Entity Linking	Jiajun Hou et.al.	2511.14181	null
2025-11-18	SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts	Fan Zhang et.al.	2511.14093	null
2025-11-17	Structural Flexibility of the TCF7L2-DNA Complex with the Type 2 Diabetes SNP rs7903146	Karthik Venuturimilli et.al.	2511.13916	null
2025-11-17	TaoSearchEmb: A Multi-Objective Reinforcement Learning Framework for Dense Retrieval in Taobao Search	Xingxian Liu et.al.	2511.13885	null
2025-11-17	Dynamic state estimation of hybrid systems: Inverters that switch between grid-following and grid-forming control schemes	Bukunmi G. Odunlami et.al.	2511.13872	null
2025-11-17	GRLoc: Geometric Representation Regression for Visual Localization	Changyang Li et.al.	2511.13864	null
2025-11-17	RSPose: Ranking Based Losses for Human Pose Estimation	Muhammed Can Keles et.al.	2511.13857	null
2025-11-17	Aletheia: Emulating the non-linear matter power spectrum in the context of evolution mapping	Ariel G. Sanchez et.al.	2511.13826	null
2025-11-17	Bosonisation Cohomology: Spin Structure Summation in Every Dimension	Philip Boyle Smith et.al.	2511.13718	null
2025-11-17	Composition and Coherence: The Syntax of Operator Networks	Shih-Yu Chang et.al.	2511.13706	null
2025-11-17	Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting	Jiangnan Ye et.al.	2511.13684	null
2025-11-17	HilbMult: A Banach-Enriched Multicategory for Operator Algebras	Shih-Yu Chang et.al.	2511.13674	null
2025-11-17	Universal Kernel Models for Iterated Completely Positive Maps	James Tian et.al.	2511.13599	null
2025-11-17	Electron Correlation by Exchange Mapping in Electronic Structure Calculations	Jerry L. Whitten et.al.	2511.13570	null
2025-11-17	Sequences of Bivariate Bicycle Codes from Covering Graphs	Benjamin C. B. Symons et.al.	2511.13560	null
2025-11-17	Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew	Farhin Farhad Riya et.al.	2511.13535	null
2025-11-17	FUSE: A Flow-based Mapping Between Shapes	Lorenzo Olearo et.al.	2511.13431	null
2025-11-17	Learning Cosmology from Nearest Neighbour Statistics	Atrideb Chatterjee et.al.	2511.13393	null
2025-11-17	Cognitive Maps in Language Models: A Mechanistic Analysis of Spatial Planning	Caroline Baumgartner et.al.	2511.13371	null
2025-11-17	Unifying points of interest taxonomies: mapping OpenStreetMap tags to the Foursquare category system	Lilou Soulas et.al.	2511.13369	null
2025-11-17	Computer Vision based group activity detection and action spotting	Narthana Sivalingam et.al.	2511.13315	null
2025-11-17	PyPeT: A Python Perfusion Tool for Automated Quantitative Brain CT and MR Perfusion Analysis	Marijn Borghouts et.al.	2511.13310	null
2025-11-17	The free Banach $f$ -algebra generated by a Banach space	David Muñoz-Lahoz et.al.	2511.13299	null
2025-11-17	Vortex creep heating in neutron star cooling with direct Urca processes in heavy neutron stars	Yoonhak Nam et.al.	2511.13263	null
2025-11-17	GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry	Chiyun Noh et.al.	2511.13216	null
2025-11-17	Collision-Free Navigation of Mobile Robots via Quadtree-Based Model Predictive Control	Osama Al Sheikh Ali et.al.	2511.13188	null
2025-11-17	Region-Point Joint Representation for Effective Trajectory Similarity Learning	Hao Long et.al.	2511.13125	null
2025-11-17	A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features	Hanzhe Liang et.al.	2511.13115	null
2025-11-14	Estimating Total Effects in Bipartite Experiments with Spillovers and Partial Eligibility	Albert Tan et.al.	2511.11564	null
2025-11-14	Terrain Costmap Generation via Scaled Preference Conditioning	Luisa Mao et.al.	2511.11529	null
2025-11-14	Lispchitz modulus of the argmin mapping in convex quadratic optimization	María Josefa Cánovas et.al.	2511.11455	null
2025-11-14	VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation	Maximilian Rokuss et.al.	2511.11450	null
2025-11-14	Robust inverse material design with physical guarantees using the Voigt-Reuss Net	Sanath Keshav et.al.	2511.11388	null
2025-11-14	SCL Decoding of Non-Binary Linear Block Codes	Jingyu Lin et.al.	2511.11256	null
2025-11-14	Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs	Jitesh Chavan et.al.	2511.11243	null
2025-11-14	RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting	Ruocheng Wu et.al.	2511.11213	null
2025-11-14	Galactic foreground residual biases in CMB lensing convergence reconstruction and delensing of B-mode maps	Kishan Deka et.al.	2511.11147	null
2025-11-14	Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation	Zhiwei Zhang et.al.	2511.11011	null
2025-11-14	Binary Verification for Zero-Shot Vision	Jeffrey Liu et.al.	2511.10983	null
2025-11-14	A proposal to construct the dark-matter-only counterpart of the observed Universe combining weak lensing and baryon censuses	Shuren Zhou et.al.	2511.10975	null
2025-11-14	Heterogeneous Complementary Distillation	Liuchi Xu et.al.	2511.10942	null
2025-11-14	A Compilation Framework for Quantum Circuits with Mid-Circuit Measurement Error Awareness	Ming Zhong et.al.	2511.10921	null
2025-11-13	The Map of Misbelief: Tracing Intrinsic and Extrinsic Hallucinations Through Attention Patterns	Elyes Hajji et.al.	2511.10837	null
2025-11-13	Neural Local Wasserstein Regression	Inga Girshfeld et.al.	2511.10824	null
2025-11-13	Universal Thermodynamic Uncertainty Relation for Quantum $f-$ Divergences	Domingos S. P. Salazar et.al.	2511.10817	null
2025-11-13	Transformers know more than they can tell – Learning the Collatz sequence	François Charton et.al.	2511.10811	null
2025-11-13	Semantic Property Maps for Driving Applications	Marcus Greiff et.al.	2511.10798	null
2025-11-13	Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces	Farhan Sheth et.al.	2511.10793	null
2025-11-13	Domination between non-Fuchsian representations and anti-de Sitter geometry	Farid Diaf et.al.	2511.10570	null
2025-11-13	OmniVGGT: Omni-Modality Driven Visual Geometry Grounded	Haosong Peng et.al.	2511.10560	link
2025-11-13	A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space	Huijie Liu et.al.	2511.10555	link
2025-11-13	Noncommutative tensor triangular geometry: modules, bimodules, and unipotent Hopf algebras	Øyvind Solberg et.al.	2511.10531	null
2025-11-13	Learning Post-Newtonian Corrections from Numerical Relativity	Jooheon Yoo et.al.	2511.10522	null
2025-11-13	LLM-YOLOMS: Large Language Model-based Semantic Interpretation and Fault Diagnosis for Wind Turbine Components	Yaru Li et.al.	2511.10394	null
2025-11-13	Modeling Layout Abstractions Using Integer Set Relations	Somashekaracharya G Bhaskaracharya et.al.	2511.10374	null
2025-11-13	Ancilla-Free Fast-Forwarding Lindbladian Simulation Algorithms by Hamiltonian Twirling	Minbo Gao et.al.	2511.10253	null
2025-11-13	VISTA: A Vision and Intent-Aware Social Attention Framework for Multi-Agent Trajectory Prediction	Stephane Da Silva Martins et.al.	2511.10203	null
2025-11-13	GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation	Jun Zhang et.al.	2511.10138	link
2025-11-13	Tailored Three Dimensional Betatron Dynamics in UltraStable Hybrid Laser Plasma RF Accelerators	A. A. Molavi Choobini et.al.	2511.10096	null
2025-11-13	Tree-Based Stochastic Optimization for Solving Large-Scale Urban Network Security Games	Shuxin Zhuang et.al.	2511.10072	null
2025-11-13	DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection	Feiyang Jia et.al.	2511.10035	null
2025-11-13	Convergent series of Stokes wave of arbitrary height in deep water via machine learning	Chong Lin et.al.	2511.09927	null
2025-11-13	Compensating Distribution Drifts in Class-incremental Learning of Pre-trained Vision Transformers	Xuan Rao et.al.	2511.09926	null
2025-11-12	A Smooth Penalty-Based Feedback Law for Reactive Obstacle Avoidance with Convergence Guarantees	Lyes Smaili et.al.	2511.09799	null
2025-11-12	Towards model-free stellar chemical abundances. Potential applications in the search for chemically peculiar stars in large spectroscopic surveys	Theosamuele Signor et.al.	2511.09733	null
2025-11-12	Efficient Hyperdimensional Computing with Modular Composite Representations	Marco Angioli et.al.	2511.09708	null
2025-11-12	Entanglement, Yang-Mills, and the Scattering Matrix as an SU(N)-equivariant Kernel	Kun-Feng Lyu et.al.	2511.09623	null
2025-11-12	Probing the Critical Behavior of a Sign-Problematic Model with Monte Carlo Simulations	Ye Ling et.al.	2511.09356	null
2025-11-05	SENT Map – Semantically Enhanced Topological Maps with Foundation Models	Raj Surya Rajendran Kathirvel et.al.	2511.03165	null
2025-11-03	An Adjoint Method for Differentiable Fluid Simulation on Flow Maps	Zhiqi Li et.al.	2511.01259	null
2025-11-15	OmniNWM: Omniscient Driving Navigation World Models	Bohan Li et.al.	2510.18313	null
2025-10-17	HEADER: Hierarchical Robot Exploration via Attention-Based Deep Reinforcement Learning with Expert-Guided Reward	Yuhong Cao et.al.	2510.15679	null
2025-11-13	UniGS: Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering	Yusen Xie et.al.	2510.12174	null
2025-10-13	ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training	Leonard Bruns et.al.	2510.11605	null
2025-10-10	Robust Visual Teach-and-Repeat Navigation with Flexible Topo-metric Graph Map Representation	Jikai Wang et.al.	2510.09089	null
2025-10-06	OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS	Simon Boche et.al.	2510.04612	null
2025-10-05	Constructing coherent spatial memory in LLM agents through graph rectification	Puzhen Zhang et.al.	2510.04195	null
2025-10-01	A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features	Axel Barroso-Laguna et.al.	2510.00978	null
2025-09-30	Updates to the WFC3/UVIS Saturation Map	Mitchell Revalski et.al.	2510.00097	null
2025-09-30	Memory-Efficient 2D/3D Shape Assembly of Robot Swarms	Shuoyu Yue et.al.	2509.26518	null
2025-09-30	Classical feature map surrogates and metrics for quantum control landscapes	Martino Calzavara et.al.	2509.25930	null
2025-09-25	Neural Integrated Sensing and Communication for the MIMO-OFDM Downlink	Ziyi Wang et.al.	2509.21118	null
2025-09-18	Semantic-LiDAR-Inertial-Wheel Odometry Fusion for Robust Localization in Large-Scale Dynamic Environments	Haoxuan Jiang et.al.	2509.14999	null
2025-09-17	Charting trajectories of human thought using large language models	Matthew M Nour et.al.	2509.14455	null
2025-10-30	FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph	Xiaolin Zhou et.al.	2509.13733	null
2025-09-16	Maps for Autonomous Driving: Full-process Survey and Frontiers	Pengxin Chen et.al.	2509.12632	null
2025-09-15	Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing	Bingyu Li et.al.	2509.12040	null
2025-09-11	ObjectReact: Learning Object-Relative Control for Visual Navigation	Sourav Garg et.al.	2509.09594	null
2025-09-01	Hierarchical Motion Captioning Utilizing External Text Data Source	Clayton Leite et.al.	2509.01471	null
2025-08-19	MMIS-Net for Retinal Fluid Segmentation and Detection	Nchongmaje Ndipenocha et.al.	2508.13936	null
2025-08-03	DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion	Zhigang Sun et.al.	2508.01778	null
2025-07-29	MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving	Thomas Monninger et.al.	2507.21423	null
2025-09-15	RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction	Yuqing Lan et.al.	2507.17594	null
2025-07-15	Mapping Fusion: Improving FPGA Technology Mapping with ASIC Mapper	Cunxi Yu et.al.	2507.10912	null
2025-07-07	Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR	Tao Du et.al.	2507.04662	null
2025-07-11	Learning to Generate Vectorized Maps at Intersections with Multiple Roadside Cameras	Quanxin Zheng et.al.	2507.02899	null
2025-06-27	Norm-dependent Lamperti-type MAP representations of stable processes and Brownian motions in the orthant	Andreas E. Kyprianou et.al.	2506.22020	null
2025-06-26	CURL-SLAM: Continuous and Compact LiDAR Mapping	Kaicheng Zhang et.al.	2506.21077	null
2025-06-25	Communication-Aware Map Compression for Online Path-Planning: A Rate-Distortion Approach	Ali Reza Pedram et.al.	2506.20579	null
2025-07-16	Cross-Layer Discrete Concept Discovery for Interpreting Language Models	Ankur Garg et.al.	2506.20040	null
2025-06-17	TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping	Jeewon Kim et.al.	2506.14178	null
2025-06-16	Complexity of Coexistence Regions in the GRHT Map	Sishu Shankar Muni et.al.	2506.13515	null
2025-06-09	ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving	Yongkang Li et.al.	2506.08052	null
2025-06-07	Multimodal Spatial Language Maps for Robot Navigation and Manipulation	Chenguang Huang et.al.	2506.06862	null
2025-08-19	Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU	Wenhao Dai et.al.	2506.06095	null
2025-07-31	X-ray Polarization Detection of the Pulsar Wind Nebula in G21.5-0.9 with IXPE	Niccolò Di Lalla et.al.	2506.05630	null
2025-08-13	DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes	Jiajun Jiang et.al.	2506.01950	null
2025-06-05	ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real	Youwei Yu et.al.	2506.01759	null
2025-06-01	Globally Consistent RGB-D SLAM with 2D Gaussian Splatting	Xingguang Zhong et.al.	2506.00970	null
2025-05-29	Bridging Scales in Map Generation: A scale-aware cascaded generative mapping framework for seamless and consistent multi-scale cartographic representation	Chenxing Sun et.al.	2502.04991	null
2025-02-10	MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction	Xiaoshuai Hao et.al.	2502.04377	null
2025-02-07	Construction of an invertible mapping to boundary conforming coordinates for arbitrarily shaped toroidal domains	Robert Babin et.al.	2411.04683	null
2024-12-30	Local Map Construction with SDMap: A Comprehensive Survey	Jiaqi Li et.al.	2409.02415	null
2024-10-15	MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping	Jiacheng Chen et.al.	2403.15951	null
2023-06-08	NeMO: Neural Map Growing System for Spatiotemporal Fusion in Bird’s-Eye-View and BDD-Map Benchmark	Xi Zhu et.al.	2306.04540	null
2023-03-07	Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit Representation	Xingrui Yang et.al.	2210.15858	null
2022-10-17	Fast genomic optical map assembly algorithm using binary representation	Przemysław Stawczyk et.al.	2210.06865	null
2023-08-17	Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping	Bodong Zhou et.al.	2208.11061	link
2023-05-18	LiDAR Road-Atlas: An Efficient Map Representation for General 3D Urban Environment	Banghe Wu et.al.	2204.05727	link
2023-08-08	NeuralBlox: Real-Time Neural Representation Fusion for Robust Volumetric Mapping	Stefan Lionar et.al.	2110.09415	null
2021-04-08	VGF-Net: Visual-Geometric Fusion Learning for Simultaneous Drone Navigation and Height Mapping	Yilin Liu et.al.	2104.03109	null
2022-09-23	Distributed Dynamic Map Fusion via Federated Learning for Intelligent Networked Vehicles	Zijian Zhang et.al.	2103.03786	null
2020-03-13	Learning word-referent mappings and concepts from raw inputs	Wai Keen Vong et.al.	2003.05573	null
2019-08-01	Recovery Map for Fermionic Gaussian Channels	Brian Swingle et.al.	1811.04956	null
2017-11-15	Finiteness of Mapping Class Group Representations from Twisted Dijkgraaf-Witten Theory	Paul Gustafson et.al.	1610.06069	null
2017-10-18	The moment map on symplectic vector space and oscillator representation	Takashi Hashimoto et.al.	1408.6597	null
2015-06-15	Maximum Likelihood Fusion of Stochastic Maps	Brandon Jones et.al.	1303.6170	null
2008-03-13	Quantum Reference Frames and the Classification of Rotationally-Invariant Maps	J. -C. Boileau et.al.	0709.0142	null

(<a href=#updated-on-20260429>back to top</a>)

Non-rigid Registration

Publish Date	Title	Authors	PDF	Code
2026-04-16	One-shot Compositional 3D Head Avatars with Deformable Hair	Yuan Sun et.al.	2604.14782	null
2026-03-22	Domain Elastic Transform: Bayesian Function Registration for High-Dimensional Scientific Data	Osamu Hirose et.al.	2603.21235	null
2026-03-20	Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement	Chunlei Zhang et.al.	2603.19623	null
2026-03-02	Preoperative-to-intraoperative Liver Registration for Laparoscopic Surgery via Latent-Grounded Correspondence Constraints	Ruize Cui et.al.	2603.01720	null
2026-01-22	Coarse-to-Fine Non-rigid Multi-modal Image Registration for Historical Panel Paintings based on Crack Structures	Aline Sindel et.al.	2601.16348	null
2026-01-09	Deformation-Aware Observation Modeling for Radar-Based Human Sensing via 3D Scan-Depth Sequence Fusion	Guangqi Shi et.al.	2601.05676	null
2025-12-27	Multimodal Diffeomorphic Registration with Neural ODEs and Structural Descriptors	Salvador Rodriguez-Sanz et.al.	2512.22689	null
2025-12-16	Test Time Optimized Generalized AI-based Medical Image Registration Method	Sneha Sree C. et.al.	2512.14556	null
2025-12-01	Robust Rigid and Non-Rigid Medical Image Registration Using Learnable Edge Kernels	Ahsan Raza Siyal et.al.	2512.01771	null
2025-11-19	Coarse-to-Fine Non-Rigid Registration for Side-Scan Sonar Mosaicking	Can Lei et.al.	2512.00052	null
2025-11-06	Systematic Evaluation of Preprocessing Techniques for Accurate Image Registration in Digital Pathology	Fatemehzahra Darzi et.al.	2511.04171	null
2025-11-25	CORE – A Cell-Level Coarse-to-Fine Image Registration Engine for Multi-stain Image Alignment	Esha Sadia Nasir et.al.	2511.03826	null
2025-12-06	Structural Stress as a Predictor of the Rate and Spatial Location of Aortic Growth in Uncomplicated Type B Aortic Dissection	Yuhang Du et.al.	2511.03287	null
2025-10-30	Simultaneous optimization of non-coplanar beam orientations and cumulative EQD2 distribution for high-dose reirradiation of locoregionally recurrent non-small cell lung cancer	Nathan Torelli et.al.	2510.26272	null
2025-10-21	MorphModes: Non-rigid Registration via Adaptive Skinning Eigenmodes	Gabrielle Browne et.al.	2510.18658	null
2025-10-17	ERNet: Efficient Non-Rigid Registration Network for Point Sequences	Guangzhao He et.al.	2510.15800	null
2026-02-24	A geometric feature tracking approach for noninvasive patient specific estimation of leaflet strain from 3D images of heart valves	Wensi Wu et.al.	2510.06578	null
2025-09-12	Human Body Segment Volume Estimation with Two RGB-D Cameras	Giulia Bassani et.al.	2509.10429	null
2025-09-09	A Comprehensive Pipeline for Aortic Segmentation and Shape Analysis	Nairouz Shehata et.al.	2509.09718	null
2025-08-19	Shape-from-Template with Generalised Camera	Agniva Sengupta et.al.	2508.13791	null
2025-08-24	FractMorph: A Fractional Fourier-Based Multi-Domain Transformer for Deformable Image Registration	Shayan Kebriti et.al.	2508.12445	null
2025-07-27	PIVOTS: Aligning unseen Structures using Preoperative to Intraoperative Volume-To-Surface Registration for Liver Navigation	Peng Liu et.al.	2507.20337	null
2025-07-10	X-RAFT: Cross-Modal Non-Rigid Registration of Blue and White Light Neurosurgical Hyperspectral Images	Charlie Budd et.al.	2507.07747	null
2025-11-12	Geo-Registration of Terrestrial LiDAR Point Clouds with Satellite Images without GNSS	Xinyu Wang et.al.	2507.05999	null
2025-07-06	Robot-assisted Transcranial Magnetic Stimulation (Robo-TMS): A Review	Wenzhi Bai et.al.	2507.04345	null
2025-07-28	ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction	Juming Xiong et.al.	2506.21923	null
2025-06-26	A Novel Framework for Integrating 3D Ultrasound into Percutaneous Liver Tumour Ablation	Shuwei Xing et.al.	2506.21162	null
2025-05-29	VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration	Ben Li et.al.	2505.23439	null
2025-05-28	NFR: Neural Feature-Guided Non-Rigid Shape Registration	Puhua Jiang et.al.	2505.22445	null
2025-05-19	GuidedMorph: Two-Stage Deformable Registration for Breast MRI	Yaqian Chen et.al.	2505.13414	null
2025-05-28	GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats	Simeon Adebola et.al.	2505.10923	link
2025-04-21	Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection	Jun Zhou et.al.	2504.15152	null
2025-04-11	X2BR: High-Fidelity 3D Bone Reconstruction from a Planar X-Ray Image with Hybrid Neural Implicit Methods	Gokce Guven et.al.	2504.08675	null
2025-03-22	MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability	Paul Hill et.al.	2503.17700	null
2025-02-26	An anatomically-informed correspondence initialisation method to improve learning-based registration for radiotherapy	Edward G. A. Henderson et.al.	2502.19101	null
2025-02-15	Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy	Mingyang Zhao et.al.	2502.10704	null
2025-04-03	MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography	Xinkai Wang et.al.	2502.08786	null
2025-01-10	A Steerable Deep Network for Model-Free Diffusion MRI Registration	Gianfranco Cortes et.al.	2501.04794	null
2024-10-31	UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration	Geng Li et.al.	2410.22909	null
2025-06-06	SynBench: A Synthetic Benchmark for Non-rigid 3D Point Cloud Registration	Sara Monji-Azad et.al.	2409.14474	null
2025-10-01	SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration	Yuxin Yao et.al.	2405.20188	link
2024-04-19	DeeperHistReg: Robust Whole Slide Images Registration Framework	Marek Wodzinski et.al.	2404.14434	null
2024-04-26	RegWSI: Whole Slide Image Registration using Combined Deep Feature- and Intensity-Based Methods: Winner of the ACROBAT 2023 Challenge	Marek Wodzinski et.al.	2404.13108	null
2024-01-05	Partition-based Nonrigid Registration for 3D Face Model	Yuping Ye et.al.	2401.02607	null
2023-03-20	Deep Graph-based Spatial Consistency for Robust Non-rigid Point Cloud Registration	Zheng Qin et.al.	2303.09950	link
2023-02-21	Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization	Yuxin Yao et.al.	2206.03410	null
2022-10-06	Non-rigid Point Cloud Registration with Neural Deformation Pyramid	Yang Li et.al.	2205.12796	null
2022-05-21	Myocardial Segmentation of Late Gadolinium Enhanced MR Images by Propagation of Contours from Cine MR Images	Dong Wei et.al.	2205.10595	null
2022-03-18	A Survey of Non-Rigid 3D Registration	Bailin Deng et.al.	2203.07858	null
2021-12-23	Geodesic squared exponential kernel for non-rigid shape registration	Florent Jousse et.al.	2112.11853	null
2021-04-27	Deep Convolutional Neural Network for Non-rigid Image Registration	Eduard F. Durech et.al.	2104.12034	null
2020-06-15	Nonrigid registration using Gaussian processes and local likelihood estimation	Ashton Wiens et.al.	2006.06864	null
2020-07-28	Cortical surface registration using unsupervised learning	Jieyu Cheng et.al.	2004.04617	null
2020-04-10	Quasi-Newton Solver for Robust Non-Rigid Registration	Yuxin Yao et.al.	2004.04322	link
2020-01-14	A Comparative Study for Non-rigid Image Registration and Rigid Image Registration	Xiaoran Zhang et.al.	2001.03831	null
2019-04-02	Automatic Nonrigid Histological Image Registration with Adaptive Multistep Algorithm	Marek Wodzinski et.al.	1904.00982	null
2019-04-07	Symmetry-guided nonrigid registration: the case for distortion correction in multidimensional photoemission spectroscopy	Rui Patrick Xian et.al.	1901.00312	null
2018-12-25	A Survey on Non-rigid 3D Shape Analysis	Hamid Laga et.al.	1812.10111	null
2019-06-20	Robust Non-Rigid Registration with Reweighted Position and Transformation Sparsity	Kun Li et.al.	1703.04861	null
2015-04-15	A Multicomponent Approach to Nonrigid Registration of Diffusion Tensor Images	Mohammed Khader et.al.	1504.01800	null
2014-03-27	Optimized imaging using non-rigid registration	Benjamin Berkels et.al.	1403.6774	null
2013-04-03	Scale Selection of Adaptive Kernel Regression by Joint Saliency Map for Nonrigid Image Registration	Zhuangming Shen et.al.	1303.0479	null
2013-04-15	Local Structure Matching Driven by Joint-Saliency-Structure Adaptive Kernel Regression	Binjie Qin et.al.	1302.0494	null
2011-04-22	A Meshless Method for Variational Nonrigid 2-D Shape Registration	Wei Liu et.al.	1104.4168	null

(<a href=#updated-on-20260429>back to top</a>)

MoE

Publish Date	Title	Authors	PDF	Code
2026-04-28	Marco-MoE: Open Multilingual Mixture-of-Expert Language Models with Efficient Upcycling	Fan Jiang et.al.	2604.25578	null
2026-04-28	The Attention Market: Interpreting Online Fair Re-ranking as Manifold Optimization under Walrasian Equilibrium	Chen Xu et.al.	2604.25577	null
2026-04-28	SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton	Xuzheng He et.al.	2604.25498	null
2026-04-28	The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents	Yuwei Sun et.al.	2604.25299	null
2026-04-28	CroSearch-R1: Better Leveraging Cross-lingual Knowledge for Retrieval-Augmented Generation	Rui Qi et.al.	2604.25182	null
2026-04-27	Power Foam: Unifying Real-Time Differentiable Ray Tracing and Rasterization	Shrisudhan Govindarajan et.al.	2604.24994	null
2026-04-27	Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity	Bojie Li et.al.	2604.24827	null
2026-04-27	SWE-QA: A Dataset and Benchmark for Complex Code Understanding	Laïla Elkoussy et.al.	2604.24814	null
2026-04-28	Agent-Centric Visual Reinforcement Learning under Dynamic Perturbations	Zhengru Fang et.al.	2604.24661	null
2026-04-27	Cortex-Inspired Continual Learning: Unsupervised Instantiation and Recovery of Functional Task Networks	Kevin McKee et.al.	2604.24637	null
2026-04-27	Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models	Yuxing Tian et.al.	2604.24608	null
2026-04-27	Vib2Conf: AI-driven discrimination of molecular conformations from vibrational spectra	Xin-Yu Lu et.al.	2604.24310	null
2026-04-27	SVOM/C-GFT: Instrumentation and Performances on the SVOM Alerts	Chao Wu et.al.	2604.24272	null
2026-04-27	SVOM/VT: On-ground processing of VT-VHF data	Chao Wu et.al.	2604.24271	null
2026-04-27	SVOM/VT: Overview of data processing and GRB identifications with X-band data	Hua-Li Li et.al.	2604.24266	null
2026-04-27	SVOM Science User Support Services at Chinese Science Center	Xu-hui Han et.al.	2604.24251	null
2026-04-27	Defusing the Trigger: Plug-and-Play Defense for Backdoored LLMs via Tail-Risk Intrinsic Geometric Smoothing	Kaisheng Fan et.al.	2604.24162	null
2026-04-27	SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs	Zi-Hao Bo et.al.	2604.23996	null
2026-04-27	LearnPruner: Rethinking Attention-based Token Pruning in Vision Language Models	Rinyoichi Takezoe et.al.	2604.23950	null
2026-04-26	AMAVA: Adaptive Motion-Aware Video-to-Audio Framework for Visually-Impaired Assistance	Benjamin Klein et.al.	2604.23909	null
2026-04-26	Transformer as an Euler Discretization of Score-based Variational Flow	Huadong Liao et.al.	2604.23740	null
2026-04-26	MetaGAI: A Large-Scale and High-Quality Benchmark for Generative AI Model and Data Card Generation	Haoxuan Zhang et.al.	2604.23539	null
2026-04-25	Scaling Multi-Node Mixture-of-Experts Inference Using Expert Activation Patterns	Abhimanyu Bambhaniya et.al.	2604.23150	null
2026-04-25	Mixture of Heterogeneous Grouped Experts for Language Modeling	Zhicheng Ma et.al.	2604.23108	null
2026-04-24	Preserving Long-Tailed Expert Information in Mixture-of-Experts Tuning	Haoze He et.al.	2604.23036	null
2026-04-24	Synchrotron polarization of anisotropic electron distribution in GRB prompt emission	Kang-Fa Cheng et.al.	2604.22598	null
2026-04-24	Towards Adaptive Continual Model Merging via Manifold-Aware Expert Evolution	Haiyun Qiu et.al.	2604.22464	null
2026-04-24	The Cathaya argyrophylla Genome Reveals the Evolutionary Trade-offs of a Living Fossil	Yun Wang et.al.	2604.22440	null
2026-04-24	QAssemble: A Pure Python Package for Quantum Many-Body Theory	Seongjun Mo et.al.	2604.22223	null
2026-04-23	Direct observation of surface bandgap shrinkage and negative electronic compressibility in SrTiO3	Warakorn Jindata et.al.	2604.21783	null
2026-04-23	Rethinking Cross-Domain Evaluation for Face Forgery Detection with Semantic Fine-grained Alignment and Mixture-of-Experts	Yuhan Luo et.al.	2604.21478	null
2026-04-23	Decoupled DiLoCo for Resilient Distributed Pre-training	Arthur Douillard et.al.	2604.21428	null
2026-04-23	Teacher-Guided Routing for Sparse Vision Mixture-of-Experts	Masahiro Kada et.al.	2604.21330	null
2026-04-23	Enhancing Online Recruitment with Category-Aware MoE and LLM-based Data Augmentation	Minping Chen et.al.	2604.21264	null
2026-04-22	LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model	Inclusion AI et.al.	2604.20796	null
2026-04-22	On Bayesian Softmax-Gated Mixture-of-Experts Models	Nicola Bariletto et.al.	2604.20551	null
2026-04-22	XRF 241001A/SN 2024aiiq: A Faint Soft X-ray Transient Detected by SVOM with a Broad-Line Type Ic Supernova Revealed by JWST	B. Schneider et.al.	2604.20346	null
2026-04-22	MD-Face: MoE-Enhanced Label-Free Disentangled Representation for Interactive Facial Attribute Editing	Xuan Cui et.al.	2604.20317	null
2026-04-22	Multi-Perspective Evidence Synthesis and Reasoning for Unsupervised Multimodal Entity Linking	Mo Zhou et.al.	2604.20283	null
2026-04-22	All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG	Dan Wang et.al.	2604.20199	null
2026-04-22	Aligning Human-AI-Interaction Trust for Mental Health Support: Survey and Position for Multi-Stakeholders	Xin Sun et.al.	2604.20166	null
2026-04-22	Temporally Extended Mixture-of-Experts Models	Zeyu Shen et.al.	2604.20156	null
2026-04-21	Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts	Chaitanya Dwivedi et.al.	2604.19835	null
2026-04-21	FEPLB: Exploiting Copy Engines for Nearly Free MoE Load Balancing in Distributed Training	Shuyao Qi et.al.	2604.19654	null
2026-04-21	CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation	Xiangyang Luo et.al.	2604.19636	null
2026-04-21	LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction	Jiakai Tang et.al.	2604.19550	null
2026-04-22	ReaLB: Real-Time Load Balancing for Multimodal MoE Inference	Yingping Wang et.al.	2604.19503	null
2026-04-21	Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input	Michael Ziegltrum et.al.	2604.19344	null
2026-04-21	UniEP: Unified Expert-Parallel MoE MegaKernel for LLM Training	Size Zheng et.al.	2604.19241	null
2026-04-21	SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning	Boyan Shi et.al.	2604.19048	null
2026-04-21	STK-Adapter: Incorporating Evolving Graph and Event Chain for Temporal Knowledge Graph Extrapolation	Shuyuan Zhao et.al.	2604.19042	null
2026-04-20	Multi-Domain Learning with Global Expert Mapping	Pourya Shamsolmoali et.al.	2604.18842	null
2026-04-20	Efficient Mixture-of-Experts LLM Inference with Apple Silicon NPUs	Afsara Benazir et.al.	2604.18788	null
2026-04-20	CAHAL: Clinically Applicable resolution enHAncement for Low-resolution MRI scans	Sergio Morell-Ortega et.al.	2604.18781	null
2026-04-20	A multimodal and temporal foundation model for virtual patient representations at healthcare system scale	Andrew Zhang et.al.	2604.18570	null
2026-04-20	GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling	Alireza Dadgarnia et.al.	2604.18556	null
2026-04-20	SemLT3D: Semantic-Guided Expert Distillation for Camera-only Long-Tailed 3D Object Detection	Hao Vo et.al.	2604.18476	null
2026-04-20	Train Separately, Merge Together: Modular Post-Training with Mixture-of-Experts	Jacob Morrison et.al.	2604.18473	null
2026-04-20	Domain-Specialized Object Detection via Model-Level Mixtures of Experts	Svetlana Pavlitska et.al.	2604.18256	null
2026-04-20	WiFo-MiSAC: A Wireless Foundation Model for Multimodal Sensing and Communication Integration via Synesthesia of Machines (SoM)	Xuanyu Liu et.al.	2604.18255	null
2026-04-20	Multi-LLM Token Filtering and Routing for Sequential Recommendation	Wuhan Chen et.al.	2604.18200	null
2026-04-20	Audio-DeepThinker: Progressive Reasoning-Aware Reinforcement Learning for High-Quality Chain-of-Thought Emergence in Audio Language Models	Xiang He et.al.	2604.18187	null
2026-04-20	RASP-Tuner: Retrieval-Augmented Soft Prompts for Context-Aware Black-Box Optimization in Non-Stationary Environments	Enze Pan et.al.	2604.18026	null
2026-04-20	MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene	Wenjie Mu et.al.	2604.17965	null
2026-04-20	Polysemantic Experts, Monosemantic Paths: Routing as Control in MoEs	Charles Ye et.al.	2604.17837	null
2026-04-20	MoE-nD: Per-Layer Mixture-of-Experts Routing for Multi-Axis KV Cache Compression	Libo Sun et.al.	2604.17695	null
2026-04-20	A Hamilton-Jacobi Reachability-Guided Search Framework for Efficient and Safe Indoor Planar Robot Navigation	Hanyang Hu et.al.	2604.17679	null
2026-04-19	Representation-Guided Parameter-Efficient LLM Unlearning	Zeguan Xiao et.al.	2604.17396	null
2026-04-19	When Text Hijacks Vision: Benchmarking and Mitigating Text Overlay-Induced Hallucination in Vision Language Models	Cui Yakun et.al.	2604.17375	null
2026-04-19	Poisson Flow Model of Cortical Folding Pattern	Moo K. Chung et.al.	2604.17291	null
2026-04-19	From Language to Action: Enhancing LLM Task Efficiency with Task-Aware MCP Server Recommendation	Shiyu He et.al.	2604.17234	null
2026-04-19	Cloud-native and Distributed Systems for Efficient and Scalable Large Language Models – A Research Agenda	Minxian Xu et.al.	2604.17227	null
2026-04-19	Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy	Shun-ichiro Hayashi et.al.	2604.17182	null
2026-04-18	Causality as a Minimum Energy Principle	Moo K. Chung et.al.	2604.17151	null
2026-04-18	IMA-MoE: An Interpretable Modality-Aware Mixture-of-Experts Framework for Characterizing the Neurobiological Signatures of Binge Eating Disorder	Lin Zhao et.al.	2604.17028	null
2026-04-18	D-QRELO: Training- and Data-Free Delta Compression for Large Language Models via Quantization and Residual Low-Rank Approximation	Junlin Li et.al.	2604.16940	null
2026-04-18	CoGR-MoE: Concept-Guided Expert Routing with Consistent Selection and Flexible Reasoning for Visual Question Answering	Xiyin Zeng et.al.	2604.16930	null
2026-04-17	Towards Trustworthy Depression Estimation via Disentangled Evidential Learning	Fangyuan Liu et.al.	2604.16579	null
2026-04-17	FL-MHSM: Spatially-adaptive Fusion and Ensemble Learning for Flood-Landslide Multi-Hazard Susceptibility Mapping at Regional Scale	Aswathi Mundayatt et.al.	2604.16265	null
2026-04-17	Joint-Centric Dual Contrastive Alignment with Structure-Preserving and Information-Balanced Regularization	Habibeh Naderi et.al.	2604.16247	null
2026-04-17	MOMENTA: Mixture-of-Experts Over Multimodal Embeddings with Neural Temporal Aggregation for Misinformation Detection	Yeganeh Abdollahinejad et.al.	2604.16172	null
2026-04-17	Breaking the Training Barrier of Billion-Parameter Universal Machine Learning Interatomic Potentials	Yuanchang Zhou et.al.	2604.15821	null
2026-04-17	Qwen3.5-Omni Technical Report	Qwen Team et.al.	2604.15804	null
2026-04-16	Electronic Signature of Melting Onset in Polycrystalline Copper at Extreme Conditions	Edna R. Toro et.al.	2604.15491	null
2026-04-16	StoSignSGD: Unbiased Structural Stochasticity Fixes SignSGD for Training Large Language Models	Dingzhi Yu et.al.	2604.15416	null
2026-04-16	OmniLight: One Model to Rule All Lighting Conditions	Youngjin Oh et.al.	2604.15170	null
2026-04-16	Towards Faster Language Model Inference Using Mixture-of-Experts Flow Matching	Aihua Li et.al.	2604.15009	null
2026-04-16	Switching Efficiency: A Novel Framework for Dissecting AI Data Center Network Efficiency	Niangen Ye et.al.	2604.14690	null
2026-04-16	ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving	Yuseon Choi et.al.	2604.14626	null
2026-04-16	WILD-SAM: Phase-Aware Expert Adaptation of SAM for Landslide Detection in Wrapped InSAR Interferograms	Yucheng Pan et.al.	2604.14540	null
2026-04-16	Geometric Metrics for MoE Specialization: From Fisher Information to Early Failure Detection	Dongxin Guo et.al.	2604.14500	null
2026-04-15	Geometric Routing Enables Causal Expert Control in Mixture of Experts	Ivan Ternovtsii et.al.	2604.14434	null
2026-04-15	Equifinality in Mixture of Experts: Routing Topology Does Not Determine Language Modeling Quality	Ivan Ternovtsii et.al.	2604.14419	null
2026-04-15	Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations	Wentao Hu et.al.	2604.14246	null
2026-04-15	Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation	Svetlana Pavlitska et.al.	2604.13761	null
2026-04-17	Enhancing Mixture-of-Experts Specialization via Cluster-Aware Upcycling	Sanghyeok Chu et.al.	2604.13508	null
2026-04-15	Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning	Shentong Mo et.al.	2604.13504	null
2026-04-14	PolicyLLM: Towards Excellent Comprehension of Public Policy for Large Language Models	Han Bao et.al.	2604.12995	null
2026-04-14	Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots	Yifei Yan et.al.	2604.12909	null
2026-04-14	Stable Fine-Time-Step Long-Horizon Turbulence Prediction with a Multi-Stepsize Mixture-of-Experts Neural Operator	Guanyu Pan et.al.	2604.12794	null
2026-04-14	AffectAgent: Collaborative Multi-Agent Reasoning for Retrieval-Augmented Multimodal Emotion Recognition	Zeheng Wang et.al.	2604.12735	null
2026-04-14	Brain-DiT: A Universal Multi-state fMRI Foundation Model with Metadata-Conditioned Pretraining	Junfeng Xia et.al.	2604.12683	null
2026-04-15	Observation of the Exotic State $π_{1}(1600)$ in $ψ(2S)\rightarrowγχ_{c1},χ_{c1}\rightarrowπ^{+}π^{-}η’$	BESIII Collaboration et.al.	2604.12524	null
2026-04-14	SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker	Junbin Su et.al.	2604.12502	null
2026-04-14	Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning	NVIDIA et.al.	2604.12374	null
2026-04-14	Nucleus-Image: Sparse MoE for Image Generation	Chandan Akiti et.al.	2604.12163	null
2026-04-13	TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction	Seungik Cho et.al.	2604.12026	null
2026-04-14	Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale	Liujie Zhang et.al.	2604.11554	null
2026-04-13	Learning How Much to Think: Difficulty-Aware Dynamic MoEs for Graph Node Classification	Jiajun Zhou et.al.	2604.11473	null
2026-04-14	Judge Like Human Examiners: A Weighted Importance Multi-Point Evaluation Framework for Generative Tasks with Long-form Answers	Guoxin Yu et.al.	2604.11246	null
2026-04-13	Sparse Hypergraph-Enhanced Frame-Event Object Detection with Fine-Grained MoE	Wei Bao et.al.	2604.11140	null
2026-04-13	Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds	Pierre Jourlin et.al.	2604.11104	null
2026-04-13	Quantitative propagation of chaos for particle systems with bounded kernels and multiplicative noise	Ning Jiang et.al.	2604.11084	null
2026-04-12	MoEITS: A Green AI approach for simplifying MoE-LLMs	Luis Balderas et.al.	2604.10603	null
2026-04-12	WaveMoE: A Wavelet-Enhanced Mixture-of-Experts Foundation Model for Time Series Forecasting	Shunyu Wu et.al.	2604.10544	null
2026-04-12	Measurement of the branching fractions of $χ_{cJ} \to π^{+}π^{-}π^{0}π^{0}$ via $ψ(3686) \to γχ_{cJ}$	BESIII Collaboration et.al.	2604.10523	null
2026-04-12	How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks	Johin Johny Arimbur et.al.	2604.10508	null
2026-04-12	CodeQuant: Unified Clustering and Quantization for Enhanced Outlier Smoothing in Low-Precision Mixture-of-Experts	Xiangyang Yin et.al.	2604.10496	null
2026-04-12	First Observation of \boldmath{ $D^+ \to a_0(980)ρ$ and $D^+ \to a_0(980)^+ f_0(500)$} in \boldmath{$D^+ \to π^+π^+π^-η$ and $D^+ \to π^+π^0π^0η$ } Decays	BESIII Collaboration et.al.	2604.10444	null
2026-04-11	DREAMuS: Dark matter REsearch with Advanced Muon Source	Xiang Chen et.al.	2604.10257	null
2026-04-11	Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis	Yang Yu et.al.	2604.10233	null
2026-04-11	SpecMoE: A Fast and Efficient Mixture-of-Experts Inference via Self-Assisted Speculative Decoding	Jehyeon Bang et.al.	2604.10152	null
2026-04-10	The Myth of Expert Specialization in MoEs: Why Routing Reflects Geometry, Not Necessarily Domain Expertise	Xi Wang et.al.	2604.09780	null
2026-04-10	SafeMind: A Risk-Aware Differentiable Control Framework for Adaptive and Safe Quadruped Locomotion	Zukun Zhang et.al.	2604.09474	null
2026-04-10	Compositional-Degradation UAV Image Restoration: Conditional Decoupled MoE Network and A Benchmark	Jinquan Yan et.al.	2604.09313	null
2026-04-10	Generalization and Scaling Laws for Mixture-of-Experts Transformers	Mansour Zoubeirou a Mayaki et.al.	2604.09175	null
2026-04-10	Text-Conditioned Multi-Expert Regression Framework for Fully Automated Multi-Abutment Design	Mianjie Zheng et.al.	2604.09047	null
2026-04-10	Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks	Wen Qiu et.al.	2604.09028	null
2026-04-10	M-IDoL: Information Decomposition for Modality-Specific and Diverse Representation Learning in Medical Foundation Model	Yihang Liu et.al.	2604.08936	null
2026-04-10	StaRPO: Stability-Augmented Reinforcement Policy Optimization	Jinghan Zhang et.al.	2604.08905	null
2026-04-09	HiFloat4 Format for Language Model Pre-training on Ascend NPUs	Mehran Taghian et.al.	2604.08826	null
2026-04-09	Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts	Haolei Xu et.al.	2604.08541	null
2026-04-09	Lost in the Hype: Revealing and Dissecting the Performance Degradation of Medical Multimodal Large Language Models in Image Classification	Xun Zhu et.al.	2604.08333	null
2026-04-09	Towards Identification and Intervention of Safety-Critical Parameters in Large Language Models	Weiwei Qi et.al.	2604.08297	null
2026-04-09	SciFigDetect: A Benchmark for AI-Generated Scientific Figure Detection	You Hu et.al.	2604.08211	null
2026-04-09	Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference	Baihui Liu et.al.	2604.08133	null
2026-04-09	Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator	Luozheng Qin et.al.	2604.08121	null
2026-04-09	HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation	Shuanghao Bai et.al.	2604.07993	null
2026-04-09	QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training–Inference Mismatch	Hao Gu et.al.	2604.07853	null
2026-04-09	Lightweight LLM Agent Memory with Small Language Models	Jiaquan Zhang et.al.	2604.07798	null
2026-04-09	Symbiotic-MoE: Unlocking the Synergy between Generation and Understanding	Xiangyue Liu et.al.	2604.07753	null
2026-04-08	From LLM to Silicon: RL-Driven ASIC Architecture Exploration for On-Device AI Inference	Ravindra Ganti et.al.	2604.07526	null
2026-04-08	SPAMoE: Spectrum-Aware Hybrid Operator Framework for Full-Waveform Inversion	Zhenyu Wang et.al.	2604.07421	null
2026-04-08	Region-Graph Optimal Transport Routing for Mixture-of-Experts Whole-Slide Image Classification	Xin Tian et.al.	2604.07298	null
2026-04-08	VersaVogue: Visual Expert Orchestration and Preference Alignment for Unified Fashion Synthesis	Jian Yu et.al.	2604.07210	null
2026-04-08	InfiniLoRA: Disaggregated Multi-LoRA Serving for Large Language Models	Hongyu Chen et.al.	2604.07173	null
2026-04-08	The Impact of Steering Large Language Models with Persona Vectors in Educational Applications	Yongchao Wu et.al.	2604.07102	null
2026-04-08	Gemma 4, Phi-4, and Qwen3: Accuracy-Efficiency Tradeoffs in Dense and MoE Reasoning Language Models	Md Motaleb Hossen Manik et.al.	2604.07035	null
2026-04-08	MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale	Tobias Falke et.al.	2604.07030	null
2026-04-08	Stress Estimation in Elderly Oncology Patients Using Visual Wearable Representations and Multi-Instance Learning	Ioannis Kyprakis et.al.	2604.06990	null
2026-04-08	MoBiE: Efficient Inference of Mixture of Binary Experts under Post-Training Quantization	Zhixiong Zhao et.al.	2604.06798	null
2026-04-08	HQF-Net: A Hybrid Quantum-Classical Multi-Scale Fusion Network for Remote Sensing Image Segmentation	Md Aminur Hossain et.al.	2604.06715	null
2026-04-08	Heterogeneous Mixture-of-Experts for Energy-Efficient Multimodal ISAC in Highly Mobile Networks	Wenqi Fan et.al.	2604.06697	null
2026-04-08	Foundry: Template-Based CUDA Graph Context Materialization for Fast LLM Serving Cold Start	Xueshen Liu et.al.	2604.06664	null
2026-04-08	Short proofs in combinatorics, probability and number theory II	Boris Alexeev et.al.	2604.06609	null
2026-04-08	Does a Global Perspective Help Prune Sparse MoEs Elegantly?	Zeliang Zhang et.al.	2604.06542	null
2026-04-07	Soft-Quantum Algorithms	Basil Kyriacou et.al.	2604.06523	null
2026-04-07	Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees	Mohammed Nowaz Rabbani Chowdhury et.al.	2604.06515	null
2026-04-07	State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation	Navan Preet Singh et.al.	2604.06421	null
2026-04-07	TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models	Lin Mu et.al.	2604.06291	null
2026-04-07	A Mixture of Experts Foundation Model for Scanning Electron Microscopy Image Analysis	Sk Miraj Ahmed et.al.	2604.05960	null
2026-04-07	Precise measurement of the CKM angle $γ$ with a novel approach	The BESIII et.al.	2604.05712	null
2026-04-08	QA-MoE: Towards a Continuous Reliability Spectrum with Quality-Aware Mixture of Experts for Robust Multimodal Sentiment Analysis	Yitong Zhu et.al.	2604.05704	null
2026-04-07	Measurement of the CKM angle $γ$ in $B^{\pm} \rightarrow D(\rightarrow K^{0}_{\rm S} h^{\prime+}h^{\prime-})h^{\pm}$ decays with a novel approach	The BESIII et.al.	2604.05701	null
2026-04-07	A Unified Foundation Model for All-in-One Multi-Modal Remote Sensing Image Restoration and Fusion with Language Prompting	Yongchuan Cui et.al.	2604.05629	null
2026-04-07	From Pixels to Personas: Tracking the Evolution of Anime Characters	Rongze Liu et.al.	2604.05507	null
2026-04-07	Task Ecologies and the Evolution of World-Tracking Representations in Large Language Models	Giulio Valentino Dalla Riva et.al.	2604.05469	null
2026-04-07	Do Domain-specific Experts exist in MoE-based LLMs?	Giang Do et.al.	2604.05267	null
2026-04-06	HI-MoE: Hierarchical Instance-Conditioned Mixture-of-Experts for Object Detection	Vadim Vashkelis et.al.	2604.04908	null
2026-04-06	LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection	Cheng Xu et.al.	2604.04815	null
2026-04-06	Galaxy Populations in Groups and Clusters: II. Conditional Luminosity Functions at Redshifts from z ~ 1 to z ~ 0	Ce Gao et.al.	2604.04794	null
2026-04-06	DeepStack: Scalable and Accurate Design Space Exploration for Distributed 3D-Stacked AI Accelerators	Zhiwen Mo et.al.	2604.04750	null
2026-04-06	Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale	Zhengcen Li et.al.	2604.04634	null
2026-04-06	Quantum-inspired Ising machine using sparsified spin connectivity	Moe Shimada et.al.	2604.04606	null
2026-04-06	REAM: Merging Improves Pruning of Experts in LLMs	Saurav Jha et.al.	2604.04356	null
2026-04-06	OmniSonic: Towards Universal and Holistic Audio Generation from Video and Text	Weiguo Pian et.al.	2604.04348	null
2026-04-05	3D-Stacked NMP, LLM Decoding, Systolic Array Microarchitecture, Multi-Core Scheduling	Chenyang Ai et.al.	2604.04253	null
2026-04-05	Three Phases of Expert Routing: How Load Balance Evolves During Mixture-of-Experts Training	Charafeddine Mouzouni et.al.	2604.04230	null
2026-04-05	SARES-DEIM: Sparse Mixture-of-Experts Meets DETR for Robust SAR Ship Detection	Fenghao Song et.al.	2604.04127	null
2026-04-05	Bootstrap-Aggregated Method-of-Moments Estimation of the Copula Correlation Parameter for Marginal Survival Inference under Dependent Censoring	Hyun-Soo Zhang et.al.	2604.04032	null
2026-04-04	SPARK-IL: Spectral Retrieval-Augmented RAG for Knowledge-driven Deepfake Detection via Incremental Learning	Hessen Bougueffa Eutamene et.al.	2604.03833	null
2026-04-04	Love Me, Love My Label: Rethinking the Role of Labels in Prompt Retrieval for Visual In-Context Learning	Tianci Luo et.al.	2604.03657	null
2026-04-04	Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation	Kening Zheng et.al.	2604.03592	null
2026-04-03	Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking	Haotian Xiang et.al.	2604.03404	null
2026-04-03	Mixture-of-Experts in Remote Sensing: A Survey	Yongchuan Cui et.al.	2604.03342	null
2026-04-03	CAMEO: A Conditional and Quality-Aware Multi-Agent Image Editing Orchestrator	Yuhan Pu et.al.	2604.03156	null
2026-04-03	JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency	Aichen Cai et.al.	2604.03044	null
2026-04-03	PolyReal: A Benchmark for Real-World Polymer Science Workflows	Wanhao Liu et.al.	2604.02934	null
2026-04-03	Council Mode: Mitigating Hallucination and Bias in LLMs via Multi-Agent Consensus	Shuai Wu et.al.	2604.02923	null
2026-04-03	Multi-Turn Reinforcement Learning for Tool-Calling Agents with Iterative Reward Calibration	Wachiravit Modecrua et.al.	2604.02869	null
2026-04-03	FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving	Qingxiu Liu et.al.	2604.02715	null
2026-04-03	V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views	Junwei You et.al.	2604.02710	null
2026-04-03	Adaptive Semantic Communication for Wireless Image Transmission Leveraging Mixture-of-Experts Mechanism	Haowen Wan et.al.	2604.02691	null
2026-04-02	The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level	Jeremy Herbst et.al.	2604.02178	null
2026-04-02	FlatAttention: Dataflow and Fabric Collectives Co-Optimization for Large Attention-Based Model Inference on Tile-Based Accelerators	Chi Zhang et.al.	2604.02110	null
2026-04-02	SURE: Synergistic Uncertainty-aware Reasoning for Multimodal Emotion Recognition in Conversations	Yiqiang Cai et.al.	2604.01916	null
2026-04-02	FourierMoE: Fourier Mixture-of-Experts Adaptation of Large Language Models	Juyong Jiang et.al.	2604.01762	null
2026-04-02	M3D-BFS: a Multi-stage Dynamic Fusion Strategy for Sample-Adaptive Multi-Modal Brain Network Analysis	Rui Dong et.al.	2604.01667	null
2026-04-02	Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models	Shuibai Zhang et.al.	2604.01622	null
2026-04-02	DWDP: Distributed Weight Data Parallelism for High-Performance LLM Inference on NVL72	Wanqian Li et.al.	2604.01621	null
2026-04-01	Learning When to See and When to Feel: Adaptive Vision-Torque Fusion for Contact-Aware Manipulation	Jiuzhou Lei et.al.	2604.01414	null
2026-04-01	Sparse Spectral LoRA: Routed Experts for Medical VLMs	Omid Nejati Manzari et.al.	2604.01310	null
2026-04-01	Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning	Mohammad R. Abu Ayyash et.al.	2604.01152	null
2026-04-02	Asymptotically Optimal Sequential Testing with Heterogeneous LLMs	Guokai Li et.al.	2604.01086	null
2026-04-01	PHASOR: Anatomy- and Phase-Consistent Volumetric Diffusion for CT Virtual Contrast Enhancement	Zilong Li et.al.	2604.01053	null
2026-04-01	KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection	Abdullah Al Shafi et.al.	2604.00878	null
2026-04-01	Cost-Penalized Fitness in FMA-Orchestrated Mixture of Experts: Experimental Evidence for Molecular Memory in Domain Adaptation	Martin Jaraiz et.al.	2604.00812	null
2026-04-01	Routing-Free Mixture-of-Experts	Yilun Liu et.al.	2604.00801	null
2026-04-01	Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer	Dharma Teja Vooturi et.al.	2604.00785	null
2026-04-01	Toward Optimal Sampling Rate Selection and Unbiased Classification for Precise Animal Activity Recognition	Axiu Mao et.al.	2604.00517	null
2026-04-01	Self-Routing: Parameter-Free Expert Routing from Hidden States	Jama Hussein Mohamud et.al.	2604.00421	null
2026-03-31	From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU Clusters	Jinghan Yao et.al.	2604.00317	null
2026-03-31	Directly visualizing the energy level structure of quantum dot molecules	Heun Mo Yoo et.al.	2604.00232	null
2026-03-31	Short proofs in combinatorics and number theory	Boris Alexeev et.al.	2603.29961	null
2026-03-31	First energy scan measurement of $e^{+}e^{-}\to K^{+}K^{-}$ around the $ψ(2S)$ resonance	BESIII Collaboration et.al.	2603.29854	null
2026-03-31	Counterfactual Analysis of Brain Network Dynamics	Moo K. Chung et.al.	2603.29843	null
2026-03-31	Training-Free Dynamic Upcycling of Expert Language Models	Eros Fanì et.al.	2603.29765	null
2026-03-31	TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic Classification	Qing He et.al.	2603.29520	null
2026-03-31	Aligning Multimodal Sequential Recommendations via Robust Direct Preference Optimization with Sparse MoE	Hejin Huang et.al.	2603.29259	null
2026-03-31	Route-Induced Density and Stability (RIDE): Controlled Intervention and Mechanism Analysis of Routing-Style Meta Prompts on LLM Internal States	Dianxing Zhang et.al.	2603.29206	null
2026-03-31	BiMoE: Brain-Inspired Experts for EEG-Dominant Affective State Recognition	Hongyu Zhu et.al.	2603.29205	null
2026-03-30	Rethinking Language Model Scaling under Transferable Hypersphere Optimization	Liliang Ren et.al.	2603.28743	null
2026-03-30	StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation	Yiran Shi et.al.	2603.28565	null
2026-03-30	Observation of $Λ^+_c\to nπ^+η$ and search for $Λ^+_c\to na_0(980)^+$	BESIII Collaboration et.al.	2603.28232	null
2026-03-30	Graph Vector Field: A Unified Framework for Multimodal Health Risk Assessment from Heterogeneous Wearable and Environmental Data Streams	Silvano Coletti et.al.	2603.28115	null
2026-03-30	ExFusion: Efficient Transformer Training via Multi-Experts Fusion	Jiacheng Ruan et.al.	2603.27965	null
2026-03-31	MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation	Ruiyao Liu et.al.	2603.27959	null
2026-03-29	KAT-Coder-V2 Technical Report	Fengxiang Li et.al.	2603.27703	null
2026-03-29	LVRPO: Language-Visual Alignment with GRPO for Multimodal Understanding and Generation	Shentong Mo et.al.	2603.27693	null
2026-03-29	PRBench: End-to-end Paper Reproduction in Physics Research	Shi Qiu et.al.	2603.27646	null
2026-03-29	Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling	Songchen Ma et.al.	2603.27624	null
2026-03-29	Fully Spiking Neural Networks with Target Awareness for Energy-Efficient UAV Tracking	Pengzhi Zhong et.al.	2603.27493	null
2026-03-29	On Token’s Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models	Chongyang Zhao et.al.	2603.27481	null
2026-03-28	Unveiling Code Clones in the Eclipse IIoT Software Ecosystem	Zengyang Li et.al.	2603.27308	null
2026-03-28	Persistent Memory Through Triple-Loop Consolidation in a Non-Gradient Dissipative Cognitive Architecture	Jianwei Lou et.al.	2603.27188	null
2026-03-28	Routing Sensitivity Without Controllability: A Diagnostic Study of Fairness in MoE Language Models	Junhyeok Lee et.al.	2603.27141	null
2026-03-27	TAPS: Task Aware Proposal Distributions for Speculative Sampling	Mohamad Zbib et.al.	2603.27027	null
2026-03-27	Learning to Commit: Generating Organic Pull Requests via Online Repository Memory	Mo Li et.al.	2603.26664	null
2026-03-27	Sustainability Is Not Linear: Quantifying Performance, Energy, and Privacy Trade-offs in On-Device Intelligence	Eziyo Ehsani et.al.	2603.26603	null
2026-03-26	Can Small Models Reason About Legal Documents? A Comparative Study	Snehit Vaddi et.al.	2603.25944	null
2026-03-26	Narrowband searches for continuous gravitational waves from known pulsars in the first two parts of the fourth LIGO–Virgo–KAGRA observing run	The LIGO Scientific Collaboration et.al.	2603.25938	null
2026-03-26	AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer’s Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study	Wenlong Hou et.al.	2603.25322	null
2026-03-26	SliderQuant: Accurate Post-Training Quantization for LLMs	Shigeng Wang et.al.	2603.25284	null
2026-03-26	A Wireless World Model for AI-Native 6G Networks	Ziqi Chen et.al.	2603.25216	null
2026-03-26	MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation	Ranxu Zhang et.al.	2603.25126	null
2026-03-26	MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting	Huyen Ngoc Tran et.al.	2603.25046	null
2026-03-26	MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models	Dohwan Ko et.al.	2603.24984	null
2026-03-26	CROSS: A Mixture-of-Experts Reinforcement Learning Framework for Generalizable Large-Scale Traffic Signal Control	Xibei Chen et.al.	2603.24930	null
2026-03-25	OptiSAR-Net++: A Large-Scale Benchmark and Transformer-Free Framework for Cross-Domain Remote Sensing Visual Grounding	Xiaoyu Tang et.al.	2603.24876	null
2026-03-25	Enes Causal Discovery	Alexis Kafantaris et.al.	2603.24436	null
2026-03-25	Cross Section Measurements of $\bar{n}p \rightarrow K^{+}K^{-}π^{+}(π^{0})$ via Antineutrons Produced by $J/ψ\to p π^{-} \bar{n}$ Decays	BESIII Collaboration et.al.	2603.24272	null
2026-03-25	B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition	Nishit Poddar et.al.	2603.24245	null
2026-03-25	Sequence-aware Large Language Models for Explainable Recommendation	Gangyi Zhang et.al.	2603.24136	null
2026-03-25	PCHC: Enabling Preference Conditioned Humanoid Control via Multi-Objective Reinforcement Learning	Huanyu Li et.al.	2603.24047	null
2026-03-25	LGEST: Dynamic Spatial-Spectral Expert Routing for Hyperspectral Image Classification	Jiawen Wen et.al.	2603.24045	null
2026-03-25	MoE-Sieve: Routing-Guided LoRA for Efficient MoE Fine-Tuning	Andrea Manzoni et.al.	2603.24044	null
2026-03-25	SiftMoE: Similarity-Aware Energy-Efficient Expert Selection for Wireless Distributed MoE Inference	Qian Chen et.al.	2603.23888	null
2026-03-24	Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters	Nan Cui et.al.	2603.23780	null
2026-03-24	The Diminishing Returns of Early-Exit Decoding in Modern LLMs	Rui Wei et.al.	2603.23701	null
2026-03-24	VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs	Haoran Yuan et.al.	2603.23481	null
2026-03-24	Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning	Connor Mclaughlin et.al.	2603.23436	null
2026-03-24	Amplitude Analysis of the Isospin-Violating Decay $J/ψ\rightarrowγηπ^{0}$	BESIII Collaboration et.al.	2603.23081	null
2026-03-24	IntentWeave: A Progressive Entry Ladder for Multi-Surface Browser Agents in Cloud Portals	Wanying Mo et.al.	2603.22917	null
2026-03-24	Search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$	BESIII Collaboration et.al.	2603.22804	null
2026-03-24	KALAVAI: Predicting When Independent Specialist Fusion Works – A Quantitative Model for Post-Hoc Cooperative LLM Training	Ramchand Kumaresan et.al.	2603.22755	null
2026-03-24	Why Database Manuals Are Not Enough: Efficient and Reliable Configuration Tuning for DBMSs via Code-Driven LLM Agents	Xinyi Zhang et.al.	2603.22708	null
2026-03-23	Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning	Jihyun Janice Ahn et.al.	2603.22619	null
2026-03-23	FullCircle: Effortless 3D Reconstruction from Casual 360 $^\circ$ Captures	Yalda Foroutan et.al.	2603.22572	null
2026-03-23	3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing	Haoyu Zhen et.al.	2603.22279	null
2026-03-23	A bending in the size-mass relation of star-forming galaxies across $0.5 < z < 6.0$ at a critical stellar mass of $10^{10}M_\odot$ revealed by JWST	Longyue Chen et.al.	2603.22239	null
2026-03-23	Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning	Daniel Shao et.al.	2603.22198	null
2026-03-23	ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive Text-to-Image Retrieval	Zhuocheng Zhang et.al.	2603.21886	null
2026-03-23	Holistic Scaling Laws for Optimal Mixture-of-Experts Architecture Optimization	Weilin Wan et.al.	2603.21862	null
2026-03-23	DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers	Tianyu Cao et.al.	2603.21608	null
2026-03-22	Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity	Zihan Fang et.al.	2603.21276	null
2026-03-22	QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression	Zhongyang Li et.al.	2603.21232	null
2026-03-22	MI-DPG: Decomposable Parameter Generation Network Based on Mutual Information for Multi-Scenario Recommendation	Wenzhuo Cheng et.al.	2603.21209	null
2026-03-22	Diffusion-based Probabilistic Air Quality Forecasting with Mechanistic Insight	Ao Ding et.al.	2603.21131	null
2026-03-22	Mixture of Chapters: Scaling Learnt Memory in Transformers	Tasmay Pankaj Tibrewal et.al.	2603.21096	null
2026-03-22	CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models	Nan Zhou et.al.	2603.21077	null
2026-03-22	LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning	Jianing Wang et.al.	2603.21065	null
2026-03-21	Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models	Yifan Yang et.al.	2603.20697	null
2026-03-21	CFNN: Continued Fraction Neural Network	Chao Wang et.al.	2603.20634	null
2026-03-21	A 4R-supported circular product-service system for luxury branded events	Ke Ma et.al.	2603.20613	null
2026-03-20	AE-LLM: Adaptive Efficiency Optimization for Large Language Models	Kaito Tanaka et.al.	2603.20492	null
2026-03-20	Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation	Marcus Armstrong et.al.	2603.20406	null
2026-03-20	Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?	Lokesh Kumar et.al.	2603.19831	null
2026-03-20	Making Video Models Adhere to User Intent with Minor Adjustments	Daniel Ajisafe et.al.	2603.19672	null
2026-03-20	Structured Prompting for Arabic Essay Proficiency: A Trait-Centric Evaluation Approach	Salim Al Mandhari et.al.	2603.19668	null
2026-03-20	CS-MUNet: A Channel-Spatial Dual-Stream Mamba Network for Multi-Organ Segmentation	Yuyang Zheng et.al.	2603.19659	null
2026-03-20	UniBioTransfer: A Unified Framework for Multiple Biometrics Transfer	Caiyi Sun et.al.	2603.19637	null
2026-03-19	Scalable Prompt Routing via Fine-Grained Latent Task Discovery	Yunyi Zhang et.al.	2603.19415	null
2026-03-22	Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation	Zhuolin Yang et.al.	2603.19220	null
2026-03-19	DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge	Yuegui Huang et.al.	2603.19172	null
2026-03-19	ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning	Weihang Huang et.al.	2603.19029	null
2026-03-19	GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants	The LIGO Scientific Collaboration et.al.	2603.19021	null
2026-03-19	GWTC-4.0: Tests of General Relativity. II. Parameterized Tests	The LIGO Scientific Collaboration et.al.	2603.19020	null
2026-03-19	GWTC-4.0: Tests of General Relativity. I. Overview and General Tests	The LIGO Scientific Collaboration et.al.	2603.19019	null
2026-03-19	DriftGuard: Mitigating Asynchronous Data Drift in Federated Learning	Yizhou Han et.al.	2603.18872	null
2026-03-19	Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision–Language–Motion Diffusion Architecture	Fuze Sun et.al.	2603.18771	null
2026-03-19	Observation of $D_s^+ \to a_0(980)^+f_0(500)$ in the Amplitude Analysis of $D_s^+ \to π^+ π^0 π^0 η$	BESIII Collaboration et.al.	2603.18521	null
2026-03-19	AIMER: Calibration-Free Task-Agnostic MoE Pruning	Zongfang Liu et.al.	2603.18492	null
2026-03-19	AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba	Yan Li et.al.	2603.18462	null
2026-03-19	Spatially Indirect Exciton Condensation in Two-Dimensional Strongly Correlated Semimetals	Yao Zeng et.al.	2603.18445	null
2026-03-18	Path-Constrained Mixture-of-Experts	Zijin Gu et.al.	2603.18297	null
2026-03-18	CORE: Robust Out-of-Distribution Detection via Confidence and Orthogonal Residual Scoring	Jin Mo Yang et.al.	2603.18290	null
2026-03-18	Resonance-enhanced integrated acousto-optic beam steering	Yue Yu et.al.	2603.18191	null
2026-03-18	Understanding Task Aggregation for Generalizable Ultrasound Foundation Models	Fangyijie Wang et.al.	2603.18123	null
2026-03-18	DebugLM: Learning Traceable Training Data Provenance for LLMs	Wenjie Jacky Mo et.al.	2603.17884	null
2026-03-18	The 1/W Law: An Analytical Study of Context-Length Routing Topology and GPU Generation Gains for LLM Inference Energy Efficiency	Huamin Chen et.al.	2603.17280	null
2026-03-17	Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency	Lucas Bandarkar et.al.	2603.17102	null
2026-03-17	Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection	Haitian Wang et.al.	2603.17069	null
2026-03-17	SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding	D. Darankoum et.al.	2603.16739	null
2026-03-17	HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture	Aojie Yuan et.al.	2603.16679	null
2026-03-19	Mixture of Style Experts for Diverse Image Stylization	Shihao Zhu et.al.	2603.16649	null
2026-03-17	Tarab: A Multi-Dialect Corpus of Arabic Lyrics and Poetry	Mo El-Haj et.al.	2603.16601	null
2026-03-17	Visual Distraction Undermines Moral Reasoning in Vision-Language Models	Xinyi Yang et.al.	2603.16445	null
2026-03-18	EngGPT2: Sovereign, Efficient and Open Intelligence	G. Ciarfaglia et.al.	2603.16430	null
2026-03-17	PlotTwist: A Creative Plot Generation Framework with Small Language Models	Abhinav Thorat et.al.	2603.16410	null
2026-03-17	DynamicGate MLP Conditional Computation via Learned Structural Dropout and Input Dependent Gating for Functional Plasticity	Yong Il Choi et.al.	2603.16367	null
2026-03-17	Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits	Jia Qing Yap et.al.	2603.16335	null
2026-03-17	AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection	Hongwei Lin et.al.	2603.16261	null
2026-03-17	Accelerating Approximate Analytical Join Queries over Unstructured Data with Statistical Guarantees	Yuxuan Zhu et.al.	2603.16153	null
2026-03-16	Confidently Wrong: Why Ignoring Binaries Biases IMF Inference at Large Sample Sizes	Anna L. Rosen et.al.	2603.15779	null
2026-03-16	Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning	Ye Wang et.al.	2603.15708	null
2026-03-16	Bridging Local and Global Knowledge: Cascaded Mixture-of-Experts Learning for Near-Shortest Path Routing	Yung-Fu Chen et.al.	2603.15541	null
2026-03-16	Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis	Penny Chong et.al.	2603.15483	null
2026-03-16	A Closer Look into LLMs for Table Understanding	Jia Wang et.al.	2603.15402	null
2026-03-16	MoE-ACT: Scaling Multi-Task Bimanual Manipulation with Sparse Language-Conditioned Mixture-of-Experts Transformers	Kangjun Guo et.al.	2603.15265	null
2026-03-17	Tracking the Discriminative Axis: Dual Prototypes for Test-Time OOD Detection Under Covariate Shift	Wooseok Lee et.al.	2603.15213	null
2026-03-16	ForceVLA2: Unleashing Hybrid Force-Position Control with Force Awareness for Contact-Rich Manipulation	Yang Li et.al.	2603.15169	null
2026-03-16	M2IR: Proactive All-in-One Image Restoration via Mamba-style Modulation and Mixture-of-Experts	Shiwei Wang et.al.	2603.14816	null
2026-03-16	Genetic Algorithms in Regression	Mo Li et.al.	2603.14801	null
2026-03-16	Universe Routing: Why Self-Evolving Agents Need Epistemic Control	Zhaohui Geoffrey Wang et.al.	2603.14799	null
2026-03-15	TopoCL: Topological Contrastive Learning for Medical Imaging	Guangyu Meng et.al.	2603.14647	null
2026-03-15	A measurement of gas rotation in galaxy groups via the kinetic Sunyaev-Zeldovich effect	Tianyi Yang et.al.	2603.14494	null
2026-03-15	Towards One-for-All Anomaly Detection for Tabular Data	Shiyuan Li et.al.	2603.14407	null
2026-03-15	WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems	Yuchen Wang et.al.	2603.14392	null
2026-03-15	M $^2$ RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling	Mayank Mishra et.al.	2603.14360	null
2026-03-15	A Physically-Grounded Attack and Adaptive Defense Framework for Real-World Low-Light Image Enhancement	Tongshun Zhang et.al.	2603.14304	null
2026-03-15	All-sky Searches for Continuous Gravitational Waves from Isolated Neutron Stars in the Data from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run	The LIGO Scientific Collaboration et.al.	2603.14168	null
2026-03-14	PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting	Xinyu Xiao et.al.	2603.13818	null
2026-03-14	Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control	Grayson Lee et.al.	2603.13733	null
2026-03-14	Sparse-Dense Mixture of Experts Adapter for Multi-Modal Tracking	Yabin Zhu et.al.	2603.13719	null
2026-03-13	NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL	Amos Goldman et.al.	2603.13606	null
2026-03-13	MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models	Md. Abdul Awal et.al.	2603.13213	null
2026-03-13	Reference-Free Image Quality Assessment for Virtual Try-On via Human Feedback	Yuki Hirakawa et.al.	2603.13057	null
2026-03-13	Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach	Elena Ryumina et.al.	2603.13056	null
2026-03-13	Multimodal Protein Language Models for Enzyme Kinetic Parameters: From Substrate Recognition to Conformational Adaptation	Fei Wang et.al.	2603.12845	null
2026-03-13	Serving Hybrid LLM Loads with SLO Guarantees Using CPU-GPU Attention Piggybacking	Zizhao Mo et.al.	2603.12831	null
2026-03-13	LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing	Jiawei Hao et.al.	2603.12645	null
2026-03-13	CarPLAN: Context-Adaptive and Robust Planning with Dynamic Scene Awareness for Autonomous Driving	Junyong Yun et.al.	2603.12607	null
2026-03-13	Spectral Dataset of Stripped-Envelope Supernovae from the Tsinghua Supernova Group	Danfeng Xiang et.al.	2603.12604	null
2026-03-13	Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation	Jia-Chen Zhang et.al.	2603.12577	null
2026-03-13	Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation	Alaa Dalaq et.al.	2603.12538	null
2026-03-12	TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition	Prabhu Vellaisamy et.al.	2603.12465	null
2026-03-12	NeuroLoRA: Context-Aware Neuromodulation for Parameter-Efficient Multi-Task Adaptation	Yuxin Yang et.al.	2603.12378	null
2026-03-12	A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition	Jiajun Sun et.al.	2603.12221	null
2026-03-12	CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation	Ziqi Ye et.al.	2603.12008	null
2026-03-12	AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization	Qiyang Li et.al.	2603.11873	null
2026-03-12	Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing	Hanchi Sun et.al.	2603.11535	null
2026-03-11	Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers	Mynampati Sri Ranganadha Avinash et.al.	2603.11114	null
2026-03-11	Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions	Kangke Cheng et.al.	2603.10721	link
2026-03-11	UniStitch: Unifying Semantic and Geometric Features for Image Stitching	Yuan Mei et.al.	2603.10568	link
2026-03-11	Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design	Junzhuo Li et.al.	2603.10379	null
2026-03-12	The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance	Jesse Yu et.al.	2603.10323	null
2026-03-10	Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions	Mingyang Song et.al.	2603.09938	null
2026-03-10	Quantifying the Necessity of Chain of Thought through Opaque Serial Depth	Jonah Brown-Cohen et.al.	2603.09786	null
2026-03-10	MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants	Zuhao Zhang et.al.	2603.09652	null
2026-03-10	MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning	Xiang Yuan et.al.	2603.09478	link
2026-03-12	Multi-tasking through quantum annealing	Jargalsaikhan Artag et.al.	2603.09468	null
2026-03-10	Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers	Albus Yizhuo Li et.al.	2603.09453	null
2026-03-10	Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking	Shilei Wang et.al.	2603.09287	null
2026-03-10	Acoustic and Semantic Modeling of Emotion in Spoken Language	Soumya Dutta et.al.	2603.09212	null
2026-03-10	GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models	Md Selim Sarowar et.al.	2603.09079	null
2026-03-09	The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference	Vignesh Adhinarayanan et.al.	2603.08960	null
2026-03-09	ConFu: Contemplate the Future for Better Speculative Sampling	Zongyue Qin et.al.	2603.08899	null
2026-03-09	Microwave response of electrically driven spins in a three-qubit quantum processor	Tanner M. Janda et.al.	2603.08577	null
2026-03-09	LAR-MoE: Latent-Aligned Routing for Mixture of Experts in Robotic Imitation Learning	Ariel Rodriguez et.al.	2603.08476	null
2026-03-09	Amplitude Analysis of Singly Cabibbo-Suppressed Decay $Λ^{+}_{c}\to p K^{+} K^{-}$	BESIII Collaboration et.al.	2603.08469	null
2026-03-09	IronEngine: Towards General AI Assistant	Xi Mo et.al.	2603.08425	null
2026-03-09	Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows	Shentong Mo et.al.	2603.08126	null
2026-03-09	An improved measurement of $η^\prime\rightarrow e^{+}e^{-}ω$	BESIII Collaboration et.al.	2603.08120	null
2026-03-09	SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving	Zihan You et.al.	2603.08113	null
2026-03-09	Deterministic Differentiable Structured Pruning for Large Language Models	Weiyu Huang et.al.	2603.08065	null
2026-03-09	Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization	Jingwei Li et.al.	2603.08022	null
2026-03-09	Scaling Machine Learning Interatomic Potentials with Mixtures of Experts	Yuzhi Liu et.al.	2603.07977	null
2026-03-09	Structural Design and Performance Analysis of Laser Transmitting Telescope for Space Gravitational Wave Detection	Long Yongtao et.al.	2603.07967	null
2026-03-09	SGG-R $^{\rm 3}$ : From Next-Token Prediction to End-to-End Unbiased Scene Graph Generation	Jiaye Feng et.al.	2603.07961	null
2026-03-09	SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans	Hansi Zeng et.al.	2603.07853	null
2026-03-08	Scalable Training of Mixture-of-Experts Models with Megatron Core	Zijie Yan et.al.	2603.07685	null
2026-03-08	AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots	Likui Zhang et.al.	2603.07648	null
2026-03-08	Mixed Effects Mixture of Experts: Modeling Double Heterogeneous Trajectories	Xinkai Yue et.al.	2603.07479	null
2026-03-08	UnSCAR: Universal, Scalable, Controllable, and Adaptable Image Restoration	Debabrata Mandal et.al.	2603.07406	null
2026-03-07	Scheduling Parallel Optical Circuit Switches for AI Training	Kevin Liang et.al.	2603.07373	null
2026-03-07	Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures	Shuqing Luo et.al.	2603.07006	null
2026-03-06	Swimba: Switch Mamba Model Scales State Space Models	Zhixu Du et.al.	2603.06938	null
2026-03-06	PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection	Zhengjian Kang et.al.	2603.06917	null
2026-03-06	PICS: Pairwise Image Compositing with Spatial Interactions	Hang Zhou et.al.	2603.06873	null
2026-03-06	ButterflyViT: 354 $\times$ Expert Compression for Edge Vision Transformers	Aryan Karmore et.al.	2603.06746	null
2026-03-06	RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering	Gaia A. Bertolino et.al.	2603.06542	null
2026-03-06	A Mixture-of-Experts Framework for Practical Hybrid-Quantum Models in Credit Card Fraud Detection	Rodrigo Chaves et.al.	2603.06473	null
2026-03-06	MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis	Dongqing Xie et.al.	2603.06378	null
2026-03-06	MoEless: Efficient MoE LLM Serving via Serverless Computing	Hanfei Yu et.al.	2603.06350	null
2026-03-06	WMoE-CLIP: Wavelet-Enhanced Mixture-of-Experts Prompt Learning for Zero-Shot Anomaly Detection	Peng Chen et.al.	2603.06313	null
2026-03-06	GazeMoE: Perception of Gaze Target with Mixture-of-Experts	Zhuangzhuang Dai et.al.	2603.06256	null
2026-03-06	EvoESAP: Non-Uniform Expert Pruning for Sparse MoE	Zongfang Liu et.al.	2603.06003	link
2026-03-06	MoE Lens – An Expert Is All You Need	Marmik Chaudhari et.al.	2603.05806	null
2026-03-06	Sparse Crosscoders for diffing MoEs and Dense models	Marmik Chaudhari et.al.	2603.05805	null
2026-03-05	Change Point Detection for Cell Populations Measured via Flow Cytometry	Yik Lun Kei et.al.	2603.05700	null
2026-03-05	FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation	Hung Nguyen Huy et.al.	2603.05690	null
2026-03-05	Multi-channel joint analysis of the exotic charmonium-like state $T_{c\bar{c}}(4020)$	BESIII Collaboration et.al.	2603.05564	null
2026-03-05	VietJobs: A Vietnamese Job Advertisement Dataset	Hieu Pham Dinh et.al.	2603.05262	null
2026-03-05	NeuronMoE: Neuron-Guided Mixture-of-Experts for Efficient Multilingual LLM Extension	Rongzhi Li et.al.	2603.05046	null
2026-03-05	Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation	Yilong Chen et.al.	2603.04971	null
2026-03-05	Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling	Yong Liu et.al.	2603.04791	null
2026-03-05	TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings	Yebo Wu et.al.	2603.04772	null
2026-03-04	ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model	Yuhao Xu et.al.	2603.04589	null
2026-03-04	Augmenting representations with scientific papers	Nicolò Oreste Pinciroli Vago et.al.	2603.04516	null
2026-03-04	RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation	Yixin Chen et.al.	2603.04348	null
2026-03-04	CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation	Jinfeng Xu et.al.	2603.04320	null
2026-03-04	*Precise measurement of the form factors in $D^0\rightarrow K^(892)^-\ell^+ν_{\ell}$ and observation of $D^0\rightarrow K_2^(1430)^-\ell^+ν_{\ell}$*	BESIII Collaboration et.al.	2603.04136	null
2026-03-04	UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization	Qianfeng Yang et.al.	2603.03967	null
2026-03-04	Glass Segmentation with Fusion of Learned and General Visual Features	Risto Ojala et.al.	2603.03718	null
2026-03-04	Plasmonic polaron in self-intercalated 1T-TiS2	Byoung Ki Choi et.al.	2603.03663	null
2026-03-03	Modeling Cross-vision Synergy for Unified Large Vision Model	Shengqiong Wu et.al.	2603.03564	null
2026-03-03	Beyond Language Modeling: An Exploration of Multimodal Pretraining	Shengbang Tong et.al.	2603.03276	null
2026-03-03	Search for a massless particle beyond the Standard Model in the $Ξ^0\toΛ+ \text{invisible}$ decay	BESIII Collaboration et.al.	2603.03199	null
2026-03-04	MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection	Jun Yeong Park et.al.	2603.03101	null
2026-03-03	CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots	Shihao Ma et.al.	2603.03067	null
2026-03-03	EduVQA: Benchmarking AI-Generated Video Quality Assessment for Education	Baoliang Chen et.al.	2603.03066	null
2026-03-03	Practical FP4 Training for Large-Scale MoE Models on Hopper GPUs	Wuyue Zhang et.al.	2603.02731	null
2026-03-03	TenExp: Mixture-of-Experts-Based Tensor Decomposition Structure Search Framework	Ting-Wei Zhou et.al.	2603.02720	null
2026-03-03	MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration	Lingshun Kong et.al.	2603.02710	null
2026-03-03	Addressing Missing and Noisy Modalities in One Solution: Unified Modality-Quality Framework for Low-quality Multimodal Data	Sijie Mai et.al.	2603.02695	null
2026-03-03	Robust Heterogeneous Analog-Digital Computing for Mixture-of-Experts Models with Theoretical Generalization Guarantees	Mohammed Nowaz Rabbani Chowdhury et.al.	2603.02633	null
2026-03-02	Search for the charmonium weak decay $ψ(2S)\to D_s^-π^+ + c.c.$ and $ψ(2S)\to D_s^-ρ^+ + c.c.$	BESIII Collaboration et.al.	2603.01777	null
2026-03-02	DynaMoE: Dynamic Token-Level Expert Activation with Layer-Wise Adaptive Capacity for Mixture-of-Experts Neural Networks	Gökdeniz Gülmez et.al.	2603.01697	null
2026-03-02	PathMoE: Interpretable Multimodal Interaction Experts for Pediatric Brain Tumor Classification	Jian Yu et.al.	2603.01547	null
2026-03-02	Multimodal Mixture-of-Experts with Retrieval Augmentation for Protein Active Site Identification	Jiayang Wu et.al.	2603.01511	null
2026-03-02	DOCFORGE-BENCH: A Comprehensive Benchmark for Document Forgery Detection and Analysis	Zengqi Zhao et.al.	2603.01433	null
2026-03-03	UETrack: A Unified and Efficient Framework for Single Object Tracking	Ben Kang et.al.	2603.01412	null
2026-03-02	Fed-GAME: Personalized Federated Learning with Graph Attention Mixture-of-Experts For Time-Series Forecasting	Yi Li et.al.	2603.01363	null
2026-03-01	Truth as a Trajectory: What Internal Representations Reveal About Large Language Model Reasoning	Hamed Damirchi et.al.	2603.01326	null
2026-03-01	Fast Confidence-Aware Human Prediction via Hardware-accelerated Bayesian Inference for Safe Robot Navigation	Michael Lu et.al.	2603.01122	null
2026-03-01	TriMoE: Augmenting GPU with AMX-Enabled CPU and DIMM-NDP for High-Throughput MoE Inference via Offloading	Yudong Pan et.al.	2603.01058	null
2026-03-01	Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving	Xubo Zhu et.al.	2603.01007	null
2026-02-28	MME: Mixture of Mesh Experts with Random Walk Transformer Gating	Amir Belder et.al.	2603.00828	null
2026-02-28	*First Amplitude Analysis of $D^0\rightarrow K^-π^0e^+ν_e$ and Observation of $D^0\rightarrow K^_2(1430)^-e^+ν_e$**	BESIII Collaboration et.al.	2603.00743	null
2026-02-28	K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control	Zhe Wu et.al.	2603.00676	null
2026-02-28	Precise Measurement and Control of Radon Progeny on Detector Surfaces	C. B. Z. Luo et.al.	2603.00647	null
2026-02-28	CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging	Jie Cao et.al.	2603.00573	null
2026-02-27	CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning	Yuxuan Liu et.al.	2602.24142	null
2026-02-27	Precision Studies and Searches for CP Asymmetries in the Inclusive Decay $Λ_{c}^{+}\to ΛX$	BESIII Collaboration et.al.	2602.24089	null
2026-02-27	Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization	Chenwei Jia et.al.	2602.24059	null
2026-02-27	Measurement of Born Cross Sections for $e^+e^-\toΣ^-\barΣ^+$ at $\sqrt{s}=3.51-4.95$ GeV and Observation of $ψ(3770)\toΣ^-\barΣ^+$	BESIII Collaboration et.al.	2602.23835	null
2026-02-27	ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation	Jiangyuan Wang et.al.	2602.23716	null
2026-02-26	Brain-OF: An Omnifunctional Foundation Model for fMRI, EEG and MEG	Hanning Guo et.al.	2602.23410	null
2026-02-26	A Mixture-of-Experts Model for Multimodal Emotion Recognition in Conversations	Soumya Dutta et.al.	2602.23300	null
2026-02-26	Learning Physical Operators using Neural Operators	Vignesh Gopakumar et.al.	2602.23113	null
2026-02-26	Residual Koopman Spectral Profiling for Predicting and Preventing Transformer Training Instability	Bum Jun Kim et.al.	2602.22988	null
2026-02-26	pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation	Shentong Mo et.al.	2602.22938	null
2026-02-26	MEDNA-DFM: A Dual-View FiLM-MoE Model for Explainable DNA Methylation Prediction	Yi He et.al.	2602.22850	null
2026-02-26	DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation	Hao Zheng et.al.	2602.22839	null
2026-02-26	Productivity and Collaboration in Hybrid Agile Teams: An Interview Study	Elisabeth Mo et.al.	2602.22835	null
2026-02-26	Measurements of branching fractions of $Λ_{c}^{+}\toΣ^{0}K_{S}^{0}π^{+}$ and $Λ_{c}^{+}\toΣ^{0}K_{S}^{0}K^{+}$	BESIII Collaboration et.al.	2602.22754	null
2026-02-26	IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation	Yanpei Guo et.al.	2602.22700	null
2026-02-26	Switch-Hurdle: A MoE Encoder with AR Hurdle Decoder for Intermittent Demand Forecasting	Fabian Muşat et.al.	2602.22685	null
2026-02-26	Accelerating LLM Pre-Training through Flat-Direction Dynamics Enhancement	Shuchen Zhu et.al.	2602.22681	null
2026-02-26	Predictive variational inference for flexible regression models	Lucas Kock et.al.	2602.22582	null
2026-02-26	Towards Dynamic Dense Retrieval with Routing Strategy	Zhan Su et.al.	2602.22547	null
2026-02-25	NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training	Dengdi Sun et.al.	2602.22059	null
2026-02-25	Excitation: Momentum For Experts	Sagi Shaier et.al.	2602.21798	null
2026-02-25	Learning from Yesterday’s Error: An Efficient Online Learning Method for Traffic Demand Prediction	Xiannan Huang et.al.	2602.21757	null
2026-02-25	TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts	Jiafeng Lin et.al.	2602.21693	null
2026-02-25	Multi-Layer Scheduling for MoE-Based LLM Reasoning	Yifan Sun et.al.	2602.21626	null
2026-02-24	A Path to an All-Sky Survey with Roman	Jiwon Jesse Han et.al.	2602.21280	null
2026-02-24	On infinite sets with no $3$ on a line	Moe Putterman et.al.	2602.21275	null
2026-02-24	ReviveMoE: Fast Recovery for Hardware Failures in Large-Scale MoE LLM Inference Deployments	Haley Li et.al.	2602.21140	null
2026-02-24	MUSE: Harnessing Precise and Diverse Semantics for Few-Shot Whole Slide Image Classification	Jiahao Xu et.al.	2602.20873	null
2026-02-25	GeCo-SRT: Geometry-aware Continual Adaptation for Robotic Cross-Task Sim-to-Real Transfer	Wenbo Yu et.al.	2602.20871	null
2026-02-24	Multi-time Loewner energy: rate function for large deviation	Mo Chen et.al.	2602.20642	null
2026-02-24	Precise Measurement of Matter-Antimatter Asymmetry with Entangled Hyperon Antihyperon Pairs	BESIII Collaboration et.al.	2602.20524	null
2026-02-24	Search for Light-Mass Fractionally Charged Particles in Space with DAMPE Experiment	F. Alemanno et.al.	2602.20519	null
2026-02-24	Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA	Nuocheng Yang et.al.	2602.20492	null
2026-02-23	Learning Discriminative and Generalizable Anomaly Detector for Dynamic Graph with Limited Supervision	Yuxing Tian et.al.	2602.20019	null
2026-02-23	Counterfactual Understanding via Retrieval-aware Multimodal Modeling for Time-to-Event Survival Prediction	Ha-Anh Hoang Nguyen et.al.	2602.19987	link
2026-02-23	ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting	Yuxing Tian et.al.	2602.19969	null
2026-02-23	A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs	Zijie Liu et.al.	2602.19938	null
2026-02-23	Towards Dexterous Embodied Manipulation via Deep Multi-Sensory Fusion and Sparse Expert Scaling	Yirui Sun et.al.	2602.19764	null
2026-02-23	Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis	Junhyeok Choi et.al.	2602.19756	null
2026-02-23	RAID: Retrieval-Augmented Anomaly Detection	Mingxiu Cai et.al.	2602.19611	null
2026-02-23	EMS-FL: Federated Tuning of Mixture-of-Experts in Satellite-Terrestrial Networks via Expert-Driven Model Splitting	Angzi Xu et.al.	2602.19485	null
2026-02-22	RegionRoute: Regional Style Transfer with Diffusion Model	Bowen Chen et.al.	2602.19254	null
2026-02-22	Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts	Toshihide Ubukata et.al.	2602.19244	null
2026-02-22	SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation	Yujie Lu et.al.	2602.19213	null
2026-02-22	JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation	Kai Liu et.al.	2602.19163	null
2026-02-22	K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model	Shiyi Cao et.al.	2602.19128	null
2026-02-22	Routing-Aware Explanations for Mixture of Experts Graph Models in Malware Detection	Hossein Shokouhinejad et.al.	2602.19025	null
2026-02-21	NeuroWise: A Multi-Agent LLM “Glass-Box” System for Practicing Double-Empathy Communication with Autistic Partners	Albert Tang et.al.	2602.18962	null
2026-02-21	Give Users the Wheel: Towards Promptable Recommendation Paradigm	Fuyuan Lyu et.al.	2602.18929	null
2026-02-21	Diverse properties of electron Forbush decreases revealed by the Dark Matter Particle Explorer	F. Alemanno et.al.	2602.18743	null
2026-02-21	Comprehensive measurement of $η^\prime$ photoproduction off the proton at $E_γ< 2.4$ $\mathrm{GeV}$	N. Muramatsu et.al.	2602.18675	null
2026-02-20	Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory	Vatsal Agarwal et.al.	2602.18434	null
2026-02-20	RamanSeg: Interpretability-driven Deep Learning on Raman Spectra for Cancer Diagnosis	Chris Tomy et.al.	2602.18119	null
2026-02-20	DeepSVU: Towards In-depth Security-oriented Video Understanding via Unified Physical-world Regularized MoE	Yujie Jin et.al.	2602.18019	null
2026-02-19	Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds	Ibne Farabi Shihab et.al.	2602.17798	null
2026-02-19	Phase-Aware Mixture of Experts for Agentic Reinforcement Learning	Shengtian Yang et.al.	2602.17038	null
2026-02-19	Arcee Trinity Large Technical Report	Varun Singh et.al.	2602.17004	null
2026-02-19	Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation	Yan Wang et.al.	2602.16990	null
2026-02-18	Claim Automation using Large Language Model	Zhengda Mo et.al.	2602.16836	null
2026-02-18	Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning	Zifan Wang et.al.	2602.16796	null
2026-02-18	Geometric Neural Operators via Lie Group-Constrained Latent Dynamics	Jiaquan Zhang et.al.	2602.16209	null
2026-02-18	OmniCT: Towards a Unified Slice-Volume LVLM for Comprehensive CT Analysis	Tianwei Lin et.al.	2602.16110	null
2026-02-18	Federated Graph AGI for Cross-Border Insider Threat Intelligence in Government Financial Schemes	Srikumar Nayak et.al.	2602.16109	null
2026-02-17	MoE-Spec: Expert Budgeting for Efficient Speculative Decoding	Bradley McDanel et.al.	2602.16052	null
2026-02-17	ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns	Ziyu Zhao et.al.	2602.15521	null
2026-02-17	GMAIL: Generative Modality Alignment for generated Image Learning	Shentong Mo et.al.	2602.15368	null
2026-02-16	Mixture-of-Experts under Finite-Rate Gating: Communication–Generalization Trade-offs	Ali Khalesi et.al.	2602.15091	null
2026-02-13	RynnBrain: Open Embodied Foundation Models	Ronghao Dang et.al.	2602.14979	null
2026-02-16	Topological and arithmetic characteristics about products of projective lines with complex tori	Jia-Li Mo et.al.	2602.14745	null
2026-02-16	DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving	Chenxu Dang et.al.	2602.14577	null
2026-02-15	DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices	Songyuan Li et.al.	2602.14301	null
2026-02-15	MILD: Multi-Intent Learning and Disambiguation for Proactive Failure Prediction in Intent-based Networking	Md. Kamrul Hossain et.al.	2602.14283	null
2026-02-15	Multi-Agent Debate: A Unified Agentic Framework for Tabular Anomaly Detection	Pinqiao Wang et.al.	2602.14251	null
2026-02-15	Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws	Jinbo Wang et.al.	2602.14208	null
2026-02-15	Synergistic Intra- and Cross-Layer Regularization Losses for MoE Expert Specialization	Rizhen Hu et.al.	2602.14159	null
2026-02-15	REAL: Resolving Knowledge Conflicts in Knowledge-Intensive Visual Question Answering via Reasoning-Pivot Alignment	Kai Ye et.al.	2602.14065	null
2026-02-15	LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts	Yang Liu et.al.	2602.14060	null
2026-02-15	Geometry-Preserving Aggregation for Mixture-of-Experts Embedding Models	Sajjad Kachuee et.al.	2602.14039	null
2026-02-15	Eureka-Audio: Triggering Audio Intelligence in Compact Language Models	Dan Zhang et.al.	2602.13954	null
2026-02-14	Assessing Cybersecurity Risks and Traffic Impact in Connected Autonomous Vehicles	Saurav Silwal et.al.	2602.13898	null
2026-02-14	Mixture-of-experts Wishart model for covariance matrices with an application to Cancer drug screening	The Tien Mai et.al.	2602.13888	null
2026-02-13	Dyad: a binary-star dynamics and statistics library for Python	Amery Gration et.al.	2602.13388	null
2026-02-13	Improved measurements of the coherence factors and strong-phase differences in $D\to K^-π^+π^+π^-$ and $D\to K^-π^+π^0$ with quantum-correlated $D\bar{D}$ decays	BESIII Collaboration et.al.	2602.13002	null
2026-02-13	Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews	Hamidreza Kazemi Taskooh et.al.	2602.12778	null
2026-02-13	Mixture of Predefined Experts: Maximizing Data Usage on Vertical Federated Learning	Jon Irureta et.al.	2602.12708	null
2026-02-13	Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers	Anrui Chen et.al.	2602.12587	null
2026-02-13	SD-MoE: Spectral Decomposition for Effective Expert Specialization	Ruijun Huang et.al.	2602.12556	null
2026-02-13	Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR	Jaeyoung Lee et.al.	2602.12546	null
2026-02-12	Query-focused and Memory-aware Reranker for Long Context Processing	Yuqing Li et.al.	2602.12192	null
2026-02-12	Measurement of the singly Cabibbo-suppressed decay $Λ_c^+\to pη’$ with Deep Learning	BESIII Collaboration et.al.	2602.11974	null
2026-02-12	Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration	Akhiad Bercovich et.al.	2602.11937	null
2026-02-12	Deep Kernel Fusion for Transformers	Zixi Zhang et.al.	2602.11808	null
2026-02-12	LAER-MoE: Load-Adaptive Expert Re-layout for Efficient Mixture-of-Experts Training	Xinyi Liu et.al.	2602.11686	null
2026-02-12	Evolutionary Router Feature Generation for Zero-Shot Graph Anomaly Detection with Mixture-of-Experts	Haiyang Jiang et.al.	2602.11622	null
2026-02-12	Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm	Jinrui Zhang et.al.	2602.11543	null
2026-02-12	Adaptive Milestone Reward for GUI Agents	Congmin Zheng et.al.	2602.11524	null
2026-02-12	Observation of a New Excited $Σ$ State in $ψ(3686)\to\bar{p}K^+Σ^0+c.c.$	BESIII Collaboration et.al.	2602.11501	null
2026-02-11	Charting Empirical Laws for LLM Fine-Tuning in Scientific Multi-Discipline Learning	Lintao Wang et.al.	2602.11215	null
2026-02-11	MoEEdit: Efficient and Routing-Stable Knowledge Editing for Mixture-of-Experts LLMs	Yupu Gu et.al.	2602.10965	null
2026-02-11	CMAD: Cooperative Multi-Agent Diffusion via Stochastic Optimal Control	Riccardo Barbano et.al.	2602.10933	null
2026-02-11	VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training	Guobin Shen et.al.	2602.10693	null
2026-02-11	Multimodal Priors-Augmented Text-Driven 3D Human-Object Interaction Generation	Yin Wang et.al.	2602.10659	null
2026-02-11	A Vision-Language Foundation Model for Zero-shot Clinical Collaboration and Automated Concept Discovery in Dermatology	Siyuan Yan et.al.	2602.10624	null
2026-02-11	Supercharging Packet-level Network Simulation of Large Model Training via Memoization and Fast-Forwarding	Fei Long et.al.	2602.10615	null
2026-02-11	Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters	Ailin Huang et.al.	2602.10604	null
2026-02-11	Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity	Guangzhi Xiong et.al.	2602.10585	null
2026-02-12	3DXTalker: Unifying Identity, Lip Sync, Emotion, and Spatial Dynamics in Expressive 3D Talking Avatars	Zhongju Wang et.al.	2602.10516	null
2026-02-10	Area-Efficient In-Memory Computing for Mixture-of-Experts via Multiplexing and Caching	Hanyuan Gao et.al.	2602.10254	link
2026-02-10	TDE 2025abcr: A Tidal Disruption Event in the Outskirts of a Massive Galaxy	Robert Stein et.al.	2602.10180	null
2026-02-10	MalMoE: Mixture-of-Experts Enhanced Encrypted Malicious Traffic Detection Under Graph Drift	Yunpeng Tan et.al.	2602.10157	null
2026-02-10	Diverse Skill Discovery for Quadruped Robots via Unsupervised Learning	Ruopeng Cui et.al.	2602.09767	null
2026-02-10	Revealing the Challenges of Attention-FFN Disaggregation for Modern MoE Models and Hardware Systems	Guowei Liu et.al.	2602.09721	null
2026-02-10	First observation of the $η_{c}\toΞ^{0} \barΞ^{0}$ decay	BESIII Collaboration et.al.	2602.09652	null
2026-02-10	DR.Experts: Differential Refinement of Distortion-Aware Experts for Blind Image Quality Assessment	Bohan Fu et.al.	2602.09531	null
2026-02-10	SMES: Towards Scalable Multi-Task Recommendation via Expert Sparsity	Yukun Zhang et.al.	2602.09386	null
2026-02-10	Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density	Zhendong Mi et.al.	2602.09316	null
2026-02-09	Generalizing GNNs with Tokenized Mixture of Experts	Xiaoguang Guo et.al.	2602.09258	null
2026-02-09	UI-Venus-1.5 Technical Report	Veuns-Team et.al.	2602.09082	null
2026-02-09	DirMoE: Dirichlet-routed Mixture of Experts	Amirhossein Vahidi et.al.	2602.09001	null
2026-02-09	OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation	Yehua Huang et.al.	2602.08896	link
2026-02-09	FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models	Annemette Brok Pirchert et.al.	2602.08818	null
2026-02-10	MOVA: Towards Scalable and Synchronized Video-Audio Generation	SII-OpenMOSS Team et.al.	2602.08794	null
2026-02-10	Redundancy-Free View Alignment for Multimodal Human Activity Recognition with Arbitrarily Missing Views	Duc-Anh Nguyen et.al.	2602.08755	null
2026-02-09	Large Language Lobotomy: Jailbreaking Mixture-of-Experts via Expert Silencing	Jona te Lintelo et.al.	2602.08741	null
2026-02-09	6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks	Mohamed Amine Ferrag et.al.	2602.08675	null
2026-02-10	Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models	Mingzi Cao et.al.	2602.08658	null
2026-02-09	Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs	Yukun Jiang et.al.	2602.08621	null
2026-02-09	Giant Magnetocaloric Effect in a High-Spin Shastry-Sutherland Dipolar Magnet	Jianjian Gong et.al.	2602.08497	null
2026-02-09	TEAM: Temporal-Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration	Linye Wei et.al.	2602.08404	null
2026-02-09	Tighnari v2: Mitigating Label Noise and Distribution Shift in Multimodal Plant Distribution Prediction via Mixture of Experts and Weakly Supervised Learning	Haixu Liu et.al.	2602.08282	null
2026-02-09	Large Language Models in Peer-Run Community Behavioral Health Services: Understanding Peer Specialists and Service Users’ Perspectives on Opportunities, Risks, and Mitigation Strategies	Cindy Peng et.al.	2602.08187	null
2026-02-08	Multimodal normative modeling in Alzheimers Disease with introspective variational autoencoders	Sayantan Kumar et.al.	2602.08077	null
2026-02-08	Efficient and Adaptable Detection of Malicious LLM Prompts via Bootstrap Aggregation	Shayan Ali Hassan et.al.	2602.08062	null
2026-02-08	Enhanced Mixture 3D CGAN for Completion and Generation of 3D Objects	Yahia Hamdi et.al.	2602.08046	null
2026-02-08	The Rise of Sparse Mixture-of-Experts: A Survey from Algorithmic Foundations to Decentralized Architectures and Vertical Domain Applications	Dong Pan et.al.	2602.08019	null
2026-02-08	Fast Model Selection and Stable Optimization for Softmax-Gated Multinomial-Logistic Mixture of Experts Models	TrungKhang Tran et.al.	2602.07997	null
2026-02-08	Thinking in Structures: Evaluating Spatial Intelligence through Reasoning on Constrained Manifolds	Chen Yang et.al.	2602.07864	null
2026-02-07	SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models	Juntong Wu et.al.	2602.07616	null
2026-02-07	MSN: A Memory-based Sparse Activation Scaling Framework for Large-scale Industrial Recommendation	Shikang Wu et.al.	2602.07526	null
2026-02-07	From Native Memes to Global Moderation: Cros-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection	Mo Wang et.al.	2602.07497	null
2026-02-07	Wavelet-Domain Masked Image Modeling for Color-Consistent HDR Video Reconstruction	Yang Zhang et.al.	2602.07393	link
2026-02-07	When the Model Said ‘No Comment’, We Knew Helpfulness Was Dead, Honesty Was Alive, and Safety Was Terrified	Gautam Siddharth Kashyap et.al.	2602.07381	null
2026-02-07	Semantic Search At LinkedIn	Fedor Borisyuk et.al.	2602.07309	null
2026-02-06	XShare: Collaborative in-Batch Expert Sharing for Faster MoE Inference	Daniil Vankov et.al.	2602.07265	null
2026-02-06	DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos	Shenyuan Gao et.al.	2602.06949	null
2026-02-06	Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing	Meng Lou et.al.	2602.06862	null
2026-02-06	POP: Online Structural Pruning Enables Efficient Inference of Large Foundation Models	Yi Chen et.al.	2602.06822	null
2026-02-06	SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers	Shentong Mo et.al.	2602.06706	null
2026-02-06	Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making	Baichuan-M3 Team et.al.	2602.06570	null
2026-02-06	TokenMixer-Large: Scaling Up Large Ranking Models in Industrial Recommenders	Yuchen Jiang et.al.	2602.06563	null
2026-02-06	HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction	Shengxuan Qiu et.al.	2602.06527	null
2026-02-05	GRP-Obliteration: Unaligning LLMs With a Single Unlabeled Prompt	Mark Russinovich et.al.	2602.06258	null
2026-02-05	To 2:4 Sparsity and Beyond: Neuron-level Activation Function to Accelerate LLM Pre-Training	Meghana Madhyastha et.al.	2602.06183	null
2026-02-05	MoSE: Mixture of Slimmable Experts for Efficient and Adaptive Language Models	Nurbek Tastan et.al.	2602.06154	null
2026-02-05	OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale	Jingze Shi et.al.	2602.05711	null
2026-02-05	Hidden simplicity in AdS spinning Mellin amplitudes via scaffolding	Song He et.al.	2602.05568	null
2026-02-05	M $^2$ -Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining	Rui Lv et.al.	2602.05429	null
2026-02-05	Mergers Drive Structural Complexity but Not Starbursts in Lyman- $α$ Emitters at $3 < z < 4$ : A JWST Spatially Resolved View	Qi Song et.al.	2602.05411	null
2026-02-05	Decision-Focused Sequential Experimental Design: A Directional Uncertainty-Guided Approach	Beichen Wan et.al.	2602.05340	null
2026-02-05	Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink	Guozhi Liu et.al.	2602.05228	null
2026-02-04	Rule-Based Spatial Mixture-of-Experts U-Net for Explainable Edge Detection	Bharadwaj Dogga et.al.	2602.05100	null
2026-02-04	Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism	Chenwei Cui et.al.	2602.04870	null
2026-02-04	PDF-HR: Pose Distance Fields for Humanoid Robots	Yi Gu et.al.	2602.04851	null
2026-02-04	ERNIE 5.0 Technical Report	Haifeng Wang et.al.	2602.04705	null
2026-02-04	Let Experts Feel Uncertainty: A Multi-Expert Label Distribution Approach to Probabilistic Time Series Forecasting	Zhen Zhou et.al.	2602.04678	null
2026-02-04	RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models	Jiacheng Liang et.al.	2602.04448	null
2026-02-04	Mixture of Masters: Sparse Chess Language Models with Player Routing	Giacomo Frisoni et.al.	2602.04447	null
2026-02-04	Study of $\barΛ$-$p$ Annihilation into Light Mesons	BESIII Collaboration et.al.	2602.04276	null
2026-02-04	Universal Quantized Berry-Dipole Flat Bands	Qingyang Mo et.al.	2602.04194	null
2026-02-04	OMG-Agent: Toward Robust Missing Modality Generation with Decoupled Coarse-to-Fine Agentic Workflows	Ruiting Dai et.al.	2602.04144	null
2026-02-04	Expert Selections In MoE Models Reveal (Almost) As Much As Text	Amir Nuriyev et.al.	2602.04105	null
2026-02-03	SpecMD: A Comprehensive Study On Speculative Expert Prefetching	Duc Hoang et.al.	2602.03921	null
2026-02-03	UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining	Changhao Wang et.al.	2602.03772	null
2026-02-03	HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing	Yizhao Gao et.al.	2602.03560	null
2026-02-03	DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs	Zeyu Zhu et.al.	2602.03495	null
2026-02-03	Scaling Continual Learning with Bi-Level Routing Mixture-of-Experts	Meng Lou et.al.	2602.03473	null
2026-02-03	VIRAL: Visual In-Context Reasoning via Analogy in Diffusion Transformers	Zhiwen Li et.al.	2602.03210	null
2026-02-03	Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry	Ye Su et.al.	2602.03204	null
2026-02-03	Aligning Forest and Trees in Images and Long Captions for Visually Grounded Understanding	Byeongju Woo et.al.	2602.02977	null
2026-02-02	Decision-Focused Optimal Transport	Suhan Liu et.al.	2602.02800	null
2026-02-02	Loss mechanisms of microwave frequency acoustic waves in thin film lithium niobate	Qixuan Lin et.al.	2602.02797	null
2026-02-02	SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning	Qifan Yu et.al.	2602.02472	null
2026-02-02	Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE	Yuanteng Chen et.al.	2602.02443	null
2026-02-02	DFKI-Speech System for WildSpoof Challenge: A robust framework for SASV In-the-Wild	Arnab Das et.al.	2602.02286	null
2026-02-02	MoLF: Mixture-of-Latent-Flow for Pan-Cancer Spatial Gene Expression Prediction from Histology	Susu Hu et.al.	2602.02282	null
2026-02-02	Kimi K2.5: Visual Agentic Intelligence	Kimi Team et.al.	2602.02276	null
2026-02-02	vLLM-Omni: Fully Disaggregated Serving for Any-to-Any Multimodal Models	Peiqi Yin et.al.	2602.02204	null
2026-02-02	No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs	Liyan Xu et.al.	2602.02103	null
2026-02-02	Edge-Aligned Initialization of Kernels for Steered Mixture-of-Experts	Martin Determann et.al.	2602.02031	null
2026-02-02	SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning	Zhen-Hao Xie et.al.	2602.01990	null
2026-02-02	Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition	Wonjun Lee et.al.	2602.01967	null
2026-02-02	SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures	Liangtao Lin et.al.	2602.01858	null
2026-02-02	From Knowing to Doing Precisely: A General Self-Correction and Termination Framework for VLA models	Wentao Zhang et.al.	2602.01811	null
2026-02-02	Mutual-Guided Expert Collaboration for Cross-Subject EEG Classification	Zhi Zhang et.al.	2602.01728	null
2026-02-02	AdNanny: One Reasoning LLM for All Offline Ads Recommendation Tasks	Nan Hu et.al.	2602.01563	null
2026-02-01	A Statistical Theory of Gated Attention through the Lens of Hierarchical Mixture of Experts	Viet Nguyen et.al.	2602.01468	null
2026-02-01	Rethinking Multinomial Logistic Mixture of Experts with Sigmoid Gating Function	Tuan Minh Pham et.al.	2602.01466	null
2026-02-01	Exposing and Defending the Achilles’ Heel of Video Mixture-of-Experts	Songping Wang et.al.	2602.01369	null
2026-02-01	Observation of $\barΛp\to K^{+}π^{+}π^{-}π^{0}$ and $\barΛp\to K^{+}π^{+}π^{-}2π^{0}$	BESIII Collaboration et.al.	2602.01282	null
2026-02-01	MiTA Attention: Efficient Fast-Weight Scaling via a Mixture of Top- $k$ Activations	Qishuai Wen et.al.	2602.01219	null
2026-02-01	Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse	Zizhuo Fu et.al.	2602.01203	null
2026-01-30	Omni-fMRI: A Universal Atlas-Free fMRI Foundation Model	Mo Wang et.al.	2601.23090	null
2026-01-30	UrbanMoE: A Sparse Multi-Modal Mixture-of-Experts Framework for Multi-Task Urban Region Profiling	Pingping Liu et.al.	2601.22746	null
2026-01-30	A Cross-Domain Graph Learning Protocol for Single-Step Molecular Geometry Refinement	Chengchun Liu et.al.	2601.22723	null
2026-01-30	A Step Back: Prefix Importance Ratio Stabilizes Policy Optimization	Shiye Lei et.al.	2601.22718	null
2026-01-30	A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation	Haonan He et.al.	2601.22708	null
2026-01-30	Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments	Jinwoo Jang et.al.	2601.22647	null
2026-01-30	SpanNorm: Reconciling Training Stability and Performance in Deep Transformers	Chao Wang et.al.	2601.22580	null
2026-01-30	SHED Light on Segmentation for Dense Prediction	Seung Hyun Lee et.al.	2601.22529	null
2026-01-30	Continual Policy Distillation from Distributed Reinforcement Learning Teachers	Yuxuan Li et.al.	2601.22475	null
2026-01-29	ECO: Quantized Training without Full-Precision Master Weights	Mahdi Nikdan et.al.	2601.22101	null
2026-01-29	Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference	Yiren Zhao et.al.	2601.22001	null
2026-01-29	MoE-ACT: Improving Surgical Imitation Learning Policies through Supervised Mixture-of-Experts	Lorenzo Mazza et.al.	2601.21971	null
2026-01-29	MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts	Evandro S. Ortigossa et.al.	2601.21866	null
2026-01-29	OneMall: One Model, More Scenarios – End-to-End Generative Recommender Family at Kuaishou E-Commerce	Kun Zhang et.al.	2601.21770	null
2026-01-29	Seg-MoE: Multi-Resolution Segment-wise Mixture-of-Experts for Time Series Forecasting Transformers	Evandro S. Ortigossa et.al.	2601.21641	null
2026-01-29	Depth-Recurrent Attention Mixtures: Giving Latent Reasoning the Attention it Deserves	Jonas Knupp et.al.	2601.21582	null
2026-01-29	Multi-Modal Time Series Prediction via Mixture of Modulated Experts	Lige Zhang et.al.	2601.21547	null
2026-01-29	ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory	Yang Zhao et.al.	2601.21545	null
2026-01-30	L $^3$ : Large Lookup Layers	Albert Tseng et.al.	2601.21461	null
2026-01-29	ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation	Zihao Huang et.al.	2601.21420	null
2026-01-29	L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts	Minghao Yang et.al.	2601.21349	null
2026-01-29	Abstracting Robot Manipulation Skills via Mixture-of-Experts Diffusion Policies	Ce Hao et.al.	2601.21251	null
2026-01-29	Scaling Embeddings Outperforms Scaling Experts in Language Models	Hong Liu et.al.	2601.21204	null
2026-01-29	ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling	Yuchen Yang et.al.	2601.21198	null
2026-01-29	Precise measurements of $D^0 \to K^-\ell^+ν_\ell$ and $D^+ \to \bar K^0\ell^+ν_\ell$ decays	BESIII Collaboration et.al.	2601.21196	null
2026-01-29	Search for $ψ_0(4360)\rightarrow ηψ(2S)$ through the process $e^+e^- \rightarrow ηηψ(2S)$	BESIII Collaboration et.al.	2601.21190	null
2026-01-29	First Experimental Constraint on the Scalar Current in the $D^{0(+)}\to \bar K\ell^+ν_{\ell}$ Transition	BESIII Collaboration et.al.	2601.21185	null
2026-01-29	BrainStack: Neuro-MoE with Functionally Guided Expert Routing for EEG-Based Language Decoding	Ziyi Zhao et.al.	2601.21148	null
2026-01-29	TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning	Shicheng Fan et.al.	2601.21135	null
2026-01-28	ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler	Bohua Zou et.al.	2601.20755	null
2026-01-28	ShieldedCode: Learning Robust Representations for Virtual Machine Protected Code	Mingqiao Mo et.al.	2601.20679	null
2026-01-28	Unsupervised Ensemble Learning Through Deep Energy-based Models	Ariel Maymon et.al.	2601.20556	null
2026-01-28	OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution	Le Zhang et.al.	2601.20380	null
2026-01-28	OSDEnhancer: Taming Real-World Space-Time Video Super-Resolution with One-Step Diffusion	Shuoyan Wei et.al.	2601.20308	null
2026-01-28	MiLorE-SSL: Scaling Multilingual Capabilities in Self-Supervised Models without Forgetting	Jing Xu et.al.	2601.20300	null
2026-01-28	HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-BENCH	Yueyang Wang et.al.	2601.20255	null
2026-01-28	Hyperparameter Transfer with Mixture-of-Expert Layers	Tianze Jiang et.al.	2601.20205	null
2026-01-28	Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery	Zhipeng Zhang et.al.	2601.20193	null
2026-01-27	Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts	TrungKhang Tran et.al.	2601.19811	null
2026-01-27	Component-Level Lesioning of Language Models Reveals Clinically Aligned Aphasia Phenotypes	Yifan Wang et.al.	2601.19723	null
2026-01-27	LoPRo: Enhancing Low-Rank Quantization via Permuted Block-Wise Rotation	Hongyaoxing Gu et.al.	2601.19675	null
2026-01-27	GMS-CAVP: Improving Audio-Video Correspondence with Multi-Scale Contrastive and Generative Pretraining	Shentong Mo et.al.	2601.19606	null
2026-01-27	Search for the isospin-violating decays $\boldsymbol{χ_{cJ}\toΛ\barΣ^{0}+c.c.}$ and $\boldsymbol{η_{c}\toΛ\barΣ^{0}+c.c.}$	BESIII Collaboration et.al.	2601.19493	null
2026-01-27	Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition	Isha Pandey et.al.	2601.19451	null
2026-01-26	Superlinear Multi-Step Attention	Yufeng Huang et.al.	2601.18401	null
2026-01-26	FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning	Zhaopeng Qiu et.al.	2601.18150	null
2026-01-26	Beyond Static Datasets: Robust Offline Policy Optimization via Vetted Synthetic Transitions	Pedram Agand et.al.	2601.18107	null
2026-01-26	OneVoice: One Model, Triple Scenarios-Towards Unified Zero-shot Voice Conversion	Zhichao Wang et.al.	2601.18094	null
2026-01-26	LatentMoE: Toward Optimal Accuracy per FLOP and Parameter in Mixture of Experts	Venmugil Elango et.al.	2601.18089	null
2026-01-25	Domain-Expert-Guided Hybrid Mixture-of-Experts for Medical AI: Integrating Data-Driven Learning with Clinical Priors	Jinchen Gu et.al.	2601.17977	null
2026-01-25	EntWorld: A Holistic Environment and Benchmark for Verifiable Enterprise GUI Agents	Ying Mo et.al.	2601.17722	null
2026-01-25	$\infty$ -MoE: Generalizing Mixture of Experts to Infinite Experts	Shota Takashiro et.al.	2601.17680	null
2026-01-25	Health-ORSC-Bench: A Benchmark for Measuring Over-Refusal and Safety Completion in Health Context	Zhihao Zhang et.al.	2601.17642	null
2026-01-24	PILOT: A Perceptive Integrated Low-level Controller for Loco-manipulation over Unstructured Scenes	Xinru Cui et.al.	2601.17440	null
2026-01-24	Topological Protection by Local Support Symmetry and Destructive Interference	Jun-Won Rhim et.al.	2601.17272	null
2026-01-23	Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts	Xuan-Phi Nguyen et.al.	2601.17111	null
2026-01-23	First evidence for $D_s^+ \to f_1(1420) e^+ν_e$ and search for $D_s^+ \to f_1(1285) e^+ν_e$	BESIII Collaboration et.al.	2601.16938	null
2026-01-23	Coarse-Grained Geometric Quantum Dynamics in the Tensor Network Representation	Mo Sha et.al.	2601.16913	null
2026-01-23	GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints	Andy Zhu et.al.	2601.16905	null
2026-01-23	Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation	Tims Pecerskis et.al.	2601.16863	null
2026-01-23	SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents	Yuhang Wang et.al.	2601.16746	null
2026-01-23	LongCat-Flash-Thinking-2601 Technical Report	Meituan LongCat Team et.al.	2601.16725	null
2026-01-23	*Search for the radiative decay $D^+_s \to γK^(892)^+$**	BESIII Collaboration et.al.	2601.16476	null
2026-01-22	proto-Lightspeed: a high-speed, ultra-low read noise imager on the Magellan Clay Telescope	Christopher Layden et.al.	2601.16268	null
2026-01-22	Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning	Moo Jin Kim et.al.	2601.16163	null
2026-01-22	Universal Refusal Circuits Across LLMs: Cross-Model Transfer via Trajectory Replay and Concept-Basis Reconstruction	Tony Cristofano et.al.	2601.16034	null
2026-01-22	Search for the reaction channel $e^+ e^- \to ηη\,J/ψ$ and the isospin partner of the $Z_c(3900)$ at center-of-mass energies $\sqrt{s} = 4.226-4.950$ GeV	BESIII Collaboration et.al.	2601.15882	null
2026-01-22	LL-GaussianImage: Efficient Image Representation for Zero-shot Low-Light Enhancement with 2D Gaussian Splatting	Yuhan Chen et.al.	2601.15772	link
2026-01-22	Redshift-Binned Constraints on the Hubble Constant under $Λ$ CDM, CPL, and Padé Cosmography	Zhi-Yuan Mo et.al.	2601.15765	null
2026-01-21	On the diagonal of low bidegree hypersurfaces	Morten Lüders et.al.	2601.15409	null
2026-01-21	Improving MoE Compute Efficiency by Composing Weight and Data Sparsity	Maciej Kilian et.al.	2601.15370	null
2026-01-21	Pb4U-GNet: Resolution-Adaptive Garment Simulation via Propagation-before-Update Graph Network	Aoran Liu et.al.	2601.15110	null
2026-01-21	Mixture-of-Experts Models in Vision: Routing, Optimization, and Generalization	Adam Rokah et.al.	2601.15021	null
2026-01-21	SynPerf: A Hybrid Analytical-ML Framework for GPU Performance Prediction	Kaixuan Zhang et.al.	2601.14910	null
2026-01-21	Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation	Rui Qi et.al.	2601.14896	null
2026-01-21	UBATrack: Spatio-Temporal State Space Model for General Multi-Modal Tracking	Qihua Liang et.al.	2601.14799	null
2026-01-21	UniRoute: Unified Routing Mixture-of-Experts for Modality-Adaptive Remote Sensing Change Detection	Qingling Shu et.al.	2601.14797	null
2026-01-21	Robustness of Mixtures of Experts to Feature Noise	Dong Sun et.al.	2601.14792	null
2026-01-21	Online Linear Programming with Replenishment	Yuze Chen et.al.	2601.14629	null
2026-01-20	$π$ MPC: A Parallel-in-horizon and Construction-free NMPC Solver	Liang Wu et.al.	2601.14414	null
2026-01-20	Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models	YuanLab. ai et.al.	2601.14327	null
2026-01-20	LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems	Badri N. Patro et.al.	2601.14053	null
2026-01-20	Understanding Multilingualism in Mixture-of-Experts LLMs: Routing Mechanism, Expert Specialization, and Layerwise Steering	Yuxin Chen et.al.	2601.14050	null
2026-01-20	DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging	Adrien Meyer et.al.	2601.13954	null
2026-01-20	The R2Pub Telescopes for Surveying: An Overview and Performance Evaluation of the System	Xuan Song et.al.	2601.13587	null
2026-01-20	ButterflyMoE: Sub-Linear Ternary Experts via Structured Butterfly Orbits	Aryan Karmore et.al.	2601.13563	null
2026-01-20	MN-TSG:Continuous Time Series Generation with Irregular Observations	Xu Zhang et.al.	2601.13534	null
2026-01-19	CLIP-Guided Adaptable Self-Supervised Learning for Human-Centric Visual Tasks	Mingshuang Luo et.al.	2601.13133	null
2026-01-19	Agentic Conversational Search with Contextualized Reasoning via Reinforcement Learning	Fengran Mo et.al.	2601.13115	null
2026-01-19	Polychronous Wave Computing: Timing-Native Address Selection in Spiking Networks	Natalila G. Berloff et.al.	2601.13079	null
2026-01-19	Synthesizing Strong-Coupling Kohn-Luttinger Superconductivity in 2D Van der Waals materials	Shi-Cong Mo et.al.	2601.13074	null
2026-01-19	PASs-MoE: Mitigating Misaligned Co-drift among Router and Experts via Pathway Activation Subspaces for Continual Learning	Zhiyan Hou et.al.	2601.13020	null
2026-01-19	HT-GNN: Hyper-Temporal Graph Neural Network for Customer Lifetime Value Prediction in Baidu Ads	Xiaohui Zhao et.al.	2601.13013	null
2026-01-19	OFA-MAS: One-for-All Multi-Agent System Topology Design based on Mixture-of-Experts Graph Generative Models	Shiyuan Li et.al.	2601.12996	null
2026-01-19	PhyG-MoE: A Physics-Guided Mixture-of-Experts Framework for Energy-Efficient GNSS Interference Recognition	Zhihan Zeng et.al.	2601.12798	null
2026-01-19	Topology-Aware Multiscale Mixture of Experts for Efficient Molecular Property Prediction	Long D. Nguyen et.al.	2601.12637	null
2026-01-18	A Mixture of Experts Vision Transformer for High-Fidelity Surface Code Decoding	Hoang Viet Nguyen et.al.	2601.12483	null
2026-01-18	Learning Diverse Skills for Behavior Models with Mixture of Experts	Wangtian Shen et.al.	2601.12397	null
2026-01-18	NADIR: Differential Attention Flow for Non-Autoregressive Transliteration in Indic Languages	Lakshya Tomar et.al.	2601.12389	null
2026-01-18	GazeFormer-MoE: Context-Aware Gaze Estimation via CLIP and MoE Transformer	Xinyuan Zhao et.al.	2601.12316	null
2026-01-18	Facet-Aware Multi-Head Mixture-of-Experts Model with Text-Enhanced Pre-training for Sequential Recommendation	Mingrui Liu et.al.	2601.12301	null
2026-01-16	Reasoning in Trees: Improving Retrieval-Augmented Generation for Multi-Hop Question Answering	Yuling Shi et.al.	2601.11255	null
2026-01-16	First Measurement of the Absolute Branching Fraction of $η_c \to γγ$	BESIII Collaboration et.al.	2601.11236	null
2026-01-16	Self-Augmented Mixture-of-Experts for QoS Prediction	Kecheng Cai et.al.	2601.11036	null
2026-01-16	RobuMTL: Enhancing Multi-Task Learning Robustness Against Weather Conditions	Tasneem Shaffee et.al.	2601.10921	null
2026-01-15	Search for sub-GeV dark particles in $η\toπ^0+\rm{invisible}$ decay	BESIII Collaboration et.al.	2601.10597	null
2026-01-15	Deterministic and scalable generation of large Fock states	Mo Xiong et.al.	2601.10559	null
2026-01-15	Algebraic Farkas Lemma and Strong Duality for Perturbed Conic Linear Programming	P. D. Khanh et.al.	2601.10390	null
2026-01-15	MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts	Yuxuan Lou et.al.	2601.10272	null
2026-01-15	A Highly Magnetic Ultra Massive White Dwarf with a 23-minute Rotation Period	Jincheng Guo et.al.	2601.10188	null
2026-01-15	What Gets Activated: Uncovering Domain and Driver Experts in MoE Language Models	Guimin Hu et.al.	2601.10159	null
2026-01-15	MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation Learning	Yusong Wang et.al.	2601.10157	null
2026-01-15	Extremum Seeking Nonovershooting Control of Strict-Feedback Systems Under Unknown Control Direction	Kaixin Lu et.al.	2601.09998	null
2026-01-14	Progressive Mixture-of-Experts with autoencoder routing for continual RANS turbulence modelling	Haoyu Ji et.al.	2601.09305	null
2026-01-14	A Raman-Gas Spectral Compressor for High-Energy Femtosecond Laser Pulses	Zegui Wang et.al.	2601.09234	null
2026-01-15	A.X K1 Technical Report	Sung Jun Cheon et.al.	2601.09200	null
2026-01-14	WiFo-E: A Scalable Wireless Foundation Model for End-to-End FDD Precoding in Communication Networks	Weibo Wen et.al.	2601.09186	null
2026-01-14	Horseshoe Mixtures-of-Experts (HS-MoE)	Nick Polson et.al.	2601.09043	null
2026-01-13	OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG	Fengran Mo et.al.	2601.09028	null
2026-01-12	TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts	Yu Xu et.al.	2601.08881	null
2026-01-13	MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm	Bowen Zhou et.al.	2601.08800	null
2026-01-13	LWM-Spectro: A Foundation Model for Wireless Baseband Signal Spectrograms	Namhyun Kim et.al.	2601.08780	null
2026-01-13	M $^2$ FMoE: Multi-Resolution Multi-View Frequency Mixture-of-Experts for Extreme-Adaptive Time Series Forecasting	Yaohui Huang et.al.	2601.08631	null
2026-01-13	Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances	Ziqi Ding et.al.	2601.08516	null
2026-01-13	Taxon: Hierarchical Tax Code Prediction with Semantically Aligned LLM Expert Guidance	Jihang Li et.al.	2601.08418	null
2026-01-13	Controlled LLM Training on Spectral Sphere	Tian Xie et.al.	2601.08393	null
2026-01-13	Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models	Bo Wang et.al.	2601.08383	null
2026-01-13	Towards Principled Design of Mixture-of-Experts Language Models under Memory and Inference Constraints	Seng Pei Liew et.al.	2601.08215	null
2026-01-12	Towards Specialized Generalists: A Multi-Task MoE-LoRA Framework for Domain-Specific LLM Adaptation	Yuxin Yang et.al.	2601.07935	null
2026-01-12	An eclipsing 8.56 minute orbital period mass-transferring binary	Emma T. Chickles et.al.	2601.07925	null
2026-01-12	Emotional Support Evaluation Framework via Controllable and Diverse Seeker Simulator	Chaewon Heo et.al.	2601.07698	null
2026-01-12	Amplitude analysis and branching fraction measurement of $J/ψ\to Λ\barΣ^0η+\mathrm{c.c}$	BESIII Collaboration et.al.	2601.07617	null
2026-01-12	Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models	Xin Cheng et.al.	2601.07372	null
2026-01-11	PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation	Yuanzhe Liu et.al.	2601.07060	null
2026-01-11	Solar Open Technical Report	Sungrae Park et.al.	2601.07022	null
2026-01-11	Deep Learning Based Channel Extrapolation for Dual-Band Massive MIMO Systems	Qikai Xiao et.al.	2601.06858	null
2026-01-11	MoE-DisCo:Low Economy Cost Training Mixture-of-Experts Models	Xin Ye et.al.	2601.06857	null
2026-01-11	MoEScore: Mixture-of-Experts-Based Text-Audio Relevance Score Prediction for Text-to-Audio System Evaluation	Bochao Sun et.al.	2601.06829	null
2026-01-11	SecMoE: Communication-Efficient Secure MoE Inference via Select-Then-Compute	Bowen Shen et.al.	2601.06790	null
2026-01-11	AutoTour: Automatic Photo Tour Guide with Smartphones and LLMs	Huatao Xu et.al.	2601.06781	null
2026-01-11	MTMCS-Bench: Evaluating Contextual Safety of Multimodal Large Language Models in Multi-Turn Dialogues	Zheyuan Liu et.al.	2601.06757	null
2026-01-10	R-Estimation with Right-Censored Data	Glen A. Satten et.al.	2601.06685	null
2026-01-10	Efficient and Reliable Estimation of Named Entity Linking Quality: A Case Study on GutBrainIE	Marco Martinelli et.al.	2601.06624	null
2026-01-10	Hellinger Multimodal Variational Autoencoders	Huyen Khanh Vo et.al.	2601.06572	null
2026-01-10	Physics-guided foundation model for universal speckle removal in ultrathin multimode fiber imaging	Xianrui Zeng et.al.	2601.06448	null
2026-01-10	The Promise of Time-Series Foundation Models for Agricultural Forecasting: Evidence from Marketing Year Average Prices	Le Wang et.al.	2601.06371	null
2026-01-09	Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning	Nusrat Jahan Prottasha et.al.	2601.06356	null
2026-01-09	AIConfigurator: Lightning-Fast Configuration Optimization for Multi-Framework LLM Serving	Tianhao Xu et.al.	2601.06288	null
2026-01-09	Orchestrating Tokens and Sequences: Dynamic Hybrid Policy Optimization for RLVR	Zijun Min et.al.	2601.05607	null
2026-01-09	Buffered AUC maximization for scoring systems via mixed-integer optimization	Moe Shiina et.al.	2601.05544	null
2026-01-09	Scalable Heterogeneous Graph Learning via Heterogeneous-aware Orthogonal Prototype Experts	Wei Zhou et.al.	2601.05537	null
2026-01-08	MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs	Jiyuan Zhang et.al.	2601.05296	null
2026-01-08	MoE3D: A Mixture-of-Experts Module for 3D Reconstruction	Zichen Wang et.al.	2601.05208	null
2026-01-08	FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts	Yiji Zhao et.al.	2601.05174	link
2026-01-08	How to Set the Learning Rate for Large-Scale Pre-training?	Yunhua Zhou et.al.	2601.05049	null
2026-01-08	CuMA: Aligning LLMs with Sparse Cultural Values via Demographic-Aware Mixture of Adapters	Ao Sun et.al.	2601.04885	null
2026-01-08	DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation	Guanzhi Deng et.al.	2601.04823	null
2026-01-08	Users Mispredict Their Own Preferences for AI Writing Assistance	Vivian Lai et.al.	2601.04461	null
2026-01-08	Re-Rankers as Relevance Judges	Chuan Meng et.al.	2601.04455	null
2026-01-07	Transitive Expert Error and Routing Problems in Complex AI Systems	Forest Mars et.al.	2601.04416	null
2026-01-06	Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language Models	Brady Steele et.al.	2601.04254	null
2026-01-07	When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life	Xinyue Lou et.al.	2601.04043	null
2026-01-07	A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems	Qi Wu et.al.	2601.03992	null
2026-01-07	Spectral Manifold Regularization for Stable and Modular Routing in Deep MoE Architectures	Ibrahim Delibasoglu et.al.	2601.03889	null
2026-01-07	PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation	Wenlong Huang et.al.	2601.03782	null
2026-01-07	Variational Inference, Entropy, and Orthogonality: A Unified Theory of Mixture-of-Experts	Ye Su et.al.	2601.03577	null
2026-01-07	CALM: Culturally Self-Aware Language Models	Lingzhi Shen et.al.	2601.03483	link
2026-01-06	The Illusion of Specialization: Unveiling the Domain-Invariant “Standing Committee” in Mixture-of-Experts Models	Yan Wang et.al.	2601.03425	null
2026-01-06	AT2024wpp: An Extremely Luminous Fast Ultraviolet Transient Powered by Accretion onto a Black Hole	Daniel A. Perley et.al.	2601.03337	null
2026-01-06	ReCCur: A Recursive Corner-Case Curation Framework for Robust Vision-Language Understanding in Open and Edge Scenarios	Yihan Wei et.al.	2601.03011	null
2026-01-08	MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free	Yishu Lei et.al.	2601.02967	null
2026-01-06	MixTTE: Multi-Level Mixture-of-Experts for Scalable and Adaptive Travel Time Estimation	Wenzhao Jiang et.al.	2601.02943	null
2026-01-06	MiMo-V2-Flash Technical Report	Bangjun Xiao et.al.	2601.02780	null
2026-01-05	Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts	Boxuan Lyu et.al.	2601.02144	null
2026-01-05	Cross section measurement of $e^{+}e^{-}\rightarrow π^{0}π^{0}ψ(3686)$ from $\sqrt{s}=$ 4.008 GeV to 4.951 GeV	BESIII Collaboration et.al.	2601.02136	null
2026-01-07	FormuLLA: A Large Language Model Approach to Generating Novel 3D Printable Formulations	Adeshola Okubena et.al.	2601.02071	null
2026-01-05	GCR: Geometry-Consistent Routing for Task-Agnostic Continual Anomaly Detection	Joongwon Chae et.al.	2601.01856	null
2026-01-05	First Observation of $D^{0(+)}\to \bar Kωe^+ν_e$ and Determination of the Branching Fraction of $\bar K_1(1270)\to \bar K ω$	BESIII Collaboration et.al.	2601.01817	null
2026-01-05	Causality-Aware Temporal Projection for Video Understanding in Video-LLMs	Zhengjian Kang et.al.	2601.01804	null
2026-01-05	Measurements of the branching fractions of $χ_{cJ}\to 2K^+ 2K^- ω$ and $φK^+ K^- ω$ decays	BESIII Collaboration et.al.	2601.01758	null
2026-01-05	K-EXAONE Technical Report	Eunbi Choi et.al.	2601.01739	null
2026-01-05	Yuan3.0 Flash: An Open Multimodal Large Language Model for Enterprise Applications	YuanLab. ai et.al.	2601.01718	null
2026-01-05	Varying-Coefficient Mixture of Experts Model	Qicheng Zhao et.al.	2601.01699	null
2026-01-06	Measurements of the absolute branching fractions of the $Λ_{c}^{+}$ hadronic decays	BESIII Collaboration et.al.	2601.01503	null
2026-01-04	Multi-Subspace Multi-Modal Modeling for Diffusion Models: Estimation, Convergence and Mixture of Experts	Ruofeng Yang et.al.	2601.01475	null
2026-01-06	Making MoE-based LLM Inference Resilient with Tarragon	Songyu Zhang et.al.	2601.01310	null
2026-01-03	MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Efficient Clinical Assistance	Hamad Khan et.al.	2601.01260	null
2026-01-02	Reliability Under Randomness: An Empirical Analysis of Sparse and Dense Language Models Across Decoding Temperatures	Kabir Grover et.al.	2601.00942	null
2026-01-02	HFedMoE: Resource-aware Heterogeneous Federated Learning with Mixture-of-Experts	Zihan Fang et.al.	2601.00583	null
2026-01-02	A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR	Yuang Zheng et.al.	2601.00557	null
2026-01-01	Geometric Regularization in Mixture-of-Experts: The Disconnect Between Weights and Activations	Hyunjun Kim et.al.	2601.00457	null
2026-01-01	Traffic-MoE: A Sparse Foundation Model for Network Traffic Analysis	Jiajun Zhou et.al.	2601.00357	null
2026-01-01	Identification and Estimation under Multiple Versions of Treatment: Mixture-of-Experts Approach	Kohei Yoshikawa et.al.	2601.00287	null
2025-12-31	Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem	Weixun Wang et.al.	2512.24873	null
2025-12-31	Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models	Ákos Prucs et.al.	2512.24776	null
2025-12-30	Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning	Ziqing Fan et.al.	2512.24265	null
2025-12-30	Training Report of TeleChat3-MoE	Xinzhang Liu et.al.	2512.24157	null
2025-12-30	*Skyrmion and Meron Crystals in Intermetallic Gd $3$Ru$_4$Al${12}$ : Microscopic Model Insights into Chiral Phases*	Jiajun Mo et.al.	2512.24071	null
2025-12-30	RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress	Ruixuan Huang et.al.	2512.23995	null
2025-12-30	Towards a bottom-up formulation of spin kinetic theory	Zonglin Mo et.al.	2512.23960	null
2026-01-02	Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling	Chulun Zhou et.al.	2512.23959	null
2025-12-30	Learnable Query Aggregation with KV Routing for Cross-view Geo-localisation	Hualin Ye et.al.	2512.23938	null
2025-12-29	Observations of the Fermi bubbles and the Galactic center excess with the DArk Matter Particle Explorer	F. Alemanno et.al.	2512.23458	null
2025-12-29	Dynamic Subspace Composition: Efficient Adaptation via Contractive Basis Expansion	Vladimer Khasia et.al.	2512.23448	null
2025-12-29	Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss	Ang Lv et.al.	2512.23447	null
2025-12-29	Bitcoin-IPC: Scaling Bitcoin with a Network of Proof-of-Stake Subnets	Marko Vukolić et.al.	2512.23439	null
2025-12-29	*Study of $\bar{K}^(892)^0 η$ and $K_S^0 a_0(980)^0$ in the $D^{0} \to K_{S}^{0}π^0η$ decay**	BESIII Collaboration et.al.	2512.23389	null
2025-12-30	YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection	Xu Lin et.al.	2512.23273	null
2025-12-28	Trust Region Masking for Long-Horizon LLM Reinforcement Learning	Yingru Li et.al.	2512.23075	null
2025-12-28	FLEX-MoE: Federated Mixture-of-Experts with Load-balanced Expert Assignment	Boyang Zhang et.al.	2512.23070	null
2025-12-28	Viability and Performance of a Private LLM Server for SMBs: A Benchmark Analysis of Qwen3-30B on Consumer-Grade Hardware	Alex Khalil et.al.	2512.23029	null
2025-12-28	Reach-Avoid Differential game with Reachability Analysis for UAVs: A decomposition approach	Minh Bui et.al.	2512.22793	null
2025-12-28	Text-Routed Sparse Mixture-of-Experts Model with Explanation and Temporal Alignment for Multi-Modal Sentiment Analysis	Dongning Rao et.al.	2512.22741	null
2025-12-27	RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure	Wei Gao et.al.	2512.22560	null
2025-12-27	Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection	Zihan Liu et.al.	2512.22483	null
2025-12-27	Bright 4B: Scaling Hyperspherical Learning for Segmentation in 3D Brightfield Microscopy	Amil Khan et.al.	2512.22423	null
2025-12-26	FUSCO: High-Performance Distributed Data Shuffling via Transformation-Communication Fusion	Zhuoran Zhu et.al.	2512.22036	null
2025-12-26	SWE-RM: Execution-free Feedback For Software Engineering Agents	KaShun Shum et.al.	2512.21919	null
2025-12-26	Accelerate Speculative Decoding with Sparse Computation in Verification	Jikai Wang et.al.	2512.21911	null
2025-12-26	MMCTOP: A Multimodal Textualization and Mixture-of-Experts Framework for Clinical Trial Outcome Prediction	Carolina Aparício et.al.	2512.21897	null
2025-12-26	CrownGen: Patient-customized Crown Generation via Point Diffusion Model	Juyoung Bae et.al.	2512.21890	null
2025-12-26	SLIM-Brain: A Data- and Training-Efficient Foundation Model for fMRI Data Analysis	Mo Wang et.al.	2512.21881	null
2025-12-25	Spatiotemporal-Untrammelled Mixture of Experts for Multi-Person Motion Prediction	Zheng Yin et.al.	2512.21707	null
2025-12-25	Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism	Xinglin Pan et.al.	2512.21487	null
2025-12-24	DeepCQ: General-Purpose Deep-Surrogate Framework for Lossy Compression Quality Prediction	Khondoker Mirazul Mumenin et.al.	2512.21433	null
2025-12-24	SparScene: Efficient Traffic Scene Representation via Sparse Graph Learning for Large-Scale Trajectory Generation	Xiaoyu Mo et.al.	2512.21133	null
2025-12-26	Identification with Orthogonal Basis Functions: Convergence Speed, Asymptotic Bias, and Rate-Optimal Pole Selection	Jiayun Li et.al.	2512.21096	null
2025-12-25	GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs	Lichao Wu et.al.	2512.21008	null
2025-12-24	SACodec: Asymmetric Quantization with Semantic Anchoring for Low-Bitrate High-Fidelity Neural Speech Codecs	Zhongren Dong et.al.	2512.20944	null
2025-12-24	RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks	Ningyuan Liu et.al.	2512.20920	null
2025-12-24	NVIDIA Nemotron 3: Efficient and Open Intelligence	NVIDIA et.al.	2512.20856	null
2025-12-23	Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning	NVIDIA et.al.	2512.20848	null
2025-12-23	Defending against adversarial attacks using mixture of experts	Mohammad Meymani et.al.	2512.20821	null
2025-12-23	MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts	Alexandros Christoforos et.al.	2512.20604	null
2025-12-23	Branch Learning in MRI: More Data, More Models, More Training	Yuyang Li et.al.	2512.20330	null
2025-12-23	Mixture-of-Experts with Gradient Conflict-Driven Subspace Topology Pruning for Emergent Modularity	Yuxing Gan et.al.	2512.20291	null
2025-12-23	Degradation-Aware Metric Prompting for Hyperspectral Image Restoration	Binfeng Wang et.al.	2512.20251	link
2025-12-23	AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model	Sofian Chaybouti et.al.	2512.20157	null
2025-12-23	Fun-Audio-Chat Technical Report	Qian Chen et.al.	2512.20156	null
2025-12-23	Bring My Cup! Personalizing Vision-Language-Action Models with Visual Attentive Prompting	Sangoh Lee et.al.	2512.20014	null
2025-12-23	Observation and branching fraction measurements of $χ_{cJ}\to p \bar p K^0_S K^0_S$	BESIII Collaboration et.al.	2512.19993	null
2025-12-22	UCCL-EP: Portable Expert-Parallel Communication	Ziming Mao et.al.	2512.19849	null
2025-12-21	How Many Experts Are Enough? Towards Optimal Semantic Specialization for Mixture-of-Experts	Sumin Park et.al.	2512.19765	null
2025-12-22	Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios	Jiawen Wang et.al.	2512.19551	null
2025-12-22	EGM: Efficiently Learning General Motion Tracking Policy for High Dynamic Humanoid Whole-Body Control	Chao Yang et.al.	2512.19043	null
2025-12-21	Tempo as the Stable Cue: Hierarchical Mixture of Tempo and Beat Experts for Music to 3D Dance Generation	Guangtao Lyu et.al.	2512.18804	null
2025-12-21	Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts	Linwei Qiu et.al.	2512.18718	null
2025-12-21	Remoe: Towards Efficient and Low-Cost MoE Inference in Serverless Computing	Wentao Liu et.al.	2512.18674	null
2025-12-21	Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach	Zhe Li et.al.	2512.18597	null
2025-12-20	Secret mixtures of experts inside your LLM	Enric Boix-Adsera et.al.	2512.18452	link
2025-12-20	MoE Pathfinder: Trajectory-driven Expert Pruning	Xican Yang et.al.	2512.18425	link
2025-12-20	MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation	Kaixing Yang et.al.	2512.18181	null
2025-12-20	Cross section and parametrization of charmonium decay	Xiao-Hu Mo et.al.	2512.18154	null
2025-12-19	MoE-TransMov: A Transformer-based Model for Next POI Prediction in Familiar & Unfamiliar Movements	Ruichen Tan et.al.	2512.17985	null
2025-12-19	Interpreting the strong clustering of ultra-diffuse galaxies by halo spin bias	Qinglin Ma et.al.	2512.17742	null
2025-12-19	Cross sections measurement of $e^+e^-\to Ξ(1530)^0\barΞ^0 + c.c.$ and search for $ψ(3770)\toΞ(1530)^0\barΞ^0 + c.c.$	BESIII Colaboration et.al.	2512.17275	null
2025-12-19	Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding	Yuqing Li et.al.	2512.17220	null
2025-12-19	Capturing Arbitrary Waveform without Absorption with Synthesis of Complex Frequencies	Zhaohua Tian et.al.	2512.17156	null
2025-12-18	Bandwidth-Efficient Adaptive Mixture-of-Experts via Low-Rank Compensation	Zhenyu Liu et.al.	2512.17073	null
2025-12-18	Compression is Routing: Reconstruction Error as an Intrinsic Signal for Modular Language Models	Zhongpan Tang et.al.	2512.16963	null
2025-12-18	LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation	Haichao Zhang et.al.	2512.16891	null
2025-12-18	The WINTER Observatory: A One-Degree InGaAs Survey Camera to study the Transient Infrared Sky	Danielle Frostig et.al.	2512.16753	null
2025-12-18	PoseMoE: Mixture-of-Experts Network for Monocular 3D Human Pose Estimation	Mengyuan Liu et.al.	2512.16494	null
2025-12-18	Efficient CPU-GPU Collaborative Inference for MoE-based LLMs on Memory-Limited Systems	En-Ming Huang et.al.	2512.16473	null
2025-12-18	Pretrained Battery Transformer (PBT): A battery life prediction foundation model	Ruifeng Tan et.al.	2512.16334	null
2025-12-19	Sigma-MoE-Tiny Technical Report	Qingguo Hu et.al.	2512.16248	null
2025-12-18	Open Ad-hoc Categorization with Contextualized Feature Learning	Zilin Wang et.al.	2512.16202	link
2025-12-18	INTELLECT-3: Technical Report	Prime Intellect Team et.al.	2512.16144	null
2025-12-17	Wake instability past a sphere settling in a strongly stratified flow	Chang-Fan Mo et.al.	2512.15626	null
2025-12-17	Measurements of the Absolute Branching Fraction of the Semileptonic Decay $\mathbf{Ξ^{-}\rightarrow Λe^- \barν_{e}}$ and the Axial Charge of the $\mathbfΞ^{-}$	BESIII Collaboration et.al.	2512.15273	null
2025-12-19	VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments	Yuze Wu et.al.	2512.15258	null
2025-12-17	*Search for the decays $X(3872)\to K_{S}^{0}K^{\pm}π^{\mp}$ and $K^(892)\bar{K}$ at BESIII**	BESIII Collaboration et.al.	2512.15091	null
2025-12-19	Let the Barbarians In: How AI Can Accelerate Systems Performance Research	Audrey Cheng et.al.	2512.14806	null
2025-12-15	SocialNav-MoE: A Mixture-of-Experts Vision Language Model for Socially Compliant Navigation with Reinforcement Fine-Tuning	Tomohito Kawabata et.al.	2512.14757	null
2025-12-16	Measurements of the branching fractions of $χ_{cJ}\to φφη, φφη^{\prime}$ and $φK^+K^-η$	BESIII Collaboration et.al.	2512.14369	null
2025-12-16	SketchAssist: A Practical Assistant for Semantic Edits and Precise Local Redrawing	Han Zou et.al.	2512.14140	null
2025-12-16	SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations	Wentao Guo et.al.	2512.14080	null
2025-12-16	Sparsity-Controllable Dynamic Top-p MoE for Large Foundation Model Pre-training	Can Jin et.al.	2512.13996	null
2025-12-15	Connection between galaxy morphology and dark-matter halo structure II: predicting disk structure from dark-matter halo properties	Jinning Liang et.al.	2512.13822	null
2025-12-13	RAST-MoE-RL: A Regime-Aware Spatio-Temporal MoE Framework for Deep Reinforcement Learning in Ride-Hailing	Yuhan Tang et.al.	2512.13727	null
2025-12-15	StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion	Guransh Singh et.al.	2512.13632	null
2025-12-16	Janus: Disaggregating Attention and Experts for Scalable MoE Inference	Zhexiang Zhang et.al.	2512.13525	null
2025-12-15	SIGMA: An AI-Empowered Training Stack on Early-Life Hardware	Lei Qu et.al.	2512.13488	null
2025-12-15	Automated Information Flow Selection for Multi-scenario Multi-task Recommendation	Chaohua Yang et.al.	2512.13396	null
2025-12-15	Sharpen the Spec, Cut the Code: A Case for Generative File System with SYSSPEC	Qingyuan Liu et.al.	2512.13047	null
2025-12-15	Safe Control of Multi-Agent Systems with Minimal Communication	Mo Yang et.al.	2512.13021	null
2025-12-15	SliceMoE: Bit-Sliced Expert Caching under Miss-Rate Constraints for Efficient MoE Inference	Yuseon Choi et.al.	2512.12990	null
2025-12-14	Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution	Boyang Yan et.al.	2512.12806	null
2025-12-14	Bayesian Optimization Parameter Tuning Framework for a Lyapunov Based Path Following Controller	Zhewen Zheng et.al.	2512.12649	null
2025-12-13	Amplitude Analysis and Branching Fraction Measurement of $D^+ \to π^+π^0π^0$	BESIII Collaboration et.al.	2512.12397	null
2025-12-13	Fine-Grained Zero-Shot Learning with Attribute-Centric Representations	Zhi Chen et.al.	2512.12219	null
2025-12-13	ALERT Open Dataset and Input-Size-Agnostic Vision Transformer for Driver Activity Recognition using IR-UWB	Jeongjun Park et.al.	2512.12206	null
2025-12-13	MixtureKit: A General Framework for Composing, Training, and Visualizing Mixture-of-Experts Models	Ahmad Chamma et.al.	2512.12121	null
2025-12-12	Measurement of the cosmic ray nickel energy spectrum from 10 GeV/n to 2 TeV/n with the DAMPE	F. Alemanno et.al.	2512.11425	null
2025-12-11	Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration	Sicheng Mo et.al.	2512.10954	null
2025-12-11	Unleashing Degradation-Carrying Features in Symmetric U-Net: Simpler and Stronger Baselines for All-in-One Image Restoration	Wenlong Jiao et.al.	2512.10581	null
2025-12-11	Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment	Han Li et.al.	2512.10450	null
2025-12-12	Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge	Junjie Bai et.al.	2512.10071	null
2025-12-10	Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach	Salvador Carrión et.al.	2512.09910	null
2025-12-10	DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation	Zhizhong Wang et.al.	2512.09814	null
2025-12-10	M3Net: A Multi-Metric Mixture of Experts Network Digital Twin with Graph Neural Networks	Blessed Guda et.al.	2512.09797	null
2025-12-10	First measurement of the absolute branching fractions of $Σ^+$ nonleptonic decays and test of the $ΔI = 1/2$ rule % $Σ^+ \to p π^0$ and $Σ^+ \to n π^+$	BESIII Collaboration et.al.	2512.09628	null
2025-12-10	FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model	Xiang Chen et.al.	2512.09282	null
2025-12-10	Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens	Yanpeng Yu et.al.	2512.09277	null
2025-12-10	Bug Priority Change Prediction: An Exploratory Study on Apache Software	Guangzong Cai et.al.	2512.09216	null
2025-12-09	Ask, Answer, and Detect: Role-Playing LLMs for Personality Detection with Question-Conditioned Mixture-of-Experts	Yifan Lyu et.al.	2512.08814	null
2025-12-09	What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance	Athena Psalta et.al.	2512.08697	null
2025-12-09	Prismatic World Model: Learning Compositional Dynamics for Planning in Hybrid Systems	Mingwei Li et.al.	2512.08411	null
2025-12-09	FastBEV++: Fast by Algorithm, Deployable by Design	Yuanpeng Chen et.al.	2512.08237	null
2025-12-08	Relational Visual Similarity	Thao Nguyen et.al.	2512.07833	null
2025-12-08	Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE	Anxiang Zeng et.al.	2512.07710	null
2025-12-08	LongCat-Image Technical Report	Meituan LongCat Team et.al.	2512.07584	null
2025-12-12	MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer	Penghui Liu et.al.	2512.07500	null
2025-12-08	Equivariant Diffusion for Crystal Structure Prediction	Peijia Lin et.al.	2512.07289	null
2025-12-08	Measurement of the branching fraction of $η\to μ^+ μ^-$ and search for $η\to e^+ e^-$	BESIII Collaboration et.al.	2512.07144	null
2025-12-09	TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning	Zebin Xing et.al.	2512.07135	null
2025-12-08	PlantBiMoE: A Bidirectional Foundation Model with SparseMoE for Plant Genomes	Kepeng Lin et.al.	2512.07113	null
2025-12-07	Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding	MinCheol Jeon et.al.	2512.06929	null
2025-12-07	Stable-MoE: Lyapunov-based Token Routing for Distributed Mixture-of-Experts Training over Edge Networks	Long Shi et.al.	2512.06784	null
2025-12-07	Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving	Wei-Bin Kou et.al.	2512.06664	null
2025-12-06	Enhancing Medical Cross-Modal Hashing Retrieval using Dropout-Voting Mixture-of-Experts Fusion	Jaewon Ahn et.al.	2512.06449	null
2025-12-04	The SAM2-to-SAM3 Gap in the Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation	Ranjan Sapkota et.al.	2512.06032	null
2025-12-05	HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies	Zhiying Du et.al.	2512.05693	link
2025-12-05	ProPhy: Progressive Physical Alignment for Dynamic World Simulation	Zijun Wang et.al.	2512.05564	null
2025-12-04	Evidence for the semileptonic decays $Λ_c^{+} \to Σ^{\pm} π^{\mp} e^+ ν_e$	BESIII Collaboration et.al.	2512.05178	null
2025-12-09	EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture	Xin He et.al.	2512.04810	null
2025-12-04	Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild	Yigui Feng et.al.	2512.04728	null
2025-12-04	Study of the reaction $Ξ^{0}n\rightarrowΛΛX$ using $Ξ^{0}$ -nucleus scattering	BESIII Collaboration et.al.	2512.04701	null
2025-12-04	Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space	Joey Hong et.al.	2512.04601	null
2025-12-04	The Binary Fraction of Stars in the Dwarf Galaxy Ursa Minor via Dark Energy Spectroscopic Instrument	Tian Qiu et.al.	2512.04477	null
2025-12-04	Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems	Zehao Fan et.al.	2512.04476	null
2025-12-03	Small Models Achieve Large Language Model Performance: Evaluating Reasoning-Enabled AI for Secure Child Welfare Research	Zia Qi et.al.	2512.04261	null
2025-12-03	Decoding Large Language Diffusion Models with Foreseeing Movement	Yichuan Mo et.al.	2512.04135	null
2025-12-03	Stable Signer: Hierarchical Sign Language Generative Model	Sen Fang et.al.	2512.04048	null
2025-12-03	OD-MoE: On-Demand Expert Loading for Cacheless Edge-Distributed MoE Inference	Liujianfu Wang et.al.	2512.03927	null
2025-12-04	A Theoretical Framework for Auxiliary-Loss-Free Load Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models	X. Y. Han et.al.	2512.03915	null
2025-12-03	Parsimonious Clustering of Covariance Matrices	Yixi Xu et.al.	2512.03912	null
2025-12-03	Measurement of the hyperon weak radiative decay $Ξ^0\toγΣ^0$ at BESIII	BESIII Collaboration et.al.	2512.03877	null
2025-12-03	Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation	Subin Kim et.al.	2512.03534	null
2025-12-03	CellScout: Visual Analytics for Mining Biomarkers in Cell State Discovery	Rui Sheng et.al.	2512.03485	null
2025-12-03	Unconventional Magneto-Optical Effects in Altermagnets	Yongpan Li et.al.	2512.03435	null
2025-12-03	SSLfmm: An R Package for Semi-Supervised Learning with a Mixed-Missingness Mechanism in Finite Mixture Models	Geoffrey J. McLachlan et.al.	2512.03322	null
2025-12-02	Intrinsic Second-Order Topological Superconductors with Tunable Majorana Zero Modes	Xiao-Jiao Wang et.al.	2512.02775	null
2025-12-02	Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction	Xiang Yuan et.al.	2512.02584	link
2025-12-02	SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of Experts	Jiaqi Liu et.al.	2512.02517	link
2025-12-02	A Fully First-Order Layer for Differentiable Optimization	Zihao Zhao et.al.	2512.02494	null
2025-12-02	Quasi-steady electron-excitonic complexes coupling in a two-dimensional semiconductor	Shangkun Mo et.al.	2512.02490	null
2025-12-02	Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention	Wenyi Xiong et.al.	2512.02368	null
2025-12-02	Understanding and Harnessing Sparsity in Unified Multimodal Models	Shwai He et.al.	2512.02351	link
2025-12-02	OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning	Boyu Zhu et.al.	2512.02306	null
2025-12-01	Towards Unified Video Quality Assessment	Chen Feng et.al.	2512.02224	null
2025-12-01	ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation	Chenyang Gu et.al.	2512.02013	null
2025-12-01	Multimodal Mixture-of-Experts for ISAC in Low-Altitude Wireless Networks	Kai Zhang et.al.	2512.01750	null
2025-12-01	GRASP: Guided Residual Adapters with Sample-wise Partitioning	Felix Nützel et.al.	2512.01675	null
2025-12-01	Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery	Zhicheng Zhao et.al.	2512.01665	null
2025-12-01	Cuffless Blood Pressure Estimation from Six Wearable Sensor Modalities in Multi-Motion-State Scenarios	Yiqiao Chen et.al.	2512.01653	null
2025-12-01	Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track	Mo Chen et.al.	2512.01608	null
2025-12-01	Personalized optimization of pediatric HD-tDCS for dose consistency and target engagement	Zeming Liu et.al.	2512.01406	null
2025-12-02	Stabilizing Reinforcement Learning with LLMs: Formulation and Practices	Chujie Zheng et.al.	2512.01374	null
2025-12-01	TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking	Hanzhi Guo et.al.	2512.01329	null
2025-12-01	Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe	Yahui Liu et.al.	2512.01252	null
2025-11-30	Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios	Jianxiang Zang et.al.	2512.00920	null
2025-11-30	Elastic Mixture of Rank-Wise Experts for Knowledge Reuse in Federated Fine-Tuning	Yebo Wu et.al.	2512.00902	null
2025-11-30	Upcycled and Merged MoE Reward Model for Mitigating Reward Hacking	Lingling Fu et.al.	2512.00724	null
2025-11-29	GCMCG: A Clustering-Aware Graph Attention and Expert Fusion Network for Multi-Paradigm, Multi-task, and Cross-Subject EEG Decoding	Yiqiao Chen et.al.	2512.00574	null
2025-11-28	Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model	Junshu Tang et.al.	2511.23429	null
2025-11-28	LFM2 Technical Report	Alexander Amini et.al.	2511.23404	null
2025-11-28	Chart2Code-MoLA: Efficient Multi-Modal Code Generation via Adaptive Expert Routing	Yifei Wang et.al.	2511.23321	null
2025-11-28	Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models	Xiang Hu et.al.	2511.23319	null
2025-11-28	Multi-Modal Scene Graph with Kolmogorov-Arnold Experts for Audio-Visual Question Answering	Zijian Fu et.al.	2511.23304	null
2025-11-28	Experts are all you need: A Composable Framework for Large Language Model Inference	Shrihari Sridharan et.al.	2511.22955	null
2025-11-28	EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model	Yuhao Xu et.al.	2511.22935	null
2025-11-27	Architecture Decoupling Is Not All You Need For Unified Multimodal Model	Dian Zheng et.al.	2511.22663	null
2025-11-27	OmniInfer: System-Wide Acceleration Techniques for Optimizing LLM Serving Throughput and Latency	Jun Wang et.al.	2511.22481	null
2025-11-27	Foundation Model for Intelligent Wireless Communications	Boxun Liu et.al.	2511.22222	null
2025-11-27	MoE3D: Mixture of Experts meets Multi-Modal 3D Understanding	Yu Li et.al.	2511.22103	null
2025-11-27	Convergence Dynamics of Over-Parameterized Score Matching for a Single Gaussian	Yiran Zhang et.al.	2511.22069	null
2025-11-26	Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models	Naifu Zhang et.al.	2511.21663	null
2025-11-26	Continual Error Correction on Low-Resource Devices	Kirill Paramonov et.al.	2511.21652	null
2025-11-27	Qwen3-VL Technical Report	Shuai Bai et.al.	2511.21631	null
2025-11-26	Enhanced Landmark Detection Model in Pelvic Fluoroscopy using 2D/3D Registration Loss	Chou Mo et.al.	2511.21575	null
2025-11-26	Scaling limits of critical FK-decorated random planar maps with $q=4$	William Da Silva et.al.	2511.21480	null
2025-11-26	Study of the reactions $\bar{n} p \to 2π^{+}π^{-}$, $2π^{+}π^{-}π^{0}$, and $2π^{+}π^{-}2π^{0}$ using $J/ψ\to p π^{-}\bar{n}$	BESIII Collaboration et.al.	2511.21462	null
2025-11-26	MemFine: Memory-Aware Fine-Grained Scheduling for MoE Training	Lu Zhao et.al.	2511.21431	null
2025-11-26	Do Reasoning Vision-Language Models Inversely Scale in Test-Time Compute? A Distractor-centric Empirical Analysis	Jiyun Bae et.al.	2511.21397	null
2025-11-26	Conditional Generative Modeling of Stochastic LTI Systems: A Behavioral Approach	Jiayun Li et.al.	2511.21219	null
2025-11-26	MLPMoE: Zero-Shot Architectural Metamorphosis of Dense LLM MLPs into Static Mixture-of-Experts	Ivan Novikov et.al.	2511.21089	null
2025-11-25	HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation	Xiang Wang et.al.	2511.20520	null
2025-11-25	Soft Adaptive Policy Optimization	Chang Gao et.al.	2511.20347	null
2025-11-25	ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories	Hai Ling et.al.	2511.20169	null
2025-11-25	Adaptive Knowledge Transfer for Cross-Disciplinary Cold-Start Knowledge Tracing	Yulong Deng et.al.	2511.20009	null
2025-11-25	SONIC: Spectral Optimization of Noise for Inpainting with Consistency	Seungyeon Baek et.al.	2511.19985	null
2025-11-25	Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models	Wentao Hu et.al.	2511.19822	null
2025-11-22	Exploiting the Experts: Unauthorized Compression in MoE-LLMs	Pinaki Prasad Guha Neogi et.al.	2511.19480	null
2025-11-22	Tracking and Segmenting Anything in Any Modality	Tianlu Zhang et.al.	2511.19475	null
2025-11-24	Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling	Long Tang et.al.	2511.19024	null
2025-11-24	OrdMoE: Preference Alignment via Hierarchical Expert Group Ranking in Multimodal Mixture-of-Experts LLMs	Yuting Gao et.al.	2511.19023	null
2025-11-24	Dynamic Mixture of Experts Against Severe Distribution Shifts	Donghu Kim et.al.	2511.18987	null
2025-11-23	HiFi-MambaV2: Hierarchical Shared-Routed MoE for High-Fidelity MRI Reconstruction	Pengcheng Fang et.al.	2511.18534	null
2025-11-23	AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixture of Expert	Yuting Gao et.al.	2511.18314	null
2025-11-22	PromptMoE: Generalizable Zero-Shot Anomaly Detection via Visually-Guided Prompt Mixtures	Yuheng Shao et.al.	2511.18116	null
2025-11-22	CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking	Hao Li et.al.	2511.17967	null
2025-11-22	Measuring the Impact of Lexical Training Data Coverage on Hallucination Detection in Large Language Models	Shuo Zhang et.al.	2511.17946	null
2025-11-22	FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning	Guoyang Xia et.al.	2511.17885	null
2025-11-22	Equivalence of Context and Parameter Updates in Modern Transformer Blocks	Adrian Goldwaser et.al.	2511.17864	null
2025-11-21	Unified Class and Domain Incremental Learning with Mixture of Experts for Indoor Localization	Akhil Singampalli et.al.	2511.17829	null
2025-11-21	Boosting Brain-inspired Path Integration Efficiency via Learning-based Replication of Continuous Attractor Neurodynamics	Zhangyu Ge et.al.	2511.17687	null
2025-11-21	Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?	Sukwon Yun et.al.	2511.17400	null
2025-11-21	MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment	Huangbiao Xu et.al.	2511.17397	link
2025-11-21	Training Foundation Models on a Full-Stack AMD Platform: Compute, Networking, and System Design	Quentin Anthony et.al.	2511.17127	null
2025-11-21	Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters	Zhan Su et.al.	2511.17044	null
2025-11-21	VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions	Qianyi Shao et.al.	2511.16998	null
2025-11-21	RadioKMoE: Knowledge-Guided Radiomap Estimation with Kolmogorov-Arnold Networks and Mixture-of-Experts	Fupei Guo et.al.	2511.16986	null
2025-11-21	MicroMoE: Fine-Grained Load Balancing for Mixture-of-Experts with Token Scheduling	Chenqi Zhao et.al.	2511.16947	null
2025-11-20	*Search for the charmonium weak decay $J/ψ\to\bar{D}^0\bar{K}^{0}+{\rm c.c.}$**	BESIII Collaboration et.al.	2511.16083	null
2025-11-20	Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution	Xiao He et.al.	2511.16024	null
2025-11-19	AquaSentinel: Next-Generation AI System Integrating Sensor Networks for Urban Underground Water Pipeline Anomaly Detection via Collaborative MoE-LLM Agent Architecture	Qiming Guo et.al.	2511.15870	null
2025-11-19	MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping	Yushi Huang et.al.	2511.15690	null
2025-11-19	Search for the lepton number violating process $Ξ^- \rightarrow Σ^+ e^- e^- +c.c.$	BESIII Collaboration et.al.	2511.15394	null
2025-11-19	VIRAL: Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation	Tairan He et.al.	2511.15200	null
2025-11-19	GPU-Initiated Networking for NCCL	Khaled Hamidouche et.al.	2511.15076	null
2025-11-19	WiCo-PG: Wireless Channel Foundation Model for Pathloss Map Generation via Synesthesia of Machines	Mingran Sun et.al.	2511.15030	null
2025-11-19	WiCo-MG: Wireless Channel Foundation Model for Multipath Generation via Synesthesia of Machines	Zengrui Han et.al.	2511.15026	null
2025-11-19	Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference	Kexin Chu et.al.	2511.15015	null
2025-11-18	HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation	Lai Wei et.al.	2511.14756	null
2025-11-18	Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching	Jintao Zhang et.al.	2511.14488	null
2025-11-18	MoE-SpeQ: Speculative Quantized Decoding with Proactive Expert Prefetching and Offloading for Mixture-of-Experts	Wenfeng Wang et.al.	2511.14102	null
2025-11-18	FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration	Jingren Liu et.al.	2511.14099	link
2025-11-18	SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts	Fan Zhang et.al.	2511.14093	null
2025-11-17	MoMoE: A Mixture of Expert Agent Model for Financial Sentiment Analysis	Peng Shu et.al.	2511.13983	null
2025-11-17	InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE	Lipeng Wang et.al.	2511.13488	null
2025-11-18	YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection	Ori Meiraz et.al.	2511.13344	null
2025-11-17	Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification	Rifen Lin et.al.	2511.13150	null
2025-11-17	Self-Adaptive Graph Mixture of Models	Mohit Meena et.al.	2511.13062	link
2025-11-17	Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based Recommendation	Yu Hou et.al.	2511.12922	null
2025-11-17	Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from Drawings	Zihao Lin et.al.	2511.12880	null
2025-11-16	Connectivity-Guided Sparsification of 2-FWL GNNs: Preserving Full Expressivity with Improved Efficiency	Rongqin Chen et.al.	2511.12838	null
2025-11-16	Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data	Yunxin Li et.al.	2511.12609	null
2025-11-16	SEMC: Structure-Enhanced Mixture-of-Experts Contrastive Learning for Ultrasound Standard Plane Recognition	Qing Cai et.al.	2511.12559	null
2025-11-16	MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics	Jing Li et.al.	2511.12525	link
2025-11-16	MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding	Zhanheng Nie et.al.	2511.12449	null
2025-11-16	Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection	Xi Xiao et.al.	2511.12410	link
2025-11-15	SAC-MoE: Reinforcement Learning with Mixture-of-Experts for Control of Hybrid Dynamical Systems with Uncertainty	Leroy D’Souza et.al.	2511.12361	null
2025-11-15	AMR-MoEGA: Antimicrobial Resistance Prediction using Mixture of Experts and Genetic Algorithms	Anshul Bagaria et.al.	2511.12223	null
2025-11-15	ViTE: Virtual Graph Trajectory Expert Router for Pedestrian Trajectory Prediction	Ruochen Li et.al.	2511.12214	null
2025-11-14	FarSkip-Collective: Unhobbling Blocking Communication in Mixture of Experts Models	Yonatan Dukler et.al.	2511.11505	null
2025-11-14	Rethinking Efficient Mixture-of-Experts for Remote Sensing Modality-Missing Classification	Qinghao Gao et.al.	2511.11460	null
2025-11-14	SPOT: Single-Shot Positioning via Trainable Near-Field Rainbow Beamforming	Yeyue Cai et.al.	2511.11391	null
2025-11-14	Parameter-Efficient MoE LoRA for Few-Shot Multi-Style Editing	Cong Cao et.al.	2511.11236	null
2025-11-14	DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding	Mingwei Xing et.al.	2511.11232	null
2025-11-14	ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization	Anzhe Cheng et.al.	2511.10971	null
2025-11-14	Go-UT-Bench: A Fine-Tuning Dataset for LLM-Based Unit Test Generation in Go	Yashshi Pipalani et.al.	2511.10868	null
2025-11-13	Generalizable Slum Detection from Satellite Imagery with Mixture-of-Experts	Sumin Lee et.al.	2511.10300	null
2025-11-13	RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo	Jueun Ko et.al.	2511.10107	null
2025-11-13	BuddyMoE: Exploiting Expert Redundancy to Accelerate Memory-Constrained Mixture-of-Experts Inference	Yun Wang et.al.	2511.10054	null
2025-11-14	HI-TransPA: Hearing Impairments Translation Personal Assistant	Zhiming Ma et.al.	2511.09915	link
2025-11-13	ConSurv: Multimodal Continual Learning for Survival Analysis	Dianzhi Yu et.al.	2511.09853	null
2025-11-11	Let the Experts Speak: Improving Survival Prediction & Calibration via Mixture-of-Experts Heads	Todd Morrill et.al.	2511.09567	null
2025-11-12	SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields	Sangheon Yang et.al.	2511.09072	null
2025-11-12	UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving	Ziyi Song et.al.	2511.09013	null
2025-11-12	Selective Sinkhorn Routing for Improved Sparse Mixture of Experts	Duc Anh Nguyen et.al.	2511.08972	null
2025-11-12	Bayesian Mixture of Experts For Large Language Models	Maryam Dialameh et.al.	2511.08968	null
2025-11-12	An Improved Dual-Attention Transformer-LSTM for Small-Sample Prediction of Modal Frequency and Actual Anchor Radius in Micro Hemispherical Resonator Design	Yuyi Yao et.al.	2511.08900	null
2025-11-11	OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild	Yuncheng Guo et.al.	2511.08423	null
2025-11-11	Text-based Aerial-Ground Person Retrieval	Xinyu Zhou et.al.	2511.08369	null
2025-11-14	Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency Differencing	Junkai Lu et.al.	2511.08229	null
2025-11-13	National Institute on Aging PREPARE Challenge: Early Detection of Cognitive Impairment Using Speech – The SpeechCARE Solution	Maryam Zolnoori et.al.	2511.08132	null
2025-11-13	Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression	Cheng Yuan et.al.	2511.08066	null
2025-11-11	TouchWalker: Real-Time Avatar Locomotion from Touchscreen Finger Walking	Geuntae Park et.al.	2511.07860	null
2025-11-10	One Router to Route Them All: Homogeneous Expert Routing for Heterogeneous Graph Transformers	Georgiy Shakirov et.al.	2511.07603	null
2025-11-12	Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs	Zhongyang Li et.al.	2511.07419	null
2025-11-11	Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction	Hyeryun Park et.al.	2511.07392	null
2025-11-10	AgenticSciML: Collaborative Multi-Agent Systems for Emergent Discovery in Scientific Machine Learning	Qile Jiang et.al.	2511.07262	null
2025-11-10	Two Heads are Better than One: Distilling Large Language Model Features Into Small Models with Feature Decomposition and Mixture	Tianhao Fu et.al.	2511.07110	null
2025-11-10	CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition	Hung-Yang Sung et.al.	2511.06860	null
2025-11-10	S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous Reasoning	Jiangwen Dong et.al.	2511.06727	null
2025-11-10	Multi-Modal Continual Learning via Cross-Modality Adapters and Representation Alignment with Knowledge Preservation	Evelyn Chee et.al.	2511.06723	null
2025-11-09	Route Experts by Sequence, not by Token	Tiansheng Wen et.al.	2511.06494	null
2025-11-09	HyMoERec: Hybrid Mixture-of-Experts for Sequential Recommendation	Kunrong Li et.al.	2511.06388	null
2025-11-09	DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation	Speed Zhu et.al.	2511.06307	null
2025-11-09	A Mixture-of-Experts Framework with Log-Logistic Components for Survival Analysis on Histopathology Images	Ardhendu Sekhar et.al.	2511.06266	null
2025-11-08	MoSKA: Mixture of Shared KV Attention for Efficient Long-Sequence LLM Inference	Myunghyun Rhee et.al.	2511.06010	null
2025-11-08	DiA-gnostic VLVAE: Disentangled Alignment-Constrained Vision Language Variational AutoEncoder for Robust Radiology Reporting with Missing Modalities	Nagur Shareef Shaik et.al.	2511.05968	null
2025-11-08	MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering	Jian Zhu et.al.	2511.05876	null
2025-11-08	In-depth Analysis on Caching and Pre-fetching in Mixture of Experts Offloading	Shuning Lin et.al.	2511.05814	null
2025-11-07	Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder	Zhen Xu et.al.	2511.05745	null
2025-11-07	BrainCSD: A Hierarchical Consistency-Driven MoE Foundation Model for Unified Connectome Synthesis and Multitask Brain Trait Prediction	Xiongri Shen et.al.	2511.05630	null
2025-11-07	Quantum-Uncertainty-Governed Spin Dynamics in s-d Coupled Systems	Jie Zheng et.al.	2511.05388	null
2025-11-07	OvA-LP: A Simple and Efficient Framework for Federated Learning on Non-IID Data	Dongjin Park et.al.	2511.05028	null
2025-11-07	MoE-DP: An MoE-Enhanced Diffusion Policy for Robust Long-Horizon Robotic Manipulation with Skill Decomposition and Failure Recovery	Baiye Cheng et.al.	2511.05007	null
2025-11-06	PuzzleMoE: Efficient Compression of Large Mixture-of-Experts Models via Sparse Expert Merging and Bit-packed inference	Yushu Zhao et.al.	2511.04805	null
2025-11-06	GNN-MoE: Context-Aware Patch Routing using GNNs for Parameter-Efficient Domain Generalization	Mahmoud Soliman et.al.	2511.04008	null
2025-11-05	GMoPE:A Prompt-Expert Mixture Framework for Graph Foundation Models	Zhibin Wang et.al.	2511.03251	null
2025-11-04	From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos	Xun Wang et.al.	2511.02762	null
2025-11-04	Verifying LLM Inference to Prevent Model Weight Exfiltration	Roy Rinberg et.al.	2511.02620	null
2025-11-04	RoME: Domain-Robust Mixture-of-Experts for MILP Solution Prediction across Domains	Tianle Pu et.al.	2511.02331	null
2025-11-04	FP8-Flow-MoE: A Casting-Free FP8 Recipe without Double Quantization Error	Fengjuan Wang et.al.	2511.02302	null
2025-11-04	Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining	Costin-Andrei Oncescu et.al.	2511.02237	null
2025-11-03	Towards Efficient Federated Learning of Networked Mixture-of-Experts for Mobile Edge Computing	Song Gao et.al.	2511.01743	null
2025-11-03	HMVLM: Human Motion-Vision-Lanuage Model via MoE LoRA	Lei Hu et.al.	2511.01463	null
2025-11-04	CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing	Yifan Zhou et.al.	2511.01197	null
2025-11-03	DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection	Guoxin Ma et.al.	2511.01192	null
2025-11-01	OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback	Kai Luo et.al.	2511.00510	null
2025-10-31	LongCat-Flash-Omni Technical Report	Meituan LongCat Team et.al.	2511.00279	link
2025-10-31	Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals	Xiangyu Fan et.al.	2510.27684	null
2025-10-31	RDMA Point-to-Point Communication for LLM Systems	Nandor Licker et.al.	2510.27656	null
2025-10-31	MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts	Jingnan Gao et.al.	2510.27234	null
2025-10-31	AFM-Net: Advanced Fusing Hierarchical CNN Visual Priors with Global Sequence Modeling for Remote Sensing Image Scene Classification	Yuanhao Tang et.al.	2510.27155	null
2025-10-30	Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement	Aaditya Shukla et.al.	2510.27051	null
2025-10-30	Mixture-of-Transformers Learn Faster: A Theoretical Study on Classification Problems	Hongbo Li et.al.	2510.27004	null
2025-10-30	MoME: Mixture of Visual Language Medical Experts for Medical Imaging Segmentation	Arghavan Rezvani et.al.	2510.26996	null
2025-10-30	ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference	Zixu Shen et.al.	2510.26730	null
2025-10-30	Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications	Chuang Zhang et.al.	2510.26628	null
2025-10-30	Asymptotic meshes from $r$ -variational adaptation methods for static problems in one dimension	Darith Hun et.al.	2510.26375	null
2025-10-30	MossNet: Mixture of State-Space Experts is a Multi-Head Attention	Shikhar Tuli et.al.	2510.26182	null
2025-10-29	Dual Mixture-of-Experts Framework for Discrete-Time Survival Analysis	Hyeonjun Lee et.al.	2510.26014	null
2025-10-31	Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training	Hong Wang et.al.	2510.25803	null
2025-10-29	Revisiting scalable sequential recommendation with Multi-Embedding Approach and Mixture-of-Experts	Qiushi Pan et.al.	2510.25285	null
2025-10-29	MoEntwine: Unleashing the Potential of Wafer-scale Chips for Large-scale Expert Parallel Inference	Xinru Tang et.al.	2510.25258	null
2025-10-29	H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts	Peilin Tan et.al.	2510.25091	null
2025-10-28	Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation	Inclusion AI et.al.	2510.24821	null
2025-10-28	Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance	Yujie Wei et.al.	2510.24711	null
2025-10-28	Language-Conditioned Representations and Mixture-of-Experts Policy for Robust Multi-Task Robotic Manipulation	Xiucheng Zhang et.al.	2510.24055	null
2025-10-26	Sparsity and Superposition in Mixture of Experts	Marmik Chaudhari et.al.	2510.23671	null
2025-10-27	EMTSF:Extraordinary Mixture of SOTA Models for Time Series Forecasting	Musleh Alharthi et.al.	2510.23396	null
2025-10-27	Rethinking GSPO: The Perplexity-Entropy Equivalence	Chi Liu et.al.	2510.23142	null
2025-10-27	Knocking-Heads Attention	Zhanchao Zhou et.al.	2510.23052	null
2025-10-27	Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts	Di Zhang et.al.	2510.23027	null
2025-10-27	MoEMeta: Mixture-of-Experts Meta Learning for Few-Shot Relational Learning	Han Wu et.al.	2510.23013	null
2025-10-25	Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation	Ling-Team et.al.	2510.22115	null
2025-10-23	Addressing Corner Cases in Autonomous Driving: A World Model-based Approach with Mixture of Experts and LLMs	Haicheng Liao et.al.	2510.21867	null
2025-10-24	PINN Balls: Scaling Second-Order Methods for PINNs with Domain Decomposition and Adaptive Sampling	Andrea Bonfanti et.al.	2510.21262	null
2025-10-24	Adaptive Graph Mixture of Residual Experts: Unsupervised Learning on Diverse Graphs with Heterogeneous Specialization	Yunlong Chu et.al.	2510.21207	null
2025-10-24	Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts	Yanguang Sun et.al.	2510.21114	null
2025-10-24	MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning	Siyong Chen et.al.	2510.21093	null
2025-10-23	Bayesian Jammer Localization with a Hybrid CNN and Path-Loss Mixture of Experts	Mariona Jaramillo-Civill et.al.	2510.20666	null
2025-10-23	xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion	Quan Li et.al.	2510.20651	null
2025-10-23	Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning	Xiaohan Lan et.al.	2510.20519	null
2025-10-23	A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization	LinFeng Li et.al.	2510.20291	null
2025-10-23	AsyncHZP: Hierarchical ZeRO Parallelism with Asynchronous Scheduling for Scalable LLM Training	Huawei Bai et.al.	2510.20111	null
2025-10-22	HybridEP: Scaling Expert Parallelism to Cross-Datacenter Scenario via Hybrid Expert/Data Transmission	Weihao Yang et.al.	2510.19470	null
2025-10-22	MoE-Prism: Disentangling Monolithic Experts for Elastic MoE Services via Model-System Co-Designs	Xinfeng Xia et.al.	2510.19366	null
2025-10-22	Modeling Turn-Taking with Semantically Informed Gestures	Varsha Suresh et.al.	2510.19350	null
2025-10-23	RailS: Load Balancing for All-to-All Communication in Distributed Mixture-of-Experts Training	Heng Xu et.al.	2510.19262	null
2025-10-22	A Design Science Blueprint for an Orchestrated AI Assistant in Doctoral Supervision	Teo Susnjak et.al.	2510.19227	null
2025-10-23	MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting	In-Hwan Jin et.al.	2510.19210	null
2025-10-25	Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model	Ling Team et.al.	2510.18855	null
2025-10-21	Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework	Yujie Xing et.al.	2510.18825	null
2025-10-21	Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification	Bin Gu et.al.	2510.18533	null
2025-10-21	Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study	Gangda Deng et.al.	2510.18370	null
2025-10-21	DeepSeek-OCR: Contexts Optical Compression	Haoran Wei et.al.	2510.18234	link
2025-10-22	L-MoE: End-to-End Training of a Lightweight Mixture of Low-Rank Adaptation Experts	Shihao Ji et.al.	2510.17898	null
2025-10-20	Towards 3D Objectness Learning in an Open World	Taichi Liu et.al.	2510.17686	null
2025-10-20	Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model	Xinwei Zhang et.al.	2510.17684	null
2025-10-20	Learned Inertial Odometry for Cycling Based on Mixture of Experts Algorithm	Hao Qiao et.al.	2510.17604	null
2025-10-23	Photon radiation induced by rescattering in strong-interacting medium with a magnetic field	Yue Zhang et.al.	2510.17597	null
2025-10-20	ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts	Zheyue Tan et.al.	2510.17483	null
2025-10-19	Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures	Pingzhi Li et.al.	2510.16968	null
2025-10-19	End-to-end Listen, Look, Speak and Act	Siyin Wang et.al.	2510.16756	null
2025-10-18	NeurIPT: Foundation Model for Neural Interfaces	Zitao Fang et.al.	2510.16548	link
2025-10-18	Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts	Yongxiang Hua et.al.	2510.16448	null
2025-10-18	Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures	Minh-Khoi Nguyen-Nhat et.al.	2510.16411	null
2025-10-17	Expert Merging in Sparse Mixture of Experts with Nash Bargaining	Dung V. Nguyen et.al.	2510.16138	null
2025-10-17	Human or AI? Comparing Design Thinking Assessments by Teaching Assistants and Bots	Sumbul Khan et.al.	2510.16069	null
2025-10-17	Mixture of Experts Approaches in Dense Retrieval Tasks	Effrosyni Sokli et.al.	2510.15683	null
2025-10-17	FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification	Zhen Sun et.al.	2510.15595	null
2025-10-17	Backdoor or Manipulation? Graph Mixture of Experts Can Defend Against Various Graph Adversarial Attacks	Yuyuan Feng et.al.	2510.15333	null
2025-10-17	MTmixAtt: Integrating Mixture-of-Experts with Multi-Mix Attention for Large-Scale Recommendation	Xianyang Qi et.al.	2510.15286	null
2025-10-17	Adaptive Individual Uncertainty under Out-Of-Distribution Shift with Expert-Routed Conformal Prediction	Amitesh Badkul et.al.	2510.15233	null
2025-10-16	Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models	Guinan Su et.al.	2510.14853	null
2025-10-16	MergeMoE: Efficient Compression of MoE Models via Expert Output Merging	Ruijie Miao et.al.	2510.14436	null
2025-10-16	Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning	Weijie Shen et.al.	2510.14300	null
2025-10-16	MACE: Mixture-of-Experts Accelerated Coordinate Encoding for Large-Scale Scene Localization and Rendering	Mingkai Liu et.al.	2510.14251	null
2025-10-16	Demonstrating Exoplanet Transit Photometry from Space with a 15-mm Aperture Optical Navigation Camera on Hayabusa2	Koki Yumoto et.al.	2510.14229	null
2025-10-15	REAP the Experts: Why Pruning Prevails for One-Shot MoE compression	Mike Lasby et.al.	2510.13999	null
2025-10-15	Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module	Ruitao Feng et.al.	2510.13558	null
2025-10-15	ExpressNet-MoE: A Hybrid Deep Neural Network for Emotion Recognition	Deeptimaan Banerjee et.al.	2510.13493	null
2025-10-15	Who Speaks for the Trigger? Dynamic Expert Routing in Backdoored Mixture-of-Experts Transformers	Xin Zhao et.al.	2510.13462	null
2025-10-15	Toward Efficient Inference Attacks: Shadow Model Sharing via Mixture-of-Experts	Li Bai et.al.	2510.13451	null
2025-10-15	UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE	Zhenyu Liu et.al.	2510.13344	null
2025-10-15	GatePro: Parameter-Free Expert Selection Optimization for Mixture-of-Experts Models	Chen Zheng et.al.	2510.13079	null
2025-10-17	Scope: Selective Cross-modal Orchestration of Visual Perception Experts	Tianyu Zhang et.al.	2510.12974	null
2025-10-14	Dendrograms of Mixing Measures for Softmax-Gated Gaussian Mixture of Experts: Consistency without Model Sweeps	Do Tien Hai et.al.	2510.12744	null
2025-10-14	Proof of Cloud: Data Center Execution Assurance for Confidential VMs	Filip Rezabek et.al.	2510.12469	null
2025-10-14	MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts	Yushu Zhao et.al.	2510.12357	null
2025-10-14	DE3S: Dual-Enhanced Soft-Sparse-Shape Learning for Medical Early Time-Series Classification	Tao Xie et.al.	2510.12214	null
2025-10-13	Enhancing the Quality of 3D Lunar Maps Using JAXA’s Kaguya Imagery	Yumi Iwashita et.al.	2510.11817	null
2025-10-13	Beyond ‘Templates’: Category-Agnostic Object Pose, Size, and Shape Estimation from a Single View	Jinyu Zhang et.al.	2510.11687	null
2025-10-13	Robust Ego-Exo Correspondence with Long-Term Memory	Yijun Hu et.al.	2510.11417	null
2025-10-13	Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers	Wenhan Ma et.al.	2510.11370	null
2025-10-13	What to expect from microscopic nuclear modelling for k $_{\rm eff}$ calculations ?	D. Rochman et.al.	2510.11256	null
2025-10-13	DND: Boosting Large Language Models with Dynamic Nested Depth	Tieyuan Chen et.al.	2510.11001	null
2025-10-13	MC#: Mixture Compressor for Mixture-of-Experts Large Models	Wei Huang et.al.	2510.10962	null
2025-10-12	Crisis-Aware Regime-Conditioned Diffusion with CVaR Allocation	Ali Atiah Alzahrani et.al.	2510.10807	null
2025-10-12	Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection	Shizhen Zhao et.al.	2510.10584	null
2025-10-12	Hierarchical LoRA MoE for Efficient CTR Model Scaling	Zhichen Zeng et.al.	2510.10432	null
2025-10-11	SP-MoE: Speculative Decoding and Prefetching for Accelerating MoE-based Model Inference	Liangkun Chen et.al.	2510.10302	null
2025-10-10	MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest	Xiao Yang et.al.	2510.09857	null
2025-10-10	ARROW: An Adaptive Rollout and Routing Method for Global Weather Forecasting	Jindong Tian et.al.	2510.09734	null
2025-10-10	Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation	Youwei Zheng et.al.	2510.09094	null
2025-10-09	LinearSR: Unlocking Linear Attention for Stable and Efficient Image Super-Resolution	Xiaohui Li et.al.	2510.08771	null
2025-10-13	dInfer: An Efficient Inference Framework for Diffusion Language Models	Yuxin Ma et.al.	2510.08666	null
2025-10-08	Dynamic Mixture-of-Experts for Visual Autoregressive Model	Jort Vincenti et.al.	2510.08629	null
2025-10-09	FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts	Heming Zou et.al.	2510.08396	link
2025-10-09	Mix- and MoE-DPO: A Variational Inference Approach to Direct Preference Optimization	Jason Bohne et.al.	2510.08256	null
2025-10-09	From Tokens to Layers: Redefining Stall-Free Scheduling for LLM Serving with Layered Prefill	Gunjun Lee et.al.	2510.08055	null
2025-10-09	Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training	Ruizhe Wang et.al.	2510.08008	null
2025-10-09	Multilingual Knowledge Graph Completion via Efficient Multilingual Knowledge Sharing	Cunli Mao et.al.	2510.07736	null
2025-10-09	Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision	Xiaoxu Ma et.al.	2510.07703	null
2025-10-09	LiveThinking: Enabling Real-Time Efficient Reasoning for AI-Powered Livestreaming via Reinforcement Learning	Yuhan Sun et.al.	2510.07685	null
2025-10-08	MoGU: Mixture-of-Gaussians with Uncertainty-based Gating for Time Series Forecasting	Yoli Shavit et.al.	2510.07459	null
2025-10-08	Less is More: Strategic Expert Selection Outperforms Ensemble Complexity in Traffic Forecasting	Walid Guettala et.al.	2510.07426	null
2025-10-08	Guided by the Experts: Provable Feature Learning Dynamic of Soft-Routed Mixture-of-Experts	Fangshuo Liao et.al.	2510.07205	null
2025-10-08	A Bridge from Audio to Video: Phoneme-Viseme Alignment Allows Every Face to Speak Multiple Languages	Zibo Su et.al.	2510.06612	null
2025-10-09	SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation	Shuang Cheng et.al.	2510.06303	null
2025-10-06	Reproducibility Study of “XRec: Large Language Models for Explainable Recommendation”	Ranjan Mishra et.al.	2510.06275	null
2025-10-10	Barbarians at the Gate: How AI is Upending Systems Research	Audrey Cheng et.al.	2510.06189	null
2025-10-07	CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits	Kangyu Wang et.al.	2510.06133	null
2025-10-07	Rasterized Steered Mixture of Experts for Efficient 2D Image Regression	Yi-Hsin Li et.al.	2510.05814	null
2025-10-07	Mixture of Neuron Experts	Runxi Cheng et.al.	2510.05781	link
2025-10-07	MSF-SER: Enriching Acoustic Modeling with Multi-Granularity Semantics for Speech Emotion Recognition	Haoxun Li et.al.	2510.05749	null
2025-10-07	Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting	Zhongkai Yu et.al.	2510.05497	null
2025-10-06	Stratum: System-Hardware Co-Design with Tiered Monolithic 3D-Stackable DRAM for Efficient MoE Serving	Yue Pan et.al.	2510.05245	null
2025-10-06	REN: Anatomically-Informed Mixture-of-Experts for Interstitial Lung Disease Diagnosis	Alec K. Peltekian et.al.	2510.04923	null
2025-10-06	LMM-Incentive: Large Multimodal Model-based Incentive Design for User-Generated Content in Web 3.0	Jinbo Wen et.al.	2510.04765	null
2025-10-06	Multilingual Routing in Mixture-of-Experts	Lucas Bandarkar et.al.	2510.04694	null
2025-10-06	Improving Multimodal Brain Encoding Model with Dynamic Subject-awareness Routing	Xuanhua Yin et.al.	2510.04670	null
2025-10-06	Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space	Tomas Figliolia et.al.	2510.04476	null
2025-10-05	HoRA: Cross-Head Low-Rank Adaptation with Joint Hypernetworks	Nghiem T. Diep et.al.	2510.04295	null
2025-10-05	SliceMoE: Routing Embedding Slices Instead of Tokens for Fine-Grained and Balanced Transformer Scaling	Harshil Vejendla et.al.	2510.04286	null
2025-10-05	MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition	Umberto Cappellazzo et.al.	2510.04136	null
2025-10-03	Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective	Yehuda Dar et.al.	2510.03151	null
2025-10-02	ElasticMoE: An Efficient Auto Scaling Method for Mixture-of-Experts Models	Gursimran Singh et.al.	2510.02613	null
2025-10-02	UpSafe $^\circ$ C: Upcycling for Controllable Safety in Large Language Models	Yuhao Sun et.al.	2510.02194	null
2025-10-02	LadderMoE: Ladder-Side Mixture of Experts Adapters for Bronze Inscription Recognition	Rixin Zhou et.al.	2510.01651	null
2025-10-01	Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs	Leyla Mirvakhabova et.al.	2510.01185	null
2025-10-01	Learning Compact Representations of LLM Abilities via Item Response Theory	Jianhao Chen et.al.	2510.00844	null
2025-10-01	Graph Integrated Multimodal Concept Bottleneck Model	Jiakai Lin et.al.	2510.00701	null
2025-10-01	FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression	Yifei Gao et.al.	2510.00621	null
2025-10-01	Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning	Minghao Yang et.al.	2510.00570	null
2025-10-07	FlowMoE: A Scalable Pipeline Scheduling Framework for Distributed Mixture-of-Experts Training	Yunqi Gao et.al.	2510.00207	null
2025-09-30	Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization	Yaoxiang Wang et.al.	2509.26520	null
2025-09-30	Nephrobase Cell+: Multimodal Single-Cell Foundation Model for Decoding Kidney Biology	Chenyu Li et.al.	2509.26223	null
2025-09-30	Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline	Haiyang Li et.al.	2509.25991	null
2025-09-30	UniMMAD: Unified Multi-Modal and Multi-Class Anomaly Detection via MoE-Driven Feature Decompression	Yuan Zhao et.al.	2509.25934	null
2025-09-30	Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel	Chuanyang Zheng et.al.	2509.25913	null
2025-10-01	A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI	Arvind Murari Vepa et.al.	2509.25889	null
2025-09-30	Collaborative Compression for Large-Scale MoE Deployment on Edge	Yixiao Chen et.al.	2509.25689	link
2025-09-30	LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts	Yuan Zhuang et.al.	2509.25684	null
2025-09-30	Guiding Mixture-of-Experts with Temporal Multimodal Interactions	Xing Han et.al.	2509.25678	null
2025-09-29	K-Prism: A Knowledge-Guided and Prompt Integrated Universal Medical Image Segmentation Model	Bangwei Guo et.al.	2509.25594	null
2025-09-29	MAESTRO : Adaptive Sparse Attention and Robust Learning for Multimodal Dynamic Time Series	Payal Mohapatra et.al.	2509.25278	null
2025-09-29	GRACE-MoE: Grouping and Replication with Locality-Aware Routing for Efficient Distributed MoE Inference	Yu Han et.al.	2509.25041	null
2025-09-29	LEAF: A Robust Expert-Based Framework for Few-Shot Continual Event Detection	Bao-Ngoc Dao et.al.	2509.24547	null
2025-09-29	One-Prompt Strikes Back: Sparse Mixture of Experts for Prompt-based Continual Learning	Minh Le et.al.	2509.24483	null
2025-09-29	Muon: Training and Trade-offs with Latent Attention and MoE	Sushant Mehta et.al.	2509.24406	null
2025-09-29	LLaDA-MoE: A Sparse MoE Diffusion Language Model	Fengqi Zhu et.al.	2509.24389	null
2025-09-29	Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning	Zhisheng Chen et.al.	2509.24222	null
2025-09-28	HunyuanImage 3.0 Technical Report	Siyu Cao et.al.	2509.23951	null
2025-09-28	Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms	Jiahao Ying et.al.	2509.23933	link
2025-09-28	Bayesian Mixture-of-Experts: Towards Making LLMs Know What They Don’t Know	Albus Yizhuo Li et.al.	2509.23830	link
2025-09-28	A Modality-Tailored Graph Modeling Framework for Urban Region Representation via Contrastive Learning	Yaya Zhao et.al.	2509.23772	null
2025-09-28	Towards a Comprehensive Scaling Law of Mixture-of-Experts	Guoliang Zhao et.al.	2509.23678	null
2025-09-28	PreScope: Unleashing the Power of Prefetching for Resource-Constrained MoE Inference	Enda Yu et.al.	2509.23638	null
2025-09-27	Agentic AI Reasoning for Mobile Edge General Intelligence: Fundamentals, Approaches, and Directions	Mingyi Luo et.al.	2509.23248	null
2025-09-27	MoE-PHDS: One MoE checkpoint for flexible runtime sparsity	Lauren. A Hannah et.al.	2509.23012	null
2025-09-26	Tiny-QMoE	Jack Cashman et.al.	2509.22951	null
2025-09-26	Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time	Yixuan Han et.al.	2509.22572	null
2025-09-26	Learning to Ball: Composing Policies for Long-Horizon Basketball Moves	Pei Xu et.al.	2509.22442	link
2025-09-26	Role-Aware Multi-modal federated learning system for detecting phishing webpages	Bo Wang et.al.	2509.22369	null
2025-09-26	HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space	Ke Li et.al.	2509.22299	link
2025-09-26	Unlocking the Power of Mixture-of-Experts for Task-Aware Time Series Analytics	Xingjian Wu et.al.	2509.22279	null
2025-09-26	MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning	Tao Wu et.al.	2509.21953	link
2025-09-26	Elastic MoE: Unlocking the Inference-Time Scalability of Mixture-of-Experts	Naibin Gu et.al.	2509.21892	null
2025-09-26	ChaosNexus: A Foundation Model for Universal Chaotic System Forecasting with Multi-scale Representations	Chang Liu et.al.	2509.21802	null
2025-09-26	LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE	Yu Shang et.al.	2509.21790	null
2025-09-24	MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering	Lihui Liu et.al.	2509.21391	null
2025-09-25	Distributed Specialization: Rare-Token Neurons in Large Language Models	Jing Liu et.al.	2509.21163	null
2025-09-26	Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns	Xuemiao Zhang et.al.	2509.21124	null
2025-09-25	Physics Informed Neural Networks for design optimisation of diamond particle detectors for charged particle fast-tracking at high luminosity hadron colliders	Alessandro Bombini et.al.	2509.21123	null
2025-09-24	Dynamic Reasoning Chains through Depth-Specialized Mixture-of-Experts in Transformer Architectures	Sampurna Roy et.al.	2509.20577	null
2025-09-24	Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study	Viktoria Stray et.al.	2509.20353	null
2025-09-24	SHMoAReg: Spark Deformable Image Registration via Spatial Heterogeneous Mixture of Experts and Attention Heads	Yuxi Zheng et.al.	2509.20073	null
2025-09-24	Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference	Ziyi Han et.al.	2509.19781	null
2025-09-23	Human-AI Narrative Synthesis to Foster Shared Understanding in Civic Decision-Making	Cassandra Overney et.al.	2509.19643	null
2025-09-21	A Statistical Mixture-of-Experts Framework for EMG Artifact Removal in EEG: Empirical Insights and a Proof-of-Concept Application	Benjamin J. Choi et.al.	2509.19385	null
2025-09-23	DevFD: Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces	Tianshuo Zhang et.al.	2509.19230	link
2025-09-23	Frequency-Domain Decomposition and Recomposition for Robust Audio-Visual Segmentation	Yunzhe Shen et.al.	2509.18912	null
2025-09-23	LongCat-Flash-Thinking Technical Report	Meituan LongCat Team et.al.	2509.18883	null
2025-09-23	PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving	Chengran Yuan et.al.	2509.18609	null
2025-09-23	Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts	Qi Wang et.al.	2509.18542	null
2025-09-23	StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models	Haoxin Yang et.al.	2509.17993	null
2025-09-23	Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark	Siu Hang Ho et.al.	2509.17894	link
2025-09-22	Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving	Ziming Liu et.al.	2509.17863	null
2025-09-22	SSNet: Flexible and robust channel extrapolation for fluid antenna systems enabled by an self-supervised learning framework	Yuan Gao et.al.	2509.17797	null
2025-09-22	Qwen3-Omni Technical Report	Jin Xu et.al.	2509.17765	null
2025-09-22	Attention-based Mixture of Experts for Robust Speech Deepfake Detection	Viola Negroni et.al.	2509.17585	null
2025-09-22	Robust Mixture Models for Algorithmic Fairness Under Latent Heterogeneity	Siqi Li et.al.	2509.17411	link
2025-09-21	MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE	Soheil Zibakhsh et.al.	2509.17238	null
2025-09-21	A community-driven optimization framework for redrawing school attendance boundaries	Hongzhao Guan et.al.	2509.17130	link
2025-09-21	CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception	Lingzhao Kong et.al.	2509.17107	null
2025-09-21	Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation	Junzhuo Li et.al.	2509.16882	null
2025-09-20	KungfuBot2: Learning Versatile Motion Skills for Humanoid Whole-Body Control	Jinrui Han et.al.	2509.16638	null
2025-09-19	DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning	Sikai Bai et.al.	2509.16105	null
2025-09-19	MoE-CE: Enhancing Generalization for Deep Learning based Channel Estimation via a Mixture-of-Experts Framework	Tianyu Li et.al.	2509.15964	null
2025-09-19	pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation	Tong Wang et.al.	2509.15638	null
2025-09-19	MEC-Quant: Maximum Entropy Coding for Extremely Low Bit Quantization-Aware Training	Junbiao Pang et.al.	2509.15514	null
2025-09-18	SPH-Net: A Co-Attention Hybrid Model for Accurate Stock Price Prediction	Yiyang Wu et.al.	2509.15414	null
2025-09-18	Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing	Zichen Wu et.al.	2509.15361	null
2025-09-18	Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting	Liran Nochumsohn et.al.	2509.15105	null
2025-09-18	Adaptive LoRA Experts Allocation and Selection for Federated Fine-Tuning	Lei Wang et.al.	2509.15087	null
2025-09-18	EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence	Chaoyin She et.al.	2509.14977	null
2025-09-18	FURINA: Free from Unmergeable Router via LINear Aggregation of mixed experts	Jiayi Han et.al.	2509.14900	null
2025-09-18	CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human	Nan Sun et.al.	2509.14889	null
2025-09-15	SparseDoctor: Towards Efficient Chat Doctor with Mixture of Experts Enhanced Large Language Models	Zhang Jianbin et.al.	2509.14269	null
2025-09-17	CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts	Leonard Hackel et.al.	2509.14104	null
2025-09-18	SAIL-VL2 Technical Report	Weijie Yin et.al.	2509.14033	null
2025-09-17	Mixture of Low-Rank Adapter Experts in Generalizable Audio Deepfake Detection	Janne Laakkonen et.al.	2509.13878	null
2025-09-17	Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation	Nguyen Lan Vi Vu et.al.	2509.13834	null
2025-09-18	Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers	Manan Mittal et.al.	2509.13548	null
2025-09-18	GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR	Yujie Guo et.al.	2509.13093	null
2025-09-16	Dual-Stage Reweighted MoE for Long-Tailed Egocentric Mistake Detection	Boyu Han et.al.	2509.12990	null
2025-09-16	Bridging Perception and Planning: Towards End-to-End Planning for Signal Temporal Logic Tasks	Bowen Ye et.al.	2509.12813	null
2025-09-16	MEGAN: Mixture of Experts for Robust Uncertainty Estimation in Endoscopy Videos	Damola Agbelese et.al.	2509.12772	null
2025-09-17	NavMoE: Hybrid Model- and Learning-based Traversability Estimation for Local Navigation via Mixture of Experts	Botao He et.al.	2509.12747	null
2025-09-16	AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models	Heng Zhang et.al.	2509.12715	null
2025-09-18	Ensembling Large Language Models for Code Vulnerability Detection: An Empirical Evaluation	Zhihong Sun et.al.	2509.12629	null
2025-09-15	A high fraction of close massive binary stars at low metallicity	H. Sana et.al.	2509.12488	null
2025-09-16	When MoE Meets Blockchain: A Trustworthy Distributed Framework of Large Models	Weihao Zhu et.al.	2509.12141	null
2025-09-15	Dynamic Adaptive Parsing of Temporal and Cross-Variable Patterns for Network State Classification	Yuan Gao et.al.	2509.11601	null
2025-09-15	RadioLAM: A Large AI Model for Fine-Grained 3D Radio Map Estimation	Zhiyuan Liu et.al.	2509.11571	null
2025-09-14	Knowledge-Guided Adaptive Mixture of Experts for Precipitation Prediction	Chen Jiang et.al.	2509.11459	null
2025-09-14	MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation	Syed Talal Wasim et.al.	2509.11394	null
2025-09-14	On Linear Mode Connectivity of Mixture-of-Experts Architectures	Viet-Hoang Tran et.al.	2509.11348	null
2025-09-13	Lightweight Metadata-Aware Mixture-of-Experts Masked Autoencoder for Earth Observation	Mohanad Albughdadi et.al.	2509.10919	null
2025-09-12	RefactorCoderQA: Benchmarking LLMs for Multi-Domain Coding Question Solutions in Cloud and Edge Deployment	Shadikur Rahman et.al.	2509.10436	null
2025-09-12	Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs	Yixiao Zhou et.al.	2509.10377	null
2025-09-12	Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts	Strahinja Nikolic et.al.	2509.10025	null
2025-09-11	Combining Textual and Spectral Features for Robust Classification of Pilot Communications	Abdullah All Tanvir et.al.	2509.09752	null
2025-09-11	Steering MoE LLMs via Expert (De)Activation	Mohsen Fayyaz et.al.	2509.09660	null
2025-09-11	HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing	Haochen Huang et.al.	2509.09420	null
2025-09-11	MoLEx: Mixture of LoRA Experts in Speech Self-Supervised Models for Audio Deepfake Detection	Zihan Pan et.al.	2509.09175	null
2025-09-11	Compass-v3: Scaling Domain-Specific LLMs for Multilingual E-Commerce in Southeast Asia	Sophia Maria et.al.	2509.09121	null
2025-09-10	MoWE : A Mixture of Weather Experts	Dibyajyoti Chakraborty et.al.	2509.09052	null
2025-09-15	Too Helpful, Too Harmless, Too Honest or Just Right?	Gautam Siddharth Kashyap et.al.	2509.08486	null
2025-09-10	Joint Learning using Mixture-of-Expert-Based Representation for Enhanced Speech Generation and Robust Emotion Recognition	Jing-Tong Tzeng et.al.	2509.08470	null
2025-09-10	Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism	Jiaming Yan et.al.	2509.08342	null
2025-09-09	SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery	Fengyu She et.al.	2509.08032	null
2025-09-09	One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning	Yuan Pu et.al.	2509.07945	null
2025-09-09	MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?	Songkai Ma et.al.	2509.07727	null
2025-09-09	DuoServe-MoE: Dual-Phase Expert Prefetch and Cache Scheduling for Efficient MoE LLM Inference	Yuning Zhang et.al.	2509.07379	null
2025-09-11	PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions	Yixuan Tang et.al.	2509.07370	null
2025-09-11	CAME-AB: Cross-Modality Attention with Mixture-of-Experts for Antibody Binding Site Prediction	Hongzong Li et.al.	2509.06465	null
2025-09-08	Ban&Pick: Achieving Free Performance Gains and Inference Speedup via Smarter Routing in MoE-LLMs	Yuanteng Chen et.al.	2509.06346	null
2025-09-08	MCTuner: Spatial Decomposition-Enhanced Database Tuning via LLM-Guided Exploration	Zihan Yan et.al.	2509.06298	null
2025-09-05	SpikingBrain Technical Report: Spiking Brain-inspired Large Models	Yuqi Pan et.al.	2509.05276	null
2025-09-05	Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers	Svetlana Pavlitska et.al.	2509.05086	null
2025-09-05	Phase-field and lip-field approaches for fracture with extreme mesh deformation (X-Mesh): a one-dimensional study	Nicolas Moës et.al.	2509.04971	null
2025-09-05	A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing	Chengkai Xu et.al.	2509.04853	null
2025-09-05	REMOTE: A Unified Multimodal Relation Extraction Framework with Multilevel Optimal Transport and Mixture-of-Experts	Xinkui Lin et.al.	2509.04844	null
2025-09-05	Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation	Svetlana Pavlitska et.al.	2509.04816	null
2025-09-04	Wav2DF-TSL: Two-stage Learning with Efficient Pre-training and Hierarchical Experts Fusion for Robust Audio Deepfake Detection	Yunqi Hao et.al.	2509.04161	null
2025-09-03	Hierarchical Federated Foundation Models over Wireless Networks for Multi-Modal Multi-Task Intelligence: Integration of Edge Learning with D2D/P2P-Enabled Fog Learning Architectures	Payam Abdisarabshali et.al.	2509.03695	null
2025-09-03	OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation	Han Li et.al.	2509.03498	null
2025-09-02	LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference	Krishna Teja Chitty-Venkata et.al.	2509.02753	null
2025-09-02	Acrobotics: A Generalist Approahc To Quadrupedal Robots’ Parkour	Guillaume Gagné-Labelle et.al.	2509.02727	null
2025-09-02	MoPEQ: Mixture of Mixed Precision Quantized Experts	Krishna Teja Chitty-Venkata et.al.	2509.02512	null
2025-09-02	Cache Management for Mixture-of-Experts LLMs – extended version	Spyros Angelopoulos et.al.	2509.02408	null
2025-09-02	OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds	Longrong Yang et.al.	2509.02322	null
2025-09-01	Automatic Screening of Parkinson’s Disease from Visual Explorations	Maria F. Alcala-Durand et.al.	2509.01326	null
2025-09-01	LongCat-Flash Technical Report	Meituan LongCat Team et.al.	2509.01322	link
2025-09-01	SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation	Chenyang Le et.al.	2509.01200	null
2025-09-06	Joint Information Extraction Across Classical and Modern Chinese with Tea-MOELoRA	Xuemei Tang et.al.	2509.01158	null
2025-08-31	MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper	Runjia Zeng et.al.	2509.00996	link
2025-08-31	Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling	Junfeng Ran et.al.	2509.00679	null
2025-11-03	Accelerating Mixture-of-Experts Inference by Hiding Offloading Latency with Speculative Decoding	Zhibin Wang et.al.	2508.21706	null
2025-07-01	Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging	Lujun Li et.al.	2506.23266	null
2025-09-23	GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors	Hengyuan Zhang et.al.	2506.14646	null
2025-06-02	Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts	Xuweiyi Chen et.al.	2505.23926	null
2025-05-29	Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity	Yehui Tang et.al.	2505.21411	null
2025-05-27	FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	Hao Kang et.al.	2505.20225	link
2025-05-22	MoE-Loco: Mixture of Experts for Multitask Locomotion	Runhan Huang et.al.	2503.08564	null
2025-03-06	Convergence Rates for Softmax Gating Mixture of Experts	Huy Nguyen et.al.	2503.03213	null
2025-01-29	Mixture of Experts (MoE): A Big Data Perspective	Wensheng Gan et.al.	2501.16352	null
2024-12-02	MH-MoE: Multi-Head Mixture-of-Experts	Shaohan Huang et.al.	2411.16205	null
2024-10-24	ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference	Xin He et.al.	2410.17954	null
2024-10-11	MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts	Peng Jin et.al.	2410.07348	link
2024-05-21	Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts	Yunxin Li et.al.	2405.11273	null
2024-05-31	Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models	Xudong Lu et.al.	2402.14800	null
2024-10-29	GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts	Shirley Wu et.al.	2312.04693	null
2023-09-12	Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning	Ted Zadouri et.al.	2309.05444	null
2023-04-25	Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism	Xin Chen et.al.	2304.11414	null
2018-06-22	Mixtures of Experts Models	Isobel Claire Gormley et.al.	1806.08200	link

(<a href=#updated-on-20260429>back to top</a>)

Mamba

Publish Date	Title	Authors	PDF	Code
2026-04-28	Vision SmolMamba: Spike-Guided Token Pruning for Energy-Efficient Spiking State-Space Vision Models	Dewei Bai et.al.	2604.25570	null
2026-04-28	TopoMamba: Topology-Aware Scanning and Fusion for Segmenting Heterogeneous Medical Visual Media	Fuchen Zheng et.al.	2604.25545	null
2026-04-28	CUDA Kernel Optimization and Counter-Free Performance Analysis for Depthwise Convolution in Cloud Environments	Huriyeh Babak et.al.	2604.25422	null
2026-04-28	Biased Dreams: Limitations to Epistemic Uncertainty Quantification in Latent Space Models	Julia Berger et.al.	2604.25416	null
2026-04-28	Reconfiguring flexibility in renewable power-to-ammonia systems using molten-salt thermal energy storage in the ammonia synthesis loop: A coordinated electro-hydrogen-thermal scheduling approach	Yiwei Qiu et.al.	2604.25192	null
2026-04-26	Integrative neurocybernetic modeling in the era of large-scale neuroscience	Il Memming Park et.al.	2604.23903	null
2026-04-26	On the Generalization Properties of Selective State-Space Models for Filtering Tasks for Unknown Systems	Alex Tang et.al.	2604.23818	link
2026-04-26	BVI-Mamba: Video Enhancement Using a Visual State-Space Model for Low-Light and Underwater Environments	Guoxi Huang et.al.	2604.23655	null
2026-04-25	Anchored Variational Inference for Personalized Sequential Latent-State Models	Xingche Guo et.al.	2604.23454	null
2026-04-25	Breaking the Resource Wall: Geometry-Guided Sequence Modeling for Efficient Semantic Segmentation	Sheng-Wei Chan et.al.	2604.23399	null
2026-04-24	Short timescale variation in the submillimeter flux of Sagittarius A*	Makoto Miyoshi et.al.	2604.22144	null
2026-04-23	MambaCSP: Hybrid-Attention State Space Models for Hardware-Efficient Channel State Prediction	Aladin Djuhera et.al.	2604.21957	null
2026-04-23	On a class of constrained particle filters for continuous-discrete state space models	Utku Erdogan et.al.	2604.21538	null
2026-04-22	Data-Driven Open-Loop Simulation for Digital-Twin Operator Decision Support in Wastewater Treatment	Gary Simethy et.al.	2604.20935	null
2026-04-22	Beyond ZOH: Advanced Discretization Strategies for Vision Mamba	Fady Ibrahim et.al.	2604.20606	null
2026-04-22	An explicit operator explains end-to-end computation in the modern neural networks used for sequence and language modeling	Anif N. Shikder et.al.	2604.20595	null
2026-04-22	MambaLiteUNet: Cross-Gated Adaptive Feature Fusion for Robust Skin Lesion Segmentation	Md Maklachur Rahman et.al.	2604.20286	null
2026-04-21	Blockage-Aware and Shadowing Aware RIS Assisted Joint Communication and Positioning for Urban Non Terrestrial Networks	Muhammad Khalil et.al.	2604.19388	null
2026-04-20	A Controlled Benchmark of Visual State-Space Backbones with Domain-Shift and Boundary Analysis for Remote-Sensing Segmentation	Nichula Wasalathilaka et.al.	2604.18721	null
2026-04-21	Sessa: Selective State Space Attention	Liubomyr Horbatko et.al.	2604.18580	null
2026-04-19	DGSSM: Diffusion guided state-space models for multimodal salient object detection	Suklav Ghosh et.al.	2604.17585	null
2026-04-26	Bilinear Input Modulation for Mamba: Koopman Bilinear Forms for Memory Retention and Multiplicative Computation	Hiroki Fujii et.al.	2604.17221	null
2026-04-19	Decomposing the Depth Profile of Fine-Tuning	Jayadev Billa et.al.	2604.17177	null
2026-04-18	The Topological Trouble With Transformers	Michael C. Mozer et.al.	2604.17121	null
2026-04-18	A state-space representation of the boundary integral equation for room acoustic modelling	Randall Ali et.al.	2604.16970	null
2026-04-18	GAMMA-Net: Adaptive Long-Horizon Traffic Spatio-Temporal Forecasting Model based on Interleaved Graph Attention and Multi-Axis Mamba	Dongyi He et.al.	2604.16859	null
2026-04-17	TriTS: Time Series Forecasting from a Multimodal Perspective	Xiang Ao et.al.	2604.16748	null
2026-04-17	SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification	Enhui Chai et.al.	2604.15711	null
2026-04-17	CLIMB: Controllable Longitudinal Brain Image Generation using Mamba-based Latent Diffusion Model and Gaussian-aligned Autoencoder	Duy-Phuong Dao et.al.	2604.15611	null
2026-04-16	MambaSL: Exploring Single-Layer Mamba for Time Series Classification	Yoo-Min Jung et.al.	2604.15174	null
2026-04-16	Learning Ad Hoc Network Dynamics via Graph-Structured World Models	Can Karacelebi et.al.	2604.14811	null
2026-04-16	HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet	Badri N. Patro et.al.	2604.14724	null
2026-04-16	On the Expressive Power and Limitations of Multi-Layer SSMs	Nikola Zubić et.al.	2604.14501	null
2026-04-15	FAST: A Synergistic Framework of Attention and State-space Models for Spatiotemporal Traffic Prediction	Xinjin Li et.al.	2604.13453	null
2026-04-15	A KL Lens on Quantization: Fast, Forward-Only Sensitivity for Mixed-Precision SSM-Transformer Models	Jason Kong et.al.	2604.13440	null
2026-04-15	Event-Adaptive State Transition and Gated Fusion for RGB-Event Object Tracking	Jinlin You et.al.	2604.13426	null
2026-04-14	A Causal Framework for Evaluating Jointly Longitudinal Outcomes and Surrogate Markers: A State-Space Approach	Silvaneo V. dos Santos et.al.	2604.12882	null
2026-04-14	Hypergraph-State Collaborative Reasoning for Multi-Object Tracking	Zikai Song et.al.	2604.12665	null
2026-04-14	A Hybrid Architecture for Benign-Malignant Classification of Mammography ROIs	Mohammed Asad et.al.	2604.12437	null
2026-04-15	RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation	Guoan Xu et.al.	2604.12319	null
2026-04-14	Physics-Informed State Space Models for Reliable Solar Irradiance Forecasting in Off-Grid Systems	Mohammed Ezzaldin Babiker Abdullah et.al.	2604.11807	null
2026-04-13	Structured State-Space Regularization for Compact and Generation-Friendly Image Tokenization	Jinsung Lee et.al.	2604.11089	null
2026-04-12	COREY: A Prototype Study of Entropy-Guided Operator Fusion with Hadamard Reparameterization for Selective State Space Models	Bo Ma et.al.	2604.10597	null
2026-04-11	Dual-Branch Remote Sensing Infrared Image Super-Resolution	Xining Ge et.al.	2604.10112	null
2026-04-10	Efficient Spatial-Temporal Focal Adapter with SSM for Temporal Action Detection	Yicheng Qiu et.al.	2604.09164	null
2026-04-09	State Space Models are Effective Sign Language Learners: Exploiting Phonological Compositionality for Vocabulary-Scale Recognition	Bryan Cheng et.al.	2604.08761	null
2026-04-09	HST-HGN: Heterogeneous Spatial-Temporal Hypergraph Networks with Bidirectional State Space Models for Global Fatigue Assessment	Changdao Chen et.al.	2604.08435	null
2026-04-09	Cognitive Flexibility as a Latent Structural Operator for Bayesian State Estimation	Thanana Nuchkrua et.al.	2604.08130	null
2026-04-09	ABMAMBA: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video Captioning	Daichi Yashima et.al.	2604.08050	null
2026-04-09	Beyond Mamba: Enhancing State-space Models with Deformable Dilated Convolutions for Multi-scale Traffic Object Detection	Jun Li et.al.	2604.08038	null
2026-04-09	The Hyperscale Lottery: How State-Space Models Have Sacrificed Edge Efficiency	Robin Geens et.al.	2604.07935	null
2026-04-09	Stochastic Thermodynamics for Autoregressive Generative Models: A Non-Markovian Perspective	Takahiro Sagawa et.al.	2604.07867	null
2026-04-08	Controller Design for Structured State-space Models via Contraction Theory	Muhammad Zakwan et.al.	2604.07069	null
2026-04-08	The Mechanistic Invariance Test: Genomic Language Models Fail to Learn Positional Regulatory Logic	Bryan Cheng et.al.	2604.06549	null
2026-04-07	The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model	Hongxu Zhou et.al.	2604.05923	null
2026-04-05	The Hiremath Early Detection (HED) Score: A Measure-Theoretic Evaluation Standard for Temporal Intelligence	Prakul Sunil Hiremath et.al.	2604.04993	null
2026-04-03	Learning Nonlinear Regime Transitions via Semi-Parametric State-Space Models	Prakul Sunil Hiremath et.al.	2604.04963	null
2026-04-07	Firebolt-VL: Efficient Vision-Language Understanding with Cross-Modality Modulation	Quoc-Huy Trinh et.al.	2604.04579	null
2026-04-06	Unified Mixture Sampler for State-Space Models: Application to Stochastic Conditional Duration Models	Daichi Hiraki et.al.	2604.04517	null
2026-04-05	CAWN: Continuous Acoustic Wave Networks for Autoregressive Language Modeling	Dejan Čugalj et.al.	2604.04250	null
2026-04-04	Mambalaya: Einsum-Based Fusion Optimizations on State-Space Models	Toluwanimi O. Odemuyiwa et.al.	2604.03829	null
2026-04-04	Optimizing Neurorobot Policy under Limited Demonstration Data through Preference Regret	Viet Dung Nguyen et.al.	2604.03523	null
2026-04-03	Adversarial Robustness of Deep State Space Models for Forecasting	Sribalaji C. Anand et.al.	2604.03427	null
2026-04-03	RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection	Cheng Lu et.al.	2604.02903	null
2026-04-02	On the Geometric Structure of Layer Updates in Deep Language Models	Jun-Sik Yoo et.al.	2604.02459	null
2026-04-02	PARD-SSM: Probabilistic Cyber-Attack Regime Detection via Variational Switching State-Space Models	Prakul Sunil Hiremath et.al.	2604.02299	null
2026-04-02	Selective State-Space Models for Koopman-based Data-driven Distribution System State Estimation	Bader Alabdulrazzaq et.al.	2604.02273	null
2026-04-02	AEGIS: Adversarial Entropy-Guided Immune System – Thermodynamic State Space Models for Zero-Day Network Evasion Detection	Vickson Ferrel et.al.	2604.02149	null
2026-04-02	Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling	Shota Takashiro et.al.	2604.01577	null
2026-04-01	Parallelized Hierarchical Connectome: A Spatiotemporal Recurrent Framework for Spiking State-Space Models	Po-Han Chiang et.al.	2604.01295	null
2026-04-01	A Benchmark of State-Space Models vs. Transformers and BiLSTM-based Models for Historical Newspaper OCR	Merveilles Agbeti-messan et.al.	2604.00725	null
2026-04-01	MATHENA: Mamba-based Architectural Tooth Hierarchical Estimator and Holistic Evaluation Network for Anatomy	Kyeonghun Kim et.al.	2604.00537	null
2026-03-31	MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control	Sahil Kumar et.al.	2604.00292	null
2026-03-31	Compressive sensing inspired self-supervised single-pixel imaging	Jijun Lu et.al.	2603.29732	null
2026-03-31	Learning Surrogate LPV State-Space Models with Uncertainty Quantification	E. Javier Olucha et.al.	2603.29532	null
2026-03-31	HCLSM: Hierarchical Causal Latent State Machines for Object-Centric World Modeling	Jaber Jaber et.al.	2603.29090	null
2026-03-30	Bridging the Geometry Mismatch: Frequency-Aware Anisotropic Serialization for Thin-Structure SSMs	Jin Bai et.al.	2603.28503	null
2026-03-30	A Probabilistic Generative Model for Spectral Speech Enhancement	Marco Hidalgo-Araya et.al.	2603.28436	null
2026-04-01	Self-Organizing Score-based Data Assimilation	Yuma Yamaoka et.al.	2603.28048	null
2026-03-27	WiMamba: Linear-Scale Wireless Foundation Model	Tomer Raviv et.al.	2603.26367	null
2026-03-26	Accelerating Bayesian Optimization for Nonlinear State-Space System Identification with Application to Lithium-Ion Batteries	Hao Tu et.al.	2603.25840	null
2026-03-26	A Mamba-based Perceptual Loss Function for Learning-based UGC Transcoding	Zihao Qi et.al.	2603.25566	null
2026-03-26	Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification	Giampaolo Bovenzi et.al.	2603.25507	null
2026-03-26	Towards Controllable Low-Light Image Enhancement: A Continuous Multi-illumination Dataset and Efficient State Space Framework	Hongru Han et.al.	2603.25296	null
2026-03-26	Vision Hopfield Memory Networks	Jianfeng Wang et.al.	2603.25157	null
2026-03-26	RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation	Kai Zhu et.al.	2603.24295	null
2026-03-25	S $^{3}$ G: Stock State Space Graph for Enhanced Stock Trend Prediction	Yao Lu et.al.	2603.24236	null
2026-03-25	State-space fading memory	Gustave Bainier et.al.	2603.23814	null
2026-03-24	The Diminishing Returns of Early-Exit Decoding in Modern LLMs	Rui Wei et.al.	2603.23701	null
2026-03-24	Markov State–Space Modeling and Channel Characterization for DNA-Based Molecular Communication	Ruifeng Zheng et.al.	2603.23394	null
2026-03-24	Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning	Konstantinos Barmpounakis et.al.	2603.23295	null
2026-03-23	Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures	Hector Borobia et.al.	2603.22473	null
2026-03-20	Graph Signal Processing Meets Mamba2: Adaptive Filter Bank via Delta Modulation	Yehjin Shin et.al.	2603.22333	null
2026-03-23	Multi-View Deformable Convolution Meets Visual Mamba for Coronary Artery Segmentation	Xiaochan Yuan et.al.	2603.21829	null
2026-03-20	MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space Models	Puskal Khadka et.al.	2603.20074	null
2026-03-20	Grid-following and Grid-forming Switching Control for Grid-connected Inverters Considering Small-signal Security Region	Qiping Lai et.al.	2603.19618	null
2026-03-20	ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization	Danish Gufran et.al.	2603.19594	null
2026-03-19	Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders	Shang-Jui Ray Kuo et.al.	2603.19209	null
2026-03-19	The Exponentially Weighted Signature	Alexandre Bloch et.al.	2603.19198	null
2026-03-19	LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling	Danaé Broustail et.al.	2603.19100	null
2026-03-19	DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection	Haochen Li et.al.	2603.18757	null
2026-03-19	Deceiving Flexibility: A Stealthy False Data Injection Model in Vehicle-to-Grid Coordination	Kaan T. Gun et.al.	2603.18424	null
2026-03-18	Atomic Trajectory Modeling with State Space Models for Biomolecular Dynamics	Liang Shi et.al.	2603.17633	null
2026-03-17	Koopman Lifted Finite Memory Identification via Truncated Grunwald Letnikov Kernels	Navid Mojahed et.al.	2603.16851	null
2026-03-17	SF-Mamba: Rethinking State Space Model for Vision	Masakazu Yoshimura et.al.	2603.16423	null
2026-03-17	RASLF: Representation-Aware State Space Model for Light Field Super-Resolution	Zeqiang Wei et.al.	2603.16243	null
2026-03-16	Mamba-3: Improved Sequence Modeling using State Space Principles	Aakash Lahoti et.al.	2603.15569	null
2026-03-16	DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages	Alish Kanani et.al.	2603.15530	null
2026-03-16	AnoleVLA: Lightweight Vision-Language-Action Model with Deep State Space Models for Mobile Manipulation	Yusuke Takagi et.al.	2603.15046	null
2026-03-14	Enhancing Eye Feature Estimation from Event Data Streams through Adaptive Inference State Space Modeling	Viet Dung Nguyen et.al.	2603.14077	null
2026-03-13	State-space models through the lens of ensemble control	Ye Feng et.al.	2603.13587	null
2026-03-13	Robust Automatic Differentiation of Square-Root Kalman Filters via Gramian Differentials	Adrien Corenflos et.al.	2603.13559	null
2026-03-13	From Gradients to Riccati Geometry: Kalman World Models for Single-Pass Learning	Andrew Kiruluta et.al.	2603.13423	null
2026-03-12	SpectralGuard: Detecting Memory Collapse Attacks in State Space Models	Davi Bonetto et.al.	2603.12414	null
2026-03-12	Spatial PDE-aware Selective State-space with Nested Memory for Mobile Traffic Grid Forecasting	Zineddine Bettouche et.al.	2603.12353	null
2026-03-12	CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks	Alexandre Le Mercier et.al.	2603.12206	null
2026-03-12	SEMamba++: A General Speech Restoration Framework Leveraging Global, Local, and Periodic Spectral Patterns	Yongjoon Lee et.al.	2603.11669	null
2026-03-11	Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild	Jun Yu et.al.	2603.11306	null
2026-03-11	Single molecule localization microscopy challenge: a biologically inspired benchmark for long-sequence modeling	Fatemeh Valeh et.al.	2603.11296	null
2026-03-11	DysonNet: Constant-Time Local Updates for Neural Quantum States	Lucas Winter et.al.	2603.11189	null
2026-03-10	Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference	Cosmo Santoni et.al.	2603.09555	null
2026-03-10	Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking	Shilei Wang et.al.	2603.09287	null
2026-03-10	Progressive Split Mamba: Effective State Space Modelling for Image Restoration	Mohammed Hassanin et.al.	2603.09171	null
2026-03-10	Rotation Equivariant Mamba for Vision Tasks	Zhongchen Zhao et.al.	2603.09138	null
2026-03-10	WS-Net: Weak-Signal Representation Learning and Gated Abundance Reconstruction for Hyperspectral Unmixing via State-Space and Weak Signal Attention Fusion	Zekun Long et.al.	2603.09037	null
2026-03-09	Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models	John Cooper et.al.	2603.08859	null
2026-03-07	Permutation-Equivariant 2D State Space Models: Theory and Canonical Architecture for Multivariate Time Series	Seungwoo Jeong et.al.	2603.08753	null
2026-03-09	BuildMamba: A Visual State-Space Based Model for Multi-Task Building Segmentation and Height Estimation from Satellite Images	Sinan U. Ulu et.al.	2603.08523	null
2026-03-08	Dissecting Spectral Granger Causality through Partial Information Decomposition	Luca Faes et.al.	2603.07634	null
2026-03-07	Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving	Jiazhuo Li et.al.	2603.07264	null
2026-03-07	Inter-Image Pixel Shuffling for Multi-focus Image Fusion	Huangxing Lin et.al.	2603.07120	null
2026-03-06	Swimba: Switch Mamba Model Scales State Space Models	Zhixu Du et.al.	2603.06938	null
2026-03-06	DLRMamba: Distilling Low-Rank Mamba for Edge Multispectral Fusion Object Detection	Qianqian Zhang et.al.	2603.06920	null
2026-03-06	Latent Autoencoder Ensemble Kalman Filter for Data assimilation	Xin T. Tong et.al.	2603.06752	null
2026-03-06	MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis	Dongqing Xie et.al.	2603.06378	null
2026-03-06	Two Localization Strategies for Sequential MCMC Data Assimilation with Applications to Nonlinear Non-Gaussian Geophysical Models	Hamza Ruzayqat et.al.	2603.05817	null
2026-03-05	Warm Starting State-Space Models with Automata Learning	William Fishell et.al.	2603.05694	null
2026-03-05	Why Depth Matters in Parallelizable Sequence Models: A Lie Algebraic View	Gyuryang Heo et.al.	2603.05573	null
2026-03-05	BLINK: Behavioral Latent Modeling of NK Cell Cytotoxicity	Iman Nematollahi et.al.	2603.05110	null
2026-03-05	DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization	Xiaodong Zhu et.al.	2603.04882	null
2026-03-04	When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift	Kevin Vogt-Lowell et.al.	2603.04648	null
2026-03-04	Mask-aware inference with State-Space Models	Ignasi Mas et.al.	2603.04568	null
2026-03-04	Architectural Proprioception in State Space Models: Thermodynamic Training Induces Anticipatory Halt Detection	Jay Noon et.al.	2603.04180	null
2026-03-04	Efficient Bayesian Estimation of Dynamic Structural Equation Models via State Space Marginalization	Øystein Sørensen et.al.	2603.04003	null
2026-03-04	Harnessing Selective State Space Models to Enhance Semianalytical Design of Fabrication-Ready Multilayered Huygens’ Metasurfaces: Part II - Generative Inverse Design (MetaMamba)	Natanel Nissan et.al.	2603.03877	null
2026-03-04	Harnessing Selective State Space Models to Enhance Semianalytical Design of Fabrication-Ready Multilayered Huygens’ Metasurfaces: Part I - Field-based Semianalytical Synthesis	Sherman W. Marcus et.al.	2603.03837	null
2026-03-04	Separators in Enhancing Autoregressive Pretraining for Vision Mamba	Hanpeng Liu et.al.	2603.03806	null
2026-03-03	MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling	Jinwoong Kim et.al.	2603.03001	null
2026-03-03	Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures	Georgios Pantazopoulos et.al.	2603.02874	null
2026-03-02	The Expressive Limits of Diagonal SSMs for State-Tracking	Mehran Shakerinava et.al.	2603.01959	null
2026-03-02	Deep Learning for Financial Time Series: A Large-Scale Benchmark of Risk-Adjusted Performance	Adir Saly-Kaufmann et.al.	2603.01820	null
2026-03-01	Efficient Extractive Summarization with MAMBA-Transformer Hybrids for Low-Resource Scenarios	Nisrine Ait Khayi et.al.	2603.01288	null
2026-03-01	VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification	Abdellah Zakaria Sellam et.al.	2603.01174	null
2026-03-01	GRAD-Former: Gated Robust Attention-based Differential Transformer for Change Detection	Durgesh Ameta et.al.	2603.01161	null
2026-02-28	Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation	Jinhan Xu et.al.	2603.00576	null
2026-02-28	Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling	Xueyang Li et.al.	2603.00439	null
2026-02-27	BiM-GeoAttn-Net: Linear-Time Depth Modeling with Geometry-Aware Attention for 3D Aortic Dissection CTA Segmentation	Yuan Zhang et.al.	2602.23803	null
2026-02-26	SpectralMamba-UNet: Frequency-Disentangled State Space Modeling for Texture-Structure Consistent Medical Image Segmentation	Fuhao Zhang et.al.	2602.23103	null
2026-02-26	Latent Matters: Learning Deep State-Space Models	Alexej Klushyn et.al.	2602.23050	null
2026-02-26	A guided residual search for nonlinear state-space identification	Merijn Floren et.al.	2602.22964	null
2026-02-26	Interpreting and Steering State-Space Models via Activation Subspace Bottlenecks	Vamshi Sunku Mohan et.al.	2602.22719	null
2026-02-26	SPMamba-YOLO: An Underwater Object Detection Network Based on Multi-Scale Feature Enhancement and Global Context Modeling	Guanghao Liao et.al.	2602.22674	null
2026-02-25	WaveSSM: Multiscale State-Space Models for Non-stationary Signal Attention	Ruben Solozabal et.al.	2602.22266	null
2026-02-23	CrossLLM-Mamba: Multimodal State Space Fusion of LLMs for RNA Interaction Prediction	Rabeya Tus Sadia et.al.	2602.22236	null
2026-02-25	Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration	Chen Wu et.al.	2602.21917	null
2026-02-25	Mamba Meets Scheduling: Learning to Solve Flexible Job Shop Scheduling with Efficient Sequence Modeling	Zhi Cao et.al.	2602.21546	null
2026-02-25	When Learning Hurts: Fixed-Pole RNN for Real-Time Online Training	Alexander Morgan et.al.	2602.21454	null
2026-02-24	Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting	Sunki Hong et.al.	2602.21415	null
2026-02-24	HiPPO Zoo: Explicit Memory Mechanisms for Interpretable State Space Models	Jack Goffinet et.al.	2602.21340	null
2026-02-24	Scaling State-Space Models on Multiple GPUs with Tensor Parallelism	Anurag Dutt et.al.	2602.21144	null
2026-02-21	NeXt2Former-CD: Efficient Remote Sensing Change Detection with Modern Vision Architectures	Yufan Wang et.al.	2602.18717	null
2026-02-19	COMBA: Cross Batch Aggregation for Learning Large Graphs with Context Gating State Space Models	Jiajun Shen et.al.	2602.17893	null
2026-02-19	Bayesian Optimality of In-Context Learning with Selective State Spaces	Di Zhang et.al.	2602.17744	null
2026-02-18	StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation	Zeyu Ren et.al.	2602.16915	null
2026-02-16	Is Mamba Reliable for Medical Imaging?	Banafsheh Saber Latibari et.al.	2602.16723	null
2026-02-17	Tracking Time-Varying Multipath Channels forActive Sonar Applications	Ashwani Koul et.al.	2602.15555	null
2026-02-15	Chemical Language Models for Natural Products: A State-Space Model Approach	Ho-Hsuan Wang et.al.	2602.13958	null
2026-02-14	Backward Smoothing versus Fixed-Lag Smoothing in Particle Filters	Genshiro Kitagawa et.al.	2602.13635	null
2026-02-13	Federated Learning of Nonlinear Temporal Dynamics with Graph Attention-based Cross-Client Interpretability	Ayse Tursucular et.al.	2602.13485	null
2026-02-09	DriveMamba: Task-Centric Scalable State Space Model for Efficient End-to-End Autonomous Driving	Haisheng Su et.al.	2602.13301	null
2026-02-13	Efficient Plug-and-Play method for Dynamic Imaging Via Kalman Smoothing	Benjamin Hawkes et.al.	2602.13043	null
2026-02-13	A Theoretical Analysis of Mamba’s Training Dynamics: Filtering Relevant Features for Generalization in State Space Models	Mugunthan Shandirasegaran et.al.	2602.12499	null
2026-02-12	Learning to Forget Attention: Memory Consolidation for Adaptive Compute Reduction	Ibne Farabi Shihab et.al.	2602.12204	null
2026-02-12	Improved state mixing in higher-order and block diagonal linear recurrent networks	Igor Dubinin et.al.	2602.12021	null
2026-02-12	RI-Mamba: Rotation-Invariant Mamba for Robust Text-to-Shape Retrieval	Khanh Nguyen et.al.	2602.11673	null
2026-02-20	Jailbreaking Leaves a Trace: Understanding and Detecting Jailbreak Attacks from Internal Representations of Large Language Models	Sri Durga Sai Sowmya Kadali et.al.	2602.11495	null
2026-02-11	Retrieval-Aware Distillation for Transformer-SSM Hybrids	Aviv Bick et.al.	2602.11374	null
2026-02-11	LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation	Lei Yao et.al.	2602.11007	null
2026-02-11	VFGS-Net: Frequency-Guided State-Space Learning for Topology-Preserving Retinal Vessel Segmentation	Ruiqi Song et.al.	2602.10978	null
2026-02-11	Trajectory-based data-driven predictive control and the state-space predictor	Levi D. Reyes Premer et.al.	2602.10936	null
2026-02-10	Can Image Splicing and Copy-Move Forgery Be Detected by the Same Model? Forensim: An Attention-Based State-Space Approach	Soumyaroop Nandi et.al.	2602.10079	null
2026-02-10	BabyMamba-HAR: Lightweight Selective State Space Models for Efficient Human Activity Recognition on Resource Constrained Devices	Mridankan Mandal et.al.	2602.09872	null
2026-02-09	DMamba: Decomposition-enhanced Mamba for Time Series Forecasting	Ruxuan Chen et.al.	2602.09081	null
2026-02-12	MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection	Venkatraman Narayanan et.al.	2602.08126	null
2026-02-06	Behavior Score Prediction in Resting-State Functional MRI by Deep State Space Modeling	Javier Salazar Cavazos et.al.	2602.07131	null
2026-02-06	Towards Understanding What State Space Models Learn About Code	Jiali Wu et.al.	2602.06774	link
2026-02-06	Efficient Online Variational Estimation via Monte Carlo Sampling	Mathis Chagneux et.al.	2602.06579	null
2026-02-06	AS-Mamba: Asymmetric Self-Guided Mamba Decoupled Iterative Network for Metal Artifact Reduction	Bowen Ning et.al.	2602.06350	null
2026-02-05	MambaVF: State Space Model for Efficient Video Fusion	Zixiang Zhao et.al.	2602.06017	null
2026-02-05	A Decomposition-based State Space Model for Multivariate Time-Series Forecasting	Shunya Nagashima et.al.	2602.05389	null
2026-02-05	HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reliable Healthcare Facility Visit Prediction	Dahai Yu et.al.	2602.05286	null
2026-02-04	Partial Ring Scan: Revisiting Scan Order in Vision State Space Models	Yi-Kuan Hsieh et.al.	2602.04170	null
2026-02-03	Systematic review of self-supervised foundation models for brain network representation using electroencephalography	Hannah Portmann et.al.	2602.03269	null
2026-02-03	Bayesian Methods for the Navier-Stokes Equations	Nicholas Polson et.al.	2602.02945	null
2026-02-02	A Multi-scale Linear-time Encoder for Whole-Slide Image Analysis	Jagan Mohan Reddy Dwarampudi et.al.	2602.02918	null
2026-02-01	Learnable Koopman-Enhanced Transformer-Based Time Series Forecasting with Spectral Control	Ali Forootani et.al.	2602.02592	null
2026-02-02	SMTrack: State-Aware Mamba for Efficient Temporal Modeling in Visual Tracking	Yinchao Ma et.al.	2602.01677	null
2026-02-02	ASGMamba: Adaptive Spectral Gating Mamba for Multivariate Time Series Forecasting	Qianyang Li et.al.	2602.01668	null
2026-02-02	Samba+: General and Accurate Salient Object Detection via A More Unified Mamba-based Framework	Wenzhuo Zhao et.al.	2602.01593	null
2026-02-02	HandMCM: Multi-modal Point Cloud-based Correspondence State Space Model for 3D Hand Pose Estimation	Wencan Cheng et.al.	2602.01586	null
2026-02-02	Rotation-free Online Handwritten Character Recognition Using Linear Recurrent Units	Zhe Ling et.al.	2602.01533	null
2026-02-04	BioTamperNet: Affinity-Guided State-Space Model Detecting Tampered Biomedical Images	Soumyaroop Nandi et.al.	2602.01435	null
2026-01-31	OCTOPUS: Enhancing the Spatial-Awareness of Vision SSMs with Multi-Dimensional Scans and Traversal Selection	Kunal Mahatha et.al.	2602.00904	null
2026-01-31	Cognitive-Flexible Control via Latent Model Reorganization with Predictive Safety Guarantees	Thanana Nuchkrua et.al.	2602.00812	null
2026-01-31	A Hybrid Mamba-SAM Architecture for Efficient 3D Medical Image Segmentation	Mohammadreza Gholipour Shahraki et.al.	2602.00650	null
2026-01-31	AIRE-Prune: Asymptotic Impulse-Response Energy for State Pruning in State Space Models	Apurba Prasad Padhy et.al.	2602.00534	null
2026-01-30	GaussianOcc3D: A Gaussian-Based Adaptive Multi-modal 3D Occupancy Prediction	A. Enes Doruk et.al.	2601.22729	null
2026-01-30	Learning to Defer in Non-Stationary Time Series via Switching State-Space Models	Yannis Montreuil et.al.	2601.22538	null
2026-01-30	Elastic Spectral State Space Models for Budgeted Inference	Dachuan Song et.al.	2601.22488	null
2026-01-29	Spectral Filtering for Learning Quantum Dynamics	Elad Hazan et.al.	2601.22400	null
2026-01-29	ParalESN: Enabling parallel information processing in Reservoir Computing	Matteo Pinna et.al.	2601.22296	null
2026-01-29	MAR: Efficient Large Language Models via Module-aware Architecture Refinement	Junhong Cai et.al.	2601.21503	null
2026-01-29	Towards Geometry-Aware and Motion-Guided Video Human Mesh Recovery	Hongjun Chen et.al.	2601.21376	null
2026-01-29	Model-Free Neural State Estimation in Nonlinear Dynamical Systems: A Comparative Study of Neural Architectures and Classical Filters	Zhuochen Liu et.al.	2601.21266	null
2026-01-28	CCMamba: Selective State-Space Models for Higher-Order Graph Learning on Combinatorial Complexes	Jiawen Chen et.al.	2601.20518	null
2026-01-27	QuaMo: Quaternion Motions for Vision-based 3D Human Kinematics Capture	Cuong Le et.al.	2601.19580	null
2026-01-27	Scale-Consistent State-Space Dynamics via Fractal of Stationary Transformations	Geunhyeok Yu et.al.	2601.19551	null
2026-01-27	On the Expressiveness of State Space Models via Temporal Logics	Eric Alsmann et.al.	2601.19467	null
2026-01-24	Fluxamba: Topology-Aware Anisotropic State Space Models for Geological Lineament Segmentation in Multi-Source Remote Sensing	Jin Bai et.al.	2601.17288	null
2026-01-23	From Noisy News Sentiment Scores to Interpretable Temporal Dynamics: A Bayesian State-Space Model	Ian Carbó Casals et.al.	2601.16769	null
2026-01-23	PanopMamba: Vision State Space Modeling for Nuclei Panoptic Segmentation	Ming Kang et.al.	2601.16631	null
2026-01-23	Omni-directional attention mechanism based on Mamba for speech separation	Ke Xue et.al.	2601.16603	null
2026-01-23	Variational Dimension Lifting for Robust Tracking of Nonlinear Stochastic Dynamics	Yonatan L. Ashenafi et.al.	2601.16470	null
2026-01-22	NeuroMamba: Multi-Perspective Feature Interaction with Visual Mamba for Neuron Segmentation	Liuyun Jiang et.al.	2601.15929	null
2026-01-22	Design, Modelling, and Control of Magnetic Ball Suspension System	Sampson E. Nwachukwu et.al.	2601.15622	null
2026-01-20	A Dual-Head Transformer-State-Space Architecture for Neurocircuit Mechanism Decomposition from fMRI	Cole Korponay et.al.	2601.15344	null
2026-01-21	UBATrack: Spatio-Temporal State Space Model for General Multi-Modal Tracking	Qihua Liang et.al.	2601.14799	null
2026-01-21	Training-Efficient Text-to-Music Generation with State-Space Modeling	Wei-Jaw Lee et.al.	2601.14786	null
2026-01-24	M2I2HA: Multi-modal Object Detection Based on Intra- and Inter-Modal Hypergraph Attention	Xiaofan Yang et.al.	2601.14776	null
2026-01-21	Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination	Ping Zhong et.al.	2601.14649	link
2026-01-20	PAS-Mamba: Phase-Amplitude-Spatial State Space Model for MRI Reconstruction	Xiaoyan Kui et.al.	2601.14530	null
2026-01-20	Gaussian Based Adaptive Multi-Modal 3D Semantic Occupancy Prediction	A. Enes Doruk et.al.	2601.14448	null
2026-01-20	ASBA: A-line State Space Model and B-line Attention for Sparse Optical Doppler Tomography Reconstruction	Zhenghong Li et.al.	2601.14165	null
2026-01-20	GeoDynamics: A Geometric State-Space Neural Network for Understanding Brain Dynamics on Riemannian Manifolds	Tingting Dan et.al.	2601.13570	null
2026-01-19	On the Relation of State Space Models and Hidden Markov Models	Aydin Ghojogh et.al.	2601.13357	null
2026-01-19	ConvMambaNet: A Hybrid CNN-Mamba State Space Architecture for Accurate and Real-Time EEG Seizure Detection	Md. Nishan Khan et.al.	2601.13234	null
2026-01-19	Analysis of Long Range Dependency Understanding in State Space Models	Srividya Ravikumar et.al.	2601.13048	null
2026-01-15	Online identification of nonlinear time-varying systems with uncertain information	He Ren et.al.	2601.10379	null
2026-01-14	Parallelizable memory recurrent units	Florent De Geeter et.al.	2601.09495	null
2026-01-14	Late Breaking Results: Quamba-SE: Soft-edge Quantizer for Activations in State Space Models	Yizhi Chen et.al.	2601.09451	null
2026-01-13	SfMamba: Efficient Source-Free Domain Adaptation via Selective Scan Modeling	Xi Chen et.al.	2601.08608	null
2026-01-13	Particle Filtering for a Class of State-Space Models with Low and Degenerate Observational Noise	Abylay Zhumekenov et.al.	2601.08411	null
2026-01-12	Rescind: Countering Image Misconduct in Biomedical Publications with Vision-Language and State-Space Modeling	Soumyaroop Nandi et.al.	2601.08040	null
2026-01-12	Language markers of emotion flexibility predict depression and anxiety treatment outcomes	Benjamin Brindle et.al.	2601.07961	null
2026-01-11	Conditional Normalizing Flows for Forward and Backward Joint State and Parameter Estimation	Luke S. Lagunowich et.al.	2601.07013	null
2026-01-11	Deep Recurrent Hidden Markov Learning Framework for Multi-Stage Advanced Persistent Threat Prediction	Saleem Ishaq Tijjani et.al.	2601.06734	null
2026-01-08	Akasha 2: Hamiltonian State Space Duality and Visual-Language Joint Embedding Predictive Architectur	Yani Meziani et.al.	2601.06212	null
2026-01-02	Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs	Andrew Kiruluta et.al.	2601.06100	null
2026-01-09	Dynamic Mortality Forecasting via Mixed-Frequency State-Space Models	Runze Li et.al.	2601.05702	null
2026-01-09	DIFF-MF: A Difference-Driven Channel-Spatial State Space Model for Multi-Modal Image Fusion	Yiming Sun et.al.	2601.05538	null
2026-01-08	DB-MSMUNet:Dual Branch Multi-scale Mamba UNet for Pancreatic CT Scans Segmentation	Qiu Guan et.al.	2601.04676	null
2026-01-07	Unified and Efficient Analysis of Machining Chatter and Surface Location Error	Woraphrut Kornmaneesang et.al.	2601.03819	null
2026-01-06	Time-Aware Synthetic Control	Saeyoung Rho et.al.	2601.03099	null
2026-01-06	Fast Surrogate Models for Adaptive Aircraft Trajectory Prediction in En route Airspace	Nick Pepper et.al.	2601.03075	null
2026-01-06	XLSR-MamBo: Scaling the Hybrid Mamba-Attention Backbone for Audio Deepfake Detection	Kwok-Ho Ng et.al.	2601.02944	null
2026-01-05	AMC26: VSSEA robust position control	Emre Sariyildiz et.al.	2601.02557	null
2026-01-05	Scalable Gaussian Processes for Integrated and Overlapping Measurements Via Augmented State Space Models	Ryan A. Rubenzahl et.al.	2601.02527	null
2026-01-02	SpikySpace: A Spiking State Space Model for Energy-Efficient Time Series Forecasting	Kaiwen Tang et.al.	2601.02411	null
2026-01-05	A Mamba-Based Model for Automatic Chord Recognition	Chunyu Yuan et.al.	2601.02101	null
2026-01-06	Hidden State Poisoning Attacks against Mamba-based Language Models	Alexandre Le Mercier et.al.	2601.01972	null
2026-01-08	Reliable Grid Forecasting: State Space Models for Safety-Critical Energy Systems	Jisoo Lee et.al.	2601.01410	null
2026-01-04	LinMU: Multimodal Understanding Made Linear	Hongjie Wang et.al.	2601.01322	null
2026-01-03	MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Efficient Clinical Assistance	Hamad Khan et.al.	2601.01260	null
2026-01-03	Benchmarking the Computational and Representational Efficiency of State Space Models against Transformers on Long-Context Dyadic Sessions	Abidemi Koledoye et.al.	2601.01237	null
2026-01-03	NeuroSSM: Multiscale Differential State-Space Modeling for Context-Aware fMRI Analysis	Furkan Genç et.al.	2601.01229	null
2026-01-01	Depth-Synergized Mamba Meets Memory Experts for All-Day Image Reflection Separation	Siyan Fang et.al.	2601.00322	null
2026-01-08	Modern Neuromorphic AI: From Intra-Token to Inter-Token Processing	Osvaldo Simeone et.al.	2601.00245	null
2025-12-30	Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis	Hao Wu et.al.	2512.24013	null
2025-12-29	MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling	Mahdi Karami et.al.	2512.23824	null
2025-12-28	Breaking the Memory Wall: Exact Analytical Differentiation via Tiled Operator-Space Evolution	Shuhuan Wang et.al.	2512.23068	null
2025-12-28	Nonlinear Dynamical Modeling of Human Intracranial Brain Activity with Flexible Inference	Kiarash Vaziri et.al.	2512.22785	null
2025-12-25	UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation	Linxuan Fan et.al.	2512.21584	null
2025-12-24	A Mechanistic Analysis of Transformers for Dynamical Systems	Gregory Duthé et.al.	2512.21113	null
2025-12-25	Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning	Mojtaba Safari et.al.	2512.19676	null
2025-12-22	Generative Krylov Subspace Representations for Scalable Quantum Eigensolvers	Changwon Lee et.al.	2512.19420	null
2025-12-22	Lag Operator SSMs: A Geometric Framework for Structured State Space Modeling	Sutashu Tomonaga et.al.	2512.18965	null
2025-12-21	State-Space Modeling of Time-Varying Spillovers on Networks	Marios Papamichalis et.al.	2512.18584	link
2025-12-19	Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers	Zeyuan Allen-Zhu et.al.	2512.17351	null
2025-12-18	KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals	Shuting Zhao et.al.	2512.16791	null
2025-12-18	KOSS: Kalman-Optimal Selective State Spaces for Long-Term Sequence Modeling	Lei Wang et.al.	2512.16723	null
2025-12-18	CPMamba: Selective State Space Models for MIMO Channel Prediction in High-Mobility Environments	Sheng Luo et.al.	2512.16315	null
2025-12-17	BarcodeMamba+: Advancing State-Space Models for Fungal Biodiversity Research	Tiancheng Gao et.al.	2512.15931	null
2025-12-22	COBRA: Catastrophic Bit-flip Reliability Analysis of State-Space Models	Sanjay Das et.al.	2512.15778	null
2025-12-17	Characterizing Mamba’s Selective Memory using Auto-Encoders	Tamanna Hossain et.al.	2512.15653	null
2025-12-17	On non-stationarity of the Poisson gamma state space models	Kaoru Irie et.al.	2512.15128	null
2025-12-17	How Many Heads Make an SSM? A Unified Framework for Attention and State Space Models	Ali Ghodsi et.al.	2512.15115	null
2025-12-16	XAI-Driven Diagnosis of Generalization Failure in State-Space Cerebrovascular Segmentation Models: A Case Study on Domain Shift Between RSNA and TopCoW Datasets	Youssef Abuzeid et.al.	2512.13977	null
2025-12-15	Temporal parallelisation of continuous-time maximum-a-posteriori trajectory estimation	Hassan Razavi et.al.	2512.13319	null
2025-12-14	Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics	Jingdi Lei et.al.	2512.12602	link
2025-12-13	HydroDiffusion: Diffusion-Based Probabilistic Streamflow Forecasting with a State Space Backbone	Yihan Wang et.al.	2512.12183	null
2025-12-12	TSkel-Mamba: Temporal Dynamic Modeling via State Space Model for Human Skeleton-based Action Recognition	Yanan Liu et.al.	2512.11503	link
2025-12-11	On a class of constrained Bayesian filters and their numerical implementation in high-dimensional state-space Markov models	Utku Erdogan et.al.	2512.11012	null
2025-12-11	Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation	Yiheng Lyu et.al.	2512.10353	null
2025-12-10	Inertial Magnetic SLAM Systems Using Low-Cost Sensors	Chuan Huang et.al.	2512.10128	null
2025-12-10	Neural posterior inference with state-space models for calibrating ice sheet simulators	Bao Anh Vu et.al.	2512.09561	null
2025-12-11	StateSpace-SSL: Linear-Time Self-supervised Learning for Plant Disease Detection	Abdullah Al Mamun et.al.	2512.09492	null
2025-12-08	How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline	Chunhui Zhang et.al.	2512.07385	null
2025-12-07	Always Keep Your Promises: DynamicLRP, A Model-Agnostic Solution To Layer-Wise Relevance Propagation	Kevin Lee et.al.	2512.07010	null
2025-12-07	FGE: A Fast Free-Boundary Grad-Shafranov Evolutive Solver	Cosmas Heiß et.al.	2512.06847	null
2025-12-07	TextMamba: Scene Text Detector with Mamba	Qiyan Zhao et.al.	2512.06657	null
2025-12-06	Assessing the Information Content of Individual Spikes in Population-Level Models of Neural Spiking Activity	Azar Ghahari et.al.	2512.06280	null
2025-12-05	Speech World Model: Causal State-Action Planning with Explicit Reasoning for Speech	Xuanru Zhou et.al.	2512.05933	null
2025-12-05	World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty	Zhiting Mei et.al.	2512.05927	null
2025-12-05	Measurements of Light Nuclei (d, t, $^3$He)-$Λ$ Correlations in Au+Au Collisions at $\sqrt{s_{NN}}=3$ GeV from STAR	Xialei Jiang et.al.	2512.05885	null
2025-12-05	Vague Knowledge: Information without Transitivity and Partitions	Kerry Xiao et.al.	2512.05833	null
2025-12-05	Ferroelectricity in dipolar liquids: from an exactly solvable model in the large-dimensional limit to finite dimensions	M. G. Izzo et.al.	2512.05758	null
2025-12-05	Comparing the latent features of universal machine-learning interatomic potentials	Sofiia Chorna et.al.	2512.05717	null
2025-12-05	LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving	Yiming Shu et.al.	2512.05686	null
2025-12-05	Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation	Dhorasso Temfack et.al.	2512.05650	null
2025-12-05	DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model	Pasquale De Marinis et.al.	2512.05613	null
2025-12-05	Supervisory Measurement-Guided Noise Covariance Estimation: Discussing Forward and Reverse Differentiation	Haoying Li et.al.	2512.05604	null
2025-12-05	CureAgent: A Training-Free Executor-Analyst Framework for Clinical Reasoning	Ting-Ting Xie et.al.	2512.05576	null
2025-12-05	MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models	Chuang Yu et.al.	2512.05530	null
2025-12-05	UniFS: Unified Multi-Contrast MRI Reconstruction via Frequency-Spatial Fusion	Jialin Li et.al.	2512.05481	null
2025-12-05	TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression	Cheng-Yuan Ho et.al.	2512.05446	null
2025-12-05	BEAVER: An Efficient Deterministic LLM Verifier	Tarun Suresh et.al.	2512.05439	null
2025-12-05	Computing Supported Models via Transformation to Stable Models	Fang Li et.al.	2512.05437	null
2025-12-05	RevoNAD: Reflective Evolutionary Exploration for Neural Architecture Design	Gyusam Chang et.al.	2512.05403	null
2025-12-05	Group Orthogonal Low-Rank Adaptation for RGB-T Tracking	Zekai Shao et.al.	2512.05359	null
2025-12-04	Nested State and Degradation Estimation of a Satellite Battery with In-flight Data	Linda Bolay et.al.	2512.05255	null
2025-12-04	The deep Hilbert space of all-to-all interacting SU(3) atoms: from quantum to classical	Federico Balducci et.al.	2512.05184	null
2025-12-04	Global phase diagram of two-dimensional dirty hyperbolic Dirac liquids	Christopher A. Leong et.al.	2512.05109	null
2025-12-04	Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction	Vincent Pauline et.al.	2512.05092	null
2025-12-04	RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation	Nicolas Houdré et.al.	2512.05025	null
2025-12-04	Reflection Removal through Efficient Adaptation of Diffusion Transformers	Daniyar Zakarin et.al.	2512.05000	link
2025-12-04	PENCO: A Physics-Energy-Numerical-Consistent Operator for 3D Phase Field Modeling	Mostafa Bamdad et.al.	2512.04863	null
2025-12-04	Model-Based and Sample-Efficient AI-Assisted Math Discovery in Sphere Packing	Rasul Tutunov et.al.	2512.04829	null
2025-12-04	LaFiTe: A Generative Latent Field for 3D Native Texturing	Chia-Hao Chen et.al.	2512.04786	null
2025-12-04	Probing false vacuum decay and bubble nucleation in a Rydberg atom array	Yu-Xin Chao et.al.	2512.04637	null
2025-12-04	Temporal and Spatial Decomposition for Prospective Studies in Energy Systems under Uncertainty	Camila Martinez Parra et.al.	2512.04622	null
2025-12-04	TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification	Zishuo Wan et.al.	2512.04576	null
2025-12-04	VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management	Hongbo Jin et.al.	2512.04540	null
2025-12-04	PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement	Yu-Wei Zhan et.al.	2512.04532	null
2025-12-04	VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory	Yifei Yu et.al.	2512.04519	null
2025-12-04	BiTAgent: A Task-Aware Modular Framework for Bidirectional Coupling between Multimodal Large Language Models and World Models	Yu-Wei Zhan et.al.	2512.04513	null
2025-12-04	DeRA: Decoupled Representation Alignment for Video Tokenization	Pengbo Guo et.al.	2512.04483	null
2025-12-04	ELG $\times$ LRG distribution through dark matter halo dynamics	Ginevra Favole et.al.	2512.04362	null
2025-12-04	Distance Is All You Need: Radial Dispersion for Uncertainty Estimation in Large Language Models	Manh Nguyen et.al.	2512.04351	null
2025-12-04	Cosmological implications of Bumblebee theory on an FLRW background	Manuel Gonzalez-Espinoza et.al.	2512.04349	null
2025-12-03	Driving Beyond Privilege: Distilling Dense-Reward Knowledge into Sparse-Reward Policies	Feeza Khan Khanzada et.al.	2512.04279	null
2025-12-03	Inflation with a Growing Fifth Dimension	Rashmish K. Mishra et.al.	2512.04177	null
2025-12-03	SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL	Siyi Chen et.al.	2512.04069	null
2025-12-03	Training-Free Policy Violation Detection via Activation-Space Whitening in LLMs	Oren Rachmil et.al.	2512.03994	null
2025-12-03	Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization	Lianyu Pang et.al.	2512.03964	null
2025-12-03	Collective dynamics of trail-interacting particles	Paul Pineau et.al.	2512.03950	null
2025-12-03	Rethinking Collapse: Coupling Quantum States to Classical Bits with quasi-probabilities	Dagomir Kaszlikowski et.al.	2512.03929	null
2025-12-03	Acceleration of Parallel Tempering for Markov Chain Monte Carlo methods	Aingeru Ramos et.al.	2512.03825	null
2025-12-03	MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving	Jia Hu et.al.	2512.03795	null
2025-12-03	A comparison between initialization strategies for the infinite hidden Markov model	Federico P. Cortese et.al.	2512.03777	null
2025-12-03	Tutorial on Large Language Model-Enhanced Reinforcement Learning for Wireless Networks	Lingyi Cai et.al.	2512.03722	link
2025-12-03	Consistent Projection of Langevin Dynamics: Preserving Thermodynamics and Kinetics in Coarse-Grained Models	Vahid Nateghi et.al.	2512.03706	null
2025-12-03	State Space Models for Bioacoustics: A comparative Evaluation with Transformers	Chengyu Tang et.al.	2512.03563	null
2025-12-03	Edge bits in average symmetry protected topological mixed state	Yoshihito Kuno et.al.	2512.03530	null
2025-12-03	Seasonal trend assessment of US extreme precipitation via changepoint segmentation	Jaechoul Lee et.al.	2512.03513	null
2025-12-03	CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving	Zhijian Qiao et.al.	2512.03510	null
2025-12-03	Procedural Mistake Detection via Action Effect Modeling	Wenliang Guo et.al.	2512.03474	link
2025-12-03	DM3D: Deformable Mamba via Offset-Guided Gaussian Sequencing for Point Cloud Understanding	Bin Liu et.al.	2512.03424	null
2025-12-03	Comparative algorithm performance evaluation and prediction for the maximum clique problem using instance space analysis	Bharat Sharman et.al.	2512.03419	null
2025-12-06	UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs	Hung-Yueh Chiang et.al.	2512.03383	null
2025-12-03	Generative Refinement:A New Paradigm for Determining Single Crystal Structures Directly from HKL Data	Wen-Lin Luo et.al.	2512.03365	null
2025-12-02	Adaptive Regime-Switching Forecasts with Distribution-Free Uncertainty: Deep Switching State-Space Models Meet Conformal Prediction	Echo Diyun LU et.al.	2512.03298	null
2025-12-02	PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer	Xiaoshui Huang et.al.	2512.03111	null
2025-12-02	The Hilbert space of gauge theories: group averaging and the quantization of Jackiw-Teitelboim gravity	Elba Alonso-Monsalve et.al.	2512.03030	null
2025-12-02	Unrolled Networks are Conditional Probability Flows in MRI Reconstruction	Kehan Qi et.al.	2512.03020	null
2025-12-02	TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond	Yifei Zeng et.al.	2512.02993	link
2025-12-08	AutoNeural: Co-Designing Vision-Language Models for NPU Inference	Wei Chen et.al.	2512.02924	null
2025-12-02	Statistical-Symbolic Verification of Perception-Based Autonomous Systems using State-Dependent Conformal Prediction	Yuang Geng et.al.	2512.02893	null
2025-12-02	MICCAI STSR 2025 Challenge: Semi-Supervised Teeth and Pulp Segmentation and CBCT-IOS Registration	Yaqi Wang et.al.	2512.02867	null
2025-12-02	Tempering the Bayes Filter towards Improved Model-Based Estimation	Menno van Zutphen et.al.	2512.02823	null
2025-12-02	Invariance under Structure Translation as the Origin of Host Immune Capacity Conservation from Noether’s Theorem	Yexing Chen et.al.	2512.02730	null
2025-12-02	DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions	Yifan Zhou et.al.	2512.02727	null
2025-12-02	Graph VQ-Transformer (GVT): Fast and Accurate Molecular Generation via High-Fidelity Discrete Latents	Haozhuo Zheng et.al.	2512.02667	null
2025-12-02	Efficient Simulation of the 2D Hubbard Model via Hilbert Space-Filling Curve Mapping	Ashkan Abedi et.al.	2512.02666	null
2025-12-02	SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization	Zhengcheng Wang et.al.	2512.02631	null
2025-12-02	Excitation function of femtoscopic Lévy source parameters of pion pairs in EPOS4	Yan Huang et.al.	2512.02560	null
2025-12-02	Deep Learning-Based Joint Uplink-Downlink CSI Acquisition for Next-Generation Upper Mid-Band Systems	Xuan He et.al.	2512.02557	null
2025-12-02	CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning	Songqiao Su et.al.	2512.02551	null
2025-12-02	Detection of photon-level signals embedded in sunlight with an atomic photodetector	Laura Zarraoa et.al.	2512.02521	null
2025-12-02	ClusterStyle: Modeling Intra-Style Diversity with Prototypical Clustering for Stylized Motion Generation	Kerui Chen et.al.	2512.02453	null
2025-12-02	WSCF-MVCC: Weakly-supervised Calibration-free Multi-view Crowd Counting	Bin Li et.al.	2512.02359	null
2025-12-02	Enhancing Cross Domain SAR Oil Spill Segmentation via Morphological Region Perturbation and Synthetic Label-to-SAR Generation	Andre Juarez et.al.	2512.02290	null
2025-12-01	High-Precision Simulations of the Parity Conserving Directed Percolation Universality Class in 1+1 Dimensions	Peter Grassberger et.al.	2512.02241	null
2025-12-01	TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models	Zhiheng Liu et.al.	2512.02014	null
2025-12-01	Low-Rank Prehab: Preparing Neural Networks for SVD Compression	Haoran Qin et.al.	2512.01980	link
2025-12-01	Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design	Danny Reidenbach et.al.	2512.01976	null
2025-12-01	Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies	Bailiang Jian et.al.	2512.01913	null
2025-12-01	Delays in Spiking Neural Networks: A State Space Model Approach	Sanja Karilanova et.al.	2512.01906	null
2025-12-01	Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos	Xavier Thomas et.al.	2512.01803	null
2025-12-01	Quantum dynamics of monitored free fermions	Igor Poboiko et.al.	2512.01772	null
2025-12-01	Mofasa: A Step Change in Metal-Organic Framework Generation	Vaidotas Simkus et.al.	2512.01756	null
2025-12-01	ViT $^3$ : Unlocking Test-Time Training in Vision	Dongchen Han et.al.	2512.01643	null
2025-12-01	Improved Disease Outbreak Detection from Out-of-sequence measurements Using Markov-switching Fixed-lag Particle Filters	Conor Rosato et.al.	2512.01639	null
2025-12-01	Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval	Xin Wang et.al.	2512.01636	null
2025-12-01	Parallel Delayed Memory Units for Enhanced Temporal Modeling in Biomedical and Bioacoustic Signal Analysis	Pengfei Sun et.al.	2512.01626	null
2025-12-01	Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation	Thao Thi Phuong Dao et.al.	2512.01589	null
2025-12-01	Real-Space Spectral Approach to Orbital Magnetization	Kevin J. U. Vidarte et.al.	2512.01575	null
2025-12-01	Q2D2: A Geometry-Aware Audio Codec Leveraging Two-Dimensional Quantization	Tal Shuster et.al.	2512.01537	null
2025-12-01	Multi-Path Collaborative Reasoning via Reinforcement Learning	Jindi Lv et.al.	2512.01485	link
2025-12-01	Language-Guided Open-World Anomaly Segmentation	Klara Reichard et.al.	2512.01427	null
2025-12-01	Fourier Neural Operators Explained: A Practical Perspective	Valentin Duruisseaux et.al.	2512.01421	null
2025-12-01	PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications	Yunze Liu et.al.	2512.01383	null
2025-12-01	InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision	Chenting Wang et.al.	2512.01342	null
2025-12-01	Gaussian Process State-Space Modeling and Particle Filtering for Time Series Decomposition and Nonlinear Signal Extraction	Genshiro Kitagawa et.al.	2512.01162	null
2025-11-30	Upper Approximation Bounds for Neural Oscillators	Zifeng Huang et.al.	2512.01015	null
2025-11-29	A State-Space Approach to Modeling Tire Degradation in Formula 1 Racing	Cole Cappello et.al.	2512.00640	null
2025-11-29	DPNet: Doppler LiDAR Motion Planning for Highly-Dynamic Environments	Wei Zuo et.al.	2512.00375	null
2025-11-28	ReactionMamba: Generating Short &Long Human Reaction Sequences	Hajra Anwar Beg et.al.	2512.00208	null
2025-11-28	Wilson loops, symmetries, and selective bulk-boundary correspondence in higher-order topological insulators	Suman Aich et.al.	2511.23471	null
2025-11-28	Visual Generation Tuning	Jiahao Guo et.al.	2511.23469	null
2025-11-28	SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments	Xinyi Li et.al.	2511.23465	null
2025-11-28	Kinetic Mixing and the Phantom Illusion: Axion-Dilaton Quintessence in Light of DESI DR2	Michael W. Toomey et.al.	2511.23463	null
2025-11-28	DisMo: Disentangled Motion Representations for Open-World Motion Transfer	Thomas Ressler-Antal et.al.	2511.23428	null
2025-11-28	Hilbert space fragmentation in driven-dephasing Rydberg atom array	Tianyi Yan et.al.	2511.23395	null
2025-11-28	Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon	Isabel Whiteley Tscherniak et.al.	2511.23384	null
2025-11-28	Functional Program Synthesis with Higher-Order Functions and Recursion Schemes	Matheus Campos Fernandes et.al.	2511.23354	null
2025-11-28	Data-driven Reachability Verification with Probabilistic Guarantees under Koopman Spectral Uncertainty	Jianqiang Ding et.al.	2511.23322	null
2025-11-28	Magnetic Dipole Portal Vector Dark Matter at Fixed-Targets	Avik Banerjee et.al.	2511.23259	null
2025-11-28	SDE-Attention: Latent Attention in SDE-RNNs for Irregularly Sampled Time Series with Missing Data	Yuting Fang et.al.	2511.23238	null
2025-11-28	Incorporating Ephemeral Traffic Waves in A Data-Driven Framework for Microsimulation in CARLA	Alex Richardson et.al.	2511.23236	null
2025-11-28	Constraining the Inert Doublet Model at the LHC	Jayita Lahiri et.al.	2511.23133	null
2025-11-28	Einstein’s 1935 Letters to Schrödinger and Popper and the Boundaries of the PBR $ψ$ -Epistemic Framework	Galina Weinstein et.al.	2511.23125	null
2025-11-28	Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM	Mengjie Liu et.al.	2511.23119	null
2025-11-28	Time Extrapolation with Graph Convolutional Autoencoder and Tensor Train Decomposition	Yuanhong Chen et.al.	2511.23037	null
2025-11-28	Joint Bayesian Inference of Parameter and Discretization Error Uncertainties in ODE Models	Shoji Toyota et.al.	2511.23010	null
2025-11-28	SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving	Wonjeong Ryu et.al.	2511.22865	null
2025-11-28	TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE	Jiawen Wei et.al.	2511.22853	null
2025-11-28	PerfMamba: Performance Analysis and Pruning of Selective State Space Models	Abdullah Al Asif et.al.	2511.22849	null
2025-11-26	TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos	Seungjae Lee et.al.	2511.21690	null
2025-11-26	G $^2$ VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning	Wenbo Hu et.al.	2511.21688	null
2025-11-26	Visualizing LLM Latent Space Geometry Through Dimensionality Reduction	Alex Ning et.al.	2511.21594	null
2025-11-26	Machine Learning Approaches to Clinical Risk Prediction: Multi-Scale Temporal Alignment in Electronic Health Records	Wei-Chen Chang et.al.	2511.21561	null
2025-11-26	MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors	Thai-Khanh Nguyen et.al.	2511.21550	null
2025-11-26	Simulations of high-energy neutrino emissions from blazars with the LeHa-Paris code	Francesco Carenini et.al.	2511.21532	null
2025-11-26	Sector theory of Levin-Wen models I : Classification of Anyon Sectors	Alex Bols et.al.	2511.21521	null
2025-11-26	Nested ensemble Kalman filter for static parameter inference in nonlinear state-space models	Andrew Golightly et.al.	2511.21497	null
2025-11-26	Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning	Taehoon Kim et.al.	2511.21490	null
2025-11-26	SONAR: Spectral-Contrastive Audio Residuals for Generalizable Deepfake Detection	Ido Nitzan HIdekel et.al.	2511.21325	null
2025-11-26	PathMamba: A Hybrid Mamba-Transformer for Topologically Coherent Road Segmentation in Satellite Imagery	Jules Decaestecker et.al.	2511.21298	null
2025-11-26	Exploring muonphilic dark matter with the $Z_2$ -even mediator at muon colliders	Wanyun Chen et.al.	2511.21290	null
2025-11-26	Floquet thermalization by power-law induced permutation symmetry breaking	Manju C et.al.	2511.21284	null
2025-11-26	I-GLIDE: Input Groups for Latent Health Indicators in Degradation Estimation	Lucas Thil et.al.	2511.21208	null
2025-11-26	Vortex-Enhanced Zitterbewegung in Relativistic Electron Wave Packets	Zhongze Guo et.al.	2511.21142	null
2025-11-26	Referring Video Object Segmentation with Cross-Modality Proxy Queries	Baoli Sun et.al.	2511.21139	null
2025-11-26	DeepRFTv2: Kernel-level Learning for Image Deblurring	Xintian Mao et.al.	2511.21132	null
2025-11-26	OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection	Chujie Wang et.al.	2511.21064	null
2025-11-26	Gated KalmaNet: A Fading Memory Layer Through Test-Time Ridge Regression	Liangzu Peng et.al.	2511.21016	null
2025-11-26	SpaceX: Exploring metrics with the SPACE model for developer productivity	Sanchit Kaul et.al.	2511.20955	null
2025-11-25	DINO-Tok: Adapting DINO for Visual Tokenizers	Mingkai Jia et.al.	2511.20565	link
2025-11-25	From Features to States: Data-Driven Selection of Measured State Variables via RFE-DMDc	Haoyu Wang et.al.	2511.20552	null
2025-11-25	Physically Interpretable Interatomic Potentials via Symbolic Regression and Reinforcement Learning	Bilvin Varughese et.al.	2511.20506	null
2025-11-25	Generative Modeling with Manifold Percolation	Rui Tong et.al.	2511.20503	null
2025-11-25	Universe of Thoughts: Enabling Creative Reasoning with Large Language Models	Yuto Suzuki et.al.	2511.20471	null
2025-11-25	Advances and Challenges in Solar Flare Prediction: A Review	Mingfu Shao et.al.	2511.20465	null
2025-11-25	STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow	Jiatao Gu et.al.	2511.20462	null
2025-11-25	Towards Trustworthy Wi-Fi Sensing: Systematic Evaluation of Deep Learning Model Robustness to Adversarial Attacks	Shreevanth Krishnaa Gopalakrishnan et.al.	2511.20456	null
2025-11-25	Adaptive Meshing for CPA Lyapunov Function Synthesis	Amy K. Strong et.al.	2511.20443	null
2025-11-25	The effect of sound speed on the gravitational wave spectrum of first order phase transitions in the early universe	Mika Mäki et.al.	2511.20436	null
2025-11-25	BRIC: Bridging Kinematic Plans and Physical Control at Test Time	Dohun Lim et.al.	2511.20431	null
2025-11-25	Proximity driven photon-tunneling in chiral quantum hybrid systems	Aryan Pratap Srivastava et.al.	2511.20357	null
2025-11-25	Active Inference in Discrete State Spaces from First Principles	Patrick Kenny et.al.	2511.20321	null
2025-11-25	Improving Language Agents through BREW	Shashank Kirtania et.al.	2511.20297	null
2025-11-25	DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion	Yinghui Li et.al.	2511.20278	link
2025-11-25	PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling	Bo-Kai Ruan et.al.	2511.20251	link
2025-11-25	POMDP-Based Routing for DTNs with Partial Knowledge and Dependent Failures	Gregory F. Stock et.al.	2511.20241	null
2025-11-25	Communication-Efficient Learning for Satellite Constellations	Ruxandra-Stefania Tudose et.al.	2511.20220	null
2025-11-25	Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis	Mohammad Mahdi et.al.	2511.20186	null
2025-11-25	Alzheimers Disease Progression Prediction Based on Manifold Mapping of Irregularly Sampled Longitudinal Data	Xin Hong et.al.	2511.20154	null
2025-11-24	Cloud4D	Jacob Lin et.al.	2511.19431	null
2025-11-24	Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection	Zixuan Wang et.al.	2511.19306	null
2025-11-24	Innovative Modular Design and Kinematic Approach based on Screw Theory for Triple Scissors Links Deployable Space Antenna Mechanism	Mamoon Aamir et.al.	2511.19287	null
2025-11-24	What is the signature of a trion in photoemission?	Jinyuan Wu et.al.	2511.19280	null
2025-11-24	Solar-GECO: Perovskite Solar Cell Property Prediction with Geometric-Aware Co-Attention	Lucas Li et.al.	2511.19263	null
2025-11-24	LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models	Shuai Wang et.al.	2511.19261	null
2025-11-24	Learning Plug-and-play Memory for Guiding Video Diffusion Models	Selena Song et.al.	2511.19229	link
2025-11-24	Reference-Free Sampling-Based Model Predictive Control	Fabian Schramm et.al.	2511.19204	null
2025-11-24	Information Physics of Intelligence: Unifying Logical Depth and Entropy under Thermodynamic Constraints	Jianfeng Xu et.al.	2511.19156	null
2025-11-24	Fast-Converging and Asymptotic-Preserving DSMC	Bin Hu et.al.	2511.19061	null
2025-11-24	Latent-Space Non-Linear Model Predictive Control for Partially-Observable Systems	Luigi Marra et.al.	2511.19056	null
2025-11-24	Multigrid with Linear Storage Complexity	Daniel Bauer et.al.	2511.19036	null
2025-11-24	Web of Non-invertible Dualities for (2+1) Dimensional Models with Subsystem Symmetries	Avijit Maity et.al.	2511.18969	null
2025-11-24	BSN-V: The First Detailed Light Curve Modeling of Eight Totally Eclipsing Contact Binary Stars Using Ground-Based and TESS Observations	Atila Poro et.al.	2511.18909	null
2025-11-24	MFmamba: A Multi-function Network for Panchromatic Image Resolution Restoration Based on State-Space Model	Qian Jiang et.al.	2511.18888	null
2025-11-24	KernelBand: Boosting LLM-based Kernel Optimization with a Hierarchical and Hardware-aware Multi-armed Bandit	Dezhi Ran et.al.	2511.18868	null
2025-11-24	SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation	Nimeshika Udayangani et.al.	2511.18816	null
2025-11-24	ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection	Ruize Ma et.al.	2511.18780	null
2025-11-24	SAOT: An Enhanced Locality-Aware Spectral Transformer for Solving PDEs	Chenhong Zhou et.al.	2511.18777	link
2025-11-24	Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers	Yiqing Shi et.al.	2511.18673	null
2025-11-21	Counterfactual World Models via Digital Twin-conditioned Video Diffusion	Yiqing Shen et.al.	2511.17481	null
2025-11-21	Moving superfluids in the rotating universe	Jose Beltrán Jiménez et.al.	2511.17472	null
2025-11-21	SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding	Nikolay Nikolov et.al.	2511.17411	null
2025-11-21	Selective Rotary Position Embedding	Sajad Movahedi et.al.	2511.17388	null
2025-11-21	ReBaPL: Repulsive Bayesian Prompt Learning	Yassir Bendou et.al.	2511.17339	null
2025-11-21	Parameter Inference from Final-State Entanglement in Higgs Decays	Jia Liu et.al.	2511.17321	null
2025-11-21	SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion	Jiajie Guo et.al.	2511.17308	null
2025-11-21	SAVeD: Semantic Aware Version Discovery	Artem Frenk et.al.	2511.17298	null
2025-11-21	PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention	Yipeng Chen et.al.	2511.17185	null
2025-11-21	On the Predictive Skill of Artificial Intelligence-based Weather Models for Extreme Events using Uncertainty Quantification	Rodrigo Almeida et.al.	2511.17176	null
2025-11-21	Dark Matter Admixed White Dwarfs: A Single-Fluid Approach	Rajasmita Sahoo et.al.	2511.17120	null
2025-11-21	RL-AD-Net: Reinforcement Learning Guided Adaptive Displacement in Latent Space for Refined Point Cloud Completion	Bhanu Pratap Paregi et.al.	2511.17054	null
2025-11-21	Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters	Zhan Su et.al.	2511.17044	null
2025-11-21	CLLMRec: LLM-powered Cognitive-Aware Concept Recommendation via Semantic Alignment and Prerequisite Knowledge Distillation	Xiangrui Xiong et.al.	2511.17041	null
2025-11-21	Generative MIMO Beam Map Construction for Location Recovery and Beam Tracking	Wangqian Chen et.al.	2511.17007	null
2025-11-21	FLUID: Training-Free Face De-identification via Latent Identity Substitution	Jinhyeong Park et.al.	2511.17005	null
2025-11-21	Stable Offline Hand-Eye Calibration for any Robot with Just One Mark	Sicheng Xie et.al.	2511.17001	null
2025-11-21	The Finer the Better: Towards Granular-aware Open-set Domain Generalization	Yunyun Wang et.al.	2511.16979	null
2025-11-21	Flow-Guided Implicit Neural Representation for Motion-Aware Dynamic MRI Reconstruction	Baoqing Li et.al.	2511.16948	null
2025-11-21	Improving Latent Reasoning in LLMs via Soft Concept Mixing	Kang Wang et.al.	2511.16885	null
2025-11-20	Dataset Distillation for Pre-Trained Self-Supervised Vision Models	George Cazenavette et.al.	2511.16674	null
2025-11-20	Strained hyperbolic Dirac fermions: Zero modes, flat bands, and competing orders	Christopher A. Leong et.al.	2511.16667	null
2025-11-20	Time dependent loss reweighting for flow matching and diffusion models is theoretically justified	Lukas Billera et.al.	2511.16599	null
2025-11-20	TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding	Boshen Xu et.al.	2511.16595	null
2025-11-20	Comment on: “Scaling and Universality at Noisy Quench Dynamical Quantum Phase Transitions”	J. Sirker et.al.	2511.16509	null
2025-11-20	Order-by-disorder from Schwinger bosons in a frustrated honeycomb ferromagnet	Arnaud Ralko et.al.	2511.16429	null
2025-11-20	Search for Higgsinos in final states with low-momentum lepton-track pairs at 13 TeV	CMS Collaboration et.al.	2511.16394	null
2025-11-20	Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning	Mohammad Areeb Qazi et.al.	2511.16333	null
2025-11-20	SeSE: A Structural Information-Guided Uncertainty Quantification Framework for Hallucination Detection in LLMs	Xingtao Zhao et.al.	2511.16275	null
2025-11-20	SwiTrack: Tri-State Switch for Cross-Modal Object Tracking	Boyue Xu et.al.	2511.16227	null
2025-11-20	CausalMamba: Interpretable State Space Modeling for Temporal Rumor Causality	Xiaotong Zhan et.al.	2511.16191	null
2025-11-20	Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion	Lirui Zhang et.al.	2511.16161	null
2025-11-20	How Noise Benefits AI-generated Image Detection	Jiazhen Yan et.al.	2511.16136	null
2025-11-20	Decoupling Complexity from Scale in Latent Diffusion Model	Tianxiong Zhong et.al.	2511.16117	null
2025-11-20	Parallelizable Complex Neural Dynamics Models for PMSM Temperature Estimation with Hardware Acceleration	Xinyuan Liao et.al.	2511.16093	null
2025-11-20	A Hybrid Proactive And Predictive Framework For Edge Cloud Resource Management	Hrikshesh Kumar et.al.	2511.16075	null
2025-11-20	High-Throughput Exploration of Refractory High-Entropy Alloys for Strength and Plasticity	Stephen A. Giles et.al.	2511.16057	null
2025-11-20	Exploiting Inter-Sample Information for Long-tailed Out-of-Distribution Detection	Nimeshika Udayangani et.al.	2511.16015	null
2025-11-20	Synergizing Deconfounding and Temporal Generalization For Time-series Counterfactual Outcome Estimation	Yiling Liu et.al.	2511.16006	null
2025-11-19	Breaking the Bottleneck with DiffuApriel: High-Throughput Diffusion LMs with Mamba Backbone	Vaibhav Singh et.al.	2511.15927	null
2025-11-19	From Qubits to Couplings: A Hybrid Quantum Machine Learning Framework for LHC Physics	Marwan Ait Haddou et.al.	2511.15672	null
2025-11-19	SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models	Senyu Fei et.al.	2511.15605	null
2025-11-19	Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi	Kayo Tei et.al.	2511.15581	null
2025-11-19	Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA	Yukun Du et.al.	2511.15551	null
2025-11-19	Partial-Wave Unitarity Bounds on Higher-Dimensional Operators from 2-to- $N$ Scattering	Céline Degrande et.al.	2511.15524	null
2025-11-19	Probing the disk-jet coupling in M87	Ainara Saiz-Pérez et.al.	2511.15482	null
2025-11-19	Robust H-infinity control and worst-case search in constrained parametric space	Ervan Kassarian et.al.	2511.15480	null
2025-11-19	Proximal Approximate Inference in State-Space Models	Hany Abdulsamad et.al.	2511.15409	null
2025-11-19	C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models	Nayoung Oh et.al.	2511.15333	null
2025-11-19	SkyEgg: Joint Implementation Selection and Scheduling for Hardware Synthesis using E-graphs	Youwei Xiao et.al.	2511.15323	null
2025-11-19	Tensor-network approach to quantum optical state evolution beyond the Fock basis	Nikolay Kapridov et.al.	2511.15295	null
2025-11-19	Reinforcement Learning in Queue-Reactive Models: Application to Optimal Execution	Tomas Espana et.al.	2511.15262	null
2025-11-19	PLATONT: Learning a Platonic Representation for Unified Network Tomography	Chengze Du et.al.	2511.15251	null
2025-11-19	Modelling and Model-Checking a ROS2 Multi-Robot System using Timed Rebeca	Hiep Hong Trinh et.al.	2511.15227	null
2025-11-19	Well-posedness and time-asymptotic of Boltzmann equations for monatomic and polyatomic mixtures	Ricardo Alonso et.al.	2511.15185	null
2025-11-19	Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance	Songze Li et.al.	2511.15164	null
2025-11-19	Robust outlier-adjusted mean-shift estimation of state-space models	Rajan Shankar et.al.	2511.15155	null
2025-11-19	TiCAL:Typicality-Based Consistency-Aware Learning for Multimodal Emotion Recognition	Wen Yin et.al.	2511.15085	null
2025-11-19	Fourier-KAN-Mamba: A Novel State-Space Equation Approach for Time-Series Anomaly Detection	Xiancheng Wang et.al.	2511.15083	null
2025-11-19	MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation	Shengjing Tian et.al.	2511.15077	null
2025-11-18	From Random Determinants to the Ground State	Hao Zhang et.al.	2511.14734	null
2025-11-18	Charged Higgs bosons associated with neutral gauge bosons at future multi–TeV muon colliders	Khiem Hong Phan et.al.	2511.14525	null
2025-11-18	Neural Networks-Enabled Channel Reconstruction for Fluid Antenna Systems: A Data-Driven Approach	Haoyu Liang et.al.	2511.14520	null
2025-11-18	D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images	Taifour Yousra Nabila et.al.	2511.14518	null
2025-11-18	Full Atom Peptide Design via Riemannian Euclidean Bayesian Flow Networks	Hao Qian et.al.	2511.14516	null
2025-11-18	Parameter Aware Mamba Model for Multi-task Dense Prediction	Xinzhuo Yu et.al.	2511.14503	null
2025-11-18	An introduction to Coupling	Artur O. Lopes et.al.	2511.14489	null
2025-11-18	Towards a Comprehensive Theory of Reservoir Computing	Denis Kleyko et.al.	2511.14484	null
2025-11-18	Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation	Aditi Agarwal et.al.	2511.14481	null
2025-11-18	Hölder regularity in bang-bang type affine optimal control problems	Alberto Domínguez Corella et.al.	2511.14459	null
2025-11-18	H-LDM: Hierarchical Latent Diffusion Models for Controllable and Interpretable PCG Synthesis from Clinical Metadata	Chenyang Xu et.al.	2511.14312	null
2025-11-18	Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation	Weimin Bai et.al.	2511.14271	null
2025-11-18	Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction	Juncheng Hu et.al.	2511.14237	null
2025-11-18	EBind: a practical approach to space binding	Jim Broadbent et.al.	2511.14229	null
2025-11-18	InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior	Weimin Bai et.al.	2511.14208	null
2025-11-18	FreeMusco: Motion-Free Learning of Latent Control for Morphology-Adaptive Locomotion in Musculoskeletal Characters	Minkwan Kim et.al.	2511.14205	null
2025-11-18	Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing	Xun Lin et.al.	2511.14157	null
2025-11-18	State-Space Representation of INGARCH Models and Their Application in Insurance	Jae Youn Ahn et.al.	2511.14091	null
2025-11-18	Cosmological dynamics of interacting dark matter-dark energy in generalized Rastall gravity	Manuel Gonzalez-Espinoza et.al.	2511.14089	null
2025-11-18	Enhancing Non-classical Properties of Entangled Coherent States via Post-Selected von Neumann Measurements	Janarbek Yuanbek et.al.	2511.14079	null
2025-11-17	Open-shell frozen natural orbital approach for quantum eigensolvers	Angela F. Harper et.al.	2511.13677	null
2025-11-17	Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?	Chunqiu Steven Xia et.al.	2511.13646	null
2025-11-17	Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification	Linhan Zhou et.al.	2511.13575	null
2025-11-17	Coclique level structure for stochastic chemical reaction networks	Simone Bruno et.al.	2511.13569	null
2025-11-17	A Quantum Tensor Network-Based Viewpoint for Modeling and Analysis of Time Series Data	Pragatheeswaran Vipulananthan et.al.	2511.13514	null
2025-11-17	Naga: Vedic Encoding for Deep State Space Models	Melanie Schaller et.al.	2511.13510	null
2025-11-17	Explainable RL Policies by Distilling to Locally-Specialized Linear Policies with Voronoi State Partitioning	Senne Deproost et.al.	2511.13322	null
2025-11-17	Voltage-Based Unsupervised Learning Framework for Bridge Damage Detection in Simultaneous Energy Harvesting and Sensing Systems	S. Yao et.al.	2511.13291	null
2025-11-17	Spectroscopic signatures of emergent elementary excitations in a kinetically constrained long-range interacting two-dimensional spin system	Tobias Kaltenmark et.al.	2511.13279	null
2025-11-17	MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI	Malek Al Abed et.al.	2511.13232	null
2025-11-17	3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale	Yijia Fan et.al.	2511.13211	null
2025-11-17	Modeling group heterogeneity in spatio-temporal data via physics-informed semiparametric regression	Marco F. De Sanctis et.al.	2511.13203	null
2025-11-17	Video Spatial Reasoning with Object-Centric 3D Rollout	Haoran Tang et.al.	2511.13190	null
2025-11-17	Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework	Diego Ortego et.al.	2511.13189	null
2025-11-17	WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection	Longhui Zheng et.al.	2511.13138	null
2025-11-17	Departures: Distributional Transport for Single-Cell Perturbation Prediction with Neural Schrödinger Bridges	Changxi Chi et.al.	2511.13124	null
2025-11-17	Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining	Zhaocheng Yu et.al.	2511.13113	null
2025-11-17	DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection	Jiazhen Yan et.al.	2511.13108	null
2025-11-17	Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact	Satyanarayan Pati et.al.	2511.13057	null
2025-11-17	Monocular 3D Lane Detection via Structure Uncertainty-Aware Network with Curve-Point Queries	Ruixin Liu et.al.	2511.13055	null
2025-11-14	Coherent-state path integrals in quantum thermodynamics	Luca Salasnich et.al.	2511.11547	null
2025-11-14	Bridging Hidden States in Vision-Language Models	Benjamin Fein-Ashley et.al.	2511.11526	null
2025-11-14	Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective	Nhat Chung et.al.	2511.11478	null
2025-11-14	Unsupervised Motion-Compensated Decomposition for Cardiac MRI Reconstruction via Neural Representation	Xuanyu Tian et.al.	2511.11436	null
2025-11-14	Lorentz Transformation in Quantum Mechanics	Marcello Baldo et.al.	2511.11342	null
2025-11-14	BOA Constrictor: A Mamba-based lossless compressor for High Energy Physics data	Akshat Gupta et.al.	2511.11337	null
2025-11-14	RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms	Yitian Kou et.al.	2511.11323	null
2025-11-14	Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs	Jitesh Chavan et.al.	2511.11243	null
2025-11-14	Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation	Quoc-Huy Trinh et.al.	2511.11177	null
2025-11-14	Non-Gaussianity-induced enhanced target-finding dynamics of confined colloids	Guirec de Tournemire et.al.	2511.11117	link
2025-11-14	CPT symmetry in the mirror universe	Natalia Gorobey et.al.	2511.11109	null
2025-11-14	On the accuracy of the model predictive control method	Georgi Angelov et.al.	2511.11098	null
2025-11-14	A Space-Time Transformer for Precipitation Forecasting	Levi Harris et.al.	2511.11090	null
2025-11-14	Evaluating Latent Generative Paradigms for High-Fidelity 3D Shape Completion from a Single Depth Image	Matthias Humt et.al.	2511.11074	null
2025-11-14	Autonomous motion in changing environment, fibrations and reaction mechanisms	Michael Farber et.al.	2511.11042	null
2025-11-14	Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation	Zhiwei Zhang et.al.	2511.11011	null
2025-11-14	ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization	Anzhe Cheng et.al.	2511.10971	null
2025-11-14	Subgrid Stress Modelling with Multi-dimensional State Space Sequence Models	Andy Wu et.al.	2511.10910	null
2025-11-14	Tracking EEG Thalamic and Cortical Focal Brain Activity using Standardized Kalman Filtering with Kinematics Modeling	Veikka Piispa et.al.	2511.10877	null
2025-11-13	Adaptive Digital Twin of Sheet Metal Forming via Proper Orthogonal Decomposition-Based Koopman Operator with Model Predictive Control	Yi-Ping Chen et.al.	2511.10852	null
2025-11-13	Impacts of Decoder Latency on Utility-Scale Quantum Computer Architectures	Abdullah Khalid et.al.	2511.10633	null
2025-11-13	OmniVGGT: Omni-Modality Driven Visual Geometry Grounded	Haosong Peng et.al.	2511.10560	link
2025-11-13	Friction terms in multi-fluid description of heavy-ion collisions	Clemens Werthmann et.al.	2511.10487	null
2025-11-13	From Local Nonclassicality to Entanglement: A Convexity Law for Single-Excitation Dynamics	Atta ur Rahman et.al.	2511.10470	null
2025-11-13	Continuous Branching Processes with Settlement in Cancer Metastasis: Stochastic Modelling and the Feller Property	Ivan Biočić et.al.	2511.10456	null
2025-11-13	Chromatic Zeros on the Limit $G^{(p,\ell)}_\infty$ of the Family $G^{(p,\ell)}_m$ of Hierarchical Graphs	Shu-Chiuan Chang et.al.	2511.10405	null
2025-11-13	FOUND: Fourier-based von Mises Distribution for Robust Single Domain Generalization in Object Detection	Mengzhu Wang et.al.	2511.10352	null
2025-11-13	Out-of-Context Misinformation Detection via Variational Domain-Invariant Learning with Test-Time Training	Xi Yang et.al.	2511.10213	null
2025-11-13	Scalable data-driven modeling of microstructure evolution by learning local dependency and spatiotemporal translation invariance rules in phase field simulation	Zishuo Lan et.al.	2511.10171	link
2025-11-13	RI-Loss: A Learnable Residual-Informed Loss for Time Series Forecasting	Jieting Wang et.al.	2511.10130	null
2025-11-13	Geometric foundations of thermodynamics in the quantum regime	Álvaro Tejero et.al.	2511.10125	null
2025-11-13	T2IBias: Uncovering Societal Bias Encoded in the Latent Space of Text-to-Image Generative Models	Abu Sufian et.al.	2511.10089	null
2025-11-13	Efficient Thought Space Exploration through Strategic Intervention	Ziheng Li et.al.	2511.10038	null
2025-11-13	The Age-Structured Chemostat with Substrate Dynamics as a Control System	Iasson Karafyllis et.al.	2511.09963	null
2025-11-13	A Universal Block Error Rate Bound for Fluid Antenna Systems	Zhentian Zhang et.al.	2511.09929	null
2025-11-13	Boosting In-Silicon Directed Evolution with Fine-Tuned Protein Language Model and Tree Search	Yaodong Yang et.al.	2511.09900	null
2025-11-13	Interaction-induced Dimension Reduction for Bound States in Microwave-Shielded Ultracold Molecules	Haitian Wang et.al.	2511.09856	null
2025-11-12	Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models	Konstantinos M. Dafnis et.al.	2511.09809	null
2025-11-12	A Robust Task-Level Control Architecture for Learned Dynamical Systems	Eshika Pathak et.al.	2511.09790	null
2025-11-12	Ksurf-Drone: Attention Kalman Filter for Contextual Bandit Optimization in Cloud Resource Allocation	Michael Dang’ana et.al.	2511.09766	null
2025-11-12	CloudMamba: Grouped Selective State Spaces for Point Cloud Analysis	Kanglin Qu et.al.	2511.07823	link
2025-11-10	On the Redundant Distributed Observability of Mixed Traffic Transportation Systems	M. Doostmohammadian et.al.	2511.06950	null
2025-11-10	Dual Mamba for Node-Specific Representation Learning: Tackling Over-Smoothing with Selective State Space Modeling	Xin He et.al.	2511.06756	null
2025-11-08	L2T-Hyena: Enhancing State-Space Models with an Adaptive Learn-to-Teach Framework	Fatemeh Sobati et.al.	2511.05926	null
2025-11-07	Sequential Markov chain Monte Carlo for Filtering of State-Space Models with Low or Degenerate Observation Noise	Abylay Zhumekenov et.al.	2511.04975	null
2025-11-06	Generative Bayesian Filtering and Parameter Learning	Edoardo Marcelli et.al.	2511.04552	null
2025-11-06	Online Bayesian Experimental Design for Partially Observed Dynamical Systems	Sara Pérez-Vieites et.al.	2511.04403	null
2025-11-05	FAPEX: Fractional Amplitude-Phase Expressor for Robust Cross-Subject Seizure Prediction	Ruizhe Zheng et.al.	2511.03263	null
2025-11-04	Apriel-H1: Towards Efficient Enterprise Reasoning Models	Oleksiy Ostapenko et.al.	2511.02651	null
2025-11-10	MM-UNet: Morph Mamba U-shaped Convolutional Networks for Retinal Vessel Segmentation	Jiawen Liu et.al.	2511.02193	null
2025-11-03	MVSMamba: Multi-View Stereo with State Space Model	Jianfei Jiang et.al.	2511.01315	null
2025-10-31	MambaNetLK: Enhancing Colonoscopy Point Cloud Registration with Mamba	Linzhe Jiang et.al.	2511.00260	null
2025-10-31	Context-Gated Cross-Modal Perception with Visual Mamba for PET-CT Lung Tumor Segmentation	Elena Mulero Ayllón et.al.	2510.27508	null
2025-10-31	Versatile and Efficient Medical Image Super-Resolution Via Frequency-Gated Mamba	Wenfeng Huang et.al.	2510.27296	null
2025-10-31	Higher-order Linear Attention	Yifan Zhang et.al.	2510.27258	null
2025-10-30	Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling	Hyunji Lee et.al.	2510.26912	null
2025-11-04	PyDPF: A Python Package for Differentiable Particle Filtering	John-Joseph Brady et.al.	2510.25693	null
2025-10-21	Stable-by-Design Neural Network-Based LPV State-Space Models for System Identification	Ahmet Eren Sertbaş et.al.	2510.24757	null
2025-10-28	DeshadowMamba: Deshadowing as 1D Sequential Similarity	Zhaotong Yang et.al.	2510.24260	null
2025-10-27	Deep Active Inference with Diffusion Policy and Multiple Timescale World Model for Real-World Exploration and Navigation	Riko Yokozawa et.al.	2510.23258	null
2025-10-30	Hankel Singular Value Regularization for Highly Compressible State Space Models	Paul Schwerdtner et.al.	2510.22951	null
2025-10-27	GTR-Mamba: Geometry-to-Tangent Routing for Hyperbolic POI Recommendation	Zhuoxuan Li et.al.	2510.22942	null
2025-10-26	Beyond Semantics: How Temporal Biases Shape Retrieval in Transformer and State-Space Models	Anooshka Bajaj et.al.	2510.22752	null
2025-10-26	Scalable Neural Decoders for Practical Real-Time Quantum Error Correction	Changwon Lee et.al.	2510.22724	null
2025-10-24	Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging	Ying Xue et.al.	2510.21654	null
2025-11-03	ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models	Federico Danieli et.al.	2510.21450	null
2025-10-23	LLM-Integrated Bayesian State Space Models for Multimodal Time-Series Forecasting	Sungjun Cho et.al.	2510.20952	null
2025-10-22	PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation	Zhuoyang Xie et.al.	2510.19475	null
2025-10-23	Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge	Penghao Wang et.al.	2510.19266	null
2025-10-21	$Δ$ t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction	Zhengbo Zhou et.al.	2510.19003	null
2025-10-23	MLMA: Towards Multilingual ASR With Mamba-based Architectures	Mohamed Nabih Ali et.al.	2510.18684	link
2025-10-15	DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association	Zenghuang Fu et.al.	2510.17860	null
2025-10-20	S4ECG: Exploring the impact of long-range interactions for arrhythmia prediction	Tiezhi Wang et.al.	2510.17406	null
2025-10-20	CausalMamba: Scalable Conditional State Space Models for Neural Causal Inference	Sangyoon Bae et.al.	2510.17318	null
2025-10-20	Understanding and Improving Length Generalization in Hierarchical Sparse Attention Models	Jiaqi Leng et.al.	2510.17196	null
2025-10-19	Schrödinger Bridge Mamba for One-Step Speech Enhancement	Jing Yang et.al.	2510.16834	null
2025-10-17	VM-BeautyNet: A Synergistic Ensemble of Vision Transformer and Mamba for Facial Beauty Prediction	Djamel Eddine Boukhari et.al.	2510.16220	null
2025-10-17	StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales	Nyle Siddiqui et.al.	2510.16209	null
2025-10-17	Recursive Inference for Heterogeneous Multi-Output GP State-Space Models with Arbitrary Moment Matching	Tengjie Zheng et.al.	2510.15390	null
2025-10-17	Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding	Shuntaro Suzuki et.al.	2510.15371	null
2025-10-16	To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models	Eran Malach et.al.	2510.14826	null
2025-10-16	State-Space Models for Tabular Prior-Data Fitted Networks	Felix Koch et.al.	2510.14573	null
2025-10-16	A Deep State-Space Model Compression Method using Upper Bound on Output Error	Hiroki Sakamoto et.al.	2510.14542	null
2025-10-16	SHaRe-SSM: An Oscillatory Spiking Neural Network for Target Variable Modeling in Long Sequences	Kartikay Agrawal et.al.	2510.14386	null
2025-10-16	DRBD-Mamba for Robust and Efficient Brain Tumor Segmentation with Analytical Insights	Danish Ali et.al.	2510.14383	null
2025-10-15	Context-Selective State Space Models: Feedback is All You Need	Riccardo Zattra et.al.	2510.14027	null
2025-10-16	The Mechanistic Emergence of Symbol Grounding in Language Models	Shuyu Wu et.al.	2510.13796	null
2025-10-14	One Dimensional CNN ECG Mamba for Multilabel Abnormality Classification in 12 Lead ECG	Huawei Jiang et.al.	2510.13046	null
2025-10-14	State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding	Jiahuan Zhou et.al.	2510.12160	link
2025-10-14	Chimera: State Space Models Beyond Sequences	Aakash Lahoti et.al.	2510.12111	link
2025-10-13	Argus: JAX state-space filtering for gravitational wave detection with a pulsar timing array	Tom Kimpson et.al.	2510.11077	null
2025-10-13	High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation	Runyang Feng et.al.	2510.11017	null
2025-10-16	MSF-Mamba: Motion-aware State Fusion Mamba for Efficient Micro-Gesture Recognition	Deng Li et.al.	2510.10478	null
2025-10-10	Design Principles for Sequence Models via Coefficient Dynamics	Jerome Sieber et.al.	2510.09389	null
2025-10-10	Task-Level Insights from Eigenvalues across Sequence Models	Rahel Rickenbach et.al.	2510.09379	null
2025-10-10	Minkowski-MambaNet: A Point Cloud Framework with Selective State Space Models for Forest Biomass Quantification	Jinxiang Tu et.al.	2510.09367	null
2025-10-10	MambaH-Fit: Rethinking Hyper-surface Fitting-based Point Cloud Normal Estimation via State Space Modelling	Weijia Wang et.al.	2510.09088	null
2025-10-13	Revisiting Node Affinity Prediction in Temporal Graphs	Krishna Sri Ipsit Mantri et.al.	2510.06940	null
2025-10-08	DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining	Zhiliang Zhu et.al.	2510.06746	null
2025-10-08	A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures	Nhat M. Hoang et.al.	2510.06640	null
2025-10-09	Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection?	Sri Durga Sai Sowmya Kadali et.al.	2510.06594	null
2025-10-09	High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training	Zhuoyi Huang et.al.	2510.05492	null
2025-10-06	The End of Transformers? On Challenging Attention and the Rise of Sub-Quadratic Architectures	Alexander M. Fichtl et.al.	2510.05364	null
2025-10-06	Rivaling Transformers: Multi-Scale Structured State-Space Mixtures for Agentic 6G O-RAN	Farhad Rezazadeh et.al.	2510.05255	null
2025-10-06	On Structured State-Space Duality	Jerry Yao-Chieh Hu et.al.	2510.04944	null
2025-10-06	MCMC for State Space models	Paul Fearnhead et.al.	2510.04932	null
2025-10-06	Hybrid Architectures for Language Models: Systematic Analysis and Design Insights	Sangmin Bae et.al.	2510.04800	null
2025-10-06	Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba	Baher Mohammad et.al.	2510.04738	null
2025-10-05	Wave-PDE Nets: Trainable Wave-Equation Layers as an Alternative to Attention	Harshil Vejendla et.al.	2510.04304	null
2025-10-09	The Curious Case of In-Training Compression of State Space Models	Makram Chahine et.al.	2510.02823	null
2025-10-02	Accurate linear modeling of EEG-based cortical activity during a passive motor task with input: a sub-space identification approach	Sanna Bakels et.al.	2510.02596	null
2025-10-02	Bridging the Prediction Error Method and Subspace Identification: A Weighted Null Space Fitting Method	Jiabao He et.al.	2510.02529	null
2025-10-01	Linear RNNs for autoregressive generation of long music samples	Konrad Szewczyk et.al.	2510.02401	null
2025-09-30	Dynamic Modeling and Control System Analysis for Continuous-Disc Filters in Pulp Mill Operations	Jose M. Campos-Salazar et.al.	2510.02385	null
2025-10-02	Knots and variance ordering of sequential Monte Carlo algorithms	Joshua J Bon et.al.	2510.01901	null
2025-10-01	Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model	Hyun-kyu Ko et.al.	2510.00862	link
2025-10-01	Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models	JingChuan Guan et.al.	2510.00563	null
2025-09-30	PRISM: Progressive Rain removal with Integrated State-space Modeling	Pengze Xue et.al.	2509.26413	null
2025-09-30	Neural Network State-Space Estimators	Minxing Sun et.al.	2509.25959	null
2025-09-30	Bringing Emerging Architectures to Sequence Labeling in NLP	Ana Ezquerro et.al.	2509.25918	null
2025-09-29	Benchmarking ECG Foundational Models: A Reality Check Across Clinical Tasks	M A Al-Masud et.al.	2509.25095	link
2025-09-29	DyMoDreamer: World Modeling with Dynamic Modulation	Boxuan Zhang et.al.	2509.24804	link
2025-09-29	Q-Net: Transferable Queue Length Estimation via Kalman-based Neural Networks	Ting Gao et.al.	2509.24725	null
2025-09-29	Wavelet-Assisted Mamba for Satellite-Derived Sea Surface Temperature Super-Resolution	Wankun Chen et.al.	2509.24334	null
2025-09-29	Similarity-Aware Selective State-Space Modeling for Semantic Correspondence	Seungwook Kim et.al.	2509.24318	link
2025-09-28	HyMaTE: A Hybrid Mamba and Transformer Model for EHR Representation Learning	Md Mozaharul Mottalib et.al.	2509.24118	link
2025-09-28	Hazy Pedestrian Trajectory Prediction via Physical Priors and Graph-Mamba	Jian Chen et.al.	2509.24020	null
2025-09-28	Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression	Jiarui Jiang et.al.	2509.23779	link
2025-10-04	EfficientMIL: Efficient Linear-Complexity MIL Method for WSI Classification	Chengying She et.al.	2509.23640	link
2025-09-26	TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses	Sahar Dastani et.al.	2509.22813	link
2025-09-26	StateX: Enhancing RNN Recall via Post-training State Expansion	Xingyu Shen et.al.	2509.22630	null
2025-09-26	Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models	Aleksandar Terzić et.al.	2509.22284	null
2025-09-25	MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation	Xinyu Liu et.al.	2509.21265	null
2025-09-26	Aligning Inductive Bias for Data-Efficient Generalization in State Space Models	Qiyu Chen et.al.	2509.20789	null
2025-09-24	SpecMamba: Accelerating Mamba Inference on FPGA with Speculative Decoding	Linfeng Zhong et.al.	2509.19873	null
2025-09-24	RoboSSM: Scalable In-context Imitation Learning via State-Space Models	Youngju Yoo et.al.	2509.19658	null
2025-09-23	Mamba Modulation: On the Length Generalization of Mamba	Peng Lu et.al.	2509.19633	null
2025-09-23	Tractable Approximation of Labeled Multi-Object Posterior Densities	Thi Hong Thai Nguyen et.al.	2509.18780	null
2025-09-23	An overview of neural architectures for self-supervised audio representation learning from masked spectrograms	Sarthak Yadav et.al.	2509.18691	null
2025-09-23	LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection	Lanhu Wu et.al.	2509.18683	null
2025-09-23	LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA	Zeyi Kang et.al.	2509.18576	null
2025-09-22	Bayesian Nonhomogeneous hidden Markov models to leverage routine in physical activity monitoring with informative wear time	Beatrice Cantoni et.al.	2509.17806	null
2025-09-22	DA-Mamba: Dialogue-aware selective state-space model for multimodal engagement estimation	Shenwei Kang et.al.	2509.17711	null
2025-09-22	Achilles’ Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data	Tianyi Chen et.al.	2509.17514	null
2025-09-21	SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction	Djamel Eddine Boukhari et.al.	2509.17172	null
2025-09-21	Communication over LQG Control Systems: A Convex Optimization Approach to Capacity	Aharon Rips et.al.	2509.17002	null
2025-09-19	Estimating Clinical Lab Test Result Trajectories from PPG using Physiological Foundation Model and Patient-Aware State Space Model – a UNIPHY+ Approach	Minxiao Wang et.al.	2509.16345	null
2025-09-19	Mamba-2 audio captioning: design space exploration and analysis	Taehan Lee et.al.	2509.15680	null
2025-09-19	De-crackling Virtual Analog Controls with Asymptotically Stable Recurrent Neural Networks	Valtteri Kallinen et.al.	2509.15622	null
2025-09-19	DC-Mamba: Bi-temporal deformable alignment and scale-sparse enhancement for remote sensing change detection	Min Sun et.al.	2509.15563	null
2025-09-17	Classification Filtering	Ilker Bayram et.al.	2509.13975	null
2025-09-17	Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models	Motonari Kambara et.al.	2509.13839	null
2025-09-17	CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling	Hanfang Liang et.al.	2509.13784	null
2025-09-17	State Space Models over Directed Graphs	Junzhi She et.al.	2509.13735	null
2025-09-16	Multivariate Low-Rank State-Space Model with SPDE Approach for High-Dimensional Data	Jacopo Rodeschini et.al.	2509.12825	null
2025-09-15	U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT	Zhi Qin Tan et.al.	2509.12069	null
2025-09-15	AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective	Yuchen Deng et.al.	2509.12052	null
2025-09-15	Joint-octamamba:an octa joint segmentation network based on feature enhanced mamba	Chuang Liu et.al.	2509.11649	null
2025-09-14	MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation	Syed Talal Wasim et.al.	2509.11394	null
2025-09-14	MEMBOT: Memory-Based Robot in Intermittent POMDP	Youzhi Liang et.al.	2509.11225	null
2025-09-12	FLARE-SSM: Deep State Space Models with Influence-Balanced Loss for 72-Hour Solar Flare Prediction	Yusuke Takagi et.al.	2509.09988	null
2025-09-12	MAESTRO: Multi-modal Adaptive Estimation for Temporal Respiratory Disease Outbreak	Hong Liu et.al.	2509.08578	null
2025-09-10	First-order State Space Model for Lightweight Image Super-resolution	Yujie Zhu et.al.	2509.08458	null
2025-09-09	A kernel-based approach to physics-informed nonlinear system identification	Cesare Donati et.al.	2509.07634	null
2025-09-07	Recursive State Inference for Linear PASFA	Vishal Rishi et.al.	2509.07028	null
2025-09-06	Hyperbolic Large Language Models	Sarang Patil et.al.	2509.05757	null
2025-09-05	A Bayesian Gaussian Process Dynamic Factor Model	Tony Chernis et.al.	2509.04928	null
2025-09-05	CD-Mamba: Cloud detection with long-range spatial dependency modeling	Tianxiang Xue et.al.	2509.04729	link
2025-09-04	VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation	Mustafa Munir et.al.	2509.04669	null
2025-09-04	Echo State Networks as State-Space Models: A Systems Perspective	Pradeep Singh et.al.	2509.04422	null
2025-09-04	Rethinking the long-range dependency in Mamba/SSM and transformer models	Cong Ma et.al.	2509.04226	null
2025-09-03	Time-Scaling State-Space Models for Dense Video Captioning	AJ Piergiovanni et.al.	2509.03426	null
2025-09-03	S2M2ECG: Spatio-temporal bi-directional State Space Model Enabled Multi-branch Mamba for ECG	Huaicheng Zhang et.al.	2509.03066	null
2025-09-02	Mentality: A Mamba-based Approach towards Foundation Models for EEG	Saarang Panchavati et.al.	2509.02746	null
2025-09-02	ESTM: An Enhanced Dual-Branch Spectral-Temporal Mamba for Anomalous Sound Detection	Chengyuan Ma et.al.	2509.02471	null
2025-09-02	AudioRWKV: Efficient and Stable Bidirectional RWKV for Audio Pattern Recognition	Jiayu Xiong et.al.	2509.02167	null
2025-09-01	A Mathematical Model of Hybrid Microgrid With Pole Placement Controller Using State Feedback For Stability Improvement	Yangyadatta Tripathy et.al.	2509.01749	null
2025-09-01	Mamba-CNN: A Hybrid Architecture for Efficient and Accurate Facial Beauty Prediction	Djamel Eddine Boukhari et.al.	2509.01431	null
2025-09-01	StoxLSTM: A Stochastic Extended Long Short-Term Memory Network for Time Series Forecasting	Zihao Wang et.al.	2509.01187	null
2025-09-01	SpectMamba: Integrating Frequency and State Space Models for Enhanced Medical Image Detection	Yao Wang et.al.	2509.01080	null
2025-08-31	Prospects of Imitating Trading Agents in the Stock Market	Mateusz Wilinski et.al.	2509.00982	null
2025-08-31	CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification	Qingyu Wang et.al.	2509.00677	null
2025-08-31	MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation	Aviral Chharia et.al.	2509.00649	link
2025-08-30	COMET: A Framework for Modeling Compound Operation Dataflows with Explicit Collectives	Shubham Negi et.al.	2509.00599	null
2025-08-30	SemaMIL: Semantic Reordering with Retrieval-Guided State Space Modeling for Whole Slide Image Classification	Lubin Gan et.al.	2509.00442	null
2025-08-29	Quantum-Optimized Selective State Space Model for Efficient Time Series Prediction	Stefan-Alexandru Jura et.al.	2509.00259	null
2025-07-24	PointLAMA: Latent Attention meets Mamba for Efficient Point Cloud Pretraining	Xuanyu Lin et.al.	2507.17296	null
2025-06-17	MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration	Bingxi Liu et.al.	2506.13183	null
2025-05-20	Mamba-Adaptor: State Space Model Adaptor for Visual Recognition	Fei Xie et.al.	2505.12685	null
2025-03-18	TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba	Jiaxu Liu et.al.	2503.13004	null
2025-03-21	MambaTron: Efficient Cross-Modal Point Cloud Enhancement using Aggregate Selective State Space Modeling	Sai Tarun Inaganti et.al.	2501.16384	null
2025-02-27	Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion	Chaodong Xiao et.al.	2410.15091	link
2024-07-18	Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model	Tao Wang et.al.	2407.12319	null
2025-01-14	Pamba: Enhancing Global Interaction in Point Clouds via State Space Model	Zhuoyuan Li et.al.	2406.17442	null
2024-06-11	PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis	Jia-wei Chen et.al.	2406.06069	null
2024-06-18	PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis	Zicheng Wang et.al.	2405.15463	link
2024-05-08	Vision Mamba: A Comprehensive Survey and Taxonomy	Xiao Liu et.al.	2405.04404	link
2024-11-12	Visual Mamba: A Survey and New Outlooks	Rui Xu et.al.	2404.18861	link
2024-04-29	A Survey on Visual Mamba	Hanwei Zhang et.al.	2404.15956	null
2025-01-10	3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering	Qingyuan Zhou et.al.	2404.05522	link
2024-03-19	Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy	Jiuming Liu et.al.	2403.06467	null
2024-06-25	MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection	Tianxiang Chen et.al.	2403.02148	link
2024-10-14	Point Cloud Mamba: Point Cloud Learning via State Space Model	Tao Zhang et.al.	2403.00762	link
2024-11-26	PointMamba: A Simple State Space Model for Point Cloud Analysis	Dingkang Liang et.al.	2402.10739	link
2024-11-15	Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model	Lianghui Zhu et.al.	2401.09417	link

(<a href=#updated-on-20260429>back to top</a>)

HSI Classification

Publish Date	Title	Authors	PDF	Code
2026-04-26	A Synergistic CNN-Transformer Network with Pooling Attention Fusion for Hyperspectral Image Classification	Peng Chen et.al.	2604.23622	null
2026-04-22	ALAS: Adaptive Long-Horizon Action Synthesis via Async-pathway Stream Disentanglement	Yutong Shen et.al.	2604.20721	null
2026-04-22	Semi-Supervised Flow Matching for Mosaiced and Panchromatic Fusion Imaging	Peiming Luo et.al.	2604.20128	null
2026-04-21	GRAFT: Geometric Refinement and Fitting Transformer for Human Scene Reconstruction	Pradyumna YM et.al.	2604.19624	null
2026-04-21	ExplainS2A: Explainable Spectral-Spatial Duality Model for Fast Transforming Sentinel-2 Image to AVIRIS-Level Hyperspectral Image	Chia-Hsiang Lin et.al.	2604.19007	null
2026-04-20	ConvVitMamba: Efficient Multiscale Convolution, Transformer, and Mamba-Based Sequence modelling for Hyperspectral Image Classification	Mohammed Q. Alkhatib et.al.	2604.18856	null
2026-04-19	HyKey: Hyperspectral Keypoint Detection and Matching in Minimally Invasive Surgery	Alexander Saikia et.al.	2604.17446	null
2026-04-17	From Articles to Canopies: Knowledge-Driven Pseudo-Labelling for Tree Species Classification using LLM Experts	Michał Romaszewski et.al.	2604.16115	null
2026-04-17	SSFT: A Lightweight Spectral-Spatial Fusion Transformer for Generic Hyperspectral Classification	Alexander Musiat et.al.	2604.15828	null
2026-04-14	Deep Spatially-Regularized and Superpixel-Based Diffusion Learning for Unsupervised Hyperspectral Image Clustering	Vutichart Buranasiri et.al.	2604.13307	null
2026-04-14	Spatial-Spectral Adaptive Fidelity and Noise Prior Reduction Guided Hyperspectral Image Denoising	Xuelin Xie et.al.	2604.12600	null
2026-04-10	Unmixing-Guided Spatial-Spectral Mamba with Clustering Tokens for Hyperspectral Image Classification	Yimin Zhu et.al.	2604.09948	null
2026-04-10	HM-Bench: A Comprehensive Benchmark for Multimodal Large Language Models in Hyperspectral Remote Sensing	Xinyu Zhang et.al.	2604.08884	null
2026-04-09	Optical spin defect pairs in cubic boron nitride	Josiah E. Hsi et.al.	2604.08737	null
2026-04-09	Preventing Overfitting in Deep Image Prior for Hyperspectral Image Denoising	Panagiotis Gkotsis et.al.	2604.08272	null
2026-04-14	MVOS_HSI: A Python Library for Preprocessing Agricultural Crop Hyperspectral Data	Rishik Aggarwal et.al.	2604.07656	null
2026-04-08	Accelerating 4D Hyperspectral Imaging through Physics-Informed Neural Representation and Adaptive Sampling	Chi-Jui Ho et.al.	2604.06561	null
2026-04-07	ASSR-Net: Anisotropic Structure-Aware and Spectrally Recalibrated Network for Hyperspectral Image Fusion	Qiya Song et.al.	2604.05742	null
2026-04-06	InfBaGel: Human-Object-Scene Interaction Generation with Dynamic Perception and Iterative Refinement	Yude Zou et.al.	2604.04843	null
2026-04-02	Cosine-Normalized Attention for Hyperspectral Image Classification	Muhammad Ahmad et.al.	2604.01763	null
2026-03-27	Learnable Quantum Efficiency Filters for Urban Hyperspectral Segmentation	Imad Ali Shah et.al.	2603.26528	null
2026-03-27	HyVIC: A Metric-Driven Spatio-Spectral Hyperspectral Image Compression Architecture Based on Variational Autoencoders	Martin Hermann Paul Fuchs et.al.	2603.26468	null
2026-03-26	Narrowband searches for continuous gravitational waves from known pulsars in the first two parts of the fourth LIGO–Virgo–KAGRA observing run	The LIGO Scientific Collaboration et.al.	2603.25938	null
2026-04-02	Searches for Continuous Gravitational Waves from Supernova Remnants in the first part of the LIGO-Virgo-KAGRA Fourth Observing run	The LIGO Scientific Collaboration et.al.	2603.25808	null
2026-03-26	Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case	Koldo Basterretxea et.al.	2603.25510	null
2026-03-26	Underdetermined Blind Source Separation via Weighted Simplex Shrinkage Regularization and Quantum Deep Image Prior	Chia-Hsiang Lin et.al.	2603.25384	null
2026-03-25	Connecting Meteorite Spectra to Lunar Surface Composition Using Hyperspectral Imaging and Machine Learning	Fatemeh Fazel Hesar et.al.	2603.24323	null
2026-03-25	LGEST: Dynamic Spatial-Spectral Expert Routing for Hyperspectral Image Classification	Jiawen Wen et.al.	2603.24045	null
2026-03-23	A Latent Representation Learning Framework for Hyperspectral Image Emulation in Remote Sensing	Chedly Ben Azizi et.al.	2603.21911	null
2026-03-23	Hyperspectral imaging solutions for brain tissue metabolic and haemodynamic monitoring: an updated perspective	Luca Giannoni et.al.	2603.21732	null
2026-04-01	Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability	Jiahui Song et.al.	2603.21510	null
2026-03-19	HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting	Songfeng Zhu et.al.	2603.20292	null
2026-03-19	GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants	The LIGO Scientific Collaboration et.al.	2603.19021	null
2026-03-19	GWTC-4.0: Tests of General Relativity. II. Parameterized Tests	The LIGO Scientific Collaboration et.al.	2603.19020	null
2026-03-19	GWTC-4.0: Tests of General Relativity. I. Overview and General Tests	The LIGO Scientific Collaboration et.al.	2603.19019	null
2026-03-17	Spectral Property-Driven Data Augmentation for Hyperspectral Single-Source Domain Generalization	Taiqin Chen et.al.	2603.16662	null
2026-03-17	3D Fourier-based Global Feature Extraction for Hyperspectral Image Classification	Muhammad Ahmad et.al.	2603.16426	null
2026-03-16	HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions	Yukang Cao et.al.	2603.15612	null
2026-03-15	All-sky Searches for Continuous Gravitational Waves from Isolated Neutron Stars in the Data from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run	The LIGO Scientific Collaboration et.al.	2603.14168	null
2026-03-14	Bidirectional Cross-Attention Fusion of High-Res RGB and Low-Res HSI for Multimodal Automated Waste Sorting	Jonas V. Funk et.al.	2603.13941	null
2026-03-12	Blind Hyperspectral and Multispectral Images Fusion: A Unified Tensor Fusion Framework from Coupled Inverse Problem Perspective	Ying Gao et.al.	2603.11530	null
2026-03-09	Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning	Yingkai Zhang et.al.	2603.07918	null
2026-03-24	Spectral Gaps and Spatial Priors: Studying Hyperspectral Downstream Adaptation Using TerraMind	Julia Anna Leonardi et.al.	2603.06690	null
2026-03-03	Unmixing microinfrared spectroscopic images of cross-sections of historical oil paintings	Shivam Pande et.al.	2603.06673	null
2026-03-05	Towards 3D Scene Understanding of Gas Plumes in LWIR Hyperspectral Images Using Neural Radiance Fields	Scout Jarman et.al.	2603.05473	null
2026-03-05	A Benchmark Study of Neural Network Compression Methods for Hyperspectral Image Classification	Sai Shi et.al.	2603.04720	null
2026-03-03	mHC-HSI: Clustering-Guided Hyper-Connection Mamba for Hyperspectral Image Classification	Yimin Zhu et.al.	2603.03418	null
2026-03-02	RoboGPU: Accelerating GPU Collision Detection for Robotics	Lufei Liu et.al.	2603.01517	null
2026-03-01	VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification	Abdellah Zakaria Sellam et.al.	2603.01174	null
2026-02-18	HS-3D-NeRF: 3D Surface and Hyperspectral Reconstruction From Stationary Hyperspectral Images Using Multi-Channel NeRFs	Kibon Ku et.al.	2602.16950	null
2026-02-11	Benchmarking Deep Learning and Statistical Target Detection Methods for PFM-1 Landmine Detection in UAV Hyperspectral Imagery	Sagar Lekhak et.al.	2602.10434	null
2026-02-04	DMS2F-HAD: A Dual-branch Mamba-based Spatial-Spectral Fusion Network for Hyperspectral Anomaly Detection	Aayushma Pant et.al.	2602.04102	null
2026-02-02	DSXFormer: Dual-Pooling Spectral Squeeze-Expansion and Dynamic Context Attention Transformer for Hyperspectral Image Classification	Farhan Ullah et.al.	2602.01906	null
2026-01-31	HSI-VAR: Rethinking Hyperspectral Restoration through Spatial-Spectral Visual Autoregression	Xiangming Wang et.al.	2602.00749	null
2026-01-31	From Prompt to Graph: Comparing LLM-Based Information Extraction Strategies in Domain-Specific Ontology Development	Xuan Liu et.al.	2602.00699	null
2026-01-31	HSSDCT: Factorized Spatial-Spectral Correlation for Hyperspectral Image Fusion	Chia-Ming Lee et.al.	2602.00490	null
2026-01-30	Cross-Domain Few-Shot Learning for Hyperspectral Image Classification Based on Mixup Foundation Model	Naeem Paeedeh et.al.	2601.22581	null
2026-01-29	SR $^{2}$ -Net: A General Plug-and-Play Model for Spectral Refinement in Hyperspectral Image Super-Resolution	Ji-Xuan He et.al.	2601.21338	null
2026-01-27	Dynamic Worlds, Dynamic Humans: Generating Virtual Human-Scene Interaction Motion in Dynamic Scenes	Yin Wang et.al.	2601.19484	null
2026-01-26	AI-enabled Satellite Edge Computing: A Single-Pixel Feature based Shallow Classification Model for Hyperspectral Imaging	Li Fang et.al.	2601.18560	null
2026-01-26	Cross-Domain Transfer with Self-Supervised Spectral-Spatial Modeling for Hyperspectral Image Classification	Jianshu Chao et.al.	2601.18088	null
2026-01-26	Semi-Supervised Hyperspectral Image Classification with Edge-Aware Superpixel Label Propagation and Adaptive Pseudo-Labeling	Yunfei Qiu et.al.	2601.18049	null
2026-01-24	HyDeMiC: A Deep Learning-based Mineral Classifier using Hyperspectral Data	M. L. Mamud et.al.	2601.17352	null
2026-01-22	Clustering-Guided Spatial-Spectral Mamba for Hyperspectral Image Classification	Zack Dewis et.al.	2601.16098	null
2026-01-22	Multimodal Imaging System Combining Hyperspectral and Laser Speckle Imaging for In Vivo Hemodynamic and Metabolic Monitoring	Junda Wang et.al.	2601.15947	null
2026-01-22	White-Box mHC: Electromagnetic Spectrum-Aware and Interpretable Stream Interactions for Hyperspectral Image Classification	Yimin Zhu et.al.	2601.15757	null
2026-01-20	SHARE: A Fully Unsupervised Framework for Single Hyperspectral Image Restoration	Jiangwei Xie et.al.	2601.13987	null
2026-01-18	Utilizing the Score of Data Distribution for Hyperspectral Anomaly Detection	Jiahui Sheng et.al.	2601.12379	null
2026-01-18	Turbo-GoDec: Exploiting the Cluster Sparsity Prior for Hyperspectral Anomaly Detection	Jiahui Sheng et.al.	2601.12337	null
2026-01-16	Anisotropic Tensor Deconvolution of Hyperspectral Images	Xinjue Wang et.al.	2601.11694	null
2026-01-13	MMLGNet: Cross-Modal Alignment of Remote Sensing Data using CLIP	Aditya Chaudhary et.al.	2601.08420	null
2026-01-12	SDHSI-Net: Learning Better Representations for Hyperspectral Images via Self-Distillation	Prachet Dev Singh et.al.	2601.07416	null
2026-01-11	Adversarial Attacks on Medical Hyperspectral Imaging Exploiting Spectral-Spatial Dependencies and Multiscale Features	Yunrui Gu et.al.	2601.07056	null
2026-01-08	EdgeLDR: Quaternion Low-Displacement Rank Neural Networks for Edge-Efficient Deep Learning	Vladimir Frants et.al.	2601.05379	null
2026-01-07	HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object Detection	Shuyan Bai et.al.	2601.03736	null
2026-01-03	Efficient Hyperspectral Image Reconstruction Using Lightweight Separate Spectral Transformers	Jianan Li et.al.	2601.01064	null
2025-12-30	Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges	Yu-Tang Chang et.al.	2512.24172	null
2025-12-25	Degradation-Aware Metric Prompting for Hyperspectral Image Restoration	Binfeng Wang et.al.	2512.20251	null
2025-12-22	Rethinking Coupled Tensor Analysis for Hyperspectral Super-Resolution: Recoverable Modeling Under Endmember Variability	Meng Ding et.al.	2512.19489	null
2026-01-21	Constraints on gravitational waves from the 2024 Vela pulsar glitch	The LIGO Scientific Collaboration et.al.	2512.17990	null
2025-12-22	A Parametric Framework for Anticipatory Flashflood Warning: Integrating Landscape Vulnerability with Precipitation Forecasts	Xiangpeng Li et.al.	2512.17785	null
2025-12-16	Bridging the Gap Between Modern UX Design and Particle Accelerator Control Room Interfaces	Rachael Hill et.al.	2512.14872	null
2025-12-11	Perception-Inspired Color Space Design for Photo White Balance Editing	Yang Cheng et.al.	2512.09383	null
2025-12-08	Agreement Disagreement Guided Knowledge Transfer for Cross-Scene Hyperspectral Imaging	Lu Huo et.al.	2512.08990	null
2025-12-08	Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration	Lu Huo et.al.	2512.08989	null
2025-12-05	Hyperspectral Unmixing with 3D Convolutional Sparse Coding and Projected Simplex Volume Maximization	Gargi Panda et.al.	2512.05674	null
2025-12-03	Label-Efficient Hyperspectral Image Classification via Spectral FiLM Modulation of Low-Level Pretrained Diffusion Features	Yuzhen Hu et.al.	2512.03430	null
2025-12-02	PyroFocus: A Deep Learning Approach to Real-Time Wildfire Detection in Multispectral Remote Sensing Imagery	Mark Moussa et.al.	2512.03257	null
2025-11-29	UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations	Yuzhen Hu et.al.	2512.00261	null
2025-12-05	Search for planetary-mass ultra-compact binaries using data from the first part of the LIGO–Virgo–KAGRA fourth observing run	The LIGO Scientific Collaboration et.al.	2511.19911	null
2025-11-23	LRDUN: A Low-Rank Deep Unfolding Network for Efficient Spectral Compressive Imaging	He Huang et.al.	2511.18513	null
2025-11-23	Uncertainty Quantification in HSI Reconstruction using Physics-Aware Diffusion Priors and Optics-Encoded Measurements	Juan Romero et.al.	2511.18473	null
2025-11-22	Spectral Super-Resolution Neural Operator with Atmospheric Radiative Transfer Prior	Ziye Zhang et.al.	2511.17895	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	null
2025-11-21	Real Noise Decoupling for Hyperspectral Image Denoising	Yingkai Zhang et.al.	2511.17196	null
2025-12-04	All-sky search for continuous gravitational-wave signals from unknown neutron stars in binary systems in the first part of the fourth LIGO-Virgo-KAGRA observing run	The LIGO Scientific Collaboration et.al.	2511.16863	null
2025-11-20	SpectralTrain: A Universal Framework for Hyperspectral Image Classification	Meihua Zhou et.al.	2511.16084	null
2025-11-19	Hyperspectral Image Classification using Spectral-Spatial Mixer Network	Mohammed Q. Alkhatib et.al.	2511.15692	null
2025-11-24	Multimodal Optical Imaging Platform for Quantitative Burn Assessment	Nathaniel Hanson et.al.	2511.15509	null
2025-11-19	Hyperspectral Super-Resolution with Inter-Image Variability via Degradation-based Low-Rank and Residual Fusion Method	Yue Wen et.al.	2511.15052	null
2025-11-17	Human-centric Maintenance Process Through Integration of AI, Speech, and AR	Parul Khanna et.al.	2511.13918	null
2025-11-17	SpectralAdapt: Semi-Supervised Domain Adaptation with Spectral Priors for Human-Centered Hyperspectral Image Reconstruction	Yufei Wen et.al.	2511.13020	null
2025-12-19	CLAReSNet: When Convolution Meets Latent Attention for Hyperspectral Image Classification	Asmit Bandyopadhyay et.al.	2511.12346	null
2025-11-15	Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification	Rupam Mukherjee et.al.	2511.12268	null
2025-11-13	Exposing DeepFakes via Hyperspectral Domain Mapping	Aditya Mehta et.al.	2511.11732	null
2025-11-13	Perceive, Act and Correct: Confidence Is Not Enough for Hyperspectral Classification	Muzhou Yang et.al.	2511.10068	null
2025-11-11	HyperScout-H: the hyperspectral imager for the ESA Hera mission	Marcel M. Popescu et.al.	2511.08047	null
2025-11-10	GEWDiff: Geometric Enhanced Wavelet-based Diffusion Model for Hyperspectral Image Super-resolution	Sirui Wang et.al.	2511.07103	link
2025-10-31	SpecAware: A Spectral-Content Aware Foundation Model for Unifying Multi-Sensor Learning in Hyperspectral Remote Sensing Mapping	Renjie Ji et.al.	2510.27219	null
2025-11-13	Direct multi-model dark-matter search with gravitational-wave interferometers using data from the first part of the fourth LIGO-Virgo-KAGRA observing run	The LIGO Scientific Collaboration et.al.	2510.27022	null
2025-10-30	GW241011 and GW241110: Exploring Binary Formation and Fundamental Physics with Asymmetric, High-Spin Black Hole Coalescence	The LIGO Scientific Collaboration et.al.	2510.26931	null
2025-11-07	Cosmological and High Energy Physics implications from gravitational-wave background searches in LIGO-Virgo-KAGRA’s O1-O4a runs	The LIGO Scientific Collaboration et.al.	2510.26848	null
2025-10-23	SpectraMorph: Structured Latent Learning for Self-Supervised Hyperspectral Super-Resolution	Ritik Shah et.al.	2510.20814	null
2025-10-20	Directional Search for Persistent Gravitational Waves: Results from the First Part of LIGO-Virgo-KAGRA’s Fourth Observing Run	The LIGO Scientific Collaboration et.al.	2510.17487	null
2025-10-18	HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications	Christopher Thirgood et.al.	2510.16664	null
2025-10-15	Near-Infrared Hyperspectral Imaging Applications in Food Analysis – Improving Algorithms and Methodologies	Ole-Christian Galbo Engstrøm et.al.	2510.13452	null
2025-10-14	Benchmarking foundation models for hyperspectral image classification: Application to cereal crop type mapping	Walid Elbarz et.al.	2510.11576	null
2025-10-13	Directly Mapping Interacting Components to Complex Systems’ Emergent Properties	Lina Yan et.al.	2510.10881	null
2025-10-10	SpectralCA: Bi-Directional Cross-Attention for Next-Generation UAV Hyperspectral Vision	D. V. Brovko et.al.	2510.09912	null
2025-10-09	Hyperspectral data augmentation with transformer-based diffusion models	Mattia Ferrari et.al.	2510.08363	null
2025-10-08	Label Semantics for Robust Hyperspectral Image Classification	Rafin Hassan et.al.	2510.07556	null
2025-10-06	In-Field Mapping of Grape Yield and Quality with Illumination-Invariant Deep Learning	Ciem Cornelissen et.al.	2510.04864	null
2025-10-02	Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction	Yi Ai et.al.	2510.01912	null
2025-10-01	Towards Adversarial Training under Hyperspectral Images	Weihua Zhang et.al.	2510.01014	null
2025-09-28	Joint Superpixel and Self-Representation Learning for Scalable Hyperspectral Image Clustering	Xianlu Li et.al.	2509.24027	null
2025-09-28	Generalized Category Discovery in Hyperspectral Images via Prototype Subspace Modeling	Xianlu Li et.al.	2509.24017	null
2025-09-20	Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment	Abhiroop Chatterjee et.al.	2509.22697	null
2025-09-25	Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models	Juana Valeria Hurtado et.al.	2509.20107	null
2025-09-21	SwarmChat: An LLM-Based, Context-Aware Multimodal Interaction System for Robotic Swarms	Ettilla Mohiuddin Eumi et.al.	2509.16920	null
2025-09-20	Spectral Compressive Imaging via Chromaticity-Intensity Decomposition	Xiaodong Wang et.al.	2509.16690	null
2025-09-16	Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation	Hugo Carlesso et.al.	2509.13229	null
2025-09-15	Progressive Flow-inspired Unfolding for Spectral Compressive Imaging	Xiaodong Wang et.al.	2509.12079	null
2025-09-19	USCTNet: A deep unfolding nuclear-norm optimization solver for physically consistent HSI reconstruction	Xiaoyang Ma et.al.	2509.10651	null
2025-09-12	Nanosculpting lateral weak link junctions in superconducting Fe(Te,Se)/Bi2Te3 with focused Si++ ions and implications on vortex pinning	Debarghya Mallick et.al.	2509.10606	null
2025-09-11	CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution	Yulin Tong et.al.	2509.09163	null
2025-09-22	HyperTTA: Test-Time Adaptation for Hyperspectral Image Classification under Distribution Shifts	Xia Yue et.al.	2509.08436	null
2025-09-09	GW250114: testing Hawking’s area law and the Kerr nature of black holes	The LIGO Scientific Collaboration et.al.	2509.08054	null
2025-09-15	Directed searches for gravitational waves from ultralight vector boson clouds around merger remnant and galactic black holes during the first part of the fourth LIGO-Virgo-KAGRA observing run	The LIGO Scientific Collaboration et.al.	2509.07352	null
2025-10-07	GWTC-4.0: Constraints on the Cosmic Expansion Rate and Modified Gravitational-wave Propagation	The LIGO Scientific Collaboration et.al.	2509.04348	null
2025-09-02	Explainability-Driven Dimensionality Reduction for Hyperspectral Imaging	Salma Haidar et.al.	2509.02340	null
2025-09-01	FantasyHSI: Video-Generation-Centric 4D Human Synthesis In Any Scene through A Graph-based Multi-Agent Framework	Lingzhou Mu et.al.	2509.01232	null
2025-08-31	CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification	Qingyu Wang et.al.	2509.00677	null
2025-08-30	Iterative Low-rank Network for Hyperspectral Image Denoising	Jin Ye et.al.	2509.00356	null
2025-08-28	Upper Limits on the Isotropic Gravitational-Wave Background from the first part of LIGO, Virgo, and KAGRA’s fourth Observing Run	The LIGO Scientific Collaboration et.al.	2508.20721	null
2025-08-27	Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities	Imad Ali Shah et.al.	2508.19905	null
2025-09-08	GWTC-4.0: Updating the Gravitational-Wave Transient Catalog with Observations from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run	The LIGO Scientific Collaboration et.al.	2508.18082	null
2025-09-03	Open Data from LIGO, Virgo, and KAGRA through the First Part of the Fourth Observing Run	The LIGO Scientific Collaboration et.al.	2508.18079	null
2025-08-25	Few-shot Unknown Class Discovery of Hyperspectral Images with Prototype Learning and Clustering	Chun Liu et.al.	2508.18075	null
2025-08-21	Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising	Jin Ye et.al.	2508.15553	link
2025-08-15	Hyperspectral vs. RGB for Pedestrian Segmentation in Urban Driving Scenes: A Comparative Study	Jiarong Li et.al.	2508.11301	null
2025-08-14	CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving	Jiarong Li et.al.	2508.10962	null
2025-08-13	Probabilistic Emissivity Retrieval from Hyperspectral Data via Physics-Guided Variational Inference	Joshua R. Tempelman et.al.	2508.08291	null
2025-08-11	Hyperspectral Imaging	Danfeng Hong et.al.	2508.08107	link
2025-08-11	DETACH: Cross-domain Learning for Long-Horizon Tasks via Mixture of Disentangled Experts	Yutong Shen et.al.	2508.07842	null
2025-08-09	TerraMAE: Learning Spatial-Spectral Representations from Hyperspectral Earth Observation Data via Adaptive Masked Autoencoders	Tanjim Bin Faruk et.al.	2508.07020	null
2025-08-05	Low-rankness and Smoothness Meet Subspace: A Unified Tensor Regularization for Hyperspectral Image Super-resolution	Jun Zhang et.al.	2508.03049	null
2025-08-02	Hyperspectral Image Recovery Constrained by Multi-Granularity Non-Local Self-Similarity Priors	Zhuoran Peng et.al.	2508.01435	null
2025-08-05	Phase-Locked SNR Band Selection for Weak Mineral Signal Detection in Hyperspectral Imagery	Judy X Yang et.al.	2508.00539	null
2025-08-01	Honey Classification using Hyperspectral Imaging and Machine Learning	Mokhtar A. Al-Awadhi et.al.	2508.00361	null
2025-07-31	SAMSA: Segment Anything Model Enhanced with Spectral Angles for Hyperspectral Interactive Medical Image Segmentation	Alfie Roddan et.al.	2507.23673	link
2025-03-28	HSLiNets: Evaluating Band Ordering Strategies in Hyperspectral and LiDAR Fusion	Judy X Yang et.al.	2503.21072	null
2025-03-11	Dynamic Cross-Modal Feature Interaction Network for Hyperspectral and LiDAR Data Classification	Junyan Lin et.al.	2503.06945	link
2024-12-04	HSLiNets: Hyperspectral Image and LiDAR Data Fusion Using Efficient Dual Non-Linear Feature Learning Networks	Judy X Yang et.al.	2412.00302	null
2024-04-09	Unsupervised Band Selection Using Fused HSI and LiDAR Attention Integrating With Autoencoder	Judy X Yang et.al.	2404.05258	null
2024-04-16	LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification	Judy X Yang et.al.	2404.03883	null
2023-04-04	Multimodal Hyperspectral Image Classification via Interconnected Fusion	Lu Huo et.al.	2304.00495	null
2023-03-24	MMFormer: Multimodal Transformer Using Multiscale Self-Attention for Remote Sensing Image Classification	Bo Zhang et.al.	2303.13101	null
2023-02-08	Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification	Meng Wang et.al.	2301.03335	null
2022-11-01	Hybridization of filter and wrapper approaches for the dimensionality reduction and classification of hyperspectral images	Asma Elmaizi et.al.	2210.16496	null
2023-01-04	A CNN with Noise Inclined Module and Denoise Framework for Hyperspectral Image Classification	Zhiqiang Gong et.al.	2205.12459	null
2021-04-07	Disentangled Non-Local Network for Hyperspectral and LiDAR Data Classification	Wenxia Liu et.al.	2104.02302	null
2021-04-07	Hyperspectral and LiDAR data classification based on linear self-attention	Min Feng et.al.	2104.02301	null
2020-07-20	Advances in Deep Learning for Hyperspectral Image Analysis–Addressing Challenges Arising in Practical Imaging Scenarios	Xiong Zhou et.al.	2007.08592	null
2020-02-05	Classification of Hyperspectral and LiDAR Data Using Coupled CNNs	Renlong Hang et.al.	2002.01144	null
2019-12-09	3D CNN with Localized Residual Connections for Hyperspectral Image Classification	Shivangi Dwivedi et.al.	1912.03000	null
2019-10-30	Deep Learning for Hyperspectral Image Classification: An Overview	Shutao Li et.al.	1910.12861	null
2021-06-08	Multiscale Principle of Relevant Information for Hyperspectral Image Classification	Yantao Wei et.al.	1907.06022	null
2018-03-01	HSI-CNN: A Novel Convolution Neural Network for Hyperspectral Image	Yanan Luo et.al.	1802.10478	null
2016-06-17	Combining multiscale features for classification of hyperspectral images: a sequence based kernel approach	Yanwei Cui et.al.	1606.04985	null
2015-04-30	Robust hyperspectral image classification with rejection fields	Filipe Condessa et.al.	1504.07918	null

(<a href=#updated-on-20260429>back to top</a>)

This site is open source. Improve this page.