Usage instructions: here
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-03-31 | EarthEmbeddingExplorer: A Web Application for Cross-Modal Retrieval of Global Satellite Images | Yijie Zheng et.al. | 2603.29441 | null |
| 2026-03-31 | MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network | Guozhi Qiu et.al. | 2603.29291 | null |
| 2026-03-30 | The Problem of Dynamic Spatial Sampling and Geofence Surveillance | Marty Davidson et.al. | 2603.28958 | null |
| 2026-03-29 | RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization | Junwei Zheng et.al. | 2603.27758 | null |
| 2026-03-29 | NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries | Mahdi Erfanian et.al. | 2603.27464 | null |
| 2026-03-28 | Zero-shot Vision-Language Reranking for Cross-View Geolocalization | Yunus Talha Erzurumlu et.al. | 2603.27251 | null |
| 2026-03-27 | Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones | Moritz Nottebaum et.al. | 2603.26551 | null |
| 2026-03-27 | HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network | Mingyu Zhang et.al. | 2603.26341 | null |
| 2026-03-26 | Bayesian Deep Count Regression and Anomaly Detection: Evidence from GDELT Event Panels | Hsin-Hsiung Huang et.al. | 2603.25970 | null |
| 2026-03-26 | Few Shots Text to Image Retrieval: New Benchmarking Dataset and Optimization Methods | Ofer Idan et.al. | 2603.25891 | null |
| 2026-03-26 | Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming | Yunus Talha Erzurumlu et.al. | 2603.25686 | null |
| 2026-03-26 | On-Demand Instructional Material Providing Agent Based on MLLM for Tutoring Support | Takumi Kato et.al. | 2603.25195 | null |
| 2026-03-28 | TIGeR: A Unified Framework for Time, Images and Geo-location Retrieval | David G. Shatwell et.al. | 2603.24749 | null |
| 2026-03-25 | GeoRouter: Dynamic Paradigm Routing for Worldwide Image Geolocalization | Pengyue Jia et.al. | 2603.24376 | null |
| 2026-03-25 | Combi-CAM: A Novel Multi-Layer Approach for Explainable Image Geolocalization | David Faget et.al. | 2603.24117 | null |
| 2026-03-24 | Sparse Autoencoders for Interpretable Medical Image Representation Learning | Philipp Wesp et.al. | 2603.23794 | null |
| 2026-03-24 | ARGENT: Adaptive Hierarchical Image-Text Representations | Chuong Huynh et.al. | 2603.23311 | null |
| 2026-03-24 | Retrieval-Guided Photovoltaic Inventory Estimation from Satellite Imagery for Distribution Grid Planning | Muhao Guo et.al. | 2603.22856 | null |
| 2026-03-24 | SOUPLE: Enhancing Audio-Visual Localization and Segmentation with Learnable Prompt Contexts | Khanh Binh Nguyen et.al. | 2603.22732 | null |
| 2026-03-24 | HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment | Sangmin Jo et.al. | 2603.22721 | null |
| 2026-03-23 | GeoFlow: Real-Time Fine-Grained Cross-View Geolocalization via Iterative Flow Prediction | Ayesh Abu Lehyeh et.al. | 2603.21943 | null |
| 2026-03-23 | ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive Text-to-Image Retrieval | Zhuocheng Zhang et.al. | 2603.21886 | null |
| 2026-03-21 | SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval | Qunjie Huang et.al. | 2603.20738 | null |
| 2026-03-21 | A Multihead Continual Learning Framework for Fine-Grained Fashion Image Retrieval with Contrastive Learning and Exponential Moving Average Distillation | Ling Xiao et.al. | 2603.20648 | null |
| 2026-03-20 | IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment | Simone Magistri et.al. | 2603.19862 | null |
| 2026-03-20 | IUP-Pose: Decoupled Iterative Uncertainty Propagation for Real-time Relative Pose Regression via Implicit Dense Alignment v1 | Jun Wang et.al. | 2603.19625 | null |
| 2026-03-24 | LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment | Shuaibang Peng et.al. | 2603.19609 | null |
| 2026-03-19 | Mapping the Midweek Mountain: The New Geography of Hybrid Work | Norman Guo et.al. | 2603.18440 | null |
| 2026-03-18 | MCoT-MVS: Multi-level Vision Selection by Multi-modal Chain-of-Thought Reasoning for Composed Image Retrieval | Xuri Ge et.al. | 2603.17360 | null |
| 2026-03-17 | Visual Product Search Benchmark | Karthik Sulthanpete Govindappa et.al. | 2603.17186 | null |
| 2026-03-17 | Retrieving Counterfactuals Improves Visual In-Context Learning | Guangzhi Xiong et.al. | 2603.16737 | null |
| 2026-03-17 | HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture | Aojie Yuan et.al. | 2603.16679 | null |
| 2026-03-17 | Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty | Mangyu Kong et.al. | 2603.16538 | null |
| 2026-03-17 | Geometric Search for Hawking Radiation from Nearby Primordial Black Holes | Shuo Xiao et.al. | 2603.16508 | null |
| 2026-03-18 | VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents | Zhengbo Zhang et.al. | 2603.16289 | null |
| 2026-03-14 | Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics | Dennis Haitz et.al. | 2603.13917 | null |
| 2026-03-14 | Sky2Ground: A Benchmark for Site Modeling under Varying Altitude | Zengyan Wang et.al. | 2603.13740 | null |
| 2026-03-13 | Design and evaluation of an agentic workflow for crisis-related synthetic tweet datasets | Roben Delos Reyes et.al. | 2603.13625 | null |
| 2026-03-13 | A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks | Tangzheng Lian et.al. | 2603.12998 | null |
| 2026-03-13 | Text-Phase Synergy Network with Dual Priors for Unsupervised Cross-Domain Image Retrieval | Jing Yang et.al. | 2603.12711 | null |
| 2026-03-13 | CM-Bench: A Comprehensive Cross-Modal Feature Matching Benchmark Bridging Visible and Infrared Images | Liangzheng Sun et.al. | 2603.12690 | null |
| 2026-03-12 | Unequal changes in commuting patterns across socio-economic strata in response to pandemic restrictions | Cristiano Marinelli et.al. | 2603.11758 | null |
| 2026-03-12 | FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval | Chenchen Zhao et.al. | 2603.11520 | null |
| 2026-03-12 | Efficient Cross-View Localization in 6G Space-Air-Ground Integrated Network | Min Hao et.al. | 2603.11398 | null |
| 2026-03-11 | Imaging flat band electron hydrodynamics in biased bilayer graphene | Canxun Zhang et.al. | 2603.11175 | null |
| 2026-03-11 | Learning to Wander: Improving the Global Image Geolocation Ability of LMMs via Actionable Reasoning | Yushuo Zheng et.al. | 2603.10463 | null |
| 2026-03-10 | Composed Vision-Language Retrieval for Skin Cancer Case Search via Joint Alignment of Global and Local Representations | Yuheng Wang et.al. | 2603.09108 | null |
| 2026-03-09 | Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational Modeling | Bowen Liu et.al. | 2603.08063 | null |
| 2026-03-09 | $L^3$ :Scene-agnostic Visual Localization in the Wild | Yu Zhang et.al. | 2603.07937 | null |
| 2026-03-08 | Fluctuation imaging of disorder in monolayer semiconductors | Tom T. C. Sistermans et.al. | 2603.07418 | null |
| 2026-03-08 | QdaVPR: A novel query-based domain-agnostic model for visual place recognition | Shanshan Wan et.al. | 2603.07414 | null |
| 2026-03-06 | EventGeM: Global-to-Local Feature Matching for Event-Based Visual Place Recognition | Adam D. Hines et.al. | 2603.05807 | null |
| 2026-03-06 | Visual Words Meet BM25: Sparse Auto-Encoder Visual Word Scoring for Image Retrieval | Donghoon Han et.al. | 2603.05781 | null |
| 2026-03-05 | Interpretable Perception and Reasoning for Audiovisual Geolocation | Yiyang Su et.al. | 2603.05708 | null |
| 2026-03-04 | PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing | Rohan Mahadev et.al. | 2603.04598 | null |
| 2026-03-04 | SSR: A Generic Framework for Text-Aided Map Compression for Localization | Mohammad Omama et.al. | 2603.04272 | null |
| 2026-03-04 | Long-Term Visual Localization in Dynamic Benthic Environments: A Dataset, Footprint-Based Ground Truth, and Visual Place Recognition Benchmark | Martin Kvisvik Larsen et.al. | 2603.04056 | null |
| 2026-03-04 | HE-VPR: Height Estimation Enabled Aerial Visual Place Recognition Against Scale Variance | Mengfan He et.al. | 2603.04050 | null |
| 2026-03-04 | DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative Negative Sampling in Composed Image Retrieval | Geon Park et.al. | 2603.04037 | null |
| 2026-03-03 | From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes | Qifan Zhang et.al. | 2603.03577 | null |
| 2026-03-03 | LOO-PIT predictive model checking | Herman Tesso et.al. | 2603.02928 | null |
| 2026-03-03 | Cross-view geo-localization, Image retrieval, Multiscale geometric modeling, Frequency domain enhancement | Hongying Zhang et.al. | 2603.02726 | null |
| 2026-03-02 | Contributions of geolocated weather and building related data for insurance assessment of flood risks | Mulah Moriah et.al. | 2603.02418 | null |
| 2026-03-02 | GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis | Srikumar Sastry et.al. | 2603.02172 | null |
| 2026-03-02 | Learning to Read Where to Look: Disease-Aware Vision-Language Pretraining for 3D CT | Simon Ging et.al. | 2603.02026 | null |
| 2026-03-02 | Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning | Haonan Jia et.al. | 2603.01696 | null |
| 2026-03-01 | MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning | Eileen Wang et.al. | 2603.01055 | null |
| 2026-02-28 | Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning | Ruoshuang Du et.al. | 2603.00511 | null |
| 2026-02-27 | Altitude-Aware Visual Place Recognition in Top-Down View | Xingyu Shao et.al. | 2602.23872 | null |
| 2026-02-26 | VGG-T $^3$ : Offline Feed-Forward 3D Reconstruction at Scale | Sven Elflein et.al. | 2602.23361 | null |
| 2026-03-07 | WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval | Tianyue Wang et.al. | 2602.23029 | null |
| 2026-02-26 | Autoregressive Visual Decoding from EEG Signals | Sicheng Dai et.al. | 2602.22555 | null |
| 2026-02-26 | Pix2Key: Controllable Open-Vocabulary Retrieval with Semantic Decomposition and Self-Supervised Visual Dictionary Learning | Guoyizhe Wei et.al. | 2602.22510 | null |
| 2026-02-25 | Global-Aware Edge Prioritization for Pose Graph Initialization | Tong Wei et.al. | 2602.21963 | null |
| 2026-03-04 | Automatic Map Density Selection for Locally-Performant Visual Place Recognition | Somayeh Hussaini et.al. | 2602.21473 | null |
| 2026-02-24 | Seeing Through Words: Controlling Visual Retrieval Quality with Language Models | Jianglin Lu et.al. | 2602.21175 | null |
| 2026-02-24 | Long-Term Multi-Session 3D Reconstruction Under Substantial Appearance Change | Beverley Gorry et.al. | 2602.20584 | null |
| 2026-02-23 | Unlocking Multimodal Document Intelligence: From Current Triumphs to Future Frontiers of Visual Document Retrieval | Yibo Yan et.al. | 2602.19961 | null |
| 2026-02-23 | Evaluating the Impact of Data Anonymization on Image Retrieval | Marvin Chen et.al. | 2602.19641 | null |
| 2026-02-22 | Knowledge-aware Visual Question Generation for Remote Sensing Images | Siran Li et.al. | 2602.19224 | null |
| 2026-02-22 | Questions beyond Pixels: Integrating Commonsense Knowledge in Visual Question Generation for Remote Sensing | Siran Li et.al. | 2602.19217 | null |
| 2026-02-19 | VQPP: Video Query Performance Prediction Benchmark | Adrian Catalin Lutu et.al. | 2602.17814 | null |
| 2026-02-19 | Visual Model Checking: Graph-Based Inference of Visual Routines for Image Retrieval | Adrià Molina et.al. | 2602.17386 | null |
| 2026-02-18 | SCAR: Satellite Imagery-Based Calibration for Aerial Recordings | Henry Hölzemann et.al. | 2602.16349 | null |
| 2026-02-17 | Automated Assessment of Kidney Ureteroscopy Exploration for Training | Fangjie Li et.al. | 2602.15988 | null |
| 2026-02-17 | Privacy-Preserving and Secure Spectrum Sharing for Database-Driven Cognitive Radio Networks | Saleh Darzia et.al. | 2602.15705 | null |
| 2026-02-17 | GMAIL: Generative Modality Alignment for generated Image Learning | Shentong Mo et.al. | 2602.15368 | null |
| 2026-02-16 | AIC CTU@AVerImaTeC: dual-retriever RAG for image-text fact checking | Herbert Ullrich et.al. | 2602.15190 | null |
| 2026-02-16 | Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery | Chandrakanth Gudavalli et.al. | 2602.14929 | null |
| 2026-02-15 | Towards Spatial Transcriptomics-driven Pathology Foundation Models | Konstantin Hemker et.al. | 2602.14177 | null |
| 2026-02-14 | High-fidelity 3D reconstruction for planetary exploration | Alfonso Martínez-Petersen et.al. | 2602.13909 | null |
| 2026-02-14 | A Deep Convolutional Network to Extract Real-Time Landmarks for UAV Navigation | Osman Tokluoglu et.al. | 2602.13814 | null |
| 2026-02-13 | InfoCIR: Multimedia Analysis for Composed Image Retrieval | Ioannis Dravilas et.al. | 2602.13402 | null |
| 2026-02-13 | EPRBench: A High-Quality Benchmark Dataset for Event Stream Based Visual Place Recognition | Xiao Wang et.al. | 2602.12919 | null |
| 2026-02-13 | GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics | Modi Jin et.al. | 2602.12617 | null |
| 2026-02-12 | DiffPlace: Street View Generation via Place-Controllable Diffusion Model Enhancing Place Recognition | Ji Li et.al. | 2602.11875 | null |
| 2026-02-12 | Arbitrary Ratio Feature Compression via Next Token Prediction | Yufan Liu et.al. | 2602.11494 | null |
| 2026-02-11 | WHEREIS: IP Address Registration Geo-Consistency | Robert Beverly et.al. | 2602.11102 | null |
| 2026-02-11 | DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories | Chenlong Deng et.al. | 2602.10809 | null |
| 2026-02-09 | Large Language Models for Geolocation Extraction in Humanitarian Crisis Response | G. Cafferata et.al. | 2602.08872 | null |
| 2026-02-09 | OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval | Teng Wang et.al. | 2602.08603 | null |
| 2026-02-16 | NovaMoon: A Strategic Lunar Reference Station for Positioning, Timing, and Largely Enhanced Science in the Earth-Moon System | Serena Molli et.al. | 2602.08432 | null |
| 2026-02-09 | A Sketch+Text Composed Image Retrieval Dataset for Thangka | Jinyu Xu et.al. | 2602.08411 | null |
| 2026-02-09 | UrbanGraphEmbeddings: Learning and Evaluating Spatially Grounded Multimodal Embeddings for Urban Science | Jie Zhang et.al. | 2602.08342 | null |
| 2026-02-10 | WristMIR: Coarse-to-Fine Region-Aware Retrieval of Pediatric Wrist Radiographs with Radiology Report-Driven Learning | Mert Sonmezer et.al. | 2602.07872 | null |
| 2026-02-04 | Do Vision-Language Models Respect Contextual Integrity in Location Disclosure? | Ruixin Yang et.al. | 2602.05023 | null |
| 2026-02-04 | SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation | David F. Ramirez et.al. | 2602.04712 | null |
| 2026-02-05 | SDR-CIR: Semantic Debias Retrieval Framework for Training-Free Zero-Shot Composed Image Retrieval | Yi Sun et.al. | 2602.04451 | null |
| 2026-02-04 | Quantile Transfer for Reliable Operating Point Selection in Visual Place Recognition | Dhyey Manish Rajani et.al. | 2602.04401 | null |
| 2026-02-04 | Beyond Static Cropping: Layer-Adaptive Visual Localization and Decoding Enhancement | Zipeng Zhu et.al. | 2602.04304 | null |
| 2026-02-03 | LaVPR: Benchmarking Language and Vision for Place Recognition | Ofer Idan et.al. | 2602.03253 | null |
| 2026-02-03 | ObjEmbed: Towards Universal Multimodal Object Embeddings | Shenghao Fu et.al. | 2602.01753 | link |
| 2026-02-02 | Real-Time Loop Closure Detection in Visual SLAM via NetVLAD and Faiss | Enguang Fan et.al. | 2602.01673 | null |
| 2026-02-02 | ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval | Tianyu Yang et.al. | 2602.01639 | null |
| 2026-02-01 | Interacted Planes Reveal 3D Line Mapping | Zeran Ke et.al. | 2602.01296 | null |
| 2026-02-05 | Invariance on Manifolds: Understanding Robust Visual Representations for Place Recognition | Jintao Cheng et.al. | 2602.00841 | null |
| 2026-02-03 | Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval | Tong Wang et.al. | 2602.00813 | null |
| 2026-01-31 | VVLoc: Prior-free 3-DoF Vehicle Visual Localization | Ze Huang et.al. | 2602.00810 | null |
| 2026-01-31 | Audio-to-Image Bird Species Retrieval without Audio-Image Pairs via Text Distillation | Ilyass Moummad et.al. | 2602.00681 | null |
| 2026-01-30 | HierLoc: Hyperbolic Entity Embeddings for Hierarchical Visual Geolocation | Hari Krishna Gadi et.al. | 2601.23064 | null |
| 2026-01-30 | Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval | Ilyass Moummad et.al. | 2601.22783 | null |
| 2026-01-29 | Variance & Greediness: A comparative study of metric-learning losses | Donghuo Zeng et.al. | 2601.21450 | null |
| 2026-01-29 | GeoRC: A Benchmark for Geolocation Reasoning Chains | Mohit Talreja et.al. | 2601.21278 | null |
| 2026-01-28 | When Vision Meets Texts in Listwise Reranking | Hongyi Cai et.al. | 2601.20623 | null |
| 2026-01-28 | Eliminating Hallucination in Diffusion-Augmented Interactive Text-to-Image Retrieval | Zhuocheng Zhang et.al. | 2601.20391 | null |
| 2026-01-30 | VGGT-SLAM 2.0: Real-time Dense Feed-forward Scene Reconstruction | Dominic Maggio et.al. | 2601.19887 | null |
| 2026-01-27 | LocationAgent: A Hierarchical Agent for Image Geolocation via Decoupling Strategy and Evidence from Parametric Knowledge | Qiujun Li et.al. | 2601.19155 | null |
| 2026-01-27 | Pixel-Grounded Retrieval for Knowledgeable Large Multimodal Models | Jeonghwan Kim et.al. | 2601.19060 | null |
| 2026-01-25 | A Multi-Modal Fusion Platform for Joint Environment Sensing and Channel Sounding in Highly Dynamic Scenarios | Xuejian Zhang et.al. | 2601.17809 | null |
| 2026-01-23 | X-Aligner: Composed Visual Retrieval without the Bells and Whistles | Yuqian Zheng et.al. | 2601.16582 | null |
| 2026-01-22 | Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing | Tingyu Song et.al. | 2601.16125 | null |
| 2026-01-21 | Unified Multimodal and Multilingual Retrieval via Multi-Task Learning with NLU Integration | Xinyuan Zhang et.al. | 2601.14714 | null |
| 2026-01-21 | LookBench: A Live and Holistic Open Benchmark for Fashion Image Retrieval | Chao Gao et.al. | 2601.14706 | null |
| 2026-01-20 | XR: Cross-Modal Agents for Composed Image Retrieval | Zhongyu Yang et.al. | 2601.14245 | null |
| 2026-01-20 | Fine-Grained Zero-Shot Composed Image Retrieval with Complementary Visual-Semantic Integration | Yongcong Ye et.al. | 2601.14060 | null |
| 2026-01-20 | Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning | Hongbo Bai et.al. | 2601.13942 | null |
| 2026-01-19 | DC-VLAQ: Query-Residual Aggregation for Robust Visual Place Recognition | Hanyu Zhu et.al. | 2601.12729 | null |
| 2026-01-18 | Abusing the Internet of Medical Things: Evaluating Threat Models and Forensic Readiness for Multi-Vector Attacks on Connected Healthcare Devices | Isabel Straw et.al. | 2601.12593 | null |
| 2026-01-17 | SupScene: Learning Overlap-Aware Global Descriptor for Unconstrained SfM | Xulei Shi et.al. | 2601.11930 | null |
| 2026-01-22 | Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning | Haomiao Tang et.al. | 2601.11393 | null |
| 2026-01-16 | Simple Models, Rich Representations: Visual Decoding from Primate Intracortical Neural Signals | Matteo Ciferri et.al. | 2601.11108 | null |
| 2026-01-20 | Multilingual-To-Multimodal (M2M): Unlocking New Languages with Monolingual Text | Piyush Singh Pasi et.al. | 2601.10096 | null |
| 2026-01-20 | UniHash: Unifying Pointwise and Pairwise Hashing Paradigms for Seen and Unseen Category Retrieval | Xiaoxu Ma et.al. | 2601.09828 | null |
| 2026-01-14 | Hybrid guided variational autoencoder for visual place recognition | Ni Wang et.al. | 2601.09248 | null |
| 2026-01-13 | Spatial Context Improves the Integration of Text with Remote Sensing for Mapping Environmental Variables | Valerie Zermatten et.al. | 2601.08750 | null |
| 2026-01-13 | Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation | Kang Fu et.al. | 2601.08311 | null |
| 2026-01-13 | Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization | Miao Pan et.al. | 2601.06224 | link |
| 2026-01-09 | Descriptor: Multi-Regional Cloud Honeypot Dataset (MURHCAD) | Enrique Feito-Casares et.al. | 2601.05813 | null |
| 2026-01-08 | Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization | Yuxiang Ji et.al. | 2601.05432 | null |
| 2026-01-08 | Multi-task Cross-modal Learning for Chest X-ray Image Retrieval | Zhaohui Liang et.al. | 2601.05399 | null |
| 2026-01-07 | ImLoc: Revisiting Visual Localization with Image-based Representation | Xudong Jiang et.al. | 2601.04185 | null |
| 2026-01-07 | CSMCIR: CoT-Enhanced Symmetric Alignment with Memory Bank for Composed Image Retrieval | Zhipeng Qian et.al. | 2601.03728 | null |
| 2026-01-07 | BREATH-VL: Vision-Language-Guided 6-DoF Bronchoscopy Localization via Semantic-Geometric Fusion | Qingyao Tian et.al. | 2601.03713 | null |
| 2026-01-07 | HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps | Xuchang Zhong et.al. | 2601.02730 | null |
| 2026-01-06 | Loop Closure using AnyLoc Visual Place Recognition in DPV-SLAM | Wenzheng Zhang et.al. | 2601.02723 | null |
| 2026-01-07 | Comparative Analysis of Binarization Methods For Medical Image Hashing On Odir Dataset | Nedim Muzoglu et.al. | 2601.02564 | null |
| 2026-01-04 | Breadcrumbs in the Digital Forest: Tracing Criminals through Torrent Metadata with OSINT | Annelies de Jong et.al. | 2601.01492 | null |
| 2026-01-05 | Vision-Language Reasoning for Geolocalization: A Reinforcement Learning Approach | Biao Wu et.al. | 2601.00388 | null |
| 2025-12-31 | OCP-LS: An Efficient Algorithm for Visual Localization | Jindi Zhong et.al. | 2512.24552 | null |
| 2025-12-29 | Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation | Guo Ye et.al. | 2512.23864 | null |
| 2026-01-07 | MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning | Jiawei Chen et.al. | 2512.23412 | null |
| 2025-12-29 | Anomaly Detection by Effectively Leveraging Synthetic Images | Sungho Kang et.al. | 2512.23227 | null |
| 2025-12-26 | Reloc-VGGT: Visual Re-localization with Geometry Grounded Transformer | Tianchen Deng et.al. | 2512.21883 | null |
| 2025-12-24 | Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval | Dao Sy Duy Minh et.al. | 2512.21221 | null |
| 2025-12-28 | UniPR-3D: Towards Universal Visual Place Recognition with Visual Geometry Grounded Transformer | Tianchen Deng et.al. | 2512.21078 | null |
| 2025-12-23 | Soft Filtering: Guiding Zero-shot Composed Image Retrieval with Prescriptive and Proscriptive Constraints | Youjin Jung et.al. | 2512.20781 | null |
| 2025-12-23 | Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark | Hao Guo et.al. | 2512.20174 | null |
| 2025-12-23 | Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach | Hao Li et.al. | 2512.20056 | null |
| 2025-12-22 | Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis | Argha Kamal Samanta et.al. | 2512.19663 | null |
| 2025-12-22 | Finer-Personalization Rank: Fine-Grained Retrieval Examines Identity Preservation for Personalized Generation | Connor Kilrain et.al. | 2512.19026 | null |
| 2025-12-21 | Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments | Saeideh Yousefzadeh et.al. | 2512.18613 | null |
| 2025-12-20 | Through the PRISm: Importance-Aware Scene Graphs for Image Retrieval | Dimitrios Georgoulopoulos et.al. | 2512.18407 | null |
| 2025-12-20 | GeoSense-AI: Fast Location Inference from Crisis Microblogs | Deepit Sapru et.al. | 2512.18225 | null |
| 2025-12-19 | MMLANDMARKS: a Cross-View Instance-Level Benchmark for Geo-Spatial Understanding | Oskar Kristoffersen et.al. | 2512.17492 | null |
| 2025-12-19 | Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors | Son Tung Nguyen et.al. | 2512.17226 | null |
| 2025-12-18 | The Effect of Negation on CLIP in Medical Imaging: Limitations of Contrastive Language-Image Pretraining | Jasmine Vu et.al. | 2512.17121 | null |
| 2025-12-18 | Plug to Place: Indoor Multimedia Geolocation from Electrical Sockets for Digital Investigation | Kanwal Aftab et.al. | 2512.16620 | null |
| 2025-12-18 | MACL: Multi-Label Adaptive Contrastive Learning Loss for Remote Sensing Image Retrieval | Amna Amir et.al. | 2512.16294 | null |
| 2025-12-16 | CLNet: Cross-View Correspondence Makes a Stronger Geo-Localizationer | Xianwei Cao et.al. | 2512.14560 | null |
| 2025-12-16 | Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries | Emanuele Mezzi et.al. | 2512.14102 | null |
| 2025-12-15 | Towards Test-time Efficient Visual Place Recognition via Asymmetric Query Processing | Jaeyoon Kim et.al. | 2512.13055 | null |
| 2025-12-14 | Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching | Wonseok Choi et.al. | 2512.12610 | null |
| 2025-12-11 | Beyond Pixels: A Training-Free, Text-to-Text Framework for Remote Sensing Image Retrieval | J. Xiao et.al. | 2512.10596 | null |
| 2025-12-10 | YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos | Ryan Meegan et.al. | 2512.09903 | null |
| 2025-12-09 | Adaptive Thresholding for Visual Place Recognition using Negative Gaussian Mixture Statistics | Nick Trinh et.al. | 2512.09071 | null |
| 2025-12-08 | Generalized Referring Expression Segmentation on Aerial Photos | Luís Marnoto et.al. | 2512.07338 | null |
| 2025-12-07 | Spatial Retrieval Augmented Autonomous Driving | Xiaosong Jia et.al. | 2512.06865 | null |
| 2025-12-06 | Language-driven Fine-grained Retrieval | Shijie Wang et.al. | 2512.06255 | null |
| 2025-12-05 | GuideNav: User-Informed Development of a Vision-Only Robotic Navigation Assistant For Blind Travelers | Hochul Hwang et.al. | 2512.06147 | null |
| 2025-12-05 | M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG | David Anugraha et.al. | 2512.05959 | null |
| 2025-12-05 | World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty | Zhiting Mei et.al. | 2512.05927 | link |
| 2025-12-05 | Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator | Md. Mahbub Hasan Akash et.al. | 2512.05866 | null |
| 2025-12-05 | Distilling Expert Surgical Knowledge: How to train local surgical VLMs for anatomy explanation in Complete Mesocolic Excision | Lennart Maack et.al. | 2512.05740 | null |
| 2025-12-05 | NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections | Juho Korkeala et.al. | 2512.05610 | null |
| 2025-12-05 | Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer | Rong Wang et.al. | 2512.05593 | null |
| 2025-12-05 | A Comprehensive Framework for Automated Quality Control in the Automotive Industry | Panagiota Moraiti et.al. | 2512.05579 | null |
| 2025-12-05 | MedDIFT: Multi-Scale Diffusion-Based Correspondence in 3D Medical Imaging | Xingyu Zhang et.al. | 2512.05571 | link |
| 2025-12-05 | 2K-Characters-10K-Stories: A Quality-Gated Stylized Narrative Dataset with Disentangled Control and Sequence Consistency | Xingxi Yin et.al. | 2512.05557 | null |
| 2025-12-05 | Know-Show: Benchmarking Video-Language Models on Spatio-Temporal Grounded Reasoning | Chinthani Sugandhika et.al. | 2512.05513 | null |
| 2025-12-05 | Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation | Fan Zhang et.al. | 2512.05494 | null |
| 2025-12-05 | WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field | Qi Zhu et.al. | 2512.05492 | null |
| 2025-12-05 | YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications | Yida Lin et.al. | 2512.05412 | null |
| 2025-12-05 | LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models | Qingqiao Hu et.al. | 2512.05391 | null |
| 2025-12-05 | Hypothesis-Based Particle Detection for Accurate Nanoparticle Counting and Digital Diagnostics | Neil H. Kim et.al. | 2512.05346 | null |
| 2025-12-05 | CATNUS: Coordinate-Aware Thalamic Nuclei Segmentation Using T1-Weighted MRI | Anqi Feng et.al. | 2512.05329 | null |
| 2025-12-04 | Nerves of generalized multicategories | Soichiro Fujii et.al. | 2512.05232 | null |
| 2025-12-04 | Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models | Rowan Bradbury et.al. | 2512.05198 | null |
| 2025-12-04 | ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning | Shengyuan Ding et.al. | 2512.05111 | null |
| 2025-12-04 | Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark | Haobo Yuan et.al. | 2512.05091 | null |
| 2025-12-04 | Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding | Abhigyan Bhattacharya et.al. | 2512.05039 | null |
| 2025-12-04 | Revealing stimulus-dependent dynamics through statistical complexity | Edson V. de Paula et.al. | 2512.05007 | null |
| 2025-12-04 | Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis | Supriya Bordoloi et.al. | 2512.04989 | null |
| 2025-12-04 | Rethinking the Use of Vision Transformers for AI-Generated Image Detection | NaHyeon Park et.al. | 2512.04969 | link |
| 2025-12-04 | LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging | Zhijian Shu et.al. | 2512.04939 | null |
| 2025-12-04 | You Only Train Once (YOTO): A Retraining-Free Object Detection Framework | Priyanto Hidayatullah et.al. | 2512.04888 | null |
| 2025-12-04 | Are Your Agents Upward Deceivers? | Dadi Guo et.al. | 2512.04864 | null |
| 2025-12-04 | Terahertz Fourier Ptychographic Imaging | Pitambar Mukherjee et.al. | 2512.04783 | null |
| 2025-12-04 | TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards | Mauro Martini et.al. | 2512.04772 | null |
| 2025-12-04 | MemLoRA: Distilling Expert Adapters for On-Device Memory Systems | Massimo Bini et.al. | 2512.04763 | null |
| 2025-12-04 | Spectral micro-CT for quantitative analysis of calcification in fibrocartilage | Vittoria Mazzini et.al. | 2512.04662 | null |
| 2025-12-04 | Metric dimension of Cartesian product of stars | Akbar Davoodi et.al. | 2512.04620 | null |
| 2025-12-04 | Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence | Tianyu Yuan et.al. | 2512.04619 | null |
| 2025-12-04 | Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot | Sheng Hang et.al. | 2512.04599 | null |
| 2025-12-04 | Structure-Aware Adaptive Kernel MPPCA Denoising for Diffusion MRI | Ananya Singhal et.al. | 2512.04586 | null |
| 2025-12-04 | Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation | Houzhang Fang et.al. | 2512.04581 | null |
| 2025-12-04 | Prompt2Craft: Generating Functional Craft Assemblies with LLMs | Vitor Hideyo Isume et.al. | 2512.04568 | null |
| 2025-12-04 | Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex | Zhizhen Wu et.al. | 2512.04556 | null |
| 2025-12-03 | RELIC: Interactive Video World Model with Long-Horizon Memory | Yicong Hong et.al. | 2512.04040 | null |
| 2025-12-03 | Needle beams and structured space-time wavepackets | Ruediger Grunwald et.al. | 2512.03993 | null |
| 2025-12-03 | DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment | Sheng-Hao Liao et.al. | 2512.03981 | link |
| 2025-12-03 | Dual Cross-Attention Siamese Transformer for Rectal Tumor Regrowth Assessment in Watch-and-Wait Endoscopy | Jorge Tapias Gomez et.al. | 2512.03883 | null |
| 2025-12-03 | Traffic Image Restoration under Adverse Weather via Frequency-Aware Mamba | Liwen Pan et.al. | 2512.03852 | null |
| 2025-12-03 | Algorithms for Boolean Matrix Factorization using Integer Programming and Heuristics | Christos Kolomvakis et.al. | 2512.03807 | null |
| 2025-12-04 | CaFTRA: Frequency-Domain Correlation-Aware Feedback-Free MIMO Transmission and Resource Allocation for 6G and Beyond | Bo Qian et.al. | 2512.03767 | null |
| 2025-12-03 | Revealing Nanoscale Molecular Organization in Liquid Crystals via Cryogenic Atom Probe Tomograph | Kuan Meng et.al. | 2512.03734 | null |
| 2025-12-03 | DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction | Kaichen Zhang et.al. | 2512.03715 | null |
| 2025-12-03 | Structured Uncertainty Similarity Score (SUSS): Learning a Probabilistic, Interpretable, Perceptual Metric Between Images | Paula Seidler et.al. | 2512.03701 | null |
| 2025-12-03 | A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection | Shahid Ansari et.al. | 2512.03684 | null |
| 2025-12-03 | Multi-Scale Visual Prompting for Lightweight Small-Image Classification | Salim Khazem et.al. | 2512.03663 | null |
| 2025-12-03 | Evaluation of Foundational Machine Learned Interatomic Potentials for Migration Barrier Predictions | Achinthya Krishna Bheemaguli et.al. | 2512.03642 | null |
| 2025-12-03 | HBFormer: A Hybrid-Bridge Transformer for Microtumor and Miniature Organ Segmentation | Fuchen Zheng et.al. | 2512.03597 | null |
| 2025-12-03 | Global-Local Aware Scene Text Editing | Fuxiang Yang et.al. | 2512.03574 | null |
| 2025-12-03 | M3DR: Towards Universal Multilingual Multimodal Document Retrieval | Adithya S Kolavi et.al. | 2512.03514 | null |
| 2025-12-03 | Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles | Haicheng Liao et.al. | 2512.03454 | null |
| 2025-12-03 | Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation | Xieji Li et.al. | 2512.03445 | link |
| 2025-12-03 | Multimodal Reinforcement Learning with Agentic Verifier for AI Agents | Reuben Tan et.al. | 2512.03438 | null |
| 2025-12-03 | Building a Radio AGN Sample from Cosmic Morning – The Radio High-Redshift Quasar Catalog (RHzQCat): I. Catalog from SDSS Quasars and Radio Surveys at $z > 3$ | Yingkang Zhang et.al. | 2512.03415 | null |
| 2025-12-02 | MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues | Zichen Liu et.al. | 2512.03046 | link |
| 2025-12-02 | Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation | Zeqi Xiao et.al. | 2512.03040 | null |
| 2025-12-02 | Stability of knot equivalence at low regularity, and symmetric critical knots for the Möbius energy | Simon Blatt et.al. | 2512.02998 | null |
| 2025-12-02 | MIRI spectrophotometry of GN-z11: Detection and nature of an optical red continuum component | A. Crespo Gómez et.al. | 2512.02997 | null |
| 2025-12-02 | GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection | Md Sohag Mia et.al. | 2512.02991 | null |
| 2025-12-02 | LoVoRA: Text-guided and Mask-free Video Object Removal and Addition with Learnable Object-aware Localization | Zhihan Xiao et.al. | 2512.02933 | null |
| 2025-12-02 | MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding | Fan Yang et.al. | 2512.02906 | null |
| 2025-12-02 | Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models | Pierpaolo Serio et.al. | 2512.02897 | null |
| 2025-12-02 | Terahertz Emission from Spintronic Stack Nanodecorated with Drop-Cast Core-Shell Plasmonic Nanoparticles | Vittorio Cecconi et.al. | 2512.02889 | null |
| 2025-12-02 | Leveraging generative adversarial networks with spatially adaptive denormalization for multivariate stochastic seismic data inversion | Roberto Miele et.al. | 2512.02863 | null |
| 2025-12-02 | BOOM: Beyond Only One Modality KIT’s Multimodal Multilingual Lecture Companion | Sai Koneru et.al. | 2512.02817 | null |
| 2025-12-02 | Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control | Yongrui Yu et.al. | 2512.02814 | null |
| 2025-12-02 | Direct observational evidence that higher-luminosity type 1 active galactic nuclei are most commonly triggered by galaxy mergers | Yongmin Yoon et.al. | 2512.02805 | null |
| 2025-12-14 | HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Retrieval | Zhiwei Chen et.al. | 2512.02792 | null |
| 2025-12-02 | Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone | Tristan Amadei et.al. | 2512.02737 | null |
| 2025-12-02 | DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions | Yifan Zhou et.al. | 2512.02727 | null |
| 2025-12-02 | Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs | Theodoros Aivalis et.al. | 2512.02713 | null |
| 2025-12-02 | GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization | Zixuan Song et.al. | 2512.02697 | link |
| 2025-12-02 | ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data | Yuxing Liu et.al. | 2512.02686 | null |
| 2025-12-02 | Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation | Agathoklis Georgiou et.al. | 2512.02660 | null |
| 2025-12-01 | Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback | Aiden Yiliu Li et.al. | 2512.01979 | null |
| 2025-12-01 | SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception | Gurmeher Khurana et.al. | 2512.01908 | null |
| 2025-12-01 | KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM | Zaid Nasser et.al. | 2512.01889 | null |
| 2025-12-01 | Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval | Xin Wang et.al. | 2512.01636 | null |
| 2025-12-01 | Depth Matching Method Based on ShapeDTW for Oil-Based Mud Imager | Fengfeng Li et.al. | 2512.01611 | null |
| 2025-12-01 | Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track | Mo Chen et.al. | 2512.01608 | null |
| 2025-12-01 | Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation | Thao Thi Phuong Dao et.al. | 2512.01589 | null |
| 2025-12-01 | Near-infrared polarimetric imaging with nonlinear flat-optics | Evgenii Menshikov et.al. | 2512.01525 | null |
| 2025-12-01 | QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions | Can Polat et.al. | 2512.01519 | null |
| 2025-12-01 | Winning Solutions for the Rayan AI Contest: Compositional Retrieval, Zero-Shot Anomaly Detection, and Backdoor Detection | Ali Nafisi et.al. | 2512.01498 | null |
| 2025-12-01 | ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers | Yiyang Ma et.al. | 2512.01426 | null |
| 2025-12-01 | Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries | Tushar Pranav et.al. | 2512.01419 | null |
| 2025-12-01 | Rethinking Intracranial Aneurysm Vessel Segmentation: A Perspective from Computational Fluid Dynamics Applications | Feiyang Xiao et.al. | 2512.01319 | null |
| 2025-12-01 | DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy | Jaewoo Song et.al. | 2512.01302 | null |
| 2025-12-01 | Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI | Kamal Basha S et.al. | 2512.01291 | null |
| 2025-12-01 | Egent: An Autonomous Agent for Equivalent Width Measurement | Yuan-Sen Ting et.al. | 2512.01270 | null |
| 2025-12-01 | Social Media Data Mining of Human Behaviour during Bushfire Evacuation | Junfeng Wu et.al. | 2512.01262 | null |
| 2025-12-01 | M4-BLIP: Advancing Multi-Modal Media Manipulation Detection through Face-Enhanced Local Analysis | Hang Wu et.al. | 2512.01214 | null |
| 2025-11-30 | A sudden fine-scale bright kernel captured by Hi-C Flare during an M1.6-class solar flare’s post-maximum phase | Sanjiv K. Tiwari et.al. | 2512.01140 | null |
| 2025-11-30 | OmniFD: A Unified Model for Versatile Face Forgery Detection | Haotian Liu et.al. | 2512.01128 | link |
| 2025-11-28 | DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline | Rui Zhang et.al. | 2511.23377 | link |
| 2025-11-28 | FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting | Tianhao Xie et.al. | 2511.23292 | null |
| 2025-11-28 | Robust 3DGS-based SLAM via Adaptive Kernel Smoothing | Shouhe Zhang et.al. | 2511.23221 | null |
| 2025-11-28 | PowerCLIP: Powerset Alignment for Contrastive Pre-Training | Masaki Kawamura et.al. | 2511.23170 | null |
| 2025-11-28 | DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation | Hongfei Zhang et.al. | 2511.23127 | link |
| 2025-11-28 | DNA-Prior: Unsupervised Denoise Anything via Dual-Domain Prior | Yanqi Cheng et.al. | 2511.23124 | null |
| 2025-11-28 | Geodiffussr: Generative Terrain Texturing with Elevation Fidelity | Tai Inui et.al. | 2511.23029 | null |
| 2025-11-28 | JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization | Yunlong Lin et.al. | 2511.23002 | null |
| 2025-11-28 | Imaging propagating terahertz collective modes in two-dimensional semiconductor double layers | Andrew T. Pierce et.al. | 2511.22962 | null |
| 2025-11-28 | HMR3D: Hierarchical Multimodal Representation for 3D Scene Understanding with Large Vision-Language Model | Chen Li et.al. | 2511.22961 | null |
| 2025-11-28 | A Trainable Centrality Framework for Modern Data | Minh Duc Vu et.al. | 2511.22959 | null |
| 2025-11-28 | Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records | Shiyu Shen et.al. | 2511.22958 | null |
| 2025-11-28 | See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection | YuEun Lee et.al. | 2511.22906 | null |
| 2025-11-28 | MARVO: Marine-Adaptive Radiance-aware Visual Odometry | Sacchin Sundar et.al. | 2511.22860 | null |
| 2025-11-28 | Breaking the Visual Shortcuts in Multimodal Knowledge-Based Visual Question Answering | Dosung Lee et.al. | 2511.22843 | null |
| 2025-11-28 | Captain Safari: A World Engine | Yu-Cheng Chou et.al. | 2511.22815 | null |
| 2025-11-27 | Alzheimer’s Disease Prediction Using EffNetViTLoRA and BiLSTM with Multimodal Longitudinal MRI Data | Mahdieh Behjat Khatooni et.al. | 2511.22774 | null |
| 2025-11-27 | ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering | Alberto Compagnoni et.al. | 2511.22715 | null |
| 2025-11-27 | Test-time scaling of diffusions with flow maps | Amirmojtaba Sabour et.al. | 2511.22688 | null |
| 2025-11-27 | VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models | Silin Cheng et.al. | 2511.22664 | null |
| 2025-11-27 | GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents | Xinyu Zhang et.al. | 2511.22441 | null |
| 2025-11-27 | UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries | Hoang-Bao Le et.al. | 2511.22253 | null |
| 2025-11-26 | Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models | Naifu Zhang et.al. | 2511.21663 | null |
| 2025-11-26 | Fast 3D Ultrasound Localization Microscopy via Projection-based Processing Framework | Jingke Zhang et.al. | 2511.21647 | null |
| 2025-11-26 | Qwen3-VL Technical Report | Shuai Bai et.al. | 2511.21631 | null |
| 2025-11-26 | Scale-Agnostic Kolmogorov-Arnold Geometry in Neural Networks | Mathew Vanherreweghe et.al. | 2511.21626 | null |
| 2025-11-26 | Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy | Teng Hu et.al. | 2511.21579 | null |
| 2025-11-26 | CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation | Shizhe Sun et.al. | 2511.21503 | null |
| 2025-11-26 | Semantic-Enhanced Feature Matching with Learnable Geometric Verification for Cross-Modal Neuron Registration | Wenwei Li et.al. | 2511.21452 | null |
| 2025-11-26 | Hierarchical Besov-Laplace priors for spatially inhomogeneous binary classification | Patric Dolmeta et.al. | 2511.21441 | null |
| 2025-11-26 | FITRep: Attention-Guided Item Representation via MLLMs | Guoxiao Zhang et.al. | 2511.21389 | null |
| 2025-11-26 | Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning | Xin Gu et.al. | 2511.21375 | null |
| 2025-11-26 | The Directed Prediction Change - Efficient and Trustworthy Fidelity Assessment for Local Feature Attribution Methods | Kevin Iselborn et.al. | 2511.21363 | null |
| 2025-11-26 | HTTM: Head-wise Temporal Token Merging for Faster VGGT | Weitian Wang et.al. | 2511.21317 | null |
| 2025-11-26 | Neural NMPC through Signed Distance Field Encoding for Collision Avoidance | Martin Jacquet et.al. | 2511.21312 | null |
| 2025-11-26 | Low-dose Chemically Specific Bioimaging via Deep-UV Lensless Holographic Microscopy on a Standard Camera | Piotr Arcab et.al. | 2511.21311 | null |
| 2025-11-26 | Adaptive Lighting Control in Visible Light Systems: An Integrated Sensing, Communication, and Illumination Framework | Xinyan Xie et.al. | 2511.21271 | null |
| 2025-11-26 | Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition | Baoli Sun et.al. | 2511.21202 | null |
| 2025-11-26 | CAHS-Attack: CLIP-Aware Heuristic Search Attack Method for Stable Diffusion | Shuhan Xia et.al. | 2511.21180 | null |
| 2025-11-26 | LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs | Shichu Sun et.al. | 2511.21150 | link |
| 2025-11-26 | Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval | Anup Roy et.al. | 2511.21121 | null |
| 2025-11-26 | Scaling Foundation Models for Radar Scene Understanding | Pushkal Mishra et.al. | 2511.21105 | null |
| 2025-11-25 | Efficient Greedy Algorithms for Feature Selection in Robot Visual Localization | Vivek Pandey et.al. | 2511.20894 | null |
| 2025-11-25 | The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment | Ziheng Ouyang et.al. | 2511.20614 | null |
| 2025-11-25 | Adaptive Hopfield Network: Rethinking Similarities in Associative Memory | Shurong Wang et.al. | 2511.20609 | null |
| 2025-11-25 | New York Smells: A Large Multimodal Dataset for Olfaction | Ege Ozguroglu et.al. | 2511.20544 | null |
| 2025-11-25 | Wide Area Surface Dosimetry with Conformal Scintillator Array for External Beam Radiotherapy | Roman Vasyltsiv et.al. | 2511.20472 | null |
| 2025-11-25 | Power-Efficient Autonomous Mobile Robots | Liangkai Liu et.al. | 2511.20467 | null |
| 2025-11-25 | STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow | Jiatao Gu et.al. | 2511.20462 | null |
| 2025-11-25 | Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search | Yunqi Zhou et.al. | 2511.20460 | link |
| 2025-11-25 | A meshless data-tailored approach to compute statistics from scattered data with adaptive radial basis functions | Damien Rigutto et.al. | 2511.20449 | null |
| 2025-11-25 | A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control | Jiawei Lin et.al. | 2511.20401 | null |
| 2025-11-25 | Real-Space Imaging of Moiré-Confined Excitons in Twisted Bilayer MoS $_2$ | Laurens J. M. Westenberg et.al. | 2511.20398 | null |
| 2025-11-25 | Interactive Visualization of Proof-of-Work Consensus Protocol on Raspberry Pi | Anton Ivashkevich et.al. | 2511.20391 | null |
| 2025-11-25 | From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations | Zhiqing Guo et.al. | 2511.20359 | null |
| 2025-11-25 | 3D Motion Perception of Binocular Vision Target with PID-CNN | Shi Jiazhao et.al. | 2511.20332 | null |
| 2025-11-25 | TaCo: Capturing Spatio-Temporal Semantic Consistency in Remote Sensing Change Detection | Han Guo et.al. | 2511.20306 | null |
| 2025-11-25 | Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations | Chao Wang et.al. | 2511.20295 | null |
| 2025-11-25 | Bootstrapping Physics-Grounded Video Generation through VLM-Guided Iterative Self-Refinement | Yang Liu et.al. | 2511.20280 | null |
| 2025-11-25 | ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis | Advik Sinha et.al. | 2511.20274 | null |
| 2025-11-25 | DRL-Guided Neural Batch Sampling for Semi-Supervised Pixel-Level Anomaly Detection | Amirhossein Khadivi Noghredeh et.al. | 2511.20270 | null |
| 2025-11-25 | XiCAD: Camera Activation Detection in the Da Vinci Xi User Interface | Alexander C. Jenke et.al. | 2511.20254 | null |
| 2025-11-25 | V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs | Sen Nie et.al. | 2511.20223 | link |
| 2025-11-25 | Intelligent Image Search Algorithms Fusing Visual Large Models | Kehan Wang et.al. | 2511.19920 | null |
| 2025-11-24 | Wigner and Gabor phase-space analysis of propagators for evolution equations | Elena Cordero et.al. | 2511.19400 | null |
| 2025-11-24 | Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments | Jorge Ortigoso-Narro et.al. | 2511.19396 | null |
| 2025-11-24 | Neural Architecture Search for Quantum Autoencoders | Hibah Agha et.al. | 2511.19246 | null |
| 2025-11-24 | In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings | Teresa Guallart-Naval et.al. | 2511.19226 | null |
| 2025-11-24 | Can Modern Vision Models Understand the Difference Between an Object and a Look-alike? | Itay Cohen et.al. | 2511.19200 | null |
| 2025-11-24 | From Pixels to Posts: Retrieval-Augmented Fashion Captioning and Hashtag Generation | Moazzam Umer Gondal et.al. | 2511.19149 | null |
| 2025-11-24 | When Semantics Regulate: Rethinking Patch Shuffle and Internal Bias for Generated Image Detection with CLIP | Beilin Chu et.al. | 2511.19126 | null |
| 2025-11-24 | DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection | Hai Ci et.al. | 2511.19111 | null |
| 2025-11-24 | Graph-based 3D Human Pose Estimation using WiFi Signals | Jichao Chen et.al. | 2511.19105 | null |
| 2025-11-24 | Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach | Fan Nie et.al. | 2511.19080 | null |
| 2025-11-24 | Granular Computing-driven SAM: From Coarse-to-Fine Guidance for Prompt-Free Segmentation | Qiyang Yu et.al. | 2511.19062 | null |
| 2025-11-24 | LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space | Hai Wu et.al. | 2511.19057 | null |
| 2025-11-24 | Multi-height probing of horizontal flows in the solar photosphere | Teodor Kostić et.al. | 2511.19048 | null |
| 2025-11-24 | Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors | Haihang Wu et.al. | 2511.19031 | null |
| 2025-11-24 | Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting | Qiyang Yu et.al. | 2511.19021 | null |
| 2025-11-24 | AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization | Christos Koutlis et.al. | 2511.18993 | null |
| 2025-11-24 | Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models | Santiago Moreno et.al. | 2511.18978 | null |
| 2025-11-24 | MagicWorld: Interactive Geometry-driven Video World Exploration | Guangyuan Li et.al. | 2511.18886 | null |
| 2025-11-24 | Personalized Federated Segmentation with Shared Feature Aggregation and Boundary-Focused Calibration | Ishmam Tashdeed et.al. | 2511.18847 | null |
| 2025-11-24 | SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation | Nimeshika Udayangani et.al. | 2511.18816 | null |
| 2025-11-23 | AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization | Shuai Zhang et.al. | 2511.18293 | null |
| 2025-11-23 | SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes | Jungho Lee et.al. | 2511.18290 | null |
| 2025-11-22 | Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models | Dachuan Zhao et.al. | 2511.18123 | null |
| 2025-11-21 | Effect of local environment on Ly $α$ line profile in DESI/ODIN LAEs | Ana Sofía M. Uzsoy et.al. | 2511.17498 | null |
| 2025-11-21 | GPR-OdomNet: Difference and Similarity-Driven Odometry Estimation Network for Ground Penetrating Radar-Based Localization | Huaichao Wang et.al. | 2511.17457 | null |
| 2025-11-21 | REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing | Binger Chen et.al. | 2511.17442 | null |
| 2025-11-21 | Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers | Christopher Boland et.al. | 2511.17421 | null |
| 2025-11-21 | IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation | Yifan Li et.al. | 2511.17384 | null |
| 2025-11-21 | SVRecon: Sparse Voxel Rasterization for Surface Reconstruction | Seunghun Oh et.al. | 2511.17364 | null |
| 2025-11-21 | NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior | Dongbo Shi et.al. | 2511.17322 | null |
| 2025-11-21 | MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning | Wenrui Zhang et.al. | 2511.17300 | null |
| 2025-11-21 | Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation | Chuancheng Shi et.al. | 2511.17282 | null |
| 2025-11-21 | A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback | Bulat Khaertdinov et.al. | 2511.17255 | null |
| 2025-11-21 | Mixed Reality Scenic Live Streaming for Cultural Heritage: Visual Interactions in a Historic Landscape | Zeyu Huang et.al. | 2511.17246 | null |
| 2025-11-21 | Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers | Cris Claessens et.al. | 2511.17209 | null |
| 2025-11-21 | SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors | Kunyi Li et.al. | 2511.17207 | null |
| 2025-11-21 | Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition | Aditya Mishra et.al. | 2511.17183 | null |
| 2025-11-21 | Reflection-Based Relative Localization for Cooperative UAV Teams Using Active Markers | Tim Lakemann et.al. | 2511.17166 | null |
| 2025-11-21 | A lightweight detector for real-time detection of remote sensing images | Qianyi Wang et.al. | 2511.17147 | null |
| 2025-11-21 | Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation | Shuo Wang et.al. | 2511.17097 | null |
| 2025-11-21 | Spanning Tree Autoregressive Visual Generation | Sangkyu Lee et.al. | 2511.17089 | null |
| 2025-11-21 | ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion | Junming Liu et.al. | 2511.17068 | null |
| 2025-11-21 | Stable Offline Hand-Eye Calibration for any Robot with Just One Mark | Sicheng Xie et.al. | 2511.17001 | null |
| 2025-11-20 | Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation | Ziyu Guo et.al. | 2511.16671 | null |
| 2025-11-20 | Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems | Elias Lumer et.al. | 2511.16654 | null |
| 2025-11-20 | Measurement incompatibility in Bayesian multiparameter quantum estimation | Francesco Albarelli et.al. | 2511.16645 | null |
| 2025-11-20 | SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction | Guolin Huang et.al. | 2511.16635 | null |
| 2025-11-20 | SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking | Haofeng Liu et.al. | 2511.16618 | link |
| 2025-11-20 | POMA-3D: The Point Map Way to 3D Scene Understanding | Ye Mao et.al. | 2511.16567 | link |
| 2025-11-20 | NutriScreener: Retrieval-Augmented Multi-Pose Graph Attention Network for Malnourishment Screening | Misaal Khan et.al. | 2511.16566 | null |
| 2025-11-20 | Investigating Optical Flow Computation: From Local Methods to a Multiresolution Horn-Schunck Implementation with Bilinear Interpolation | Haytham Ziani et.al. | 2511.16535 | null |
| 2025-11-20 | Contrastive vision-language learning with paraphrasing and negation | Kwun Ho Ngan et.al. | 2511.16527 | null |
| 2025-11-20 | BoxingVI: A Multi-Modal Benchmark for Boxing Action Recognition and Localization | Rahul Kumar et.al. | 2511.16524 | null |
| 2025-11-20 | YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras | Fan Yang et.al. | 2511.16521 | null |
| 2025-11-20 | TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models | Li Zhang et.al. | 2511.16423 | null |
| 2025-11-20 | DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration | Meng-Cheng Shih et.al. | 2511.16364 | null |
| 2025-11-20 | CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering | Joni Vanherck et.al. | 2511.16349 | null |
| 2025-11-20 | Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks | Yi Ting Tsai et.al. | 2511.16341 | null |
| 2025-11-20 | Non-squeezing and other global rigidity results in locally conformal symplectic geometry | Mélanie Bertelson et.al. | 2511.16329 | null |
| 2025-11-20 | Real-Time Inference for Distributed Multimodal Systems under Communication Delay Uncertainty | Victor Croisfelt et.al. | 2511.16225 | null |
| 2025-11-20 | Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2511.16091 | null |
| 2025-11-20 | AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers | Boxun Xu et.al. | 2511.16047 | null |
| 2025-11-20 | InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer | Muyao Yuan et.al. | 2511.15967 | null |
| 2025-11-19 | GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization | Yikun Wang et.al. | 2511.15705 | link |
| 2025-11-19 | First Frame Is the Place to Go for Video Content Customization | Jingxi Chen et.al. | 2511.15700 | link |
| 2025-11-19 | Hyperspectral Image Classification using Spectral-Spatial Mixer Network | Mohammed Q. Alkhatib et.al. | 2511.15692 | link |
| 2025-11-19 | Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning | Tao Hu et.al. | 2511.15633 | null |
| 2025-11-19 | Catching the 2021 γ-ray flare in the blazar TXS 2013+370 | Giorgos Michailidis et.al. | 2511.15601 | null |
| 2025-11-19 | Multi-Text Guided Few-Shot Semantic Segmentation | Qiang Jiao et.al. | 2511.15515 | null |
| 2025-11-19 | SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome | Dabin Jeong et.al. | 2511.15464 | null |
| 2025-11-19 | HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation | Linyin Luo et.al. | 2511.15435 | null |
| 2025-11-19 | The Empowerment of Science of Science by Large Language Models: New Tools and Methods | Guoqiang Liang et.al. | 2511.15370 | null |
| 2025-11-19 | On the phase aberration estimation using common mid-angle correlations | Naiara Korta Martiartu et.al. | 2511.15336 | null |
| 2025-11-19 | C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models | Nayoung Oh et.al. | 2511.15333 | null |
| 2025-11-19 | Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval | Qing Wang et.al. | 2511.15201 | null |
| 2025-11-19 | Probing Electro-Magnetic Field Enhancement in 3D Plasmonic Nanopores Using DNA-PAINT and Nanorulers | German Lanzavecchia et.al. | 2511.15181 | null |
| 2025-11-19 | Multimodal Wireless Foundation Models | Ahmed Aboulfotouh et.al. | 2511.15162 | null |
| 2025-11-19 | Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation | Jin Wang et.al. | 2511.15118 | link |
| 2025-11-19 | BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer | Wenhan Yu et.al. | 2511.15090 | null |
| 2025-11-19 | Hyperspectral Super-Resolution with Inter-Image Variability via Degradation-based Low-Rank and Residual Fusion Method | Yue Wen et.al. | 2511.15052 | null |
| 2025-11-18 | Reconstruction of three-dimensional shapes of normal and disease-related erythrocytes from partial observations using multi-fidelity neural networks | Haizhou Wen et.al. | 2511.14962 | null |
| 2025-11-18 | FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding | Zhenshi Li et.al. | 2511.14901 | null |
| 2025-11-18 | Quantum Transport Spectroscopy of Pseudomagnetic Field in Graphene | Divya Sahani et.al. | 2511.14888 | null |
| 2025-11-18 | FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation | Yunfeng Wu et.al. | 2511.14712 | link |
| 2025-11-18 | Cell Shape Emerges from Motion | Gautham Gopinath et.al. | 2511.14707 | link |
| 2025-11-18 | Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images | Farheen Ramzan et.al. | 2511.14702 | null |
| 2025-11-18 | Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities | Huiyan Zou et.al. | 2511.14687 | null |
| 2025-11-18 | A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases | Tao Yang et.al. | 2511.14638 | null |
| 2025-11-18 | Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains | Qingwei Ben et.al. | 2511.14625 | link |
| 2025-11-18 | Deep Learning-Based Regional White Matter Hyperintensity Mapping as a Robust Biomarker for Alzheimer’s Disease | Julia Machnio et.al. | 2511.14588 | null |
| 2025-11-18 | Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction | Jaume Ros et.al. | 2511.14544 | null |
| 2025-11-18 | D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images | Taifour Yousra Nabila et.al. | 2511.14518 | null |
| 2025-11-18 | Aerial Assistance System for Automated Firefighting during Turntable Ladder Operations | Jan Quenzel et.al. | 2511.14504 | null |
| 2025-11-18 | VENUS: A Strongly Lensed Clumpy Galaxy at $z\sim11-12$ behind the Galaxy Cluster MACS J0257.1-2325 | Minami Nakane et.al. | 2511.14483 | null |
| 2025-11-18 | Multi-network Topology Underlying Individual Language Learning Success | Peilun Song et.al. | 2511.14453 | null |
| 2025-11-18 | DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval | Zongwei Zhen et.al. | 2511.14449 | null |
| 2025-11-18 | Agentic Video Intelligence: A Flexible Framework for Advanced Video Exploration and Understanding | Hong Gao et.al. | 2511.14446 | null |
| 2025-11-18 | Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving | Kangqiao Zhao et.al. | 2511.14386 | null |
| 2025-11-18 | O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model | Rishi Gupta et.al. | 2511.14368 | null |
| 2025-11-18 | Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors | Jeryes Danial et.al. | 2511.14335 | null |
| 2025-11-18 | SAM-Fed: SAM-Guided Federated Semi-Supervised Learning for Medical Image Segmentation | Sahar Nasirihaghighi et.al. | 2511.14302 | null |
| 2025-11-18 | NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration | Luohong Wu et.al. | 2511.14286 | null |
| 2025-11-18 | Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery | Yiming Zeng et.al. | 2511.14270 | null |
| 2025-11-17 | Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images | Yinuo Xu et.al. | 2511.13586 | null |
| 2025-11-17 | Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification | Linhan Zhou et.al. | 2511.13575 | link |
| 2025-11-17 | Language-Guided Invariance Probing of Vision-Language Models | Jae Joong Lee et.al. | 2511.13494 | null |
| 2025-11-17 | Unlocking the Forgery Detection Potential of Vanilla MLLMs: A Novel Training-Free Pipeline | Rui Zuo et.al. | 2511.13442 | null |
| 2025-11-17 | Attention Grounded Enhancement for Visual Document Retrieval | Wanqing Cui et.al. | 2511.13415 | null |
| 2025-11-17 | An Unusual Velocity Field in a Sunspot Penumbra | H. Balthasar et.al. | 2511.13374 | null |
| 2025-11-17 | Stray Light Correction for the Helioseismic and Magnetic Imager | A. A. Norton et.al. | 2511.13348 | null |
| 2025-11-17 | GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models | Yushuo Zheng et.al. | 2511.13259 | null |
| 2025-11-17 | Referring Camouflaged Object Detection With Multi-Context Overlapped Windows Cross-Attention | Yu Wen et.al. | 2511.13249 | null |
| 2025-11-17 | Uncovering and Mitigating Transient Blindness in Multimodal Model Editing | Xiaoqi Han et.al. | 2511.13243 | null |
| 2025-11-17 | GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry | Chiyun Noh et.al. | 2511.13216 | null |
| 2025-11-17 | Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework | Diego Ortego et.al. | 2511.13189 | null |
| 2025-11-17 | GenTract: Generative Global Tractography | Alec Sargood et.al. | 2511.13183 | null |
| 2025-11-17 | THIR: Topological Histopathological Image Retrieval | Zahra Tabatabaei et.al. | 2511.13170 | null |
| 2025-11-17 | SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration | Haodong Wang et.al. | 2511.13168 | null |
| 2025-11-17 | MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications | Gagan Raj Gupta et.al. | 2511.13131 | null |
| 2025-11-17 | Region-Point Joint Representation for Effective Trajectory Similarity Learning | Hao Long et.al. | 2511.13125 | null |
| 2025-11-17 | Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining | Zhaocheng Yu et.al. | 2511.13113 | null |
| 2025-11-17 | uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data | Dahyun Chung et.al. | 2511.13036 | null |
| 2025-11-17 | Towards 3D Object-Centric Feature Learning for Semantic Scene Completion | Weihua Wang et.al. | 2511.13031 | null |
| 2025-11-14 | DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding | Dawei Zhu et.al. | 2511.11552 | null |
| 2025-11-14 | STEM EBIC as a Quantitative Probe of Semiconductor Devices | Sebastian Schneider et.al. | 2511.11528 | null |
| 2025-11-14 | Bridging Hidden States in Vision-Language Models | Benjamin Fein-Ashley et.al. | 2511.11526 | null |
| 2025-11-14 | OpenUS: A Fully Open-Source Foundation Model for Ultrasound Image Analysis via Self-Adaptive Masked Contrastive Learning | Xiaoyu Zheng et.al. | 2511.11510 | link |
| 2025-11-14 | Planetary nebulae as tracers of stellar population properties: a pilot study with MUSE | Ana Inés Ennis et.al. | 2511.11479 | null |
| 2025-11-14 | Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs | Francisco Nogueira et.al. | 2511.11427 | null |
| 2025-11-14 | Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment | Lukun Wu et.al. | 2511.11422 | null |
| 2025-11-14 | Bidimensional measurements of photon statistics within a multimodal temporal framework | C. Hainaut et.al. | 2511.11403 | null |
| 2025-11-14 | GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes | Shumit A. Mitra et.al. | 2511.11401 | null |
| 2025-11-14 | Shadow-Induced Warps in Protoplanetary disks | Shangjia Zhang et.al. | 2511.11358 | null |
| 2025-11-14 | Gluing sheaves along Harder-Narasimhan strata of $\mathrm{Bun}_G$ | Jon Miles et.al. | 2511.11327 | null |
| 2025-11-14 | StochEP: Stochastic Equilibrium Propagation for Spiking Convergent Recurrent Neural Networks | Jiaqi Lin et.al. | 2511.11320 | null |
| 2025-11-14 | DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding | Tanveer Hannan et.al. | 2511.11313 | null |
| 2025-11-14 | MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising | Chenghan Fu et.al. | 2511.11305 | null |
| 2025-11-14 | Coordinative Learning with Ordinal and Relational Priors for Volumetric Medical Image Segmentation | Haoyi Wang et.al. | 2511.11276 | link |
| 2025-11-14 | 3D Stokes polarimetric imaging at nanoscales | Isael Herrera et.al. | 2511.11222 | null |
| 2025-11-14 | Positional Bias in Multimodal Embedding Models: Do They Favor the Beginning, the Middle, or the End? | Kebin Wu et.al. | 2511.11216 | null |
| 2025-11-14 | Inverse modeling of porous flow through deep neural networks: the case of coffee percolation | Antoniorenee Barletta et.al. | 2511.11194 | null |
| 2025-11-14 | CareCom: Generative Image Composition with Calibrated Reference Features | Jiaxuan Chen et.al. | 2511.11060 | null |
| 2025-11-14 | MPCGNet: A Multiscale Feature Extraction and Progressive Feature Aggregation Network Using Coupling Gates for Polyp Segmentation | Wei Wang et.al. | 2511.11032 | null |
| 2025-11-13 | Multitask GLocal OBIA-Mamba for Sentinel-2 Landcover Mapping | Zack Dewis et.al. | 2511.10604 | null |
| 2025-11-13 | Excitonic Landscapes in Monolayer Lateral Heterostructures Revealed by Unsupervised Machine Learning | Maninder Kaur et.al. | 2511.10600 | null |
| 2025-11-13 | Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering | Bavana Durgapraveen et.al. | 2511.10591 | null |
| 2025-11-13 | Tight Robustness Certification through the Convex Hull of $\ell_0$ Attacks | Yuval Shapira et.al. | 2511.10576 | null |
| 2025-11-13 | Two Americas of Well-Being: Divergent Rural-Urban Patterns of Life Satisfaction and Happiness from 2.6 B Social Media Posts | Stefano Maria Iacus et.al. | 2511.10542 | null |
| 2025-11-13 | SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation | Wei Li et.al. | 2511.10518 | null |
| 2025-11-13 | Measuring dissimilarity between convex cones by means of max-min angles | Welington de Oliveira et.al. | 2511.10483 | null |
| 2025-11-13 | Extending the Frontier of Spatially-Resolved Supermassive Black Hole Mass Measurements to at $1\lesssim z\lesssim2$ : Simulations with ELT/MICADO High-Resolution Mass Models and HARMONI Integral-Field Stellar Kinematics | Dieu D. Nguyen et.al. | 2511.10427 | null |
| 2025-11-13 | Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators | Maximiliane Gruber et.al. | 2511.10424 | null |
| 2025-11-13 | MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns | Jiarui Zhang et.al. | 2511.10390 | null |
| 2025-11-13 | Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery | Prince Mensah et.al. | 2511.10387 | null |
| 2025-11-13 | DermAI: Clinical dermatology acquisition through quality-driven image collection for AI classification in mobile | Thales Bezerra et.al. | 2511.10367 | null |
| 2025-11-13 | Rethinking Visual Information Processing in Multimodal LLMs | Dongwan Kim et.al. | 2511.10301 | null |
| 2025-11-13 | H3Former: Hypergraph-based Semantic-Aware Aggregation via Hyperbolic Hierarchical Contrastive Loss for Fine-Grained Visual Classification | Yongji Zhang et.al. | 2511.10260 | null |
| 2025-11-13 | TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding | Jinxuan Li et.al. | 2511.10241 | null |
| 2025-11-13 | Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization | Ashutosh Anshul et.al. | 2511.10212 | null |
| 2025-11-13 | Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA | Yiran Zhang et.al. | 2511.10182 | null |
| 2025-11-13 | GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval | Hao Zou et.al. | 2511.10154 | null |
| 2025-11-13 | Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction | Mingda Jia et.al. | 2511.10134 | null |
| 2025-11-13 | GridPrune: From “Where to Look” to “What to Select” in Visual Token Pruning for MLLMs | Yuxiang Duan et.al. | 2511.10081 | null |
| 2025-11-10 | TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research | Han Zhang et.al. | 2511.07412 | null |
| 2025-11-10 | LeCoT: revisiting network architecture for two-view correspondence pruning | Luanyuan Dai et.al. | 2511.07078 | null |
| 2025-11-09 | DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization | Tao Liu et.al. | 2511.06422 | link |
| 2025-11-09 | ALIGN: A Vision-Language Framework for High-Accuracy Accident Location Inference through Geo-Spatial Neural Reasoning | MD Thamed Bin Zaman Chowdhury et.al. | 2511.06316 | null |
| 2025-11-08 | Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era | Feng Lu et.al. | 2511.06024 | link |
| 2025-11-08 | Hilbert-Guided Block-Sparse Local Attention | Yunge Li et.al. | 2511.05832 | null |
| 2025-11-07 | Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments | Laura Alejandra Encinar Gonzalez et.al. | 2511.05404 | null |
| 2025-11-07 | DAFM: Dynamic Adaptive Fusion for Multi-Model Collaboration in Composed Image Retrieval | Yawei Cai et.al. | 2511.05020 | null |
| 2025-11-06 | Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA | Itbaan Safwan et.al. | 2511.04384 | null |
| 2025-11-06 | An Efficient Algorithm for Learning-Based Visual Localization | Jindi Zhong et.al. | 2511.04232 | null |
| 2025-11-05 | The Human Flourishing Geographic Index: A County-Level Dataset for the United States, 2013–2023 | Stefano M. Iacus et.al. | 2511.03915 | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | null |
| 2025-11-04 | LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment | Rohan Wandre et.al. | 2511.02371 | null |
| 2025-11-03 | SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment | Xinyu Mao et.al. | 2511.01390 | null |
| 2025-11-02 | GeoToken: Hierarchical Geolocalization of Images via Next Token Prediction | Narges Ghasemi et.al. | 2511.01082 | null |
| 2025-11-02 | Dynamic Multi-level Weighted Alignment Network for Zero-shot Sketch-based Image Retrieval | Hanwen Su et.al. | 2511.00925 | null |
| 2025-10-31 | GEDICorrect: A Scalable Python Tool for Orbit-, Beam-, and Footprint-Level GEDI Geolocation Correction | Leonel Corado et.al. | 2511.00319 | null |
| 2025-10-31 | Approximate Diverse $k$ -nearest Neighbor Search in Vector Database | Jiachen Zhao et.al. | 2510.27243 | null |
| 2025-11-03 | Evaluating Perspectival Biases in Cross-Modal Retrieval | Teerapol Saengsukhiran et.al. | 2510.26861 | null |
| 2025-10-30 | Scaling Image Geo-Localization to Continent Level | Philipp Lindenberger et.al. | 2510.26795 | null |
| 2025-10-29 | Citizen science dataset on residents’ urban heat perception in outdoor public spaces of climate-vulnerable neighborhoods | Ferran Larroya et.al. | 2510.25645 | null |
| 2025-10-29 | Instance-Level Composed Image Retrieval | Bill Psomas et.al. | 2510.25387 | null |
| 2025-10-28 | DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts | Binbin Li et.al. | 2510.24813 | null |
| 2025-10-27 | Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment | Hongyi Wang et.al. | 2510.23224 | null |
| 2025-10-26 | Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models | Yang Zhang et.al. | 2510.22868 | null |
| 2025-10-30 | Cross-view Localization and Synthesis – Datasets, Challenges and Opportunities | Ningli Xu et.al. | 2510.22736 | null |
| 2025-10-26 | STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models | Mahiro Ukai et.al. | 2510.22571 | null |
| 2025-10-25 | Cross-Platform Short-Video Diplomacy: Topic and Sentiment Analysis of China-US Relations on Douyin and TikTok | Zheng Wei et.al. | 2510.22415 | null |
| 2025-10-24 | BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models | Ziheng Zhang et.al. | 2510.20095 | null |
| 2025-10-18 | Small Language Models Offer Significant Potential for Science Community | Jian Zhang et.al. | 2510.18890 | null |
| 2025-10-21 | Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection | Ji Du et.al. | 2510.18437 | link |
| 2025-10-21 | ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization | Yuanhe Guo et.al. | 2510.18433 | null |
| 2025-10-21 | DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing | Luxuan Li et.al. | 2510.18218 | null |
| 2025-10-20 | Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition | Timur Ismagilov et.al. | 2510.17739 | null |
| 2025-10-18 | iWatchRoadv2: Pothole Detection, Geospatial Mapping, and Intelligent Road Governance | Rishi Raj Sahoo et.al. | 2510.16375 | link |
| 2025-10-16 | Acquisition of interpretable domain information during brain MR image harmonization for content-based image retrieval | Keima Abe et.al. | 2510.14535 | null |
| 2025-10-15 | Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition | Emily Miller et.al. | 2510.13464 | null |
| 2025-10-15 | Mobile Coverage Analysis using Crowdsourced Data | Timothy Wong et.al. | 2510.13459 | null |
| 2025-10-13 | Embedding the Teacher: Distilling vLLM Preferences for Scalable Image Retrieval | Eric He et.al. | 2510.12014 | null |
| 2025-10-13 | Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across Scales | Zhaofang Qian et.al. | 2510.10880 | null |
| 2025-10-08 | Population synthesis with geographic coordinates | Jacopo Lenti et.al. | 2510.09669 | null |
| 2025-10-10 | Hierarchical Scheduling for Multi-Vector Image Retrieval | Maoliang Li et.al. | 2510.08976 | null |
| 2025-10-09 | DarkHash: A Data-Free Backdoor Attack Against Deep Hashing | Ziqi Zhou et.al. | 2510.08094 | null |
| 2025-10-09 | CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning | Weihuang Lin et.al. | 2510.08003 | null |
| 2025-10-09 | Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision | Xiaoxu Ma et.al. | 2510.07703 | null |
| 2025-10-08 | Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval | Didrik Bergström et.al. | 2510.06868 | null |
| 2025-10-07 | CalibCLIP: Contextual Calibration of Dominant Semantics for Text-Driven Image Retrieval | Bin Kang et.al. | 2510.05586 | null |
| 2025-10-06 | Personalizing Retrieval using Joint Embeddings or “the Return of Fluffy” | Bruno Korbar et.al. | 2510.05411 | null |
| 2025-10-05 | Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition | Yu Kiu et.al. | 2510.04282 | null |
| 2025-10-04 | The Overlooked Value of Test-time Reference Sets in Visual Place Recognition | Mubariz Zaffar et.al. | 2510.03751 | null |
| 2025-10-03 | Team Xiaomi EV-AD VLA: Caption-Guided Retrieval System for Cross-Modal Drone Navigation – Technical Report for IROS 2025 RoboSense Challenge Track 4 | Lingfeng Zhang et.al. | 2510.02728 | null |
| 2025-10-01 | A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features | Axel Barroso-Laguna et.al. | 2510.00978 | null |
| 2025-10-01 | Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions | Thanh Nguyen Canh et.al. | 2510.00783 | null |
| 2025-09-30 | Video Object Segmentation-Aware Audio Generation | Ilpo Viertola et.al. | 2509.26604 | null |
| 2025-09-30 | SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval | Ren-Di Wu et.al. | 2509.26330 | null |
| 2025-09-30 | SETR: A Two-Stage Semantic-Enhanced Framework for Zero-Shot Composed Image Retrieval | Yuqi Xiao et.al. | 2509.26012 | null |
| 2025-09-30 | SAGE: Spatial-visual Adaptive Graph Exploration for Visual Place Recognition | Shunpeng Chen et.al. | 2509.25723 | null |
| 2025-09-29 | Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity | Tu-Hoa Pham et.al. | 2509.25520 | null |
| 2025-09-29 | Performance-Efficiency Trade-off for Fashion Image Retrieval | Julio Hurtado et.al. | 2509.24477 | null |
| 2025-09-28 | Prepare for Warp Speed: Sub-millisecond Visual Place Recognition Using Event Cameras | Vignesh Ramanathan et.al. | 2509.24094 | null |
| 2025-09-27 | Terrorism & Democracy in Burkina-Faso | P Carmel Marie Zagre et.al. | 2509.23046 | null |
| 2025-09-26 | Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation | Jinpeng Lu et.al. | 2509.22307 | null |
| 2025-09-25 | Enhancing Contrastive Learning for Geolocalization by Discovering Hard Negatives on Semivariograms | Boyi Chen et.al. | 2509.21573 | null |
| 2025-09-23 | SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment | Binod Singh et.al. | 2509.20401 | null |
| 2025-09-24 | A Versatile Foundation Model for AI-enabled Mammogram Interpretation | Fuxiang Huang et.al. | 2509.20271 | null |
| 2025-09-23 | Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions | Ioanna Ntinou et.al. | 2509.19203 | link |
| 2025-09-30 | OrthoLoC: UAV 6-DoF Localization and Calibration Using Orthographic Geodata | Oussema Dhaouadi et.al. | 2509.18350 | link |
| 2025-09-21 | Learning Attribute-Aware Hash Codes for Fine-Grained Image Retrieval via Query Optimization | Peng Wang et.al. | 2509.17049 | null |
| 2025-09-20 | PM25Vision: A Large-Scale Benchmark Dataset for Visual Estimation of Air Quality | Yang Han et.al. | 2509.16519 | null |
| 2025-09-25 | Efficient Multimodal Dataset Distillation via Generative Models | Zhenghao Zhao et.al. | 2509.15472 | link |
| 2025-09-18 | SERVAL: Surprisingly Effective Zero-Shot Visual Document Retrieval Powered by Large Vision and Language Models | Thong Nguyen et.al. | 2509.15432 | link |
| 2025-09-18 | Assessing metadata privacy in neuroimaging | Emilie Kibsgaard et.al. | 2509.15278 | null |
| 2025-09-18 | PRISM: Product Retrieval In Shopping Carts using Hybrid Matching | Arda Kabadayi et.al. | 2509.14985 | null |
| 2025-09-18 | Chain-of-Thought Re-ranking for Image Retrieval Tasks | Shangrong Wu et.al. | 2509.14746 | null |
| 2025-09-18 | DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising | Li Gao et.al. | 2509.14565 | null |
| 2025-09-18 | Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods | Adam D. Hines et.al. | 2509.14516 | link |
| 2025-09-17 | Hashing-Baseline: Rethinking Hashing in the Age of Pretrained Models | Ilyass Moummad et.al. | 2509.14427 | link |
| 2025-09-17 | CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts | Leonard Hackel et.al. | 2509.14104 | null |
| 2025-09-16 | Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization | Yujia Lin et.al. | 2509.13474 | null |
| 2025-09-18 | MapAnything: Universal Feed-Forward Metric 3D Reconstruction | Nikhil Keetha et.al. | 2509.13414 | link |
| 2025-09-17 | DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval | Zechao Liu et.al. | 2509.12824 | null |
| 2025-09-16 | Ketto and the Science of Giving: A Data-Driven Investigation of Crowdfunding for India | Karuna Chandra et.al. | 2509.12616 | null |
| 2025-09-15 | Bridging Vision Language Models and Symbolic Grounding for Video Question Answering | Haodi Ma et.al. | 2509.11862 | null |
| 2025-09-14 | UnLoc: Leveraging Depth Uncertainties for Floorplan Localization | Matthias Wüest et.al. | 2509.11301 | null |
| 2025-09-12 | A Stochastic Birth-and-Death Approach for Street Furniture Geolocation in Urban Environments | Evan Murphy et.al. | 2509.10310 | null |
| 2025-09-11 | Listening for “You”: Enhancing Speech Image Retrieval via Target Speaker Extraction | Wenhao Yang et.al. | 2509.09306 | null |
| 2025-09-09 | Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark | Yandi Yang et.al. | 2509.07362 | null |
| 2025-09-08 | Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval | Emil Demić et.al. | 2509.06566 | null |
| 2025-09-06 | Augmenting Human-Centered Racial Covenant Detection and Georeferencing with Plug-and-Play NLP Pipelines | Jiyoon Pyo et.al. | 2509.05829 | null |
| 2025-09-05 | Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots) | Emanuela Boros et.al. | 2509.04948 | null |
| 2025-09-05 | FloodVision: Urban Flood Depth Estimation Using Foundation Vision-Language Models and Domain Knowledge Graph | Zhangding Liu et.al. | 2509.04772 | null |
| 2025-09-05 | Global-to-Local or Local-to-Global? Enhancing Image Retrieval with Efficient Local Search and Effective Global Re-ranking | Dror Aiger et.al. | 2509.04351 | null |
| 2025-09-05 | GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization | Pengyue Jia et.al. | 2509.04334 | link |
| 2025-09-04 | DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval | Ruohong Yang et.al. | 2509.04193 | null |
| 2025-09-04 | A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning | Qika Lin et.al. | 2509.03906 | null |
| 2025-09-02 | Scale, Don’t Fine-tune: Guiding Multimodal LLMs for Efficient Visual Place Recognition at Test-Time | Jintao Cheng et.al. | 2509.02129 | null |
| 2025-09-02 | Ensemble-Based Event Camera Place Recognition Under Varying Illumination | Therese Joseph et.al. | 2509.01968 | null |
| 2025-09-01 | ConamArray: A 32-Element Broadband MEMS Ultrasound Transducer Array | Dennis Laurijssen et.al. | 2509.01372 | null |
| 2025-09-01 | M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision | Che Liu et.al. | 2509.01360 | null |
| 2025-09-01 | Street-Level Geolocalization Using Multimodal Large Language Models and Retrieval-Augmented Generation | Yunus Serhat Bicakci et.al. | 2509.01341 | null |
| 2025-09-01 | ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization | Thinh-Phuc Nguyen et.al. | 2509.01259 | null |
| 2025-09-03 | Multimodal Iterative RAG for Knowledge Visual Question Answering | Changin Choi et.al. | 2509.00798 | null |
| 2025-08-31 | Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification | Y Hop Nguyen et.al. | 2509.00752 | null |
| 2025-08-31 | EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions | Dinh-Khoi Vo et.al. | 2509.00751 | null |
| 2025-08-29 | Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders | Faizan Farooq Khan et.al. | 2509.00177 | null |
| 2025-08-29 | HCCM: Hierarchical Cross-Granularity Contrastive and Matching Learning for Natural Language-Guided Drones | Hao Ruan et.al. | 2508.21539 | null |
| 2025-08-27 | Disentangling Latent Embeddings with Sparse Linear Concept Subspaces (SLiCS) | Zhi Li et.al. | 2508.20322 | null |
| 2025-08-27 | Low-exposure, high-quality multimodal speckle X-ray imaging via an intrinsic gradient-flow approach | Jayvan Liu et.al. | 2508.20209 | null |
| 2025-08-27 | Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study | Max Torop et.al. | 2508.20188 | null |
| 2024-09-27 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | null |
| 2024-08-14 | Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN | Hao Li et.al. | 2408.06761 | null |
| 2024-07-25 | MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Gabriele Berton et.al. | 2406.02776 | null |
| 2024-04-02 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
| 2023-09-11 | Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers | Jongwon Lee et.al. | 2309.04441 | null |
| 2023-08-16 | Wide-Area Geolocalization with a Limited Field of View Camera in Challenging Urban Environments | Lena M. Downes et.al. | 2308.07432 | null |
| 2023-04-18 | CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression | Mubariz Zaffar et.al. | 2304.07426 | null |
| 2023-05-19 | Wide-Area Geolocalization with a Limited Field of View Camera | Lena M. Downes et.al. | 2209.11854 | null |
| 2022-08-09 | A Survey on Visual Map Localization Using LiDARs and Cameras | Elhousni Mahdi et.al. | 2208.03376 | null |
| 2022-07-26 | ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization | Ivan Cisneros et.al. | 2207.12317 | null |
| 2022-06-01 | Investigating the Role of Image Retrieval for Visual Localization – An exhaustive benchmark | Martin Humenberger et.al. | 2205.15761 | null |
| 2022-05-25 | VPAIR – Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments | Michael Schleiss et.al. | 2205.11567 | null |
| 2021-05-10 | Probabilistic Visual Place Recognition for Hierarchical Localization | Ming Xu et.al. | 2105.03091 | null |
| 2021-02-26 | Scene Retrieval for Contextual Visual Mapping | William H. B. Smith et.al. | 2102.12728 | null |
| 2020-12-02 | Benchmarking Image Retrieval for Visual Localization | Noé Pion et.al. | 2011.11946 | null |
| 2023-05-02 | City-Scale Visual Place Recognition with Deep Local Features Based on Multi-Scale Ordered VLAD Pooling | Duc Canh Le et.al. | 2009.09255 | null |
| 2019-04-16 | Localizing Discriminative Visual Landmarks for Place Recognition | Zhe Xin et.al. | 1904.06635 | null |
| 2018-09-18 | UAV Pose Estimation using Cross-view Geolocalization with Satellite Imagery | Akshay Shetty et.al. | 1809.05979 | null |
| 2018-05-16 | Visual Global Localization with a Hybrid WNN-CNN Approach | Avelino Forechi et.al. | 1805.03183 | link |
| 2017-04-28 | Real-Time Visual Place Recognition for Personal Localization on a Mobile Device | Michał Nowicki et.al. | 1611.02061 | null |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-04-02 | Riemannian and Symplectic Geometry for Hierarchical Text-Driven Place Recognition | Tianyi Shang et.al. | 2604.01598 | null |
| 2026-03-16 | Voronoi-based Second-order Descriptor with Whitened Metric in LiDAR Place Recognition | Jaein Kim et.al. | 2603.14974 | null |
| 2026-03-09 | RLPR: Radar-to-LiDAR Place Recognition via Two-Stage Asymmetric Cross-Modal Alignment for Autonomous Driving | Zhangshuo Qi et.al. | 2603.07920 | null |
| 2026-03-06 | PROBE: Probabilistic Occupancy BEV Encoding with Analytical Translation Robustness for 3D Place Recognition | Jinseop Lee et.al. | 2603.05965 | null |
| 2026-01-29 | Advanced techniques and applications of LiDAR Place Recognition in Agricultural Environments: A Comprehensive Survey | Judith Vilella-Cantos et.al. | 2601.22198 | null |
| 2026-01-26 | Low Cost, High Efficiency: LiDAR Place Recognition in Vineyards with Matryoshka Representation Learning | Judith Vilella-Cantos et.al. | 2601.18714 | null |
| 2025-12-05 | NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections | Juho Korkeala et.al. | 2512.05610 | null |
| 2025-12-04 | A dynamic memory assignment strategy for dilation-based ICP algorithm on embedded GPUs | Qiong Chang et.al. | 2512.04996 | null |
| 2025-12-04 | TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards | Mauro Martini et.al. | 2512.04772 | null |
| 2025-12-03 | DM3D: Deformable Mamba via Offset-Guided Gaussian Sequencing for Point Cloud Understanding | Bin Liu et.al. | 2512.03424 | null |
| 2025-12-03 | What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models | Tianchen Deng et.al. | 2512.03422 | null |
| 2025-12-02 | GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection | Md Sohag Mia et.al. | 2512.02991 | null |
| 2025-12-02 | Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models | Pierpaolo Serio et.al. | 2512.02897 | null |
| 2025-12-01 | Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching | Yue Pan et.al. | 2512.01850 | null |
| 2025-12-01 | RoboLoc: A Benchmark Dataset for Point Place Recognition and Localization in Indoor-Outdoor Integrated Environments | Jaejin Jeon et.al. | 2512.01194 | null |
| 2025-11-30 | LAHNet: Local Attentive Hashing Network for Point Cloud Registration | Wentao Qu et.al. | 2512.00927 | null |
| 2025-11-27 | BrepGPT: Autoregressive B-rep Generation with Voronoi Half-Patch | Pu Li et.al. | 2511.22171 | null |
| 2025-11-27 | Constant-Volume Deformation Manufacturing for Material-Efficient Shaping | Lei Li et.al. | 2511.22042 | null |
| 2025-11-26 | Diagonal Scaling: A Multi-Dimensional Resource Model and Optimization Framework for Distributed Databases | Shahir Abdullah et.al. | 2511.21612 | null |
| 2025-11-26 | PFF-Net: Patch Feature Fitting for Point Cloud Normal Estimation | Qing Li et.al. | 2511.21365 | null |
| 2025-11-26 | $δ$ -core subsampling, strong collapses and TDA | Elias Gabriel Minian et.al. | 2511.20954 | null |
| 2025-11-25 | Accelerating Sparse Convolutions in Voxel-Based Point Cloud Networks | Dionysios Adamopoulos et.al. | 2511.20834 | null |
| 2025-11-25 | DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion | Yinghui Li et.al. | 2511.20278 | null |
| 2025-11-25 | FLaTEC: Frequency-Disentangled Latent Triplanes for Efficient Compression of LiDAR Point Clouds | Xiaoge Zhang et.al. | 2511.20065 | null |
| 2025-11-24 | PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion | Yichen Yang et.al. | 2511.18801 | null |
| 2025-11-23 | Object-centric Task Representation and Transfer using Diffused Orientation Fields | Cem Bilaloglu et.al. | 2511.18563 | null |
| 2025-11-22 | Two-step Generalized RBF-Generated Finite Difference Method on Manifolds | Rongji Li et.al. | 2511.18049 | null |
| 2025-11-21 | RL-AD-Net: Reinforcement Learning Guided Adaptive Displacement in Latent Space for Refined Point Cloud Completion | Bhanu Pratap Paregi et.al. | 2511.17054 | null |
| 2025-11-20 | CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering | Joni Vanherck et.al. | 2511.16349 | null |
| 2025-11-20 | Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion | Lirui Zhang et.al. | 2511.16161 | null |
| 2025-11-20 | Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2511.16091 | null |
| 2025-11-19 | Atlas Gaussian processes on restricted domains and point clouds | Mu Niu et.al. | 2511.15822 | null |
| 2025-11-21 | The MeerKAT Fornax Survey VI. The collapse of the galaxy HI Mass Function in Fornax | D. Kleiner et.al. | 2511.15795 | null |
| 2025-11-19 | Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2511.15597 | null |
| 2025-11-19 | Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language | Yan Xia et.al. | 2511.15308 | null |
| 2025-11-18 | NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration | Luohong Wu et.al. | 2511.14286 | null |
| 2025-11-17 | Part-X-MLLM: Part-aware 3D Multimodal Large Language Model | Chunshi Wang et.al. | 2511.13647 | link |
| 2025-11-18 | ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes | Yixuan Yang et.al. | 2511.12977 | null |
| 2025-11-12 | Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement | Lian He et.al. | 2511.11702 | null |
| 2025-11-18 | LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning | Xinran Yang et.al. | 2511.10040 | null |
| 2025-11-13 | AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models | Xinyi Wang et.al. | 2511.10017 | link |
| 2025-11-12 | PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model | Yunqian Cheng et.al. | 2511.09724 | link |
| 2025-11-12 | IFG: Internet-Scale Guidance for Functional Grasping Generation | Ray Muxin Liu et.al. | 2511.09558 | null |
| 2025-11-12 | HOTFLoc++: End-to-End Hierarchical LiDAR Place Recognition, Re-Ranking, and 6-DoF Metric Localisation in Forests | Ethan Griffiths et.al. | 2511.09170 | null |
| 2025-11-11 | Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention Mechanisms | Jiaxun Guo et.al. | 2511.08833 | null |
| 2025-11-11 | Hierarchical Direction Perception via Atomic Dot-Product Operators for Rotation-Invariant Point Clouds Learning | Chenyu Hu et.al. | 2511.08240 | null |
| 2025-11-11 | Accurate and Efficient Surface Reconstruction from Point Clouds via Geometry-Aware Local Adaptation | Eito Ogawa et.al. | 2511.08233 | null |
| 2025-11-10 | Semi-distributed Cross-modal Air-Ground Relative Localization | Weining Lu et.al. | 2511.06749 | null |
| 2025-11-10 | PointCubeNet: 3D Part-level Reasoning with 3x3x3 Point Cloud Blocks | Da-Yeong Kim et.al. | 2511.06744 | null |
| 2025-11-07 | Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments | Laura Alejandra Encinar Gonzalez et.al. | 2511.05404 | null |
| 2025-11-10 | Rethinking Metrics and Diffusion Architecture for 3D Point Cloud Generation | Matteo Bastico et.al. | 2511.05308 | null |
| 2025-11-07 | Implicit reconstruction from point cloud: an adaptive level-set-based semi-Lagrangian method | Silvia Preda et.al. | 2511.05145 | null |
| 2025-11-04 | Curvature of high-dimensional data | Jiayi Chen et.al. | 2511.02873 | null |
| 2025-11-02 | GauDP: Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies | Ziye Wang et.al. | 2511.00998 | null |
| 2025-11-02 | Modeling Microenvironment Trajectories on Spatial Transcriptomics with NicheFlow | Kristiyan Sakalyan et.al. | 2511.00977 | link |
| 2025-11-04 | Towards classification-based representation learning for place recognition on LiDAR scans | Maksim Konoplia et.al. | 2511.00738 | null |
| 2025-11-01 | Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles | Hyungtae Lim et.al. | 2511.00635 | link |
| 2025-10-31 | MambaNetLK: Enhancing Colonoscopy Point Cloud Registration with Mamba | Linzhe Jiang et.al. | 2511.00260 | null |
| 2025-10-29 | Figuring Out Gas & Galaxies In Enzo (FOGGIE) XI: Circumgalactic O VI Emission Traces Clumpy Inflowing Recycled Gas | Cassandra Lochhaas et.al. | 2510.25844 | null |
| 2025-10-02 | LangGrasp: Leveraging Fine-Tuned LLMs for Language Interactive Robot Grasping with Ambiguous Instructions | Yunhan Lin et.al. | 2510.02104 | null |
| 2025-08-09 | LifelongPR: Lifelong point cloud place recognition based on sample replay and prompt learning | Xianghong Zou et.al. | 2507.10034 | null |
| 2025-08-08 | ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models | Minwoo Jung et.al. | 2505.18364 | null |
| 2025-05-26 | MinkUNeXt-SI: Improving point cloud-based place recognition including spherical coordinates and LiDAR intensity | Judith Vilella-Cantos et.al. | 2505.17591 | null |
| 2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
| 2025-08-27 | OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion | Shuhao Kang et.al. | 2504.19258 | null |
| 2025-06-19 | An Iterative Task-Driven Framework for Resilient LiDAR Place Recognition in Adverse Weather | Xiongwei Zhao et.al. | 2504.14806 | null |
| 2025-04-16 | Diffusion Based Robust LiDAR Place Recognition | Benjamin Krummenacher et.al. | 2504.12412 | null |
| 2025-10-03 | Vehicle-Scene Interaction: A Text-Driven 3D Lidar Place Recognition Method for Autonomous Driving | Tianyi Shang et.al. | 2503.18035 | null |
| 2025-10-29 | L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery | Ziwei Shi et.al. | 2503.11245 | null |
| 2025-03-21 | HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views | Ethan Griffiths et.al. | 2503.08140 | null |
| 2025-03-06 | ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images | Yanqing Shen et.al. | 2503.04475 | null |
| 2025-03-20 | CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework | Yanlong Xu et.al. | 2503.02593 | null |
| 2025-02-07 | HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer | Minwoo Jung et.al. | 2501.18943 | null |
| 2024-12-20 | SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning | Yuhao Li et.al. | 2412.15577 | null |
| 2025-04-04 | PerLA: Perceptive 3D Language Assistant | Guofeng Mei et.al. | 2411.19774 | link |
| 2024-10-10 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
| 2025-05-19 | A Deeper Look into Second-Order Feature Aggregation for LiDAR Place Recognition | Saimunur Rahman et.al. | 2409.15919 | null |
| 2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
| 2024-10-02 | Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition | Hogyun Kim et.al. | 2408.07330 | null |
| 2024-07-31 | SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition | Raktim Gautam Goswami et.al. | 2407.08260 | null |
| 2024-06-21 | Voxel-Based Point Cloud Localization for Smart Spaces Management | F. S. Mortazavi et.al. | 2406.15110 | null |
| 2024-10-09 | PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture | T. Barros et.al. | 2405.19038 | null |
| 2024-05-14 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966 | link |
| 2025-03-14 | VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition | Yun-Jin Li et.al. | 2403.14594 | null |
| 2024-08-30 | Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests | Haedam Oh et.al. | 2403.14326 | null |
| 2024-02-27 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | null |
| 2024-03-19 | HeLiPR: Heterogeneous LiDAR Dataset for inter-LiDAR Place Recognition under Spatiotemporal Variations | Minwoo Jung et.al. | 2309.14590 | null |
| 2023-08-25 | VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition | Gengxuan Tian et.al. | 2308.12870 | null |
| 2024-09-25 | SelFLoc: Selective Feature Fusion for Large-scale Point Cloud-based Place Recognition | Qibo Qiu et.al. | 2306.01205 | null |
| 2025-06-26 | BEVPlace: Learning LiDAR-based Place Recognition using Bird’s Eye View Images | Lun Luo et.al. | 2302.14325 | null |
| 2023-11-14 | Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map | Haodong Yuan et.al. | 2206.03062 | null |
| 2022-11-30 | InCloud: Incremental Learning for Point Cloud Place Recognition | Joshua Knights et.al. | 2203.00807 | link |
| 2025-06-26 | BVMatch: Lidar-based Place Recognition Using Bird’s-eye View Images | Lun Luo et.al. | 2109.00317 | null |
| 2021-04-15 | MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition | Jacek Komorowski et.al. | 2104.05327 | link |
| 2021-04-23 | Robust Place Recognition using an Imaging Lidar | Tixiao Shan et.al. | 2103.02111 | null |
| 2021-06-21 | Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning | Huan Yin et.al. | 2102.04960 | link |
| 2021-08-05 | A Registration-aided Domain Adaptation Network for 3D Point Cloud Based Place Recognition | Zhijian Qiao et.al. | 2012.05018 | null |
| 2020-08-04 | PIC-Net: Point Cloud and Image Collaboration Network for Large-Scale Place Recognition | Yuheng Lu et.al. | 2008.00658 | null |
| 2020-07-06 | LOL: Lidar-Only Odometry and Localization in 3D Point Cloud Maps | David Rozenberszki et.al. | 2007.01595 | link |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-03-08 | TAPFormer: Robust Arbitrary Point Tracking via Transient Asynchronous Fusion of Frames and Events | Jiaxiong Liu et.al. | 2603.04989 | null |
| 2026-01-07 | SpatiaLoc: Leveraging Multi-Level Spatial Enhanced Descriptors for Cross-Modal Localization | Tianyi Shang et.al. | 2601.03579 | null |
| 2025-12-05 | Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures | Amirkia Rafiei Oskooei et.al. | 2512.05908 | null |
| 2025-12-04 | Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models | Manar Alnaasan et.al. | 2512.04425 | null |
| 2025-12-02 | GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection | Md Sohag Mia et.al. | 2512.02991 | null |
| 2025-12-02 | Reasoning-Aware Multimodal Fusion for Hateful Video Detection | Shuonan Yang et.al. | 2512.02743 | null |
| 2025-12-02 | GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization | Zixuan Song et.al. | 2512.02697 | null |
| 2025-12-01 | TBT-Former: Learning Temporal Boundary Distributions for Action Localization | Thisara Rathnayaka et.al. | 2512.01298 | null |
| 2025-11-29 | CourseTimeQA: A Lecture-Video Benchmark and a Latency-Constrained Cross-Modal Fusion Method for Timestamped QA | Vsevolod Kovalev et.al. | 2512.00360 | null |
| 2025-11-27 | MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning | Kyeongha Rho et.al. | 2512.00115 | null |
| 2025-11-28 | Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records | Shiyu Shen et.al. | 2511.22958 | null |
| 2025-11-27 | Enhanced Graph Convolutional Network with Chebyshev Spectral Graph and Graph Attention for Autism Spectrum Disorder Classification | Adnan Ferdous Ashrafi et.al. | 2511.22178 | link |
| 2025-11-28 | Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy | Teng Hu et.al. | 2511.21579 | null |
| 2025-11-26 | Semantic-Enhanced Feature Matching with Learnable Geometric Verification for Cross-Modal Neuron Registration | Wenwei Li et.al. | 2511.21452 | null |
| 2025-11-25 | Prompt-Aware Adaptive Elastic Weight Consolidation for Continual Learning in Medical Vision-Language Models | Ziyuan Gao et.al. | 2511.20732 | null |
| 2025-11-25 | ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis | Advik Sinha et.al. | 2511.20274 | null |
| 2025-11-25 | ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction | Yuanzhe Li et.al. | 2511.20020 | null |
| 2025-11-24 | Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach | Fan Nie et.al. | 2511.19080 | null |
| 2025-11-24 | AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization | Christos Koutlis et.al. | 2511.18993 | null |
| 2025-11-24 | A Theory-Inspired Framework for Few-Shot Cross-Modal Sketch Person Re-Identification | Yunpeng Gong et.al. | 2511.18677 | null |
| 2025-11-22 | CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking | Hao Li et.al. | 2511.17967 | null |
| 2025-11-21 | Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics | Wei Zhang et.al. | 2511.17685 | null |
| 2025-11-21 | Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers | Cris Claessens et.al. | 2511.17209 | null |
| 2025-11-21 | Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition | Aditya Mishra et.al. | 2511.17183 | null |
| 2025-11-19 | Multi-Text Guided Few-Shot Semantic Segmentation | Qiang Jiao et.al. | 2511.15515 | null |
| 2025-11-19 | Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language | Yan Xia et.al. | 2511.15308 | null |
| 2025-11-18 | NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration | Luohong Wu et.al. | 2511.14286 | null |
| 2025-11-18 | SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts | Fan Zhang et.al. | 2511.14093 | null |
| 2025-11-17 | Attention Grounded Enhancement for Visual Document Retrieval | Wanqing Cui et.al. | 2511.13415 | null |
| 2025-11-17 | Uncovering and Mitigating Transient Blindness in Multimodal Model Editing | Xiaoqi Han et.al. | 2511.13243 | null |
| 2025-11-17 | 3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale | Yijia Fan et.al. | 2511.13211 | null |
| 2025-11-17 | SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration | Haodong Wang et.al. | 2511.13168 | null |
| 2025-11-15 | FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention | Peng Zhang et.al. | 2511.12215 | link |
| 2025-11-15 | Calibrated Multimodal Representation Learning with Missing Modalities | Xiaohao Liu et.al. | 2511.12034 | null |
| 2025-11-14 | DEFT-LLM: Disentangled Expert Feature Tuning for Micro-Expression Recognition | Ren Zhang et.al. | 2511.10948 | null |
| 2025-11-13 | Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification | Junjie Zhang et.al. | 2511.10774 | null |
| 2025-11-13 | URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding | Yongxin Shi et.al. | 2511.10552 | null |
| 2025-11-13 | Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization | Ashutosh Anshul et.al. | 2511.10212 | null |
| 2025-11-13 | HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction | Yueran Zhao et.al. | 2511.10211 | null |
| 2025-11-13 | Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction | Mingda Jia et.al. | 2511.10134 | null |
| 2025-11-14 | FreDFT: Frequency Domain Fusion Transformer for Visible-Infrared Object Detection | Wencong Wu et.al. | 2511.10046 | null |
| 2025-11-12 | BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation | Hongchao Shu et.al. | 2511.09443 | null |
| 2025-11-12 | xHAP: Cross-Modal Attention for Haptic Feedback Estimation in the Tactile Internet | Georgios Kokkinis et.al. | 2511.09137 | null |
| 2025-11-11 | Multi-modal Deepfake Detection and Localization with FPN-Transformer | Chende Zheng et.al. | 2511.08031 | null |
| 2025-11-11 | Cross Modal Fine-grained Alignment via Granularity-aware and Region-uncertain Modeling | Jiale Liu et.al. | 2511.07710 | link |
| 2025-11-10 | Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding | Yuzhen Li et.al. | 2511.06908 | null |
| 2025-11-10 | Semi-distributed Cross-modal Air-Ground Relative Localization | Weining Lu et.al. | 2511.06749 | null |
| 2025-11-09 | Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation | Tzu-Jung Lin et.al. | 2511.06240 | null |
| 2025-11-04 | C3-Diff: Super-resolving Spatial Transcriptomics via Cross-modal Cross-content Contrastive Diffusion Modelling | Xiaofei Wang et.al. | 2511.05571 | null |
| 2025-11-06 | DINOv2 Driven Gait Representation Learning for Video-Based Visible-Infrared Person Re-identification | Yujie Yang et.al. | 2511.04281 | null |
| 2025-11-06 | CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation | Yuwen Tao et.al. | 2511.03992 | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | null |
| 2025-11-03 | 3EED: Ground Everything Everywhere in 3D | Rong Li et.al. | 2511.01755 | null |
| 2025-11-03 | SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment | Xinyu Mao et.al. | 2511.01390 | null |
| 2025-11-02 | Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya | Hassan Ugail et.al. | 2511.01000 | null |
| 2025-11-02 | VesSAM: Efficient Multi-Prompting for Segmenting Complex Vessel | Suzhong Fu et.al. | 2511.00981 | null |
| 2025-10-24 | A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization | LinFeng Li et.al. | 2510.20291 | null |
| 2025-10-20 | Closed-Loop Transfer for Weakly-supervised Affordance Grounding | Jiajin Tang et.al. | 2510.17384 | null |
| 2025-09-27 | AttAnchor: Guiding Cross-Modal Token Alignment in VLMs with Attention Anchors | Junyang Zhang et.al. | 2509.23109 | null |
| 2025-09-30 | InterKey: Cross-modal Intersection Keypoints for Global Localization on OpenStreetMap | Nguyen Hoang Khoi Tran et.al. | 2509.13857 | null |
| 2025-12-28 | Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrieval | Hao Yin et.al. | 2509.13754 | null |
| 2025-09-16 | Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization | Yujia Lin et.al. | 2509.13474 | null |
| 2025-09-12 | TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion | Xiaodong Guo et.al. | 2509.10005 | null |
| 2025-09-09 | Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark | Yandi Yang et.al. | 2509.07362 | null |
| 2025-10-10 | SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization | Hanjun Kim et.al. | 2506.15175 | null |
| 2025-08-27 | OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion | Shuhao Kang et.al. | 2504.19258 | null |
| 2024-12-19 | Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration | Ziheng Zhou et.al. | 2412.12628 | null |
| 2024-12-02 | Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures | Qiyuan Shen et.al. | 2412.01299 | null |
| 2024-11-02 | X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios | Yichen Xie et.al. | 2411.01123 | null |
| 2025-05-12 | Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization | Ling Xing et.al. | 2409.07967 | null |
| 2025-02-24 | MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms | Tianyi Shang et.al. | 2408.15740 | null |
| 2024-06-26 | Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2406.17679 | null |
| 2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429 | null |
| 2024-04-27 | Instance-free Text to Point Cloud Localization with Relative Position Awareness | Lichao Wang et.al. | 2404.17845 | null |
| 2024-07-15 | SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Yang Miao et.al. | 2404.00469 | null |
| 2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | null |
| 2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
| 2023-09-20 | Sound Source Localization is All about Cross-Modal Alignment | Arda Senocak et.al. | 2309.10724 | null |
| 2023-10-17 | Counterfactual Cross-modality Reasoning for Weakly Supervised Video Moment Localization | Zezhong Lv et.al. | 2308.05648 | link |
| 2023-06-06 | Energy-Based Models for Cross-Modal Localization using Convolutional Transformers | Alan Wu et.al. | 2306.04021 | null |
| 2023-05-07 | Poses as Queries: Image-to-LiDAR Map Localization with Transformers | Jinyu Miao et.al. | 2305.04298 | null |
| 2023-03-23 | Egocentric Audio-Visual Object Localization | Chao Huang et.al. | 2303.13471 | null |
| 2023-02-20 | Champion Solution for the WSDM2023 Toloka VQA Challenge | Shengyi Gao et.al. | 2301.09045 | null |
| 2023-01-13 | Text to Point Cloud Localization with Relation-Enhanced Transformer | Guangzhi Wang et.al. | 2301.05372 | link |
| 2022-12-06 | Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds | Zhipeng Zhao et.al. | 2212.02757 | null |
| 2022-10-31 | Visual Answer Localization with Cross-modal Mutual Knowledge Transfer | Yixuan Weng et.al. | 2210.14823 | null |
| 2022-08-04 | Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos | Juncheng Li et.al. | 2208.01954 | null |
| 2023-01-18 | CSDN: Cross-modal Shape-transfer Dual-refinement Network for Point Cloud Completion | Zhe Zhu et.al. | 2208.00751 | null |
| 2022-04-06 | Text2Pos: Text-to-Point-Cloud Cross-Modal Localization | Manuel Kolmet et.al. | 2203.15125 | null |
| 2022-02-15 | Visual Sound Localization in the Wild by Cross-Modal Interference Erasing | Xian Liu et.al. | 2202.06406 | null |
| 2021-08-18 | Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences | Hyunjong Park et.al. | 2108.07422 | null |
| 2021-07-28 | Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization | Fa-Ting Hong et.al. | 2107.12589 | null |
| 2020-09-15 | RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization | Niluthpol Chowdhury Mithun et.al. | 2009.05695 | null |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-04-02 | GEMM-GS: Accelerating 3D Gaussian Splatting on Tensor Cores with GEMM-Compatible Blending | Haomin Li et.al. | 2604.02120 | null |
| 2026-04-02 | ProDiG: Progressive Diffusion-Guided Gaussian Splatting for Aerial to Ground Reconstruction | Sirshapan Mitra et.al. | 2604.02003 | null |
| 2026-04-02 | Resonance4D: Frequency-Domain Motion Supervision for Preset-Free Physical Parameter Learning in 4D Dynamic Physical Scene Simulation | Changshe Zhang et.al. | 2604.01994 | null |
| 2026-04-02 | GS^2: Graph-based Spatial Distribution Optimization for Compact 3D Gaussian Splatting | Xianben Yang et.al. | 2604.01884 | null |
| 2026-04-02 | FaCT-GS: Fast and Scalable CT Reconstruction with Gaussian Splatting | Pawel Tomasz Pieta et.al. | 2604.01844 | null |
| 2026-04-02 | Director: Instance-aware Gaussian Splatting for Dynamic Scene Modeling and Understanding | Yuheng Jiang et.al. | 2604.01678 | null |
| 2026-04-02 | F3DGS: Federated 3D Gaussian Splatting for Decentralized Multi-Agent World Modeling | Morui Zhu et.al. | 2604.01605 | null |
| 2026-04-02 | Satellite-Free Training for Drone-View Geo-Localization | Tao Liu et.al. | 2604.01581 | null |
| 2026-04-02 | ColorGradedGaussians: Palette-Based Color Grading for 3D Gaussian Splatting via View-Space Sparse Decomposition | Cheng-Kang Ted Chao et.al. | 2604.01551 | null |
| 2026-04-01 | Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars | Derek Austin et.al. | 2604.01447 | null |
| 2026-04-01 | LESV: Language Embedded Sparse Voxel Fusion for Open-Vocabulary 3D Scene Understanding | Fusang Wang et.al. | 2604.01388 | null |
| 2026-04-01 | Neural Harmonic Textures for High-Quality Primitive Based Neural Reconstruction | Jorge Condor et.al. | 2604.01204 | null |
| 2026-04-01 | Diff3R: Feed-forward 3D Gaussian Splatting with Uncertainty-aware Differentiable Optimization | Yueh-Cheng Liu et.al. | 2604.01030 | null |
| 2026-04-01 | Autoregressive Appearance Prediction for 3D Gaussian Avatars | Michael Steiner et.al. | 2604.00928 | null |
| 2026-04-01 | Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM | Monica M. Q. Li et.al. | 2604.00804 | null |
| 2026-04-01 | DirectFisheye-GS: Enabling Native Fisheye Input in Gaussian Splatting with Cross-View Joint Optimization | Zhengxian Yang et.al. | 2604.00648 | null |
| 2026-04-01 | TRiGS: Temporal Rigid-Body Motion for Scalable 4D Gaussian Splatting | Suwoong Yeom et.al. | 2604.00538 | null |
| 2026-04-01 | RT-GS: Gaussian Splatting with Reflection and Transmittance Primitives | Kunnong Zeng et.al. | 2604.00509 | null |
| 2026-04-01 | ARGS: Auto-Regressive Gaussian Splatting via Parallel Progressive Next-Scale Prediction | Quanyuan Ruan et.al. | 2604.00494 | null |
| 2026-03-31 | GRVS: a Generalizable and Recurrent Approach to Monocular Dynamic View Synthesis | Thomas Tanay et.al. | 2603.29734 | null |
| 2026-03-31 | Adversarial Prompt Injection Attack on Multimodal Large Language Models | Meiwen Ding et.al. | 2603.29418 | null |
| 2026-03-31 | AA-Splat: Anti-Aliased Feed-forward Gaussian Splatting | Taewoo Suh et.al. | 2603.29394 | null |
| 2026-03-31 | MotionScale: Reconstructing Appearance, Geometry, and Motion of Dynamic Scenes with Scalable 4D Gaussian Splatting | Haoran Zhou et.al. | 2603.29296 | null |
| 2026-03-31 | LightHarmony3D: Harmonizing Illumination and Shadows for Object Insertion in 3D Gaussian Splatting | Tianyu Huang et.al. | 2603.29209 | null |
| 2026-03-31 | Efficient Camera Pose Augmentation for View Generalization in Robotic Policy Learning | Sen Wang et.al. | 2603.29192 | null |
| 2026-03-31 | Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting | Huaqi Tao et.al. | 2603.29185 | null |
| 2026-03-31 | LG-HCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting | Xuan Deng et.al. | 2603.28431 | null |
| 2026-03-30 | ObjectMorpher: 3D-Aware Image Editing via Deformable 3DGS Models | Yuhuan Xie et.al. | 2603.28152 | null |
| 2026-03-30 | SVGS: Single-View to 3D Object Editing via Gaussian Splatting | Pengcheng Xue et.al. | 2603.28126 | null |
| 2026-03-30 | \textit{4DSurf}: High-Fidelity Dynamic Scene Surface Reconstruction | Renjie Wu et.al. | 2603.28064 | null |
| 2026-03-30 | Physically Inspired Gaussian Splatting for HDR Novel View Synthesis | Huimin Zeng et.al. | 2603.28020 | null |
| 2026-03-29 | GS3LAM: Gaussian Semantic Splatting SLAM | Linfei Li et.al. | 2603.27781 | null |
| 2026-03-31 | SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering | Jiahao Niu et.al. | 2603.27516 | null |
| 2026-03-28 | DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification | Kenji Tojo et.al. | 2603.27151 | null |
| 2026-03-26 | arg-VU: Affordance Reasoning with Physics-Aware 3D Geometry for Visual Understanding in Robotic Surgery | Nan Xiao et.al. | 2603.26814 | null |
| 2026-03-27 | Detailed Geometry and Appearance from Opportunistic Motion | Ryosuke Hirai et.al. | 2603.26665 | null |
| 2026-03-27 | Drive-Through 3D Vehicle Exterior Reconstruction via Dynamic-Scene SfM and Distortion-Aware Gaussian Splatting | Nitin Kulkarni et.al. | 2603.26638 | null |
| 2026-03-27 | Scene Grounding In the Wild | Tamir Cohen et.al. | 2603.26584 | null |
| 2026-03-27 | GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport | Youngju Na et.al. | 2603.26181 | null |
| 2026-03-27 | R-PGA: Robust Physical Adversarial Camouflage Generation via Relightable 3D Gaussian Splatting | Tianrui Lou et.al. | 2603.26067 | null |
| 2026-03-26 | Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting | Yixing Lao et.al. | 2603.25745 | null |
| 2026-03-26 | ViewSplat: View-Adaptive Dynamic Gaussian Splatting for Feed-Forward Synthesis | Moonyeon Jeong et.al. | 2603.25265 | null |
| 2026-03-26 | AirSplat: Alignment and Rating for Robust Feed-Forward 3D Gaussian Splatting | Minh-Quan Viet Bui et.al. | 2603.25129 | null |
| 2026-03-26 | Learning Explicit Continuous Motion Representation for Dynamic Gaussian Splatting from Monocular Videos | Xuankai Zhang et.al. | 2603.25058 | null |
| 2026-03-27 | GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator | Liyuan Zhu et.al. | 2603.25053 | null |
| 2026-03-26 | MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes | Wonjoon Lee et.al. | 2603.25042 | null |
| 2026-03-26 | $π$ , But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation | Johnathan Tucker et.al. | 2603.25038 | null |
| 2026-03-26 | Relaxed Rigidity with Ray-based Grouping for Dynamic Gaussian Splatting | Junoh Leea et.al. | 2603.24994 | null |
| 2026-03-25 | Confidence-Based Mesh Extraction from 3D Gaussians | Lukas Radl et.al. | 2603.24725 | null |
| 2026-03-25 | Accurate Point Measurement in 3DGS – A New Alternative to Traditional Stereoscopic-View Based Measurements | Deyan Deng et.al. | 2603.24716 | null |
| 2026-03-25 | ViHOI: Human-Object Interaction Synthesis with Visual Priors | Songjin Cai et.al. | 2603.24383 | null |
| 2026-03-25 | SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision | Avigail Cohen Rimon et.al. | 2603.24036 | null |
| 2026-03-25 | FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting | Yixian Wang et.al. | 2603.23891 | null |
| 2026-03-24 | AdvSplat: Adversarial Attacks on Feed-Forward Gaussian Splatting Models | Yiran Qiao et.al. | 2603.23686 | null |
| 2026-03-26 | Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting | Peiyu Xu et.al. | 2603.23637 | null |
| 2026-03-26 | Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors | Chuanqing Zhuang et.al. | 2603.23324 | null |
| 2026-03-23 | Drop-In Perceptual Optimization for 3D Gaussian Splatting | Ezgi Ozyilkan et.al. | 2603.23297 | null |
| 2026-03-24 | GTLR-GS: Geometry-Texture Aware LiDAR-Regularized 3D Gaussian Splatting for Realistic Scene Reconstruction | Yan Fang et.al. | 2603.23192 | null |
| 2026-03-24 | PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding | Lirong Che et.al. | 2603.22796 | null |
| 2026-03-25 | Instrument-Splatting++: Towards Controllable Surgical Instrument Digital Twin Using Gaussian Splatting | Shuojue Yang et.al. | 2603.22792 | null |
| 2026-03-24 | Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis | Chamuditha Jayanga Galappaththige et.al. | 2603.22786 | null |
| 2026-03-23 | FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario | Hang Dai et.al. | 2603.22102 | null |
| 2026-03-23 | Fast undersampled dynamic MRI reconstruction using explicit representation learning with Gaussian splatting | M. L. Terpstra et.al. | 2603.21980 | null |
| 2026-03-23 | Cross-Instance Gaussian Splatting Registration via Geometry-Aware Feature-Guided Alignment | Roy Amoyal et.al. | 2603.21936 | null |
| 2026-03-23 | Camera-Agnostic Pruning of 3D Gaussian Splats via Descriptor-Based Beta Evidence | Peter Fasogbon et.al. | 2603.21933 | null |
| 2026-03-23 | RefracGS: Novel View Synthesis Through Refractive Water Surfaces with 3D Gaussian Ray Tracing | Yiming Shao et.al. | 2603.21695 | null |
| 2026-03-22 | EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization | Haolan Xu et.al. | 2603.21332 | null |
| 2026-03-25 | F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting | Injae Kim et.al. | 2603.21304 | null |
| 2026-03-22 | CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs | Shanmukha Vellamcheti et.al. | 2603.21114 | null |
| 2026-03-24 | 2Xplat: Two Experts Are Better Than One Generalist | Hwasik Jeong et.al. | 2603.21064 | null |
| 2026-03-22 | SGAD-SLAM: Splatting Gaussians at Adjusted Depth for Better Radiance Fields in RGBD SLAM | Pengchong Hu et.al. | 2603.21055 | null |
| 2026-03-21 | Fast and Robust Deformable 3D Gaussian Splatting | Han Jiao et.al. | 2603.20857 | null |
| 2026-03-21 | The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting | Ivan Desiatov et.al. | 2603.20714 | null |
| 2026-03-21 | GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction | Di Kong et.al. | 2603.20611 | null |
| 2026-03-20 | Nevis Digital Twin: Photogrammetry and Immersive Visualization of Historical Sites | Alex Apffel et.al. | 2603.20560 | null |
| 2026-03-20 | TRGS-SLAM: IMU-Aided Gaussian Splatting SLAM for Blurry, Rolling Shutter, and Noisy Thermal Images | Spencer Carmichael et.al. | 2603.20443 | null |
| 2026-03-20 | Fourier Splatting: Generalized Fourier encoded primitives for scalable radiance fields | Mihnea-Bogdan Jurca et.al. | 2603.19834 | null |
| 2026-03-20 | HUGE-Bench: A Benchmark for High-Level UAV Vision-Language-Action Tasks | Jingyu Guo et.al. | 2603.19822 | null |
| 2026-03-20 | 3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction | Takeshi Noda et.al. | 2603.19682 | null |
| 2026-03-20 | StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention | Zhongrui Yu et.al. | 2603.19552 | null |
| 2026-03-20 | Matryoshka Gaussian Splatting | Zhilin Guo et.al. | 2603.19234 | null |
| 2026-03-19 | Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting | Yiren Lu et.al. | 2603.19193 | null |
| 2026-03-19 | GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning | Yiren Lu et.al. | 2603.19137 | null |
| 2026-03-19 | GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting | Ahmed Tawfik Aboukhadra et.al. | 2603.18912 | null |
| 2026-03-19 | From ex(p) to poly: Gaussian Splatting with Polynomial Kernels | Joerg H. Mueller et.al. | 2603.18707 | null |
| 2026-03-19 | OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting | Hongjia Zhai et.al. | 2603.18510 | null |
| 2026-03-19 | Inst4DGS: Instance-Decomposed 4D Gaussian Splatting with Multi-Video Label Permutation Learning | Yonghan Lee et.al. | 2603.18402 | null |
| 2026-03-18 | Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting | Guillem Casadesus Vila et.al. | 2603.18218 | null |
| 2026-03-18 | AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion Priors | Aymen Mir et.al. | 2603.17975 | null |
| 2026-03-18 | CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image | Yizheng Song et.al. | 2603.17779 | null |
| 2026-03-18 | ReLaGS: Relational Language Gaussian Splatting | Yaxu Xie et.al. | 2603.17605 | null |
| 2026-03-18 | UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images | Guibiao Liao et.al. | 2603.17519 | null |
| 2026-03-18 | A Tutorial on Learning-Based Radio Map Construction: Data, Paradigms, and Physics-Awarenes | Xiucheng Wang et.al. | 2603.17499 | null |
| 2026-03-18 | Adaptive Anchor Policies for Efficient 4D Gaussian Streaming | Ashim Dahal et.al. | 2603.17227 | null |
| 2026-03-17 | SMAL-pets: SMAL Based Avatars of Pets from Single Image | Piotr Borycki et.al. | 2603.17131 | null |
| 2026-03-16 | KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition | Yuhan Chen et.al. | 2603.16943 | link |
| 2026-03-17 | M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM | Kerui Ren et.al. | 2603.16844 | null |
| 2026-03-17 | Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty | Mangyu Kong et.al. | 2603.16538 | null |
| 2026-03-17 | Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation | Yiming Huang et.al. | 2603.16211 | null |
| 2026-03-17 | NanoGS: Training-Free Gaussian Splat Simplification | Butian Xiong et.al. | 2603.16103 | null |
| 2026-03-16 | Feed-forward Gaussian Registration for Head Avatar Creation and Editing | Malte Prinzler et.al. | 2603.15811 | null |
| 2026-03-16 | IRIS: Intersection-aware Ray-based Implicit Editable Scenes | Grzegorz Wilczyński et.al. | 2603.15368 | null |
| 2026-03-16 | NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation | Jiahang Liu et.al. | 2603.15186 | null |
| 2026-03-16 | GeoNVS: Geometry Grounded Video Diffusion for Novel View Synthesis | Minjun Kang et.al. | 2603.14965 | null |
| 2026-03-16 | LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision | Yiming Huang et.al. | 2603.14763 | null |
| 2026-03-16 | E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction | Yunsoo Kim et.al. | 2603.14684 | null |
| 2026-03-15 | Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting | Shuai Guo et.al. | 2603.14316 | null |
| 2026-03-15 | In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels | Tomislav Medic et.al. | 2603.14309 | null |
| 2026-03-15 | 4D Synchronized Fields: Motion-Language Gaussian Splatting for Temporal Scene Understanding | Mohamed Rayan Barhdadi et.al. | 2603.14301 | null |
| 2026-03-15 | S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction | Renhe Zhang et.al. | 2603.14232 | null |
| 2026-03-14 | PhyGaP: Physically-Grounded Gaussians with Polarization Cues | Jiale Wu et.al. | 2603.14001 | null |
| 2026-03-14 | Scene Generation at Absolute Scale: Utilizing Semantic and Geometric Guidance From Text for Accurate and Interpretable 3D Indoor Scene Generation | Stefan Ainetter et.al. | 2603.13910 | null |
| 2026-03-14 | RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting | Xuezhen Wang et.al. | 2603.13783 | null |
| 2026-03-13 | NumColor: Precise Numeric Color Control in Text-to-Image Generation | Muhammad Atif Butt et.al. | 2603.13547 | null |
| 2026-03-13 | SldprtNet: A Large-Scale Multimodal Dataset for CAD Generation in Language-Driven 3D Design | Ruogu Li et.al. | 2603.13098 | null |
| 2026-03-13 | Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting | Yang Chen et.al. | 2603.12796 | null |
| 2026-03-13 | LR-SGS: Robust LiDAR-Reflectance-Guided Salient Gaussian Splatting for Self-Driving Scene Reconstruction | Ziyu Chen et.al. | 2603.12647 | null |
| 2026-03-12 | RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution | Ali Mosleh et.al. | 2603.12493 | null |
| 2026-03-12 | AstroSplat: Physics-Based Gaussian Splatting for Rendering and Reconstruction of Small Celestial Bodies | Jennifer Nolan et.al. | 2603.11969 | null |
| 2026-03-12 | Mango-GS: Enhancing Spatio-Temporal Consistency in Dynamic Scenes Reconstruction using Multi-Frame Node-Guided 4D Gaussian Splatting | Tingxuan Huang et.al. | 2603.11543 | null |
| 2026-03-12 | Mobile-GS: Real-time Gaussian Splatting for Mobile Devices | Xiaobiao Du et.al. | 2603.11531 | null |
| 2026-03-11 | InstantHDR: Single-forward Gaussian Splatting for High Dynamic Range 3D Reconstruction | Dingqiang Ye et.al. | 2603.11298 | null |
| 2026-03-11 | S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs | Yuzhou Ji et.al. | 2603.10893 | link |
| 2026-03-11 | PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction | Yufei Han et.al. | 2603.10801 | null |
| 2026-03-11 | Splat2Real: Novel-view Scaling for Physical AI with 3D Gaussian Splatting | Hansol Lim et.al. | 2603.10638 | null |
| 2026-03-11 | P-GSVC: Layered Progressive 2D Gaussian Splatting for Scalable Image and Video | Longan Wang et.al. | 2603.10551 | null |
| 2026-03-12 | SignSparK: Efficient Multilingual Sign Language Production via Sparse Keyframe Learning | Jianhe Low et.al. | 2603.10446 | null |
| 2026-03-10 | ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare | Freeman Cheng et.al. | 2603.09968 | null |
| 2026-03-10 | GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming System | Zhiye Tang et.al. | 2603.09718 | null |
| 2026-03-10 | ProGS: Towards Progressive Coding for 3D Gaussian Splatting | Zhiye Tang et.al. | 2603.09703 | null |
| 2026-03-10 | VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM | Anh Thuan Tran et.al. | 2603.09673 | null |
| 2026-03-10 | DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics | Yuanhang Lei et.al. | 2603.09668 | null |
| 2026-03-12 | X-GS: An Extensible Open Framework for Perceiving and Thinking via 3D Gaussian Splatting | Yueen Ma et.al. | 2603.09632 | null |
| 2026-03-10 | IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework | Feiyu Wang et.al. | 2603.09312 | null |
| 2026-03-10 | DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction | Fuzhen Jiang et.al. | 2603.09291 | null |
| 2026-03-10 | Learning Convex Decomposition via Feature Fields | Yuezhi Yang et.al. | 2603.09285 | null |
| 2026-03-10 | Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists | Jiaqi Liu et.al. | 2603.09277 | null |
| 2026-03-09 | SkipGS: Post-Densification Backward Skipping for Efficient 3DGS Training | Jingxing Li et.al. | 2603.08997 | null |
| 2026-03-09 | SurgCalib: Gaussian Splatting-Based Hand-Eye Calibration for Robot-Assisted Minimally Invasive Surgery | Zijian Wu et.al. | 2603.08983 | null |
| 2026-03-09 | Where, What, Why: Toward Explainable 3D-GS Watermarking | Mingshu Cai et.al. | 2603.08809 | null |
| 2026-03-09 | ImprovedGS+: A High-Performance C++/CUDA Re-Implementation Strategy for 3D Gaussian Splatting | Jordi Muñoz Vicente et.al. | 2603.08661 | null |
| 2026-03-09 | Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction | Zhe Yang et.al. | 2603.08503 | null |
| 2026-03-09 | Improving Continual Learning for Gaussian Splatting based Environments Reconstruction on Commercial Off-the-Shelf Edge Devices | Ivan Zaino et.al. | 2603.08499 | null |
| 2026-03-09 | HDR-NSFF: High Dynamic Range Neural Scene Flow Fields | Shin Dong-Yeon et.al. | 2603.08313 | null |
| 2026-03-09 | DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving | Zhuolin He et.al. | 2603.08254 | null |
| 2026-03-08 | SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation | Zixuan Pan et.al. | 2603.07789 | null |
| 2026-03-08 | Ref-DGS: Reflective Dual Gaussian Splatting | Ningjing Fan et.al. | 2603.07664 | null |
| 2026-03-08 | Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence | Yuanyuan Gao et.al. | 2603.07660 | null |
| 2026-03-08 | EmbedTalk: Triplane-Free Talking Head Synthesis using Embedding-Driven Gaussian Deformation | Arpita Saggar et.al. | 2603.07604 | null |
| 2026-03-08 | 3DGS-HPC: Distractor-free 3D Gaussian Splatting with Hybrid Patch-wise Classification | Jiahao Chen et.al. | 2603.07587 | null |
| 2026-03-08 | ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction | Haibao Yu et.al. | 2603.07552 | null |
| 2026-03-07 | MipSLAM: Alias-Free Gaussian Splatting SLAM | Yingzhao Li et.al. | 2603.06989 | null |
| 2026-03-06 | ColonSplat: Reconstruction of Peristaltic Motion in Colonoscopy with Dynamic Gaussian Splatting | Weronika Smolak-Dyżewska et.al. | 2603.06860 | null |
| 2026-03-06 | Active View Selection with Perturbed Gaussian Ensemble for Tomographic Reconstruction | Yulun Wu et.al. | 2603.06852 | null |
| 2026-03-06 | EntON: Eigenentropy-Optimized Neighborhood Densification in 3D Gaussian Splatting | Miriam Jäger et.al. | 2603.06216 | null |
| 2026-03-06 | VG3S: Visual Geometry Grounded Gaussian Splatting for Semantic Occupancy Prediction | Xiaoyang Yan et.al. | 2603.06210 | null |
| 2026-03-06 | Transforming Omnidirectional RGB-LiDAR data into 3D Gaussian Splatting | Semin Bae et.al. | 2603.06061 | null |
| 2026-03-06 | FTSplat: Feed-forward Triangle Splatting Network | Xiong Jinlin et.al. | 2603.05932 | null |
| 2026-03-06 | CylinderSplat: 3D Gaussian Splatting with Cylindrical Triplanes for Panoramic Novel View Synthesis | Qiwei Wang et.al. | 2603.05882 | null |
| 2026-03-05 | Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups | Leif Van Holland et.al. | 2603.05507 | null |
| 2026-03-05 | SSR-GS: Separating Specular Reflection in Gaussian Splatting for Glossy Surface Reconstruction | Ningjing Fan et.al. | 2603.05152 | null |
| 2026-03-05 | GaussTwin: Unified Simulation and Correction with Gaussian Splatting for Robotic Digital Twins | Yichen Cai et.al. | 2603.05108 | null |
| 2026-03-05 | GloSplat: Joint Pose-Appearance Optimization for Faster and More Accurate 3D Reconstruction | Tianyu Xiong et.al. | 2603.04847 | null |
| 2026-03-05 | DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction | Shiyu Zhang et.al. | 2603.04770 | null |
| 2026-03-03 | VIRGi: View-dependent Instant Recoloring of 3D Gaussians Splats | Alessio Mazzucchelli et.al. | 2603.02986 | null |
| 2026-03-03 | Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting | Kaiqiang Xiong et.al. | 2603.02893 | null |
| 2026-03-04 | Generalized non-exponential Gaussian splatting | Sébastien Speierer et.al. | 2603.02887 | null |
| 2026-03-03 | Multimodal-Prior-Guided Importance Sampling for Hierarchical Gaussian Splatting in Sparse-View Novel View Synthesis | Kaiqiang Xiong et.al. | 2603.02866 | null |
| 2026-03-03 | R3GW: Relightable 3D Gaussians for Outdoor Scenes in the Wild | Margherita Lea Corona et.al. | 2603.02801 | null |
| 2026-03-03 | SemGS: Feed-Forward Semantic 3D Gaussian Splatting from Sparse Views for Generalizable Scene Understanding | Sheng Ye et.al. | 2603.02548 | null |
| 2026-03-03 | OnlineX: Unified Online 3D Reconstruction and Understanding with Active-to-Stable State Evolution | Chong Xia et.al. | 2603.02134 | null |
| 2026-03-02 | LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation | Hualiang Wei et.al. | 2603.02129 | null |
| 2026-03-02 | Sparse View Distractor-Free Gaussian Splatting | Yi Gu et.al. | 2603.01603 | null |
| 2026-03-02 | Radiometrically Consistent Gaussian Surfels for Inverse Rendering | Kyu Beom Han et.al. | 2603.01491 | null |
| 2026-03-01 | FLICKER: A Fine-Grained Contribution-Aware Accelerator for Real-Time 3D Gaussian Splatting | Wenhui Ou et.al. | 2603.01158 | null |
| 2026-03-01 | D-REX: Differentiable Real-to-Sim-to-Real Engine for Learning Dexterous Grasping | Haozhe Lou et.al. | 2603.01151 | null |
| 2026-03-03 | HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views | Jiashu Li et.al. | 2603.01099 | null |
| 2026-03-01 | Decoupling Motion and Geometry in 4D Gaussian Splatting | Yi Zhang et.al. | 2603.00952 | null |
| 2026-02-28 | TokenSplat: Token-aligned 3D Gaussian Splatting for Feed-forward Pose-free Reconstruction | Yihui Li et.al. | 2603.00697 | null |
| 2026-02-28 | Zero-Shot Robotic Manipulation via 3D Gaussian Splatting-Enhanced Multimodal Retrieval-Augmented Generation | Zilong Xie et.al. | 2603.00500 | null |
| 2026-02-28 | ArtiFixer: Enhancing and Extending 3D Reconstruction with Auto-Regressive Diffusion Models | Riccardo de Lutio et.al. | 2603.00492 | null |
| 2026-02-28 | Station2Radar: query conditioned gaussian splatting for precipitation field | Doyi Kim et.al. | 2603.00418 | null |
| 2026-02-27 | UFO-4D: Unposed Feedforward 4D Reconstruction from Two Images | Junhwa Hur et.al. | 2602.24290 | null |
| 2026-02-27 | Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives | Haoran Wang et.al. | 2602.24136 | null |
| 2026-02-27 | DiffusionHarmonizer: Bridging Neural Reconstruction and Photorealistic Simulation with Online Diffusion Enhancer | Yuxuan Zhang et.al. | 2602.24096 | null |
| 2026-02-27 | SR3R: Rethinking Super-Resolution 3D Reconstruction With Feed-Forward Gaussian Splatting | Xiang Feng et.al. | 2602.24020 | null |
| 2026-02-27 | Provable Subspace Identification of Nonlinear Multi-view CCA | Zhiwei Han et.al. | 2602.23785 | null |
| 2026-02-27 | No Calibration, No Depth, No Problem: Cross-Sensor View Synthesis with 3D Consistency | Cho-Ying Wu et.al. | 2602.23559 | null |
| 2026-02-26 | Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking | Maximilian Luz et.al. | 2602.23172 | null |
| 2026-02-26 | PackUV: Packed Gaussian UV Maps for 4D Volumetric Video | Aashish Rai et.al. | 2602.23040 | null |
| 2026-02-26 | GSTurb: Gaussian Splatting for Atmospheric Turbulence Mitigation | Hanliang Du et.al. | 2602.22800 | null |
| 2026-02-26 | Sapling-NeRF: Geo-Localised Sapling Reconstruction in Forests for Ecological Monitoring | Miguel Ángel Muñoz-Bañón et.al. | 2602.22731 | null |
| 2026-02-26 | ArtPro: Self-Supervised Articulated Object Reconstruction with Adaptive Integration of Mobility Proposals | Xuelu Li et.al. | 2602.22666 | null |
| 2026-02-26 | BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model | Yuci Han et.al. | 2602.22596 | null |
| 2026-02-26 | GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views | Tianyu Chen et.al. | 2602.22571 | null |
| 2026-02-26 | SwiftNDC: Fast Neural Depth Correction for High-Fidelity 3D Reconstruction | Kang Han et.al. | 2602.22565 | null |
| 2026-02-25 | AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction | Hanyang Liu et.al. | 2602.22376 | null |
| 2026-02-25 | Interactive Augmented Reality-enabled Outdoor Scene Visualization For Enhanced Real-time Disaster Response | Dimitrios Apostolakis et.al. | 2602.21874 | null |
| 2026-02-25 | Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping | Junmyeong Lee et.al. | 2602.21668 | null |
| 2026-02-27 | DAGS-SLAM: Dynamic-Aware 3DGS SLAM via Spatiotemporal Motion Probability and Uncertainty-Aware Scheduling | Li Zhang et.al. | 2602.21644 | null |
| 2026-02-24 | HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles | Yifan Wang et.al. | 2602.21333 | null |
| 2026-02-24 | BrepGaussian: CAD reconstruction from Multi-View Images with Gaussian Splatting | Jiaxing Yu et.al. | 2602.21105 | null |
| 2026-02-24 | Dropping Anchor and Spherical Harmonics for Sparse-view Gaussian Splatting | Shuangkang Fang et.al. | 2602.20933 | null |
| 2026-02-24 | RU4D-SLAM: Reweighting Uncertainty in Gaussian Splatting SLAM for 4D Scene Reconstruction | Yangfan Zhao et.al. | 2602.20807 | link |
| 2026-02-24 | Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization | Yangsen Chen et.al. | 2602.20718 | null |
| 2026-02-24 | Real-time Calibration-free Imaging Through Dynamic and Distinct Multimode Fibers via Spatial Harmonic Invariant Nonlinear Encoding (SHINE) | Zhiyuan Wang et.al. | 2602.20562 | null |
| 2026-02-24 | WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos | Hanhui Li et.al. | 2602.20556 | null |
| 2026-02-23 | Aesthetic Camera Viewpoint Suggestion with 3D Aesthetic Field | Sheyang Tang et.al. | 2602.20363 | null |
| 2026-02-23 | Large-scale Photorealistic Outdoor 3D Scene Reconstruction from UAV Imagery Using Gaussian Splatting Techniques | Christos Maikos et.al. | 2602.20342 | null |
| 2026-02-23 | tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction | Chen Wang et.al. | 2602.20160 | null |
| 2026-02-23 | Augmented Radiance Field: A General Framework for Enhanced Gaussian Splatting | Yixin Yang et.al. | 2602.19916 | null |
| 2026-02-23 | One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image | Pengfei Wang et.al. | 2602.19766 | null |
| 2026-02-23 | RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing | Kaifa Yang et.al. | 2602.19753 | null |
| 2026-02-22 | DefenseSplat: Enhancing the Robustness of 3D Gaussian Splatting via Frequency-Aware Filtering | Yiran Qiao et.al. | 2602.19323 | null |
| 2026-02-21 | Compact Hadamard Latent Codes for Efficient Spectral Rendering | Jiaqi Yu et.al. | 2602.18741 | null |
| 2026-02-20 | Unifying Color and Lightness Correction with View-Adaptive Curve Adjustment for Robust 3D Novel View Synthesis | Ziteng Cui et.al. | 2602.18322 | null |
| 2026-02-20 | Diff2DGS: Reliable Reconstruction of Occluded Surgical Scenes via 2D Gaussian Splatting | Tianyi Song et.al. | 2602.18314 | null |
| 2026-02-19 | 4D Monocular Surgical Reconstruction under Arbitrary Camera Motions | Jiwei Shan et.al. | 2602.17473 | null |
| 2026-02-19 | NRGS-SLAM: Monocular Non-Rigid SLAM for Endoscopy via Deformation-Aware 3D Gaussian Splatting | Jiwei Shan et.al. | 2602.17182 | null |
| 2026-02-19 | B $^3$ -Seg: Camera-Free, Training-Free 3DGS Segmentation via Analytic EIG and Beta-Bernoulli Bayesian Updates | Hiromichi Kamata et.al. | 2602.17134 | null |
| 2026-02-19 | 3D Scene Rendering with Multimodal Gaussian Splatting | Chi-Shiang Gau et.al. | 2602.17124 | null |
| 2026-02-19 | i-PhysGaussian: Implicit Physical Simulation for 3D Gaussian Splatting | Yicheng Cao et.al. | 2602.17117 | null |
| 2026-02-17 | Semantic-Guided 3D Gaussian Splatting for Transient Object Removal | Aditi Prabakaran et.al. | 2602.15516 | null |
| 2026-02-17 | DAV-GSWT: Diffusion-Active-View Sampling for Data-Efficient Gaussian Splatting Wang Tiles | Rong Fu et.al. | 2602.15355 | null |
| 2026-02-16 | Time-Archival Camera Virtualization for Sports and Visual Performances | Yunxiao Zhang et.al. | 2602.15181 | null |
| 2026-02-16 | Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery | Chandrakanth Gudavalli et.al. | 2602.14929 | null |
| 2026-02-16 | Gaussian Mesh Renderer for Lightweight Differentiable Rendering | Xinpeng Liu et.al. | 2602.14493 | null |
| 2026-02-15 | Learnable Multi-level Discrete Wavelet Transforms for 3D Gaussian Splatting Frequency Modulation | Hung Nguyen et.al. | 2602.14199 | null |
| 2026-02-14 | High-fidelity 3D reconstruction for planetary exploration | Alfonso Martínez-Petersen et.al. | 2602.13909 | null |
| 2026-02-14 | Human-Aligned Evaluation of a Pixel-wise DNN Color Constancy Model | Hamed Heidari-Gorji et.al. | 2602.13887 | null |
| 2026-02-14 | Joint Orientation and Weight Optimization for Robust Watertight Surface Reconstruction via Dirichlet-Regularized Winding Fields | Jiaze Li et.al. | 2602.13801 | null |
| 2026-02-14 | Nighttime Autonomous Driving Scene Reconstruction with Physically-Based Gaussian Splatting | Tae-Kyeong Kim et.al. | 2602.13549 | null |
| 2026-02-13 | FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation | Huajian Zeng et.al. | 2602.13444 | null |
| 2026-02-13 | GSM-GS: Geometry-Constrained Single and Multi-view Gaussian Splatting for Surface Reconstruction | Xiao Ren et.al. | 2602.12796 | null |
| 2026-02-12 | LatentAM: Real-Time, Large-Scale Latent Gaussian Attention Mapping via Online Dictionary Learning | Junwoon Lee et.al. | 2602.12314 | null |
| 2026-02-12 | 3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting | Wancai Zheng et.al. | 2602.12159 | null |
| 2026-02-12 | GSO-SLAM: Bidirectionally Coupled Gaussian Splatting and Direct Visual Odometry | Jiung Yeon et.al. | 2602.11714 | null |
| 2026-02-12 | TG-Field: Geometry-Aware Radiative Gaussian Fields for Tomographic Reconstruction | Yuxiang Zhong et.al. | 2602.11705 | null |
| 2026-02-13 | Variation-aware Flexible 3D Gaussian Editing | Hao Qin et.al. | 2602.11638 | null |
| 2026-02-12 | LeafFit: Plant Assets Creation from 3D Gaussian Splatting | Chang Luo et.al. | 2602.11577 | null |
| 2026-02-14 | ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles | Seungyeon Yoo et.al. | 2602.11575 | null |
| 2026-02-10 | ERGO: Excess-Risk-Guided Optimization for High-Fidelity Monocular 3D Gaussian Splatting | Zehua Ma et.al. | 2602.10278 | null |
| 2026-02-10 | XSPLAIN: XAI-enabling Splat-based Prototype Learning for Attribute-aware INterpretability | Dominik Galus et.al. | 2602.10239 | null |
| 2026-02-10 | ArtisanGS: Interactive Tools for Gaussian Splat Selection with AI and Human in the Loop | Clement Fuji Tsang et.al. | 2602.10173 | null |
| 2026-02-10 | Faster-GS: Analyzing and Improving Gaussian Splatting Optimization | Florian Hahlbohm et.al. | 2602.09999 | link |
| 2026-02-10 | CompSplat: Compression-aware 3D Gaussian Splatting for Real-world Video | Hojun Song et.al. | 2602.09816 | link |
| 2026-02-10 | SciFlow-Bench: Evaluating Structure-Aware Scientific Diagram Generation via Inverse Parsing | Tong Zhang et.al. | 2602.09809 | null |
| 2026-02-10 | Toward Fine-Grained Facial Control in 3D Talking Head Generation | Shaoyang Xie et.al. | 2602.09736 | null |
| 2026-02-10 | Stability and Concentration in Nonlinear Inverse Problems with Block-Structured Parameters: Lipschitz Geometry, Identifiability, and an Application to Gaussian Splatting | Joe-Mei Feng et.al. | 2602.09415 | null |
| 2026-02-10 | Grow with the Flow: 4D Reconstruction of Growing Plants with Gaussian Flow Fields | Weihan Luo et.al. | 2602.08958 | null |
| 2026-02-09 | Analysis of Converged 3D Gaussian Splatting Solutions: Density Effects and Prediction Limit | Zhendong Wang et.al. | 2602.08909 | null |
| 2026-02-09 | GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion | Santiago Montiel-Marín et.al. | 2602.08784 | null |
| 2026-02-09 | Rotated Lights for Consistent and Efficient 2D Gaussians Inverse Rendering | Geng Lin et.al. | 2602.08724 | null |
| 2026-02-09 | Informative Object-centric Next Best View for Object-aware 3D Gaussian Splatting in Cluttered Scenes | Seunghoon Jeong et.al. | 2602.08266 | null |
| 2026-02-08 | Recovering 3D Shapes from Ultra-Fast Motion-Blurred Images | Fei Yu et.al. | 2602.07860 | null |
| 2026-02-11 | Thermal odometry and dense mapping using learned odometry and Gaussian splatting | Tianhao Zhou et.al. | 2602.07493 | null |
| 2026-02-06 | Zero-Shot UAV Navigation in Forests via Relightable 3D Gaussian Splatting | Zinan Lv et.al. | 2602.07101 | null |
| 2026-02-06 | DynFOA: Generating First-Order Ambisonics with Conditional Diffusion for Dynamic and Acoustically Complex 360-Degree Videos | Ziyu Luo et.al. | 2602.06846 | null |
| 2026-02-06 | GaussianPOP: Principled Simplification Framework for Compact 3D Gaussian Splatting via Error Quantification | Soonbin Lee et.al. | 2602.06830 | null |
| 2026-02-06 | Uncertainty-Aware 4D Gaussian Splatting for Monocular Occluded Human Rendering | Weiquan Wang et.al. | 2602.06343 | null |
| 2026-02-05 | From Blurry to Believable: Enhancing Low-quality Talking Heads with 3D Generative Priors | Ding-Jiun Huang et.al. | 2602.06122 | null |
| 2026-02-05 | NVS-HO: A Benchmark for Novel View Synthesis of Handheld Objects | Musawar Ali et.al. | 2602.05822 | null |
| 2026-02-05 | PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction | Ju Shen et.al. | 2602.05190 | null |
| 2026-02-04 | QuantumGS: Quantum Encoding Framework for Gaussian Splatting | Grzegorz Wilczyński et.al. | 2602.05047 | null |
| 2026-02-04 | Nix and Fix: Targeting 1000x Compression of 3D Gaussian Splatting with Diffusion Models | Cem Eteke et.al. | 2602.04549 | null |
| 2026-02-04 | VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image | Teng-Fang Hsiao et.al. | 2602.04349 | null |
| 2026-02-04 | Towards Next-Generation SLAM: A Survey on 3DGS-SLAM Focusing on Performance, Robustness, and Future Directions | Li Wang et.al. | 2602.04251 | null |
| 2026-02-03 | AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting | Joanna Kaleta et.al. | 2602.04043 | null |
| 2026-02-02 | Intellectual Property Protection for 3D Gaussian Splatting Assets: A Survey | Longjie Zhao et.al. | 2602.03878 | null |
| 2026-02-01 | Split&Splat: Zero-Shot Panoptic Segmentation via Explicit Instance Modeling and 3D Gaussian Splatting | Leonardo Monchieri et.al. | 2602.03809 | null |
| 2026-02-03 | Constrained Dynamic Gaussian Splatting | Zihan Zheng et.al. | 2602.03538 | null |
| 2026-02-03 | Pi-GS: Sparse-View Gaussian Splatting with Dense π^3 Initialization | Manuel Hofer et.al. | 2602.03327 | null |
| 2026-02-03 | WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU | Yudong Han et.al. | 2602.03207 | null |
| 2026-02-05 | SharpTimeGS: Sharp and Stable Dynamic Gaussian Splatting via Lifespan Modulation | Zhanfeng Liao et.al. | 2602.02989 | null |
| 2026-02-01 | Position: 3D Gaussian Splatting Watermarking Should Be Scenario-Driven and Threat-Model Explicit | Yangfan Deng et.al. | 2602.02602 | null |
| 2026-02-02 | SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation | Mu Huang et.al. | 2602.02402 | null |
| 2026-02-02 | UrbanGS: A Scalable and Efficient Architecture for Geometrically Accurate Large-Scene Reconstruction | Changbai Li et.al. | 2602.02089 | null |
| 2026-02-03 | SurfSplat: Conquering Feedforward 2D Gaussian Splatting with Surface Continuity Priors | Bing He et.al. | 2602.02000 | null |
| 2026-02-02 | CloDS: Visual-Only Unsupervised Cloth Dynamics Learning in Unknown Conditions | Yuliang Zhan et.al. | 2602.01844 | null |
| 2026-02-02 | CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding | Yuling Shi et.al. | 2602.01785 | null |
| 2026-02-02 | FastPhysGS: Accelerating Physics-based Dynamic 3DGS Simulation via Interior Completion and Adaptive Optimization | Yikun Ma et.al. | 2602.01723 | null |
| 2026-02-02 | VRGaussianAvatar: Integrating 3D Gaussian Avatars into VR | Hail Song et.al. | 2602.01674 | null |
| 2026-02-02 | MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation | Xiaoxi Kong et.al. | 2602.01513 | null |
| 2026-02-01 | Radioactive 3D Gaussian Ray Tracing for Tomographic Reconstruction | Ling Chen et.al. | 2602.01057 | null |
| 2026-01-31 | HPC: Hierarchical Point-based Latent Representation for Streaming Dynamic Gaussian Splatting Compression | Yangzhi Ma et.al. | 2602.00671 | null |
| 2026-01-31 | Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting | Yian Zhao et.al. | 2602.00618 | null |
| 2026-01-31 | PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting | Xin Zhang et.al. | 2602.00463 | null |
| 2026-01-30 | 3DGS $^2$ -TR: Scalable Second-Order Trust-Region Method for 3D Gaussian Splatting | Roger Hsiao et.al. | 2602.00395 | null |
| 2026-01-29 | Learning Physics-Grounded 4D Dynamics with Neural Gaussian Force Fields | Shiqian Li et.al. | 2602.00148 | link |
| 2026-01-30 | PaperBanana: Automating Academic Illustration for AI Scientists | Dawei Zhu et.al. | 2601.23265 | null |
| 2026-01-30 | Learning Geometrically-Grounded 3D Visual Representations for View-Generalizable Robotic Manipulation | Di Zhang et.al. | 2601.22988 | null |
| 2026-01-30 | Diachronic Stereo Matching for Multi-Date Satellite Imagery | Elías Masquil et.al. | 2601.22808 | null |
| 2026-01-30 | PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction | Changjian Jiang et.al. | 2601.22046 | null |
| 2026-01-29 | Hybrid Foveated Path Tracing with Peripheral Gaussians for Immersive Anatomy | Constantin Kleinbeck et.al. | 2601.22026 | null |
| 2026-01-28 | FreeFix: Boosting 3D Gaussian Splatting via Fine-Tuning-Free Diffusion Models | Hongyu Zhou et.al. | 2601.20857 | link |
| 2026-01-28 | GRTX: Efficient Ray Tracing for 3D Gaussian-Based Rendering | Junseo Lee et.al. | 2601.20429 | null |
| 2026-01-28 | GVGS: Gaussian Visibility-Aware Multi-View Geometry for Accurate Surface Reconstruction | Mai Su et.al. | 2601.20331 | null |
| 2026-01-27 | Graphical X Splatting (GraphiXS): A Graphical Model for 4D Gaussian Splatting under Uncertainty | Doga Yilmaz et.al. | 2601.19843 | null |
| 2026-01-27 | WaterClear-GS: Optical-Aware Gaussian Splatting for Underwater Reconstruction and Restoration | Xinrui Zhang et.al. | 2601.19753 | null |
| 2026-01-28 | Fast Converging 3D Gaussian Splatting for 1-Minute Reconstruction | Ziyu Zhang et.al. | 2601.19489 | null |
| 2026-01-27 | ClipGS-VR: Immersive and Interactive Cinematic Visualization of Volumetric Medical Data in Mobile Virtual Reality | Yuqi Tong et.al. | 2601.19310 | null |
| 2026-01-27 | TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment | Jiarun Liu et.al. | 2601.19247 | null |
| 2026-01-27 | UniMGS: Unifying Mesh and 3D Gaussian Splatting with Single-Pass Rasterization and Proxy-Based Deformation | Zeyu Xiao et.al. | 2601.19233 | null |
| 2026-01-27 | Bridging Visual and Wireless Sensing: A Unified Radiation Field for 3D Radio Map Construction | Chaozheng Wen et.al. | 2601.19216 | null |
| 2026-01-26 | Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting | Tong Shi et.al. | 2601.18633 | null |
| 2026-01-26 | ExoGS: A 4D Real-to-Sim-to-Real Framework for Scalable Manipulation Data Collection | Yiming Wang et.al. | 2601.18629 | null |
| 2026-01-26 | LoD-Structured 3D Gaussian Splatting for Streaming Video Reconstruction | Xinhui Liu et.al. | 2601.18475 | null |
| 2026-01-27 | Geometry-Grounded Gaussian Splatting | Baowen Zhang et.al. | 2601.17835 | null |
| 2026-01-25 | Advancing Structured Priors for Sparse-Voxel Surface Reconstruction | Ting-Hsun Chi et.al. | 2601.17720 | null |
| 2026-01-28 | PocketGS: On-Device Training of 3D Gaussian Splatting for High Perceptual Modeling | Wenzhi Guo et.al. | 2601.17354 | null |
| 2026-01-23 | LGDWT-GS: Local and Global Discrete Wavelet-Regularized 3D Gaussian Splatting for Sparse-View Scene Reconstruction | Shima Salehi et.al. | 2601.17185 | null |
| 2026-01-26 | A Step to Decouple Optimization in 3DGS | Renjie Ding et.al. | 2601.16736 | null |
| 2026-01-23 | ReWeaver: Towards Simulation-Ready and Topology-Accurate Garment Reconstruction | Ming Li et.al. | 2601.16672 | null |
| 2026-01-22 | EVolSplat4D: Efficient Volume-based Gaussian Splatting for 4D Urban Scene Synthesis | Sheng Miao et.al. | 2601.15951 | null |
| 2026-01-22 | ThermoSplat: Cross-Modal 3D Gaussian Splatting with Feature Modulation and Geometry Decoupling | Zhaoqi Su et.al. | 2601.15897 | null |
| 2026-01-22 | LL-GaussianImage: Efficient Image Representation for Zero-shot Low-Light Enhancement with 2D Gaussian Splatting | Yuhan Chen et.al. | 2601.15772 | null |
| 2026-01-27 | LL-GaussianMap: Zero-shot Low-Light Image Enhancement via 2D Gaussian Splatting Guided Gain Maps | Yuhan Chen et.al. | 2601.15766 | null |
| 2026-01-21 | SplatBus: A Gaussian Splatting Viewer Framework via GPU Interprocess Communication | Yinghan Xu et.al. | 2601.15431 | null |
| 2026-01-21 | LuxRemix: Lighting Decomposition and Remixing for Indoor Scenes | Ruofan Liang et.al. | 2601.15283 | null |
| 2026-01-21 | ScenDi: 3D-to-2D Scene Diffusion Cascades for Urban Generation | Hanlei Guo et.al. | 2601.15221 | null |
| 2026-01-21 | POTR: Post-Training 3DGS Compression | Bert Ramlot et.al. | 2601.14821 | null |
| 2026-01-22 | Structured Image-based Coding for Efficient Gaussian Splatting Compression | Pedro Martin et.al. | 2601.14510 | null |
| 2026-01-20 | Rig-Aware 3D Reconstruction of Vehicle Undercarriages using Gaussian Splatting | Nitin Kulkarni et.al. | 2601.14208 | null |
| 2026-01-20 | One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step Diffusion | Yitong Dong et.al. | 2601.14161 | null |
| 2026-01-20 | ParkingTwin: Training-Free Streaming 3D Reconstruction for Parking-Lot Digital Twins | Xinhao Liu et.al. | 2601.13706 | null |
| 2026-01-19 | GaussExplorer: 3D Gaussian Splatting for Embodied Exploration and Reasoning | Kim Yu-Ji et.al. | 2601.13132 | null |
| 2026-01-19 | TreeDGS: Aerial Gaussian Splatting for Distant DBH Measurement | Belal Shaheen et.al. | 2601.12823 | null |
| 2026-01-19 | CSGaussian: Progressive Rate-Distortion Compression and Segmentation for 3D Gaussian Splatting | Yu-Jen Tseng et.al. | 2601.12814 | null |
| 2026-01-19 | KaoLRM: Repurposing Pre-trained Large Reconstruction Models for Parametric 3D Face Reconstruction | Qingtian Zhu et.al. | 2601.12736 | null |
| 2026-01-17 | Active Semantic Mapping of Horticultural Environments Using Gaussian Splatting | Jose Cuaran et.al. | 2601.12122 | null |
| 2026-01-16 | studentSplat: Your Student Model Learns Single-view 3D Gaussian Splatting | Yimu Pan et.al. | 2601.11772 | null |
| 2026-01-15 | RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation | Peng Chen et.al. | 2601.10606 | null |
| 2026-01-15 | Thinking Like Van Gogh: Structure-Aware Style Transfer via Flow-Guided 3D Gaussian Splatting | Zhendong Wang et.al. | 2601.10075 | null |
| 2026-01-14 | Variable Basis Mapping for Real-Time Volumetric Visualization | Qibiao Li et.al. | 2601.09417 | null |
| 2026-01-19 | TIDI-GS: Floater Suppression in 3D Gaussian Splatting for Enhanced Indoor Scene Fidelity | Sooyeun Yang et.al. | 2601.09291 | null |
| 2026-01-14 | GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials | Bei Huang et.al. | 2601.09265 | null |
| 2026-01-14 | A $^2$ TG: Adaptive Anisotropic Textured Gaussians for Efficient 3D Scene Representation | Sheng-Chi Hsu et.al. | 2601.09243 | null |
| 2026-01-12 | 3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing | Jiahua Dong et.al. | 2601.07963 | null |
| 2026-01-12 | FMAC: a Fair Fiducial Marker Accuracy Comparison Software | Guillaume J. Laurent et.al. | 2601.07723 | null |
| 2026-01-13 | ViewMorpher3D: A 3D-aware Diffusion Framework for Multi-Camera Novel View Synthesis in Autonomous Driving | Farhad G. Zanjani et.al. | 2601.07540 | null |
| 2026-01-12 | Mon3tr: Monocular 3D Telepresence with Pre-built Gaussian Avatars as Amortization | Fangyu Lin et.al. | 2601.07518 | null |
| 2026-01-12 | R3-RECON: Radiance-Field-Free Active Reconstruction via Renderability | Xiaofeng Jin et.al. | 2601.07484 | null |
| 2026-01-11 | SARA: Scene-Aware Reconstruction Accelerator | Jee Won Lee et.al. | 2601.06831 | null |
| 2026-01-10 | SRFlow: A Dataset and Regularization Model for High-Resolution Facial Optical Flow via Splatting Rasterization | JiaLin Zhang et.al. | 2601.06479 | null |
| 2026-01-09 | NAS-GS: Noise-Aware Sonar Gaussian Splatting | Shida Xu et.al. | 2601.06285 | null |
| 2026-01-08 | Akasha 2: Hamiltonian State Space Duality and Visual-Language Joint Embedding Predictive Architectur | Yani Meziani et.al. | 2601.06212 | null |
| 2026-01-09 | LayerGS: Decomposition and Inpainting of Layered 3D Human Avatars via 2D Gaussian Splatting | Yinghan Xu et.al. | 2601.05853 | link |
| 2026-01-09 | FeatureSLAM: Feature-enriched 3D gaussian splatting SLAM in real time | Christopher Thirgood et.al. | 2601.05738 | null |
| 2026-01-09 | GS-DMSR: Dynamic Sensitive Multi-scale Manifold Enhancement for Accelerated High-Quality 3D Gaussian Splatting | Nengbo Lu et.al. | 2601.05584 | link |
| 2026-01-09 | GaussianSwap: Animatable Video Face Swapping with 3D Gaussian Splatting | Xuan Cheng et.al. | 2601.05511 | null |
| 2026-01-08 | MOSAIC-GS: Monocular Scene Reconstruction via Advanced Initialization for Complex Dynamic Environments | Svitlana Morkva et.al. | 2601.05368 | null |
| 2026-01-08 | OceanSplat: Object-aware Gaussian Splatting with Trinocular View Consistency for Underwater Scene Reconstruction | Minseong Kweon et.al. | 2601.04984 | null |
| 2026-01-08 | AgentOCR: Reimagining Agent History via Optical Self-Compression | Lang Feng et.al. | 2601.04786 | null |
| 2026-01-08 | ProFuse: Efficient Cross-View Context Fusion for Open-Vocabulary 3D Gaussian Splatting | Yen-Jen Chiou et.al. | 2601.04754 | link |
| 2026-01-09 | Differential Locally Injective Grid Deformation and Optimization | Julian Knodt et.al. | 2601.04494 | null |
| 2026-01-07 | SCAR-GS: Spatial Context Attention for Residuals in Progressive Gaussian Splatting | Diego Revilla et.al. | 2601.04348 | null |
| 2026-01-07 | IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting | Wei Long et.al. | 2601.03824 | null |
| 2026-01-07 | G2P: Gaussian-to-Point Attribute Alignment for Boundary-Aware 3D Semantic Segmentation | Hojun Song et.al. | 2601.03510 | null |
| 2026-01-06 | RelightAnyone: A Generalized Relightable 3D Gaussian Head Model | Yingyan Xu et.al. | 2601.03357 | null |
| 2026-01-06 | CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature | Eldad Matmon et.al. | 2601.03319 | null |
| 2026-01-06 | A High-Fidelity Digital Twin for Robotic Manipulation Based on 3D Gaussian Splatting | Ziyang Sun et.al. | 2601.03200 | null |
| 2026-01-06 | Stroke Patches: Customizable Artistic Image Styling Using Regression | Ian Jaffray et.al. | 2601.03114 | null |
| 2026-01-06 | SA-ResGS: Self-Augmented Residual 3D Gaussian Splatting for Next Best View Selection | Kim Jun-Seong et.al. | 2601.03024 | null |
| 2026-01-06 | CAMO: Category-Agnostic 3D Motion Transfer from Monocular 2D Videos | Taeyeon Kim et.al. | 2601.02716 | null |
| 2026-01-05 | HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures | Yating Wang et.al. | 2601.02103 | null |
| 2026-01-05 | 360-GeoGS: Geometrically Consistent Feed-Forward 3D Gaussian Splatting Reconstruction for 360 Images | Jiaqi Yao et.al. | 2601.02102 | null |
| 2026-01-05 | InpaintHuman: Reconstructing Occluded Humans with Multi-Scale UV Mapping and Identity-Preserving Diffusion Inpainting | Jinlong Fan et.al. | 2601.02098 | null |
| 2026-01-05 | SketchRodGS: Sketch-based Extraction of Slender Geometries for Animating Gaussian Splatting Scenes | Haato Watanabe et.al. | 2601.02072 | null |
| 2026-01-05 | ESGaussianFace: Emotional and Stylized Audio-Driven Facial Animation via 3D Gaussian Splatting | Chuhang Ma et.al. | 2601.01847 | null |
| 2026-01-04 | Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows | Aymen Mir et.al. | 2601.01660 | null |
| 2026-01-04 | ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking | Xiaobao Wei et.al. | 2601.01386 | null |
| 2026-01-04 | ShadowGS: Shadow-Aware 3D Gaussian Splatting for Satellite Imagery | Feng Luo et.al. | 2601.00939 | null |
| 2026-01-01 | Clean-GS: Semantic Mask-Guided Pruning for 3D Gaussian Splatting | Subhankar Mishra et.al. | 2601.00913 | null |
| 2025-12-28 | RGS-SLAM: Robust Gaussian Splatting SLAM with One-Shot Dense Initialization | Wei-Tse Cheng et.al. | 2601.00705 | null |
| 2026-01-01 | SV-GS: Sparse View 4D Reconstruction with Skeleton-Driven Gaussian Splatting | Jun-Jee Chao et.al. | 2601.00285 | null |
| 2025-12-31 | PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes | Luca Collorone et.al. | 2512.24986 | null |
| 2025-12-31 | UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning | Ankit Dhiman et.al. | 2512.24763 | null |
| 2025-12-31 | Splatwizard: A Benchmark Toolkit for 3D Gaussian Splatting Compression | Xiang Liu et.al. | 2512.24742 | null |
| 2025-12-30 | Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression | Huanxiong Liang et.al. | 2512.24018 | null |
| 2025-12-30 | Improved 3D Gaussian Splatting of Unknown Spacecraft Structure Using Space Environment Illumination Knowledge | Tae Ha Park et.al. | 2512.23998 | null |
| 2025-12-29 | Contour Information Aware 2D Gaussian Splatting for Image Representation | Masaya Takabe et.al. | 2512.23255 | null |
| 2025-12-29 | GVSynergy-Det: Synergistic Gaussian-Voxel Representations for Multi-View 3D Object Detection | Yi Zhang et.al. | 2512.23176 | null |
| 2025-12-30 | Differentiable Physics-Driven Human Representation for Millimeter-Wave Based Pose Estimation | Shuntian Zheng et.al. | 2512.23054 | null |
| 2025-12-28 | Hash Grid Feature Pruning | Yangzhi Ma et.al. | 2512.22882 | null |
| 2025-12-28 | Next Best View Selections for Semantic and Dynamic 3D Gaussian Splatting | Yiqian Li et.al. | 2512.22771 | null |
| 2025-12-27 | SCPainter: A Unified Framework for Realistic 3D Asset Insertion and Novel View Synthesis | Paul Dobre et.al. | 2512.22706 | null |
| 2025-12-30 | Tracking by Predicting 3-D Gaussians Over Time | Tanish Baranwal et.al. | 2512.22489 | null |
| 2025-12-24 | AirGS: Real-Time 4D Gaussian Streaming for Free-Viewpoint Video Experiences | Zhe Wang et.al. | 2512.20943 | null |
| 2025-12-24 | Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting | Yoonwoo Jeong et.al. | 2512.20927 | null |
| 2025-12-23 | Nebula: Enable City-Scale 3D Gaussian Splatting in Virtual Reality via Collaborative Rendering and Accelerated Stereo Rasterization | He Zhu et.al. | 2512.20495 | null |
| 2025-12-23 | SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images | Linfei Li et.al. | 2512.20377 | link |
| 2025-12-23 | Enhancing annotations for 5D apple pose estimation through 3D Gaussian Splatting (3DGS) | Robert van de Ven et.al. | 2512.20148 | link |
| 2025-12-25 | Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs | Cyrus Vachha et.al. | 2512.20129 | null |
| 2025-12-22 | WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion | Hanyang Kong et.al. | 2512.19678 | null |
| 2025-12-22 | 4D Gaussian Splatting as a Learned Dynamical System | Arnold Caleb Asiimwe et.al. | 2512.19648 | null |
| 2025-12-22 | GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting | Tiantian Li et.al. | 2512.19108 | null |
| 2025-12-21 | EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images | Jongmin Park et.al. | 2512.18692 | null |
| 2025-12-21 | Geometric-Photometric Event-based 3D Gaussian Ray Tracing | Kai Kohyama et.al. | 2512.18640 | null |
| 2025-12-21 | ChronoDreamer: Action-Conditioned World Model as an Online Simulator for Robotic Planning | Zhenhao Zhou et.al. | 2512.18619 | null |
| 2025-12-20 | MatSpray: Fusing 2D Material World Knowledge on 3D Geometry | Philipp Langsteiner et.al. | 2512.18314 | null |
| 2025-12-22 | Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding | Yue Li et.al. | 2512.17817 | null |
| 2025-12-19 | G3Splat: Geometrically Consistent Generalizable Gaussian Splatting | Mehdi Hosseinzadeh et.al. | 2512.17547 | link |
| 2025-12-19 | FLEG: Feed-Forward Language Embedded Gaussian Splatting from Any Views | Qijian Tian et.al. | 2512.17541 | null |
| 2025-12-19 | Voxel-GS: Quantized Scaffold Gaussian Splatting Compression with Run-Length Coding | Chunyang Fu et.al. | 2512.17528 | null |
| 2025-12-19 | Flying in Clutter on Monocular RGB by Learning in 3D Radiance Fields with Domain Adaptation | Xijie Huang et.al. | 2512.17349 | null |
| 2025-12-18 | Instant Expressive Gaussian Head Avatar via 3D-Aware Expression Distillation | Kaiwen Jiang et.al. | 2512.16893 | null |
| 2025-12-18 | SDFoam: Signed-Distance Foam for explicit surface reconstruction | Antonella Rech et.al. | 2512.16706 | null |
| 2025-12-18 | Using Gaussian Splats to Create High-Fidelity Facial Geometry and Texture | Haodi He et.al. | 2512.16397 | null |
| 2025-12-17 | Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering | Divam Gupta et.al. | 2512.15711 | null |
| 2025-12-17 | Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting | Arthur Moreau et.al. | 2512.15508 | null |
| 2025-12-19 | VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments | Yuze Wu et.al. | 2512.15258 | null |
| 2025-12-17 | MVGSR: Multi-View Consistent 3D Gaussian Super-Resolution via Epipolar Guidance | Kaizhe Zhang et.al. | 2512.15048 | null |
| 2025-12-17 | A Gaussian Parameterization for Direct Atomic Structure Identification in Electron Tomography | Nalini M. Singh et.al. | 2512.15034 | null |
| 2025-12-16 | BridgeNet: A Dataset of Graph-based Bridge Structural Models for Machine Learning Applications | Lazlo Bleker et.al. | 2512.14496 | null |
| 2025-12-16 | Broadening View Synthesis of Dynamic Scenes from Constrained Monocular Videos | Le Jiang et.al. | 2512.14406 | null |
| 2025-12-16 | HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis | Kaizhe Zhang et.al. | 2512.14352 | null |
| 2025-12-16 | Beyond a Single Light: A Large-Scale Aerial Dataset for Urban Scene Reconstruction Under Varying Illumination | Zhuoxiao Li et.al. | 2512.14200 | null |
| 2025-12-16 | Spherical Voronoi: Directional Appearance as a Differentiable Partition of the Sphere | Francesco Di Sario et.al. | 2512.14180 | null |
| 2025-12-16 | GaussianPlant: Structure-aligned Gaussian Splatting for 3D Reconstruction of Plants | Yang Yang et.al. | 2512.14087 | null |
| 2025-12-16 | ASAP-Textured Gaussians: Enhancing Textured Gaussians with Adaptive Sampling and Anisotropic Parameterization | Meng Wei et.al. | 2512.14039 | null |
| 2025-12-15 | Nexels: Neurally-Textured Surfels for Real-Time Novel View Synthesis with Sparse Geometries | Victor Rong et.al. | 2512.13796 | null |
| 2025-12-15 | Computer vision training dataset generation for robotic environments using Gaussian splatting | Patryk Niżeniec et.al. | 2512.13411 | null |
| 2025-12-15 | Light Field Based 6DoF Tracking of Previously Unobserved Objects | Nikolai Goncharov et.al. | 2512.13007 | null |
| 2025-12-15 | Qonvolution: Towards Learning High-Frequency Signals with Queried Convolution | Abhinav Kumar et.al. | 2512.12898 | null |
| 2025-12-14 | Fast 2DGS: Efficient Image Representation with Deep Gaussian Prior | Hao Wang et.al. | 2512.12774 | null |
| 2025-12-13 | Keep the Lights On, Keep the Lengths in Check: Plug-In Adversarial Detection for Time-Series LLMs in Energy Forecasting | Hua Ma et.al. | 2512.12154 | null |
| 2025-12-12 | Moment-Based 3D Gaussian Splatting: Resolving Volumetric Occlusion with Order-Independent Transmittance | Jan U. Müller et.al. | 2512.11800 | null |
| 2025-12-12 | 3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation | Zhiguo Lu et.al. | 2512.11557 | null |
| 2025-12-12 | Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video | Meng-Li Shih et.al. | 2512.11356 | null |
| 2025-12-12 | Lightweight 3D Gaussian Splatting Compression via Video Codec | Qi Yang et.al. | 2512.11186 | null |
| 2025-12-11 | GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting | Madhav Agarwal et.al. | 2512.10939 | link |
| 2025-12-11 | MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos | Kehong Gong et.al. | 2512.10881 | null |
| 2025-12-11 | DeMapGS: Simultaneous Mesh Deformation and Surface Attribute Mapping via Gaussian Splatting | Shuyi Zhou et.al. | 2512.10572 | link |
| 2025-12-11 | Neural Hamiltonian Deformation Fields for Dynamic Scene Rendering | Hai-Long Qin et.al. | 2512.10424 | null |
| 2025-12-11 | Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views | Zhankuo Xu et.al. | 2512.10369 | null |
| 2025-12-11 | Physically Aware 360 $^\circ$ View Generation from a Single Image using Disentangled Scene Embeddings | Karthikeya KV et.al. | 2512.10293 | null |
| 2025-12-11 | Long-LRM++: Preserving Fine Details in Feed-Forward Wide-Coverage Reconstruction | Chen Ziwen et.al. | 2512.10267 | null |
| 2025-12-10 | TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing | Jiachen Tao et.al. | 2512.10095 | null |
| 2025-12-10 | GAINS: Gaussian-based Inverse Rendering from Sparse Multi-View Captures | Patrick Noras et.al. | 2512.09925 | null |
| 2025-12-10 | Splatent: Splatting Diffusion Latents for Novel View Synthesis | Or Hirschorn et.al. | 2512.09923 | null |
| 2025-12-10 | YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos | Ryan Meegan et.al. | 2512.09903 | null |
| 2025-12-10 | ReMoSPLAT: Reactive Mobile Manipulation Control on a Gaussian Splat | Nicolas Marticorena et.al. | 2512.09656 | null |
| 2025-12-10 | D $^2$ GSLAM: 4D Dynamic Gaussian Splatting SLAM | Siting Zhu et.al. | 2512.09411 | null |
| 2025-12-11 | Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video | Seonghwa Choi et.al. | 2512.09335 | null |
| 2025-12-10 | MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectional Blending with Hierarchical Densification | Sangwoon Kwak et.al. | 2512.09270 | null |
| 2025-12-09 | GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars | Kelian Baert et.al. | 2512.09162 | null |
| 2025-12-09 | OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics | Jisang Yoo et.al. | 2512.08625 | null |
| 2025-12-09 | On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs | Yijia Guo et.al. | 2512.08498 | null |
| 2025-12-09 | Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform | Yuning Gong et.al. | 2512.08478 | null |
| 2025-12-09 | HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting | Chang Liu et.al. | 2512.08334 | null |
| 2025-12-09 | Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation | Srijan Dokania et.al. | 2512.08271 | null |
| 2025-12-08 | Multi-view Pyramid Transformer: Look Coarser to See Broader | Gyeongjin Kang et.al. | 2512.07806 | null |
| 2025-12-08 | Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects | Shuohan Tao et.al. | 2512.07381 | null |
| 2025-12-08 | Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting | Shilong Jin et.al. | 2512.07345 | null |
| 2025-12-08 | AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing | Ziming Hong et.al. | 2512.07247 | null |
| 2025-12-08 | STRinGS: Selective Text Refinement in Gaussian Splatting | Abhinav Raundhal et.al. | 2512.07230 | null |
| 2025-12-08 | SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting | Seokhyun Youn et.al. | 2512.07197 | null |
| 2025-12-08 | MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation | Muyu Xu et.al. | 2512.07165 | null |
| 2025-12-09 | COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision | Jaeyoon Lee et.al. | 2512.07107 | null |
| 2025-12-07 | RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting | Hoang-Nhat Tran et.al. | 2512.07052 | null |
| 2025-12-07 | MeshSplatting: Differentiable Rendering with Opaque Meshes | Jan Held et.al. | 2512.06818 | null |
| 2025-12-07 | RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting | Longjie Zhao et.al. | 2512.06774 | null |
| 2025-12-07 | EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy | Yumeng He et.al. | 2512.06684 | null |
| 2025-12-06 | AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars | Ramazan Fazylov et.al. | 2512.06438 | null |
| 2025-12-06 | TriaGS: Differentiable Triangulation-Guided Geometric Consistency for 3D Gaussian Splatting | Quan Tran et.al. | 2512.06269 | null |
| 2025-12-05 | Tracking-Guided 4D Generation: Foundation-Tracker Motion Priors for 3D Model Animation | Su Sun et.al. | 2512.06158 | null |
| 2025-12-05 | Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition | Anne Sielemann et.al. | 2512.05936 | null |
| 2025-12-05 | Physically-Based Simulation of Automotive LiDAR | L. Dudzik et.al. | 2512.05932 | null |
| 2025-12-05 | Edit-aware RAW Reconstruction | Abhijith Punnappurath et.al. | 2512.05859 | null |
| 2025-12-05 | 3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering | Blanca Inigo et.al. | 2512.05803 | null |
| 2025-12-05 | Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer | Rong Wang et.al. | 2512.05593 | null |
| 2025-12-05 | SCoNE: Spherical Consistent Neighborhoods Ensemble for Effective and Efficient Multi-View Anomaly Detection | Yang Xu et.al. | 2512.05540 | null |
| 2025-12-05 | TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression | Cheng-Yuan Ho et.al. | 2512.05446 | null |
| 2025-12-05 | Image Semantic Communication with Quadtree Partition-based Coding | Yinhuan Huang et.al. | 2512.05395 | null |
| 2025-12-05 | SplatPainter: Interactive Authoring of 3D Gaussians from 2D Edits via Test-Time Training | Yang Zheng et.al. | 2512.05354 | null |
| 2025-12-04 | DEAR: Dataset for Evaluating the Aesthetics of RenderingDEAR: Dataset for Evaluating the Aesthetics of Rendering | Vsevolod Plohotnuk et.al. | 2512.05209 | null |
| 2025-12-04 | Light-X: Generative 4D Video Rendering with Camera and Illumination Control | Tianqi Liu et.al. | 2512.05115 | link |
| 2025-12-08 | Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting | Hao-Jen Chien et.al. | 2512.05113 | null |
| 2025-12-04 | NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation | Yu Zeng et.al. | 2512.05106 | null |
| 2025-12-04 | 4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer | Xianfeng Wu et.al. | 2512.05060 | null |
| 2025-12-04 | Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image | Yanran Zhang et.al. | 2512.05044 | link |
| 2025-12-04 | Reflection Removal through Efficient Adaptation of Diffusion Transformers | Daniyar Zakarin et.al. | 2512.05000 | link |
| 2025-12-04 | Federated Learning for Terahertz Wireless Communication | O. Tansel Baydas et.al. | 2512.04984 | null |
| 2025-12-04 | RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS | Chuanyu Fu et.al. | 2512.04815 | null |
| 2025-12-04 | Bridging Simulation and Reality: Cross-Domain Transfer with Semantic 2D Gaussian Splatting | Jian Tang et.al. | 2512.04731 | null |
| 2025-12-04 | Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex | Zhizhen Wu et.al. | 2512.04556 | null |
| 2025-12-04 | Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization | Hong Kuang et.al. | 2512.04542 | null |
| 2025-12-04 | Refaçade: Editing Object with Given Reference Texture | Youze Huang et.al. | 2512.04534 | null |
| 2025-12-04 | UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes | Changhe Liu et.al. | 2512.04421 | link |
| 2025-12-03 | SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting | Yonghan Lee et.al. | 2512.04315 | null |
| 2025-12-03 | Mind-to-Face: Neural-Driven Photorealistic Avatar Synthesis via EEG Decoding | Haolin Xiong et.al. | 2512.04313 | null |
| 2025-12-03 | Machine Learning Pipeline for Denoising Low Signal-To-Noise Ratio and Out-of-Distribution Transmission Electron Microscopy Datasets | Brian Lee et.al. | 2512.04045 | null |
| 2025-12-03 | RELIC: Interactive Video World Model with Long-Horizon Memory | Yicong Hong et.al. | 2512.04040 | null |
| 2025-12-03 | C3G: Learning Compact 3D Representations with 2K Gaussians | Honggyu An et.al. | 2512.04021 | null |
| 2025-12-03 | Collective dynamics of trail-interacting particles | Paul Pineau et.al. | 2512.03950 | null |
| 2025-12-03 | Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding | Haoran Zhou et.al. | 2512.03601 | null |
| 2025-12-03 | CloseUpAvatar: High-Fidelity Animatable Full-Body Avatars with Mixture of Multi-Scale Textures | David Svitov et.al. | 2512.03593 | null |
| 2025-12-03 | Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models | Shojiro Yamabe et.al. | 2512.03463 | null |
| 2025-12-03 | What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models | Tianchen Deng et.al. | 2512.03422 | null |
| 2025-12-03 | ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding | Lingjun Zhao et.al. | 2512.03370 | null |
| 2025-12-02 | Flux4D: Flow-based Unsupervised 4D Reconstruction | Jingkang Wang et.al. | 2512.03210 | null |
| 2025-12-02 | PPTArena: A Benchmark for Agentic PowerPoint Editing | Michael Ofengenden et.al. | 2512.03042 | null |
| 2025-12-02 | SurfFill: Completion of LiDAR Point Clouds via Gaussian Surfel Splatting | Svenja Strobel et.al. | 2512.03010 | null |
| 2025-12-02 | DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images | Xiaoxue Chen et.al. | 2512.03004 | link |
| 2025-12-02 | EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis | Yancheng Zhang et.al. | 2512.02932 | null |
| 2025-12-02 | Adaptive hydrogels with spatiotemporal stiffening using pH-modulating enzymes | Natascha Gray et.al. | 2512.02698 | null |
| 2025-12-02 | PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes | Derui Shan et.al. | 2512.02664 | null |
| 2025-12-02 | PoreTrack3D: A Benchmark for Dynamic 3D Gaussian Splatting in Pore-Scale Facial Trajectory Tracking | Dong Li et.al. | 2512.02648 | null |
| 2025-12-02 | Content-Aware Texturing for Gaussian Splatting | Panagiotis Papantonakis et.al. | 2512.02621 | link |
| 2025-12-02 | G-SHARP: Gaussian Surgical Hardware Accelerated Real-time Pipeline | Vishwesh Nath et.al. | 2512.02482 | null |
| 2025-12-02 | VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM | Zihan Zhu et.al. | 2512.02293 | null |
| 2025-12-01 | SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting | Pranav Asthana et.al. | 2512.02172 | null |
| 2025-12-01 | Flowchart2Mermaid: A Vision-Language Model Powered System for Converting Flowcharts into Editable Diagram Code | Pritam Deka et.al. | 2512.02170 | null |
| 2025-12-01 | ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation | Chenyang Gu et.al. | 2512.02013 | null |
| 2025-12-01 | Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights | Juanxi Tian et.al. | 2512.01816 | null |
| 2025-12-01 | IGen: Scalable Data Generation for Robot Learning from Open-World Images | Chenghao Gu et.al. | 2512.01773 | null |
| 2025-12-02 | SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge | Yumeng He et.al. | 2512.01629 | null |
| 2025-12-01 | Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry Network | Tianyu Luan et.al. | 2512.01380 | null |
| 2025-12-01 | TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking | Hanzhi Guo et.al. | 2512.01329 | null |
| 2025-12-01 | DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy | Jaewoo Song et.al. | 2512.01302 | null |
| 2025-12-01 | EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly | Xiaokun Pan et.al. | 2512.01296 | null |
| 2025-12-01 | Pay Attention Later: From Vector Space Diffusion to Linearithmic Spectral Phase-Locking | Alper Yıldırım et.al. | 2512.01208 | null |
| 2025-11-30 | LISA-3D: Lifting Language-Image Segmentation to 3D via Multi-View Consistency | Zhongbin Guo et.al. | 2512.01008 | null |
| 2025-11-30 | Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation | An Yang et.al. | 2512.00944 | null |
| 2025-11-30 | Feed-Forward 3D Gaussian Splatting Compression with Long-Context Modeling | Zhening Liu et.al. | 2512.00877 | null |
| 2025-11-30 | Smol-GS: Compact Representations for Abstract 3D Gaussian Splatting | Haishan Wang et.al. | 2512.00850 | null |
| 2025-11-30 | PolarGS: Polarimetric Cues for Ambiguity-Free Gaussian Splatting with Accurate Geometry Recovery | Bo Guo et.al. | 2512.00794 | null |
| 2025-11-30 | Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards | Qiang Lyu et.al. | 2512.00743 | null |
| 2025-11-30 | Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer | Dong In Lee et.al. | 2512.00677 | null |
| 2025-11-29 | Asset-Driven Sematic Reconstruction of Dynamic Scene with Multi-Human-Object Interactions | Sandika Biswas et.al. | 2512.00547 | null |
| 2025-11-29 | Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update | Zeyuan An et.al. | 2512.00534 | null |
| 2025-11-29 | SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level Style Control | Ji Gan et.al. | 2512.00413 | null |
| 2025-11-29 | Debate with Images: Detecting Deceptive Behaviors in Multimodal Large Language Models | Sitong Fang et.al. | 2512.00349 | null |
| 2025-11-28 | Object-Centric Data Synthesis for Category-level Object Detection | Vikhyat Agarwal et.al. | 2511.23450 | null |
| 2025-11-28 | FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting | Tianhao Xie et.al. | 2511.23292 | null |
| 2025-11-28 | Robust 3DGS-based SLAM via Adaptive Kernel Smoothing | Shouhe Zhang et.al. | 2511.23221 | null |
| 2025-11-28 | NumeriKontrol: Adding Numeric Control to Diffusion Transformers for Instruction-based Image Editing | Zhenyu Xu et.al. | 2511.23105 | null |
| 2025-11-28 | Geometry-Consistent 4D Gaussian Splatting for Sparse-Input Dynamic View Synthesis | Yiwei Li et.al. | 2511.23044 | null |
| 2025-11-28 | DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management | Casimir Feldmann et.al. | 2511.23030 | null |
| 2025-11-28 | MrGS: Multi-modal Radiance Fields with 3D Gaussian Splatting for RGB-Thermal Novel View Synthesis | Minseong Kweon et.al. | 2511.22997 | null |
| 2025-11-28 | MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation | Yuta Oshima et.al. | 2511.22989 | null |
| 2025-11-28 | Ovis-Image Technical Report | Guo-Hua Wang et.al. | 2511.22982 | null |
| 2025-11-28 | Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM | Shouhe Zhang et.al. | 2511.22968 | null |
| 2025-11-28 | DenoiseGS: Gaussian Reconstruction Model for Burst Denoising | Yongsen Cheng et.al. | 2511.22939 | null |
| 2025-11-28 | FedAU2: Attribute Unlearning for User-Level Federated Recommender Systems with Adaptive and Robust Adversarial Training | Yuyuan Li et.al. | 2511.22872 | null |
| 2025-11-28 | TokCom-UEP: Semantic Importance-Matched Unequal Error Protection for Resilient Image Transmission | Kaizheng Zhang et.al. | 2511.22859 | null |
| 2025-11-27 | GSpaRC: Gaussian Splatting for Real-time Reconstruction of RF Channels | Bhavya Sai Nukapotula et.al. | 2511.22793 | null |
| 2025-11-27 | Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction | Boyao Zhou et.al. | 2511.22704 | null |
| 2025-11-27 | Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer | Z-Image Team et.al. | 2511.22699 | null |
| 2025-11-27 | Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation | Shubhankar Borse et.al. | 2511.22690 | null |
| 2025-11-27 | Bringing Your Portrait to 3D Presence | Jiawei Zhang et.al. | 2511.22553 | null |
| 2025-11-27 | FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning | Yuan Yao et.al. | 2511.22265 | null |
| 2025-11-27 | Can Protective Watermarking Safeguard the Copyright of 3D Gaussian Splatting? | Wenkai Huang et.al. | 2511.22262 | null |
| 2025-11-26 | Resolution Where It Counts: Hash-based GPU-Accelerated 3D Reconstruction via Variance-Adaptive Voxel Grids | Lorenzo De Rebotti et.al. | 2511.21459 | link |
| 2025-11-26 | Endo-G $^{2}$ T: Geometry-Guided & Temporally Aware Time-Embedded 4DGS For Endoscopic Scenes | Yangle Liu et.al. | 2511.21367 | null |
| 2025-11-26 | Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting | Juncheng Chen et.al. | 2511.21265 | null |
| 2025-11-26 | The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval | Jaime Garcia-Martinez et.al. | 2511.21247 | null |
| 2025-11-26 | Transformer Driven Visual Servoing and Dual Arm Impedance Control for Fabric Texture Matching | Fuyuki Tokuda et.al. | 2511.21203 | null |
| 2025-11-26 | Dual Preintegration for Relative State Estimation | Ruican Xia et.al. | 2511.21189 | null |
| 2025-11-25 | MODEST: Multi-Optics Depth-of-Field Stereo Dataset | Nisarg K. Trivedi et.al. | 2511.20853 | null |
| 2025-11-25 | Private Data Imputation | Abdelkarim Kati et.al. | 2511.20832 | null |
| 2025-11-25 | Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI | Xinhao Liu et.al. | 2511.20620 | null |
| 2025-11-25 | PhysChoreo: Physics-Controllable Video Generation with Part-Aware Semantic Grounding | Haoze Zhang et.al. | 2511.20562 | null |
| 2025-11-25 | GS-Checker: Tampering Localization for 3D Gaussian Splatting | Haoliang Han et.al. | 2511.20354 | null |
| 2025-11-25 | Material-informed Gaussian Splatting for 3D World Reconstruction in a Digital Twin | João Malheiro Silva et.al. | 2511.20348 | null |
| 2025-11-25 | Active3D: Active High-Fidelity 3D Reconstruction via Hierarchical Uncertainty Quantification | Yan Li et.al. | 2511.20050 | null |
| 2025-11-25 | Clair Obscur: an Illumination-Aware Method for Real-World Image Vectorization | Xingyue Lin et.al. | 2511.20034 | null |
| 2025-11-25 | GigaWorld-0: World Models as Data Engine to Empower Embodied AI | GigaWorld Team et.al. | 2511.19861 | null |
| 2025-11-25 | Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks | Xiangkai Ma et.al. | 2511.19856 | null |
| 2025-11-25 | STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction | Jiankuo Zhao et.al. | 2511.19854 | link |
| 2025-11-24 | ModHiFi: Identifying High Fidelity predictive components for Model Modification | Dhruva Kashyap et.al. | 2511.19566 | null |
| 2025-11-24 | Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation | Jaeyeong Kim et.al. | 2511.19542 | link |
| 2025-11-24 | LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context | Jingzhi Bao et.al. | 2511.19437 | link |
| 2025-11-24 | Efficiency vs. Fidelity: A Comparative Analysis of Diffusion Probabilistic Models and Flow Matching on Low-Resource Hardware | Srishti Gupta et.al. | 2511.19379 | null |
| 2025-11-24 | DensifyBeforehand: LiDAR-assisted Content-aware Densification for Efficient and Quality 3D Gaussian Splatting | Phurtivilai Patt et.al. | 2511.19294 | null |
| 2025-11-24 | IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes | Carl Lindström et.al. | 2511.19235 | null |
| 2025-11-24 | NVGS: Neural Visibility for Occlusion Culling in 3D Gaussian Splatting | Brent Zoomers et.al. | 2511.19202 | null |
| 2025-11-24 | AvatarBrush: Monocular Reconstruction of Gaussian Avatars with Intuitive Local Editing | Mengtian Li et.al. | 2511.19189 | null |
| 2025-11-24 | MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes | Kehua Chen et.al. | 2511.19172 | null |
| 2025-11-24 | Neural Texture Splatting: Expressive 3D Gaussian Splatting for View Synthesis, Geometry, and Dynamic Reconstruction | Yiming Wang et.al. | 2511.18873 | null |
| 2025-11-24 | NI-Tex: Non-isometric Image-based Garment Texture Generation | Hui Shan et.al. | 2511.18765 | null |
| 2025-11-24 | Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing | Xiaotong Huang et.al. | 2511.18755 | null |
| 2025-11-24 | MAGMA-Edu: Multi-Agent Generative Multimodal Framework for Text-Diagram Educational Question Generation | Zhenyu Wu et.al. | 2511.18714 | null |
| 2025-11-24 | Inverse Rendering for High-Genus Surface Meshes from Multi-View Images | Xiang Gao et.al. | 2511.18680 | null |
| 2025-11-23 | NeAR: Coupled Neural Asset-Renderer Stack | Hong Li et.al. | 2511.18600 | null |
| 2025-11-23 | PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation | Samarth Chopra et.al. | 2511.18570 | null |
| 2025-11-23 | Splatblox: Traversability-Aware Gaussian Splatting for Outdoor Robot Navigation | Samarth Chopra et.al. | 2511.18525 | null |
| 2025-11-23 | ReCoGS: Real-time ReColoring for Gaussian Splatting scenes | Lorenzo Rutayisire et.al. | 2511.18441 | null |
| 2025-11-23 | CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images | Avishka Perera et.al. | 2511.18424 | null |
| 2025-11-23 | SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation | Peter Siegel et.al. | 2511.18386 | null |
| 2025-11-23 | Synthetic Curriculum Reinforces Compositional Text-to-Image Generation | Shijian Wang et.al. | 2511.18378 | null |
| 2025-11-23 | Alias-free 4D Gaussian Splatting | Zilong Chen et.al. | 2511.18367 | null |
| 2025-11-21 | Planning with Sketch-Guided Verification for Physics-Aware Video Generation | Yidong Huang et.al. | 2511.17450 | null |
| 2025-11-21 | Refracting Reality: Generating Images with Realistic Transparent Objects | Yue Yin et.al. | 2511.17340 | null |
| 2025-11-21 | QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy | Adam Lilja et.al. | 2511.17221 | null |
| 2025-11-21 | FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception | Shubham Sonarghare et.al. | 2511.17210 | null |
| 2025-11-21 | SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors | Kunyi Li et.al. | 2511.17207 | null |
| 2025-11-21 | PEGS: Physics-Event Enhanced Large Spatiotemporal Motion Reconstruction via 3D Gaussian Splatting | Yijun Xu et.al. | 2511.17116 | null |
| 2025-11-21 | Towards Generative Design Using Optimal Transport for Shape Exploration and Solution Field Interpolation | Sergio Torregrosa et.al. | 2511.17111 | null |
| 2025-11-21 | SPAGS: Sparse-View Articulated Object Reconstruction from Single State via Planar Gaussian Splatting | Di Wu et.al. | 2511.17092 | null |
| 2025-11-21 | REArtGS++: Generalizable Articulation Reconstruction with Temporal Geometry Constraint via Planar Gaussian Splatting | Di Wu et.al. | 2511.17059 | null |
| 2025-11-21 | RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation | Wenzhuo Sun et.al. | 2511.17048 | null |
| 2025-11-21 | Parameter-Free Neural Lens Blur Rendering for High-Fidelity Composites | Lingyan Ruan et.al. | 2511.17014 | null |
| 2025-11-21 | Stable Offline Hand-Eye Calibration for any Robot with Just One Mark | Sicheng Xie et.al. | 2511.17001 | null |
| 2025-11-21 | PhysMorph-GS: Differentiable Shape Morphing via Joint Optimization of Physics and Rendering Objectives | Chang-Yong Song et.al. | 2511.16988 | null |
| 2025-11-21 | Gradient-Driven Natural Selection for Compact 3D Gaussian Splatting | Xiaobin Deng et.al. | 2511.16980 | null |
| 2025-11-21 | One Walk is All You Need: Data-Efficient 3D RF Scene Reconstruction with Human Movements | Yiheng Bian et.al. | 2511.16966 | null |
| 2025-11-21 | MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis | Di Luo et.al. | 2511.16957 | null |
| 2025-11-21 | UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation | Chi Zhang et.al. | 2511.16917 | null |
| 2025-11-20 | Vorion: A RISC-V GPU with Hardware-Accelerated 3D Gaussian Rendering and Training | Yipeng Wang et.al. | 2511.16831 | null |
| 2025-11-20 | SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG | Mengnan Jiang et.al. | 2511.16766 | null |
| 2025-11-20 | EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering | Pierrick Bournez et.al. | 2511.16542 | null |
| 2025-11-20 | Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution | Jaime Álvarez Urueña et.al. | 2511.16541 | null |
| 2025-11-20 | Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation | Zongcai Tan et.al. | 2511.16494 | null |
| 2025-11-20 | Neural Positioning Without External Reference | Till-Yannic Müller et.al. | 2511.16352 | null |
| 2025-11-20 | CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering | Joni Vanherck et.al. | 2511.16349 | null |
| 2025-11-20 | Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling | Minseok Seo et.al. | 2511.16301 | link |
| 2025-11-20 | Optimizing 3D Gaussian Splattering for Mobile GPUs | Md Musfiqur Rahman Sanim et.al. | 2511.16298 | null |
| 2025-11-20 | How Robot Dogs See the Unseeable | Oliver Bimber et.al. | 2511.16262 | null |
| 2025-11-20 | Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers | Jian Ma et.al. | 2511.16156 | link |
| 2025-11-20 | LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM | Sibaek Lee et.al. | 2511.16144 | link |
| 2025-11-20 | Clustered Error Correction with Grouped 4D Gaussian Splatting | Taeho Kang et.al. | 2511.16112 | link |
| 2025-11-20 | Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2511.16091 | null |
| 2025-11-20 | Panel-by-Panel Souls: A Performative Workflow for Expressive Faces in AI-Assisted Manga Creation | Qing Zhang et.al. | 2511.16038 | null |
| 2025-11-20 | CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis | Zijian Wu et.al. | 2511.16030 | null |
| 2025-11-19 | Think Visually, Reason Textually: Vision-Language Synergy in ARC | Beichen Zhang et.al. | 2511.15703 | null |
| 2025-11-19 | ChartEditor: A Reinforcement Learning Framework for Robust Chart Editing | Liangyu Chen et.al. | 2511.15266 | null |
| 2025-11-19 | VIRAL: Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation | Tairan He et.al. | 2511.15200 | null |
| 2025-11-19 | Gaussian Blending: Rethinking Alpha Blending in 3D Gaussian Splatting | Junseo Koo et.al. | 2511.15102 | link |
| 2025-11-19 | BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching | Yachuan Huang et.al. | 2511.15066 | null |
| 2025-11-19 | Evaluating Multimodal Large Language Models on Vertically Written Japanese Text | Keito Sasagawa et.al. | 2511.15059 | null |
| 2025-11-18 | X-WIN: Building Chest Radiograph World Model via Predictive Sensing | Zefan Yang et.al. | 2511.14918 | null |
| 2025-11-18 | Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video | Yarin Bekor et.al. | 2511.14848 | null |
| 2025-11-18 | SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction | Meiying Gu et.al. | 2511.14633 | link |
| 2025-11-18 | Interaction-Aware 4D Gaussian Splatting for Dynamic Hand-Object Interaction Reconstruction | Hao Tian et.al. | 2511.14540 | null |
| 2025-11-18 | 2D Gaussians Spatial Transport for Point-supervised Density Regression | Miao Shang et.al. | 2511.14477 | link |
| 2025-11-18 | BEDLAM2.0: Synthetic Humans and Cameras in Motion | Joachim Tesch et.al. | 2511.14394 | null |
| 2025-11-19 | Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving | Kangqiao Zhao et.al. | 2511.14386 | null |
| 2025-11-18 | IBGS: Image-Based Gaussian Splatting | Hoang Chuong Nguyen et.al. | 2511.14357 | null |
| 2025-11-18 | Silhouette-to-Contour Registration: Aligning Intraoral Scan Models with Cephalometric Radiographs | Yiyi Miao et.al. | 2511.14343 | null |
| 2025-11-18 | Dental3R: Geometry-Aware Pairing for Intraoral 3D Reconstruction from Sparse-View Photographs | Yiyi Miao et.al. | 2511.14315 | null |
| 2025-11-18 | GEN3D: Generating Domain-Free 3D Scenes from a Single Image | Yuxin Zhang et.al. | 2511.14291 | null |
| 2025-11-19 | Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery | Yiming Zeng et.al. | 2511.14270 | null |
| 2025-11-19 | RoboTidy : A 3D Gaussian Splatting Household Tidying Benchmark for Embodied Navigation and Action | Xiaoquan Sun et.al. | 2511.14161 | null |
| 2025-11-18 | iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion | Hao Wang et.al. | 2511.14149 | null |
| 2025-11-18 | Splat Regression Models | Mara Daniels et.al. | 2511.14042 | null |
| 2025-11-17 | GRLoc: Geometric Representation Regression for Visual Localization | Changyang Li et.al. | 2511.13864 | null |
| 2025-11-17 | Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting | Jiangnan Ye et.al. | 2511.13684 | null |
| 2025-11-17 | Opt3DGS: Optimizing 3D Gaussian Splatting with Adaptive Exploration and Curvature-Aware Exploitation | Ziyang Huang et.al. | 2511.13571 | null |
| 2025-11-17 | Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling | Adam Hazimeh et.al. | 2511.13478 | null |
| 2025-11-17 | SkyReels-Text: Fine-grained Font-Controllable Text Editing for Poster Design | Yunjie Yu et.al. | 2511.13285 | null |
| 2025-11-17 | SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting | Zihan Li et.al. | 2511.13278 | null |
| 2025-11-17 | SymGS : Leveraging Local Symmetries for 3D Gaussian Splatting Compression | Keshav Gupta et.al. | 2511.13264 | null |
| 2025-11-17 | Birth of a Painting: Differentiable Brushstroke Reconstruction | Ying Jiang et.al. | 2511.13191 | null |
| 2025-11-17 | Beyond Darkness: Thermal-Supervised 3D Gaussian Splatting for Low-Light Novel View Synthesis | Qingsen Ma et.al. | 2511.13011 | null |
| 2025-11-17 | TR-Gaussians: High-fidelity Real-time Rendering of Planar Transmission and Reflection with 3D Gaussian Splatting | Yong Liu et.al. | 2511.13009 | null |
| 2025-11-17 | SplatSearch: Instance Image Goal Navigation for Mobile Robots using 3D Gaussian Splatting and Diffusion Models | Siddarth Narasimhan et.al. | 2511.12972 | null |
| 2025-11-17 | GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving | Chunyong Hu et.al. | 2511.12941 | null |
| 2025-11-17 | Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration | Changhun Oh et.al. | 2511.12930 | null |
| 2025-11-17 | Redshifting the Cosmological Constant in Unimodular Gravity via Nonlinear Quantum Mechanics | David E. Kaplan et.al. | 2511.12897 | null |
| 2025-11-17 | Reconstructing 3D Scenes in Native High Dynamic Range | Kaixuan Zhang et.al. | 2511.12895 | null |
| 2025-11-16 | Which Way from B to A: The role of embedding geometry in image interpolation for Stable Diffusion | Nicholas Karris et.al. | 2511.12757 | null |
| 2025-11-15 | Changes in Real Time: Online Scene Change Detection with Multi-View Fusion | Chamuditha Jayanga Galappaththige et.al. | 2511.12370 | link |
| 2025-11-15 | LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors | Qifeng Chen et.al. | 2511.12304 | link |
| 2025-11-15 | SRSplat: Feed-Forward Super-Resolution Gaussian Splatting from Sparse Multi-View Images | Xinyuan Hu et.al. | 2511.12040 | null |
| 2025-11-14 | SimTac: A Physics-Based Simulator for Vision-Based Tactile Sensing with Biomorphic Structures | Xuyang Zhang et.al. | 2511.11456 | null |
| 2025-11-14 | Robust inverse material design with physical guarantees using the Voigt-Reuss Net | Sanath Keshav et.al. | 2511.11388 | null |
| 2025-11-14 | Shadow-Induced Warps in Protoplanetary disks | Shangjia Zhang et.al. | 2511.11358 | null |
| 2025-11-14 | RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image | Hengfei Wang et.al. | 2511.11289 | null |
| 2025-11-14 | 3D Gaussian and Diffusion-Based Gaze Redirection | Abiram Panchalingam et.al. | 2511.11231 | null |
| 2025-11-14 | RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting | Ruocheng Wu et.al. | 2511.11213 | null |
| 2025-11-14 | Dynamic Gaussian Scene Reconstruction from Unsynchronized Videos | Zhixin Xu et.al. | 2511.11175 | null |
| 2025-11-14 | PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRI | Sun Jo et.al. | 2511.11048 | null |
| 2025-11-14 | Draft and Refine with Visual Experts | Sungheon Jeong et.al. | 2511.11005 | null |
| 2025-11-13 | MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns | Jiarui Zhang et.al. | 2511.10390 | null |
| 2025-11-13 | Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision | Yu Deng et.al. | 2511.10316 | null |
| 2025-11-13 | HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction | Yueran Zhao et.al. | 2511.10211 | null |
| 2025-11-13 | Competing Localizations on Disordered Non-Hermitian Random Graph Lattice | S Rahul et.al. | 2511.10156 | null |
| 2025-11-13 | AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models | Xinyi Wang et.al. | 2511.10017 | link |
| 2025-11-13 | Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching | Uday Bhaskar et.al. | 2511.09955 | null |
| 2025-11-13 | TSPE-GS: Probabilistic Depth Extraction for Semi-Transparent Surface Reconstruction via 3D Gaussian Splatting | Zhiyuan Xu et.al. | 2511.09944 | null |
| 2025-11-13 | AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting | Aymen Mir et.al. | 2511.09827 | null |
| 2025-11-12 | Traversable wormhole with double trace deformations via gravitational shear and sound channels | Fitria Khairunnisa et.al. | 2511.09815 | null |
| 2025-11-12 | A Shared-Autonomy Construction Robotic System for Overhead Works | David Minkwan Kim et.al. | 2511.09695 | null |
| 2025-11-12 | BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation | Hongchao Shu et.al. | 2511.09443 | null |
| 2025-11-12 | OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS | Haiyi Li et.al. | 2511.09397 | null |
| 2025-11-12 | Computational Caustic Design for Surface Light Source | Sizhuo Zhou et.al. | 2511.09361 | null |
| 2025-11-12 | WDT-MD: Wavelet Diffusion Transformers for Microaneurysm Detection in Fundus Images | Yifei Sun et.al. | 2511.08987 | null |
| 2025-11-11 | RePose-NeRF: Robust Radiance Fields for Mesh Reconstruction under Noisy Camera Poses | Sriram Srinivasan et.al. | 2511.08545 | null |
| 2025-11-11 | 3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation | Yunhong He et.al. | 2511.08536 | null |
| 2025-11-11 | SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering | Laura Bragagnolo et.al. | 2511.08294 | null |
| 2025-11-11 | Perceptual Quality Assessment of 3D Gaussian Splatting: A Subjective Dataset and Prediction Metric | Zhaolin Wan et.al. | 2511.08032 | null |
| 2025-11-11 | UltraGS: Gaussian Splatting for Ultrasound Novel View Synthesis | Yuezhe Yang et.al. | 2511.07743 | null |
| 2025-11-10 | Accelerated, Memory-Efficient Far-Field Scattering Computation with Monte Carlo SBR | Samuel Audia et.al. | 2511.07586 | null |
| 2025-11-10 | YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting | Botao Ye et.al. | 2511.07321 | null |
| 2025-11-10 | 4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D Generation | Mengmeng Liu et.al. | 2511.07241 | null |
| 2025-11-10 | Sparse4DGS: 4D Gaussian Splatting for Sparse-Frame Dynamic Scene Reconstruction | Changyue Shi et.al. | 2511.07122 | null |
| 2025-11-10 | GFix: Perceptually Enhanced Gaussian Splatting Video Compression | Siyue Teng et.al. | 2511.06953 | null |
| 2025-11-10 | MUGSQA: Novel Multi-Uncertainty-Based Gaussian Splatting Quality Assessment Method, Dataset, and Benchmarks | Tianang Chen et.al. | 2511.06830 | null |
| 2025-11-10 | ConeGS: Error-Guided Densification Using Pixel Cones for Improved Reconstruction with Fewer Primitives | Bartłomiej Baranowski et.al. | 2511.06810 | null |
| 2025-11-10 | Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes | Meijun Guo et.al. | 2511.06765 | null |
| 2025-11-10 | Rethinking Rainy 3D Scene Reconstruction via Perspective Transforming and Brightness Tuning | Qianfeng Yang et.al. | 2511.06734 | null |
| 2025-11-10 | DIAL-GS: Dynamic Instance Aware Reconstruction for Label-free Street Scenes with 4D Gaussian Splatting | Chenpeng Su et.al. | 2511.06632 | null |
| 2025-11-09 | Inpaint360GS: Efficient Object-Aware 3D Inpainting via Gaussian Splatting for 360° Scenes | Shaoxiang Wang et.al. | 2511.06457 | null |
| 2025-11-09 | Physics-Informed Deformable Gaussian Splatting: Towards Unified Constitutive Laws for Time-Evolving Material Field | Haoqin Hong et.al. | 2511.06299 | null |
| 2025-11-08 | StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video | Zhihui Ke et.al. | 2511.06046 | null |
| 2025-11-07 | 4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos | Mengqi Guo et.al. | 2511.05229 | null |
| 2025-11-07 | Splatography: Sparse multi-view dynamic Gaussian Splatting for filmmaking challenges | Adrian Azzarelli et.al. | 2511.05152 | null |
| 2025-11-07 | Efficient representation of 3D spatial data for defense-related applications | Benjamin Kahl et.al. | 2511.05109 | null |
| 2025-11-07 | CLM: Removing the GPU Memory Barrier for 3D Gaussian Splatting | Hexu Zhao et.al. | 2511.04951 | null |
| 2025-11-07 | Channel Knowledge Map Construction: Recent Advances and Open Challenges | Zixiang Ren et.al. | 2511.04944 | null |
| 2025-11-06 | 3D Gaussian Point Encoders | Jim James et.al. | 2511.04797 | null |
| 2025-11-10 | Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions | Kaifeng Zhang et.al. | 2511.04665 | link |
| 2025-11-06 | FastGS: Training 3D Gaussian Splatting in 100 Seconds | Shiwei Ren et.al. | 2511.04283 | null |
| 2025-11-06 | CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation | Yuwen Tao et.al. | 2511.03992 | null |
| 2025-11-05 | DentalSplat: Dental Occlusion Novel View Synthesis from Sparse Intra-Oral Photographs | Yiyi Miao et.al. | 2511.03099 | null |
| 2025-11-04 | PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing | Antonio Oroz et.al. | 2511.02777 | null |
| 2025-11-04 | Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping | Jiajia Li et.al. | 2511.02207 | null |
| 2025-11-01 | 4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting | Chun-Tin Wu et.al. | 2511.00560 | null |
| 2025-10-31 | SAGS: Self-Adaptive Alias-Free Gaussian Splatting for Dynamic Surgical Endoscopic Reconstruction | Wenfeng Huang et.al. | 2510.27318 | null |
| 2025-10-31 | WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond | Zhicong Sun et.al. | 2510.27133 | null |
| 2025-10-30 | DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting | Moonsoo Jeong et.al. | 2510.26921 | null |
| 2025-10-30 | HEIR: Learning Graph-Based Motion Hierarchies | Cheng Zheng et.al. | 2510.26786 | null |
| 2025-10-30 | The Impact and Outlook of 3D Gaussian Splatting | Bernhard Kerbl et.al. | 2510.26694 | null |
| 2025-10-30 | AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM | Mirko Usuelli et.al. | 2510.26358 | link |
| 2025-10-30 | 6D Channel Knowledge Map Construction via Bidirectional Wireless Gaussian Splatting | Juncong Zhou et.al. | 2510.26166 | null |
| 2025-10-30 | JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting | Yuxuan Li et.al. | 2510.26117 | null |
| 2025-11-02 | D $^2$ GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction | Kejing Xia et.al. | 2510.25173 | null |
| 2025-10-29 | AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians | Xiyu Zhang et.al. | 2510.25129 | null |
| 2025-10-28 | NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation | Mingyu Jeong et.al. | 2510.24335 | null |
| 2025-10-28 | LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation | Haotian Zhou et.al. | 2510.24118 | null |
| 2025-10-28 | A Survey on Collaborative SLAM with 3D Gaussian Splatting | Phuc Nguyen Xuan et.al. | 2510.23988 | null |
| 2025-10-27 | PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors | Xirui Jin et.al. | 2510.23930 | link |
| 2025-10-27 | Explicit Memory through Online 3D Gaussian Splatting Improves Class-Agnostic Video Segmentation | Anthony Opipari et.al. | 2510.23521 | null |
| 2025-10-27 | VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting | Hoonhee Cho et.al. | 2510.23205 | null |
| 2025-10-27 | EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction | Taoyu Wu et.al. | 2510.23087 | null |
| 2025-10-27 | Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method | Bohan Li et.al. | 2510.22973 | null |
| 2025-10-27 | Gen-LangSplat: Generalized Language Gaussian Splatting with Pre-Trained Feature Compression | Pranav Saxena et.al. | 2510.22930 | null |
| 2025-10-26 | Region-Adaptive Learned Hierarchical Encoding for 3D Gaussian Splatting Data | Shashank N. Sridhara et.al. | 2510.22812 | null |
| 2025-10-26 | Edge Collaborative Gaussian Splatting with Integrated Rendering and Communication | Yujie Wan et.al. | 2510.22718 | null |
| 2025-10-28 | LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering | Wenkai Zhu et.al. | 2510.22669 | null |
| 2025-10-26 | RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience | Huilin Yin et.al. | 2510.22600 | null |
| 2025-10-26 | DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss | Jing Yang et.al. | 2510.22473 | null |
| 2025-10-25 | GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation | Phillip Mueller et.al. | 2510.22337 | null |
| 2025-10-25 | DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum | Yaokun Li et.al. | 2510.22213 | link |
| 2025-10-24 | Towards Physically Executable 3D Gaussian for Embodied Navigation | Bingchen Miao et.al. | 2510.21307 | null |
| 2025-10-23 | GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation | Guangqi Jiang et.al. | 2510.20813 | null |
| 2025-10-23 | Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking | Zixuan Wu et.al. | 2510.20335 | null |
| 2025-10-23 | COS3D: Collaborative Open-Vocabulary 3D Segmentation | Runsong Zhu et.al. | 2510.20238 | null |
| 2025-10-22 | Extreme Views: 3DGS Filter for Novel View Synthesis from Out-of-Distribution Camera Poses | Damian Bowness et.al. | 2510.20027 | null |
| 2025-10-21 | Re-Activating Frozen Primitives for 3D Gaussian Splatting | Yuxin Cheng et.al. | 2510.19653 | null |
| 2025-10-22 | VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction | Junhong Lin et.al. | 2510.19578 | null |
| 2025-10-22 | Advances in 4D Representation: Geometry, Motion, and Interaction | Mingrui Zhao et.al. | 2510.19255 | null |
| 2025-10-22 | MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting | In-Hwan Jin et.al. | 2510.19210 | null |
| 2025-10-22 | GRASPLAT: Enabling dexterous grasping through novel view synthesis | Matteo Bortolon et.al. | 2510.19200 | null |
| 2025-10-21 | Moving Light Adaptive Colonoscopy Reconstruction via Illumination-Attenuation-Aware 3D Gaussian Splatting | Hao Wang et.al. | 2510.18739 | null |
| 2025-10-21 | Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos | Jinfeng Liu et.al. | 2510.18489 | null |
| 2025-10-21 | OpenInsGaussian: Open-vocabulary Instance Gaussian Segmentation with Context-aware Cross-view Fusion | Tianyu Huang et.al. | 2510.18253 | null |
| 2025-10-20 | From Volume Rendering to 3D Gaussian Splatting: Theory and Applications | Vitor Pereira Matias et.al. | 2510.18101 | null |
| 2025-10-20 | HouseTour: A Virtual Real Estate A(I)gent | Ata Çelen et.al. | 2510.18054 | null |
| 2025-10-20 | Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats | Simeon Adebola et.al. | 2510.17783 | null |
| 2025-10-20 | Raindrop GS: A Benchmark for 3D Gaussian Splatting under Raindrop Conditions | Zhiqiang Teng et.al. | 2510.17719 | null |
| 2025-10-20 | Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS | Feng Zhou et.al. | 2510.17479 | link |
| 2025-10-20 | GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation | Ruitong Gan et.al. | 2510.17095 | null |
| 2025-10-19 | 2DGS-R: Revisiting the Normal Consistency Regularization in 2D Gaussian Splatting | Haofan Ren et.al. | 2510.16837 | null |
| 2025-10-19 | GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation | Junbo Li et.al. | 2510.16777 | null |
| 2025-10-18 | HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars | Haocheng Tang et.al. | 2510.16463 | null |
| 2025-10-18 | REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting | Changyue Shi et.al. | 2510.16410 | null |
| 2025-10-17 | Proactive Scene Decomposition and Reconstruction | Baicheng Li et.al. | 2510.16272 | null |
| 2025-10-17 | PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction | Ting-Yu Yen et.al. | 2510.15386 | null |
| 2025-10-17 | GaussGym: An open-source real-to-sim framework for learning locomotion from pixels | Alejandro Escontrela et.al. | 2510.15352 | null |
| 2025-10-16 | SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images | Jiaxin Guo et.al. | 2510.15072 | null |
| 2025-10-16 | Leveraging Learned Image Prior for 3D Gaussian Compression | Seungjoo Shin et.al. | 2510.14705 | null |
| 2025-10-16 | BalanceGS: Algorithm-System Co-design for Efficient 3D Gaussian Splatting Training on GPU | Junyi Wu et.al. | 2510.14564 | null |
| 2025-10-16 | GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering | Alexander Valverde et.al. | 2510.14270 | null |
| 2025-10-16 | Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures | Yuancheng Xu et.al. | 2510.14179 | link |
| 2025-10-17 | Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images | Emanuel Garbin et.al. | 2510.14081 | null |
| 2025-10-15 | Instant Skinned Gaussian Avatars for Web, Mobile and VR Applications | Naruya Kondo et.al. | 2510.13978 | null |
| 2025-10-15 | VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator | Hyojun Go et.al. | 2510.13454 | null |
| 2025-10-15 | Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering | Siddharth Tourani et.al. | 2510.13381 | null |
| 2025-10-15 | STT-GS: Sample-Then-Transmit Edge Gaussian Splatting with Joint Client Selection and Power Control | Zhen Li et.al. | 2510.13186 | null |
| 2025-10-14 | Uncertainty Matters in Dynamic Gaussian Splatting for Monocular 4D Reconstruction | Fengzhi Guo et.al. | 2510.12768 | null |
| 2025-10-17 | BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring | An Zhao et.al. | 2510.12493 | null |
| 2025-10-14 | Hybrid Gaussian Splatting for Novel Urban View Synthesis | Mohamed Omran et.al. | 2510.12308 | null |
| 2025-10-14 | PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes | Ying A et.al. | 2510.12282 | null |
| 2025-10-14 | UniGS: Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering | Yusen Xie et.al. | 2510.12174 | null |
| 2025-10-14 | G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior | Junfeng Ni et.al. | 2510.12099 | null |
| 2025-10-13 | GS-Verse: Mesh-based Gaussian Splatting for Physics-aware Interaction in Virtual Reality | Anastasiya Pechko et.al. | 2510.11878 | null |
| 2025-10-13 | Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams | Takuya Nakabayashi et.al. | 2510.11717 | null |
| 2025-10-13 | Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation | Maggie Wang et.al. | 2510.11689 | null |
| 2025-10-13 | VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment | Qing Li et.al. | 2510.11473 | null |
| 2025-10-13 | MaterialRefGS: Reflective Gaussian Splatting with Multi-view Consistent Material Inference | Wenyuan Zhang et.al. | 2510.11387 | null |
| 2025-10-12 | Dynamic Gaussian Splatting from Defocused and Motion-blurred Monocular Videos | Xuankai Zhang et.al. | 2510.10691 | null |
| 2025-10-12 | High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting | Haoyu Zhao et.al. | 2510.10637 | null |
| 2025-10-12 | Towards Efficient 3D Gaussian Human Avatar Compression: A Prior-Guided Framework | Shanzhi Yin et.al. | 2510.10492 | null |
| 2025-10-11 | Opacity-Gradient Driven Density Control for Compact and Efficient Few-Shot 3D Gaussian Splatting | Abdelrhman Elrawy et.al. | 2510.10257 | null |
| 2025-10-11 | Color3D: Controllable and Consistent 3D Colorization with Personalized Colorizer | Yecong Wan et.al. | 2510.10152 | null |
| 2025-10-11 | Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting | Jiahui Lu et.al. | 2510.10097 | null |
| 2025-10-11 | P-4DGS: Predictive 4D Gaussian Splatting with 90 $\times$ Compression | Henan Wang et.al. | 2510.10030 | null |
| 2025-10-11 | CLoD-GS: Continuous Level-of-Detail via 3D Gaussian Splatting | Zhigang Cheng et.al. | 2510.09997 | null |
| 2025-10-11 | VG-Mapping: Variation-Aware 3D Gaussians for Online Semi-static Scene Mapping | Yicheng He et.al. | 2510.09962 | null |
| 2025-10-10 | LTGS: Long-Term Gaussian Scene Chronology From Sparse View Updates | Minkwan Kim et.al. | 2510.09881 | null |
| 2025-10-10 | Vision Language Models: A Survey of 26K Papers | Fengming Lin et.al. | 2510.09586 | null |
| 2025-10-10 | FLOWING: Implicit Neural Flows for Structure-Preserving Morphing | Arthur Bizzi et.al. | 2510.09537 | link |
| 2025-10-10 | Two-Stage Gaussian Splatting Optimization for Outdoor Scene Reconstruction | Deborah Pintani et.al. | 2510.09489 | null |
| 2025-10-10 | Visibility-Aware Densification for 3D Gaussian Splatting in Dynamic Urban Scenes | Yikang Zhang et.al. | 2510.09364 | null |
| 2025-10-09 | ReSplat: Learning Recurrent Gaussian Splats | Haofei Xu et.al. | 2510.08575 | null |
| 2025-10-09 | D $^2$ GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction | Meixi Song et.al. | 2510.08566 | null |
| 2025-10-09 | Splat the Net: Radiance Fields with Splattable Neural Primitives | Xilong Zhou et.al. | 2510.08491 | null |
| 2025-10-09 | Efficient Label Refinement for Face Parsing Under Extreme Poses Using 3D Gaussian Splatting | Ankit Gahlawat et.al. | 2510.08096 | null |
| 2025-10-09 | CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving | Tianrui Zhang et.al. | 2510.07944 | null |
| 2025-10-09 | PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting | Houqiang Zhong et.al. | 2510.07830 | null |
| 2025-10-09 | DEGS: Deformable Event-based 3D Gaussian Splatting from RGB and Event Stream | Junhao He et.al. | 2510.07752 | null |
| 2025-10-09 | ComGS: Efficient 3D Object-Scene Composition via Surface Octahedral Probes | Jian Gao et.al. | 2510.07729 | null |
| 2025-10-08 | Generating Surface for Text-to-3D using 2D Gaussian Splatting | Huanning Dong et.al. | 2510.06967 | null |
| 2025-10-08 | Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity | Islomjon Shukhratov et.al. | 2510.06802 | null |
| 2025-10-08 | SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis | Jipeng Lyu et.al. | 2510.06694 | null |
| 2025-10-09 | RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction | Leshu Li et.al. | 2510.06644 | null |
| 2025-10-07 | Active Next-Best-View Optimization for Risk-Averse Path Planning | Amirhossein Mollaei Khass et.al. | 2510.06481 | null |
| 2025-10-07 | ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars | Peizhi Yan et.al. | 2510.05488 | null |
| 2025-10-06 | Provable Affine Identifiability of Nonlinear CCA under Latent Distributional Priors | Zhiwei Han et.al. | 2510.04758 | null |
| 2025-10-04 | Enhancing Foveated Rendering with Weighted Reservoir Sampling | Ville Cantory et.al. | 2510.03964 | null |
| 2025-10-04 | Optimized Minimal 4D Gaussian Splatting | Minseo Lee et.al. | 2510.03857 | null |
| 2025-10-03 | SketchPlan: Diffusion Based Drone Planning From Human Sketches | Sixten Norelius et.al. | 2510.03545 | null |
| 2025-09-30 | Universal Beta Splatting | Rong Liu et.al. | 2510.03312 | null |
| 2025-10-03 | Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields | Zhiting Mei et.al. | 2510.03104 | link |
| 2025-10-03 | GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting | Xinran Zhang et.al. | 2510.02884 | null |
| 2025-10-03 | From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting | Jianing Chen et.al. | 2510.02732 | null |
| 2025-10-03 | FSFSplatter: Build Surface and Novel Views with Sparse-Views within 3min | Yibin Zhao et.al. | 2510.02691 | null |
| 2025-10-02 | SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting | Sung-Yeon Park et.al. | 2510.02469 | null |
| 2025-10-02 | StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions | Bo-Hsu Ke et.al. | 2510.02314 | null |
| 2025-10-02 | Performance-Guided Refinement for Visual Aerial Navigation using Editable Gaussian Splatting in FalconGym 2.0 | Yan Miao et.al. | 2510.02248 | null |
| 2025-10-02 | Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects | Georgios Kouros et.al. | 2510.02069 | null |
| 2025-10-02 | GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing | Mengtian Li et.al. | 2510.02034 | null |
| 2025-10-02 | 4DGS-Craft: Consistent and Interactive 4D Gaussian Splatting Editing | Lei Liu et.al. | 2510.01991 | null |
| 2025-10-02 | ROI-GS: Interest-based Local Quality 3D Gaussian Splatting | Quoc-Anh Bui et.al. | 2510.01978 | null |
| 2025-10-02 | GreenhouseSplat: A Dataset of Photorealistic Greenhouse Simulations for Mobile Robotics | Diram Tabaa et.al. | 2510.01848 | null |
| 2025-10-02 | LOBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction | Sheng-Hsiang Hung et.al. | 2510.01767 | null |
| 2025-10-02 | MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics | Changmin Lee et.al. | 2510.01619 | link |
| 2025-10-01 | Instant4D: 4D Gaussian Splatting in Minutes | Zhanpeng Luo et.al. | 2510.01119 | link |
| 2025-09-30 | HART: Human Aligned Reconstruction Transformer | Xiyi Chen et.al. | 2509.26621 | null |
| 2025-09-30 | Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting | Hanzhou Liu et.al. | 2509.26455 | link |
| 2025-09-30 | GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts | Zhenyu Shu et.al. | 2509.26055 | null |
| 2025-09-30 | PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion | Zhiwei Zhang et.al. | 2509.26008 | null |
| 2025-09-30 | LLM-Powered Code Analysis and Optimization for Gaussian Splatting Kernels | Yi Hu et.al. | 2509.25626 | null |
| 2025-09-29 | GaussianLens: Localized High-Resolution Reconstruction via On-Demand Gaussian Densification | Yijia Weng et.al. | 2509.25603 | null |
| 2025-09-29 | Triangle Splatting+: Differentiable Rendering with Opaque Triangles | Jan Held et.al. | 2509.25122 | null |
| 2025-10-02 | GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction | Huaizhi Qu et.al. | 2509.25075 | link |
| 2025-09-29 | LVT: Large-Scale Scene Reconstruction via Local View Transformers | Tooba Imtiaz et.al. | 2509.25001 | link |
| 2025-09-29 | DWGS: Enhancing Sparse-View Gaussian Splatting with Hybrid-Loss Depth Estimation and Bidirectional Warping | Yu Ma et.al. | 2509.24893 | null |
| 2025-09-29 | ExGS: Extreme 3D Gaussian Compression with Diffusion Priors | Jiaqi Chen et.al. | 2509.24758 | null |
| 2025-10-01 | Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh | Yuanyuan Gao et.al. | 2509.24421 | null |
| 2025-09-29 | OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction | Yuhang Cao et.al. | 2509.24308 | null |
| 2025-09-28 | CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting | Dragoş-Andrei Chileban et.al. | 2509.23947 | null |
| 2025-09-28 | From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations | Javed Ahmad et.al. | 2509.23555 | null |
| 2025-09-27 | Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos | Junyi Wu et.al. | 2509.23492 | null |
| 2025-09-27 | OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting | Atakan Topaloglu et.al. | 2509.23258 | null |
| 2025-09-26 | Learning Unified Representation of 3D Gaussian Splatting | Yuelin Xin et.al. | 2509.22917 | null |
| 2025-09-26 | Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting | Yasmine Omri et.al. | 2509.22615 | null |
| 2025-09-26 | GS-2M: Gaussian Splatting for Joint Mesh Reconstruction and Material Decomposition | Dinh Minh Nguyen et.al. | 2509.22276 | null |
| 2025-09-26 | Polysemous Language Gaussian Splatting via Matching-based Mask Lifting | Jiayu Ding et.al. | 2509.22225 | null |
| 2025-09-26 | Large Material Gaussian Model for Relightable 3D Generation | Jingrui Ye et.al. | 2509.22112 | null |
| 2025-09-26 | Drag4D: Align Your Motion with Text-Driven 3D Scene Generation | Minjun Kang et.al. | 2509.21888 | null |
| 2025-09-30 | Dynamic Novel View Synthesis in High Dynamic Range | Kaixuan Zhang et.al. | 2509.21853 | null |
| 2025-09-25 | PowerGS: Display-Rendering Power Co-Optimization for Neural Rendering in Power-Constrained XR Systems | Weikai Lin et.al. | 2509.21702 | null |
| 2025-09-25 | Gaussian splatting holography | Shuhe Zhang et.al. | 2509.20774 | null |
| 2025-09-25 | FreeInsert: Personalized Object Insertion with Geometric and Style Control | Yuhong Zhang et.al. | 2509.20756 | null |
| 2025-09-23 | SeHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing | Yiyu Li et.al. | 2509.20400 | null |
| 2025-09-24 | 4D Driving Scene Generation With Stereo Forcing | Hao Lu et.al. | 2509.20251 | null |
| 2025-09-24 | GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes | Guo Chen et.al. | 2509.19937 | null |
| 2025-09-24 | Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering | Jiangxue Yu et.al. | 2509.19898 | null |
| 2025-09-24 | BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting | Yixun Zhang et.al. | 2509.19793 | null |
| 2025-09-24 | PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction | Yufei Han et.al. | 2509.19726 | null |
| 2025-09-23 | VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction | Weijie Wang et.al. | 2509.19297 | null |
| 2025-09-23 | Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation | Sherwin Bahmani et.al. | 2509.19296 | null |
| 2025-09-23 | WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction | Hung Nguyen et.al. | 2509.19073 | null |
| 2025-09-23 | Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting | Zijing Guo et.al. | 2509.18956 | null |
| 2025-09-23 | DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring | Pengteng Li et.al. | 2509.18898 | null |
| 2025-09-23 | FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation | Zhaorui Wang et.al. | 2509.18759 | null |
| 2025-09-23 | SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones | Maximilian Adang et.al. | 2509.18610 | null |
| 2025-09-23 | Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction | Xiaoting Yin et.al. | 2509.18566 | null |
| 2025-09-23 | BridgeSplat: Bidirectionally Coupled CT and Non-Rigid Gaussian Splatting for Deformable Intraoperative Surgical Navigation | Maximilian Fehrentz et.al. | 2509.18501 | null |
| 2025-09-23 | Differentiable Light Transport with Gaussian Surfels via Adapted Radiosity for Efficient Relighting and Geometry Reconstruction | Kaiwen Jiang et.al. | 2509.18497 | null |
| 2025-09-22 | GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction | Jiahe Li et.al. | 2509.18090 | null |
| 2025-09-22 | GaussianPSL: A novel framework based on Gaussian Splatting for exploring the Pareto frontier in multi-criteria optimization | Phuong Mai Dinh et.al. | 2509.17889 | null |
| 2025-09-22 | ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos | Shi Chen et.al. | 2509.17864 | null |
| 2025-09-22 | From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes | Guoxi Huang et.al. | 2509.17789 | null |
| 2025-09-22 | Neural-MMGS: Multi-modal Neural Gaussian Splats for Large-Scale Scene Reconstruction | Sitian Shen et.al. | 2509.17762 | null |
| 2025-09-23 | EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device | Gunjan Chhablani et.al. | 2509.17430 | link |
| 2025-09-22 | FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR | Junzhe Wu et.al. | 2509.17390 | null |
| 2025-09-22 | SmokeSeer: 3D Gaussian Splatting for Smoke Removal and Scene Reconstruction | Neham Jain et.al. | 2509.17329 | null |
| 2025-09-21 | SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2509.17246 | null |
| 2025-09-23 | HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis | Zipeng Wang et.al. | 2509.17083 | null |
| 2025-09-21 | Efficient 3D Scene Reconstruction and Simulation from Sparse Endoscopic Views | Zhenya Yang et.al. | 2509.17027 | null |
| 2025-09-21 | PGSTalker: Real-Time Audio-Driven Talking Head Generation via 3D Gaussian Splatting with Pixel-Aware Density Control | Tianheng Zhu et.al. | 2509.16922 | null |
| 2025-09-21 | ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM | Amanuel T. Dufera et.al. | 2509.16863 | null |
| 2025-09-20 | MedGS: Gaussian Splatting for Multi-Modal 3D Medical Imaging | Kacper Marzol et.al. | 2509.16806 | null |
| 2025-09-20 | ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting | Xiaoyang Yan et.al. | 2509.16552 | null |
| 2025-09-19 | RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars | Weiyi Xiong et.al. | 2509.16119 | null |
| 2025-09-19 | Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval | Liwei Liao et.al. | 2509.15871 | null |
| 2025-09-19 | Camera Splatting for Continuous View Optimization | Gahye Lee et.al. | 2509.15677 | null |
| 2025-09-19 | FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting | Yuwei Jia et.al. | 2509.15648 | null |
| 2025-09-19 | GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading | Donghyun Lee et.al. | 2509.15645 | null |
| 2025-09-19 | MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild | Deming Li et.al. | 2509.15548 | null |
| 2025-09-18 | Causal Reasoning Elicits Controllable 3D Scene Generation | Shen Chen et.al. | 2509.15249 | null |
| 2025-09-18 | FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction | Jinlong Fan et.al. | 2509.14739 | null |
| 2025-09-18 | RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI | Cong Tai et.al. | 2509.14687 | null |
| 2025-09-17 | Perception-Integrated Safety Critical Control via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting | Dario Tscholl et.al. | 2509.14421 | null |
| 2025-09-17 | MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping | Zhihao Cao et.al. | 2509.14191 | null |
| 2025-09-17 | Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction | Yifan Mo et.al. | 2509.13938 | null |
| 2025-09-17 | LamiGauss: Pitching Radiative Gaussian for Sparse-View X-ray Laminography Reconstruction | Chu Chen et.al. | 2509.13863 | null |
| 2025-09-16 | MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM | Yinlong Bai et.al. | 2509.13536 | null |
| 2025-09-16 | Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization | Hao Xu et.al. | 2509.13482 | link |
| 2025-09-16 | Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image | Gaofeng Liu et.al. | 2509.13013 | null |
| 2025-09-16 | Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings | Abdalla Arafa et.al. | 2509.12938 | null |
| 2025-09-16 | Effective Gaussian Management for High-fidelity Object Reconstruction | Jiateng Liu et.al. | 2509.12742 | null |
| 2025-09-15 | Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization | Mengjiao Han et.al. | 2509.12138 | null |
| 2025-09-15 | Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting | Yi-Hsin Li et.al. | 2509.11853 | null |
| 2025-09-15 | A Controllable 3D Deepfake Generation Framework with Gaussian Splatting | Wending Liu et.al. | 2509.11624 | null |
| 2025-09-14 | On the Skinning of Gaussian Avatars | Nikolaos Zioulis et.al. | 2509.11411 | null |
| 2025-09-14 | ROSGS: Relightable Outdoor Scenes With Gaussian Splatting | Lianjun Liao et.al. | 2509.11275 | null |
| 2025-09-14 | SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting | Ashkan Taghipour et.al. | 2509.11116 | null |
| 2025-09-13 | AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting | Gurutva Patle et.al. | 2509.11003 | null |
| 2025-09-13 | Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation | Yi-Ruei Liu et.al. | 2509.10759 | null |
| 2025-09-12 | T2Bs: Text-to-Character Blendshapes via Video Generation | Jiahao Luo et.al. | 2509.10678 | null |
| 2025-09-15 | On the Geometric Accuracy of Implicit and Primitive-based Representations Derived from View Rendering Constraints | Elias De Smijter et.al. | 2509.10241 | null |
| 2025-09-09 | SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting | Mahtab Dahaghin et.al. | 2509.07809 | null |
| 2025-09-09 | HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting | Yimin Pan et.al. | 2509.07774 | null |
| 2025-09-09 | DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning | Wenzhi Guo et.al. | 2509.07493 | null |
| 2025-09-09 | DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation | Ze-Xin Yin et.al. | 2509.07435 | null |
| 2025-09-07 | MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning | Jiarui Chen et.al. | 2509.07021 | null |
| 2025-09-10 | VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes | Shengkai Zhang et.al. | 2509.06685 | null |
| 2025-09-15 | Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation | Ian Page et.al. | 2509.06433 | null |
| 2025-09-08 | 3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom | Matthieu Gendrin et.al. | 2509.06400 | null |
| 2025-09-05 | Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting | Sen Wang et.al. | 2509.05515 | null |
| 2025-09-05 | Toward Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization | Mengjiao Han et.al. | 2509.05216 | null |
| 2025-09-05 | Symbolic Graphics Programming with Large Language Models | Yamei Chen et.al. | 2509.05208 | null |
| 2025-09-05 | GeoSplat: A Deep Dive into Geometry-Constrained Gaussian Splatting | Yangming Li et.al. | 2509.05075 | null |
| 2025-09-05 | CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus | Hannah Schieber et.al. | 2509.04859 | null |
| 2025-09-04 | SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer | Jimin Xu et.al. | 2509.04379 | null |
| 2025-09-03 | ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction | Sankeerth Durvasula et.al. | 2509.03775 | null |
| 2025-09-02 | Efficient Geometry Compression and Communication for 3D Gaussian Splatting Point Clouds | Liang Xie et.al. | 2509.02232 | null |
| 2025-09-02 | GRMM: Real-Time High-Fidelity Gaussian Morphable Head Model with Learned Residuals | Mohit Mendiratta et.al. | 2509.02141 | null |
| 2025-09-02 | 2D Gaussian Splatting with Semantic Alignment for Image Inpainting | Hongyu Li et.al. | 2509.01964 | null |
| 2025-09-01 | GaussianGAN: Real-Time Photorealistic controllable Human Avatars | Mohamed Ilyes Lakhal et.al. | 2509.01681 | null |
| 2025-09-01 | FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field | Fan Zhu et.al. | 2509.01547 | null |
| 2025-09-01 | Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars | Vanessa Sklyarova et.al. | 2509.01469 | link |
| 2025-08-31 | Towards Integrating Multi-Spectral Imaging with Gaussian Splatting | Josef Grün et.al. | 2509.00989 | null |
| 2025-09-03 | GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency | Joongho Jo et.al. | 2509.00911 | null |
| 2025-09-03 | UPGS: Unified Pose-aware Gaussian Splatting for Dynamic Scene Deblurring | Zhijing Wu et.al. | 2509.00831 | null |
| 2025-08-31 | SWAGSplatting: Semantic-guided Water-scene Augmented Gaussian Splatting | Zhuodong Jiang et.al. | 2509.00800 | null |
| 2025-08-31 | MarkSplatter: Generalizable Watermarking for 3D Gaussian Splatting Model via Splatter Image Structure | Xiufeng Huang et.al. | 2509.00757 | null |
| 2025-08-31 | DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments | Yi Liu et.al. | 2509.00741 | null |
| 2025-08-30 | AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection | Houshu He et.al. | 2509.00433 | null |
| 2025-08-29 | Complete Gaussian Splats from a Single Image with Denoising Diffusion Models | Ziwei Liao et.al. | 2508.21542 | null |
| 2025-08-12 | EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events | Siyu Chen et.al. | 2508.07003 | null |
| 2025-10-29 | GS4: Generalizable Sparse Splatting Semantic SLAM | Mingqi Jiang et.al. | 2506.06517 | null |
| 2025-06-04 | LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM | Roman Titkov et.al. | 2506.03073 | null |
| 2025-05-16 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
| 2025-11-13 | SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM | Samuel Cerezo et.al. | 2504.13713 | null |
| 2025-03-18 | DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes | Runfa Blark Li et.al. | 2503.11979 | null |
| 2025-02-24 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
| 2025-01-15 | VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes | Ke Wu et.al. | 2501.08286 | null |
| 2025-07-15 | SEGS-SLAM: Structure-enhanced 3D Gaussian Splatting SLAM with Appearance Embedding | Tianci Wen et.al. | 2501.05242 | null |
| 2024-12-05 | RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting | Zhenzhong Cao et.al. | 2412.01217 | null |
| 2024-11-20 | LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2411.12185 | null |
| 2025-08-11 | MBA-SLAM: Motion Blur Aware Gaussian Splatting SLAM | Peng Wang et.al. | 2411.08279 | null |
| 2025-03-11 | CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Dapeng Feng et.al. | 2410.00486 | null |
| 2024-10-01 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
| 2025-03-11 | Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2409.12518 | null |
| 2025-05-05 | UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM | Mostafa Mansour et.al. | 2409.00362 | null |
| 2024-04-02 | MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements | Lisong C. Sun et.al. | 2404.00923 | null |
| 2024-03-26 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
| 2024-04-09 | GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting | Chi Yan et.al. | 2311.11700 | null |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-04-02 | Deep Neural Network Based Roadwork Detection for Autonomous Driving | Sebastian Wullrich et.al. | 2604.02282 | null |
| 2026-04-02 | LEO: Graph Attention Network based Hybrid Multi Sensor Extended Object Fusion and Tracking for Autonomous Driving Applications | Mayank Mayank et.al. | 2604.02206 | null |
| 2026-04-02 | UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving | Yongkang Li et.al. | 2604.02190 | null |
| 2026-04-02 | Causal Scene Narration with Runtime Safety Supervision for Vision-Language-Action Driving | Yun Li et.al. | 2604.01723 | null |
| 2026-04-02 | Hi-LOAM: Hierarchical Implicit Neural Fields for LiDAR Odometry and Mapping | Zhiliu Yang et.al. | 2604.01720 | null |
| 2026-04-02 | Riemannian and Symplectic Geometry for Hierarchical Text-Driven Place Recognition | Tianyi Shang et.al. | 2604.01598 | null |
| 2026-04-01 | SECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous Driving | Wenjing Wang et.al. | 2604.01337 | null |
| 2026-04-01 | Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models | Xiaosong Jia et.al. | 2604.01259 | null |
| 2026-04-01 | Simulating Realistic LiDAR Data Under Adverse Weather for Autonomous Vehicles: A Physics-Informed Learning Approach | Vivek Anand et.al. | 2604.01254 | null |
| 2026-04-01 | VRUD: A Drone Dataset for Complex Vehicle-VRU Interactions within Mixed Traffic | Ziyu Wang et.al. | 2604.01134 | null |
| 2026-04-01 | ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction | Yuheng Zhang et.al. | 2604.01081 | null |
| 2026-04-01 | DLWM: Dual Latent World Models enable Holistic Gaussian-centric Pre-training in Autonomous Driving | Yiyao Zhu et.al. | 2604.00969 | null |
| 2026-04-01 | DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale | Sicheng Zuo et.al. | 2604.00813 | null |
| 2026-04-01 | Towards Viewpoint-Robust End-to-End Autonomous Driving with 3D Foundation Model Priors | Hiroki Hashimoto et.al. | 2604.00597 | null |
| 2026-04-01 | SAR/ISAR Imaging in 6G Network | Yanmo Hu et.al. | 2604.00583 | null |
| 2026-04-01 | COTTA: Context-Aware Transfer Adaptation for Trajectory Prediction in Autonomous Driving | Seohyoung Park et.al. | 2604.00402 | null |
| 2026-04-01 | Neural Reconstruction of LiDAR Point Clouds under Jamming Attacks via Full-Waveform Representation and Simultaneous Laser Sensing | Ryo Yoshida et.al. | 2604.00371 | null |
| 2026-03-31 | Better than Average: Spatially-Aware Aggregation of Segmentation Uncertainty Improves Downstream Performance | Vanessa Emanuela Guarino et.al. | 2603.29941 | null |
| 2026-03-31 | C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving | Zhihong Cui et.al. | 2603.29908 | null |
| 2026-03-31 | SparseDriveV2: Scoring is All You Need for End-to-End Autonomous Driving | Wenchao Sun et.al. | 2603.29163 | null |
| 2026-03-30 | AutoWorld: Scaling Multi-Agent Traffic Simulation with Self-Supervised World Models | Mozhgan Pourkeshavatz et.al. | 2603.28963 | null |
| 2026-03-30 | A Semantic Observer Layer for Autonomous Vehicles: Pre-Deployment Feasibility Study of VLMs for Low-Latency Anomaly Detection | Kunal Runwal et.al. | 2603.28888 | null |
| 2026-03-30 | OccSim: Multi-kilometer Simulation with Long-horizon Occupancy World Models | Tianran Liu et.al. | 2603.28887 | null |
| 2026-03-30 | FL-PBM: Pre-Training Backdoor Mitigation for Federated Learning | Osama Wehbi et.al. | 2603.28673 | null |
| 2026-03-31 | RAD-LAD: Rule and Language Grounded Autonomous Driving in Real-Time | Anurag Ghosh et.al. | 2603.28522 | null |
| 2026-03-30 | Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms | Muyang He et.al. | 2603.28489 | null |
| 2026-03-30 | TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation | Minh-Khoi Do et.al. | 2603.28233 | null |
| 2026-03-30 | Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal | Kazuma Ikeda et.al. | 2603.28224 | null |
| 2026-03-30 | $AutoDrive\text{-}P^3$ : Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning | Yuqi Ye et.al. | 2603.28116 | null |
| 2026-03-30 | To View Transform or Not to View Transform: NeRF-based Pre-training Perspective | Hyeonjun Jeong et.al. | 2603.28090 | null |
| 2026-03-30 | Effort-Based Criticality Metrics for Evaluating 3D Perception Errors in Autonomous Driving | Sharang Kaul et.al. | 2603.28029 | null |
| 2026-03-30 | Energy-Aware Imitation Learning for Steering Prediction Using Events and Frames | Hu Cao et.al. | 2603.28008 | null |
| 2026-03-30 | UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection | Hongjing Wu et.al. | 2603.27995 | null |
| 2026-03-29 | Benchmarking Multi-View BEV Object Detection with Mixed Pinhole and Fisheye Cameras | Xiangzhong Liu et.al. | 2603.27818 | null |
| 2026-03-29 | TianJi:An autonomous AI meteorologist for discovering physical mechanisms in atmospheric science | Kaikai Zhang et.al. | 2603.27738 | null |
| 2026-03-29 | Annotation-Free Detection of Drivable Areas and Curbs Leveraging LiDAR Point Cloud Maps | Fulong Ma et.al. | 2603.27553 | null |
| 2026-03-28 | HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors | Ke Li et.al. | 2603.27371 | null |
| 2026-03-28 | Guided Lensless Polarization Imaging | Noa Kraicer et.al. | 2603.27357 | null |
| 2026-03-28 | Class-Distribution Guided Active Learning for 3D Occupancy Prediction in Autonomous Driving | Wonjune Kim et.al. | 2603.27294 | null |
| 2026-03-28 | Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving | Qiqi Liu et.al. | 2603.27287 | null |
| 2026-03-28 | Robust Global-Local Behavior Arbitration via Continuous Command Fusion Under LiDAR Errors | Mohamed Elgouhary et.al. | 2603.27273 | null |
| 2026-03-28 | An Instance-Centric Panoptic Occupancy Prediction Benchmark for Autonomous Driving | Yi Feng et.al. | 2603.27238 | null |
| 2026-03-28 | RailVQA: A Benchmark and Framework for Efficient Interpretable Visual Cognition in Automatic Train Operation | Sen Zhang et.al. | 2603.27112 | null |
| 2026-03-26 | Vega: Learning to Drive with Natural Language Instructions | Sicheng Zuo et.al. | 2603.25741 | null |
| 2026-03-26 | Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving | Zehao Wang et.al. | 2603.25740 | null |
| 2026-03-26 | Can Users Specify Driving Speed? Bench2Drive-Speed: Benchmark and Baselines for Desired-Speed Conditioned Autonomous Driving | Yuqian Shao et.al. | 2603.25672 | null |
| 2026-03-26 | Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case | Koldo Basterretxea et.al. | 2603.25510 | null |
| 2026-03-26 | RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models | Yufeng Yang et.al. | 2603.25502 | null |
| 2026-03-26 | Temporally Decoupled Diffusion Planning for Autonomous Driving | Xiang Li et.al. | 2603.25462 | null |
| 2026-03-26 | Denoise and Align: Towards Source-Free UDA for Robust Panoramic Semantic Segmentation | Yaowen Chang et.al. | 2603.25131 | null |
| 2026-03-26 | Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model | Ziyan Wang et.al. | 2603.24989 | null |
| 2026-03-26 | TIGFlow-GRPO: Trajectory Forecasting via Interaction-Aware Flow Matching and Reward-Driven Optimization | Xuepeng Jing et.al. | 2603.24936 | null |
| 2026-03-25 | DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving | Pengxuan Yang et.al. | 2603.24587 | null |
| 2026-03-25 | Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving | Linbo Wang et.al. | 2603.24581 | null |
| 2026-03-25 | Toward Physically Consistent Driving Video World Models under Challenging Trajectories | Jiawei Zhou et.al. | 2603.24506 | null |
| 2026-03-25 | Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification | Han Sun et.al. | 2603.24058 | null |
| 2026-03-25 | Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration | Guopeng Li et.al. | 2603.23889 | null |
| 2026-03-24 | Rectify, Don’t Regret: Avoiding Pitfalls of Differentiable Simulation in Trajectory Prediction | Harsh Yadav et.al. | 2603.23393 | null |
| 2026-03-24 | Modeling Edge-to-Cloud Offloading Workloads for Autonomous Vehicles | Longkun Li et.al. | 2603.23310 | null |
| 2026-03-25 | PoseDriver: A Unified Approach to Multi-Category Skeleton Detection for Autonomous Driving | Yasamin Borhani et.al. | 2603.23215 | null |
| 2026-03-24 | Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment | Guoyang Zhao et.al. | 2603.23034 | null |
| 2026-03-24 | Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction | Chengxin Lv et.al. | 2603.22852 | null |
| 2026-03-24 | Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems | Manognya Lokesh Reddy et.al. | 2603.22781 | null |
| 2026-03-23 | LRC-WeatherNet: LiDAR, RADAR, and Camera Fusion Network for Real-time Weather-type Classification in Autonomous Driving | Nour Alhuda Albashir et.al. | 2603.21987 | null |
| 2026-03-23 | The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation | Guannan Lai et.al. | 2603.21928 | null |
| 2026-03-23 | Disengagement Analysis and Field Tests of a Prototypical Open-Source Level 4 Autonomous Driving System | Marvin Seegert et.al. | 2603.21926 | null |
| 2026-03-23 | Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis | Kangbo Zhao et.al. | 2603.21661 | null |
| 2026-03-23 | HACMatch Semi-Supervised Rotation Regression with Hardness-Aware Curriculum Pseudo Labeling | Mei Li et.al. | 2603.21583 | null |
| 2026-03-23 | Ultrafast microwave sensing and automatic recognition of dynamic objects in open world using programmable surface plasmonic neural networks | Qian Ma et.al. | 2603.21521 | null |
| 2026-03-22 | Dynasto: Validity-Aware Dynamic-Static Parameter Optimization for Autonomous Driving Testing | Dmytro Humeniuk et.al. | 2603.21427 | null |
| 2026-03-22 | Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving | Haixi Zhang et.al. | 2603.21061 | null |
| 2026-03-22 | KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph | Ye Tian et.al. | 2603.21029 | null |
| 2026-03-21 | OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation | Aarush Aggarwal et.al. | 2603.20777 | null |
| 2026-03-24 | GHOST: Ground-projected Hypotheses from Observed Structure-from-Motion Trajectories | Tomasz Frelek et.al. | 2603.20583 | null |
| 2026-03-20 | Understanding Behavior Cloning with Action Quantization | Haoqun Cao et.al. | 2603.20538 | null |
| 2026-03-20 | Wildfire Spread Scenarios: Increasing Sample Diversity of Segmentation Diffusion Models with Training-Free Methods | Sebastian Gerard et.al. | 2603.20188 | null |
| 2026-03-20 | Uncertainty Matters: Structured Probabilistic Online Mapping for Motion Prediction in Autonomous Driving | Pritom Gogoi et.al. | 2603.20076 | null |
| 2026-03-20 | X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving | Chaoda Zheng et.al. | 2603.19979 | null |
| 2026-03-23 | 2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction | Tianbao Zhang et.al. | 2603.19964 | null |
| 2026-03-20 | LIORNet: Self-Supervised LiDAR Snow Removal Framework for Autonomous Driving under Adverse Weather Conditions | Ji-il Park et.al. | 2603.19936 | null |
| 2026-03-20 | Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them | Michael Hubbertz et.al. | 2603.19852 | null |
| 2026-03-20 | DynFlowDrive: Flow-Based Dynamic World Modeling for Autonomous Driving | Xiaolu Liu et.al. | 2603.19675 | null |
| 2026-03-20 | StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention | Zhongrui Yu et.al. | 2603.19552 | null |
| 2026-03-19 | DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding | Dong Zhuo et.al. | 2603.19219 | null |
| 2026-03-19 | Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting | Yiren Lu et.al. | 2603.19193 | null |
| 2026-03-19 | Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving | Huiwen Yan et.al. | 2603.19188 | null |
| 2026-03-19 | Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs | Gaoxiang Cao et.al. | 2603.18871 | null |
| 2026-03-19 | Student views in AI Ethics and Social Impact | Tudor-Dan Mihoc et.al. | 2603.18827 | null |
| 2026-03-19 | Benchmarking Visual Feature Representations for LiDAR-Inertial-Visual Odometry Under Challenging Conditions | Eunseon Choi et.al. | 2603.18589 | null |
| 2026-03-19 | CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention | Jiacheng Tang et.al. | 2603.18561 | null |
| 2026-03-18 | DriveVLM-RL: Neuroscience-Inspired Reinforcement Learning with Vision-Language Models for Safe and Deployable Autonomous Driving | Zilin Huang et.al. | 2603.18315 | null |
| 2026-03-18 | VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events | Mohammad Qazim Bhat et.al. | 2603.18178 | null |
| 2026-03-18 | DarkDriving: A Real-World Day and Night Aligned Dataset for Autonomous Driving in the Dark Environment | Wuqi Wang et.al. | 2603.18067 | null |
| 2026-03-18 | AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception | Jinho Park et.al. | 2603.17979 | null |
| 2026-03-18 | An HMDP-MPC Decision-making Framework with Adaptive Safety Margins and Hysteresis for Autonomous Driving | Siyuan Li et.al. | 2603.17802 | null |
| 2026-03-18 | From Virtual Environments to Real-World Trials: Emerging Trends in Autonomous Driving | A. Humnabadkar et.al. | 2603.17714 | null |
| 2026-03-18 | VectorWorld: Efficient Streaming World Model via Diffusion Flow on Vector Graphs | Chaokang Jiang et.al. | 2603.17652 | null |
| 2026-03-18 | Hierarchical Decision-Making under Uncertainty: A Hybrid MDP and Chance-Constrained MPC Approach | Siyuan Li et.al. | 2603.17634 | null |
| 2026-03-18 | Physics-informed Deep Mixture-of-Koopmans Vehicle Dynamics Model with Dual-branch Encoder for Distributed Electric-drive Trucks | Jinyu Miao et.al. | 2603.17416 | null |
| 2026-03-18 | VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm | Hongbo Lu et.al. | 2603.17382 | null |
| 2026-03-17 | Topology-Preserving Deep Joint Source-Channel Coding for Semantic Communication | Omar Erak et.al. | 2603.17126 | null |
| 2026-03-16 | Joint Optimization of Storage and Loading for High-Performance 3D Point Cloud Data Processing | Ke Wang et.al. | 2603.16945 | null |
| 2026-03-17 | CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection | Junseok Lee et.al. | 2603.16439 | null |
| 2026-03-17 | Poisoning the Pixels: Revisiting Backdoor Attacks on Semantic Segmentation | Guangsheng Zhang et.al. | 2603.16405 | null |
| 2026-03-17 | Learning Human-Object Interaction for 3D Human Pose Estimation from LiDAR Point Clouds | Daniel Sungho Jung et.al. | 2603.16343 | null |
| 2026-03-17 | DriveFix: Spatio-Temporally Coherent Driving Scene Restoration | Heyu Si et.al. | 2603.16306 | null |
| 2026-03-17 | Toward Deep Representation Learning for Event-Enhanced Visual Autonomous Perception: the eAP Dataset | Jinghang Li et.al. | 2603.16303 | null |
| 2026-03-17 | AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection | Hongwei Lin et.al. | 2603.16261 | null |
| 2026-03-17 | PanguMotion: Continuous Driving Motion Forecasting with Pangu Transformers | Quanhao Ren et.al. | 2603.16196 | null |
| 2026-03-17 | HIPO: Instruction Hierarchy via Constrained Reinforcement Learning | Keru Chen et.al. | 2603.16152 | null |
| 2026-03-17 | The Era of End-to-End Autonomy: Transitioning from Rule-Based Driving to Large Driving Models | Eduardo Nebot et.al. | 2603.16050 | null |
| 2026-03-18 | Safety Case Patterns for VLA-based driving systems: Insights from SimLingo | Gerhard Yu et.al. | 2603.16013 | null |
| 2026-03-16 | CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving | Yihong Guo et.al. | 2603.15771 | null |
| 2026-03-16 | CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving | Erick Silva et.al. | 2603.15364 | null |
| 2026-03-16 | ADV-0: Closed-Loop Min-Max Adversarial Training for Long-Tail Robustness in Autonomous Driving | Tong Nie et.al. | 2603.15221 | null |
| 2026-03-16 | What Matters for Scalable and Robust Learning in End-to-End Driving Planners? | David Holtz et.al. | 2603.15185 | null |
| 2026-03-16 | Learning from Mistakes: Post-Training for Driving VLA with Takeover Data | Yinfeng Gao et.al. | 2603.14972 | null |
| 2026-03-16 | Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation | Xingtai Gui et.al. | 2603.14948 | null |
| 2026-03-16 | FAR-Drive: Frame-AutoRegressive Video Generation in Closed-Loop Autonomous Driving | Yaoru Li et.al. | 2603.14938 | null |
| 2026-03-16 | PerlAD: Towards Enhanced Closed-loop End-to-end Autonomous Driving with Pseudo-simulation-based Reinforcement Learning | Yinfeng Gao et.al. | 2603.14908 | null |
| 2026-03-16 | AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture-of-Transformers for End-to-End Autonomous Driving | Wenhui Huang et.al. | 2603.14851 | null |
| 2026-03-16 | RadarXFormer: Robust Object Detection via Cross-Dimension Fusion of 4D Radar Spectra and Images for Autonomous Driving | Yue Sun et.al. | 2603.14822 | null |
| 2026-03-16 | LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision | Yiming Huang et.al. | 2603.14763 | null |
| 2026-03-16 | TrajMamba: An Ego-Motion-Guided Mamba Model for Pedestrian Trajectory Prediction from an Egocentric Perspective | Yusheng Peng et.al. | 2603.14739 | null |
| 2026-03-15 | Learning to Order: Task Sequencing as In-Context Optimization | Jan Kobiolka et.al. | 2603.14550 | null |
| 2026-03-15 | WorldVLM: Combining World Model Forecasting and Vision-Language Reasoning | Stefan Englmeier et.al. | 2603.14497 | null |
| 2026-03-15 | DRCC-LPVMPC: Robust Data-Driven Control for Autonomous Driving and Obstacle Avoidance | Shiming Fang et.al. | 2603.14408 | null |
| 2026-03-15 | Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces | Jiayuan Du et.al. | 2603.14354 | null |
| 2026-03-14 | Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics | Dennis Haitz et.al. | 2603.13917 | null |
| 2026-03-14 | Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving | Zhexi Lian et.al. | 2603.13842 | null |
| 2026-03-11 | FlowAD: Ego-Scene Interactive Modeling for Autonomous Driving | Mingzhe Guo et.al. | 2603.13399 | null |
| 2026-03-13 | Panoramic Multimodal Semantic Occupancy Prediction for Quadruped Robots | Guoqiang Zhao et.al. | 2603.13108 | null |
| 2026-03-13 | VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation | Juhye Park et.al. | 2603.12918 | null |
| 2026-03-16 | Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection | Kadir-Kaan Özer et.al. | 2603.12916 | null |
| 2026-03-13 | Composing Driving Worlds through Disentangled Control for Adversarial Scenario Generation | Yifan Zhan et.al. | 2603.12864 | null |
| 2026-03-13 | Improving critical buildings energy resilience via shared autonomous electric vehicles – A sequential optimization framework | Jinming Liu et.al. | 2603.12771 | null |
| 2026-03-13 | IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud Registration | Dongxu Zhang et.al. | 2603.12719 | null |
| 2026-03-13 | CarPLAN: Context-Adaptive and Robust Planning with Dynamic Scene Awareness for Autonomous Driving | Junyong Yun et.al. | 2603.12607 | null |
| 2026-03-12 | A Neuro-Symbolic Framework Combining Inductive and Deductive Reasoning for Autonomous Driving Planning | Hongyan Wei et.al. | 2603.12421 | null |
| 2026-03-12 | QUARE: Multi-Agent Negotiation for Balancing Quality Attributes in Requirements Engineering | Haowei Cheng et.al. | 2603.11890 | null |
| 2026-03-12 | R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection | Zhongyu Xia et.al. | 2603.11566 | null |
| 2026-03-12 | Risk-Controllable Multi-View Diffusion for Driving Scenario Generation | Hongyi Lin et.al. | 2603.11534 | null |
| 2026-03-12 | Zero-Shot Cross-City Generalization in End-to-End Autonomous Driving: Self-Supervised versus Supervised Representations | Fatemeh Naeinian et.al. | 2603.11417 | null |
| 2026-03-11 | DriveXQA: Cross-modal Visual Question Answering for Adverse Driving Scene Understanding | Mingzhe Tao et.al. | 2603.11380 | null |
| 2026-03-11 | Radiometric fingerprinting of object surfaces using mobile laser scanning and semantic 3D road space models | Benedikt Schwab et.al. | 2603.11252 | null |
| 2026-03-11 | A Survey of Reasoning in Autonomous Driving Systems: Open Challenges and Emerging Paradigms | Kejin Yu et.al. | 2603.11093 | null |
| 2026-03-13 | DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving | Shuyao Shang et.al. | 2603.11041 | null |
| 2026-03-11 | STADA: Specification-based Testing for Autonomous Driving Agents | Joy Saha et.al. | 2603.10940 | null |
| 2026-03-11 | Evaluating randomized smoothing as a defense against adversarial attacks in trajectory prediction | Julian F. Schumann et.al. | 2603.10821 | null |
| 2026-03-11 | Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction | Hao Zhou et.al. | 2603.10597 | null |
| 2026-03-11 | KnowDiffuser: A Knowledge-Guided Diffusion Planner with LM Reasoning and Prior-Informed Trajectory Initialization | Fan Ding et.al. | 2603.10441 | null |
| 2026-03-11 | Motion Forcing: A Decoupled Framework for Robust Video Generation in Motion Dynamics | Tianshuo Xu et.al. | 2603.10408 | null |
| 2026-03-11 | PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner | Eugene Ku et.al. | 2603.10330 | null |
| 2026-03-10 | HG-Lane: High-Fidelity Generation of Lane Scenes under Adverse Weather and Lighting Conditions without Re-annotation | Daichao Zhao et.al. | 2603.10128 | null |
| 2026-03-10 | $M^2$ -Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs | Kaixin Lin et.al. | 2603.09737 | null |
| 2026-03-10 | RESBev: Making BEV Perception More Robust | Lifeng Zhuo et.al. | 2603.09529 | null |
| 2026-03-10 | Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning | Chun-Peng Chang et.al. | 2603.09512 | null |
| 2026-03-10 | StyleVLA: Driving Style-Aware Vision Language Action Model for Autonomous Driving | Yuan Gao et.al. | 2603.09482 | null |
| 2026-03-10 | EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation | Jiajun Cao et.al. | 2603.09465 | null |
| 2026-03-10 | Declarative Scenario-based Testing with RoadLogic | Ezio Bartocci et.al. | 2603.09455 | null |
| 2026-03-10 | Open-World Motion Forecasting | Nicolas Schischka et.al. | 2603.09420 | null |
| 2026-03-10 | Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning | Kanishkha Jaisankar et.al. | 2603.09255 | null |
| 2026-03-09 | Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving Architectures | David Fernandez et.al. | 2603.08897 | null |
| 2026-03-09 | OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras | Yongzhi Lin et.al. | 2603.08521 | null |
| 2026-03-09 | Graph Based Semantic Encoder Decoder Framework for Task Oriented Communications in Connected Autonomous Vehicles | Soheyb Ribouh et.al. | 2603.08438 | null |
| 2026-03-09 | DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving | Zhuolin He et.al. | 2603.08254 | null |
| 2026-03-09 | ALOOD: Exploiting Language Representations for LiDAR-based Out-of-Distribution Object Detection | Michael Kösel et.al. | 2603.08180 | null |
| 2026-03-09 | SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving | Zihan You et.al. | 2603.08113 | null |
| 2026-03-09 | RLPR: Radar-to-LiDAR Place Recognition via Two-Stage Asymmetric Cross-Modal Alignment for Autonomous Driving | Zhangshuo Qi et.al. | 2603.07920 | null |
| 2026-03-09 | NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving | Ximeng Tao et.al. | 2603.07901 | null |
| 2026-03-09 | Toward Unified Multimodal Representation Learning for Autonomous Driving | Ximeng Tao et.al. | 2603.07874 | null |
| 2026-03-08 | 4DRC-OCC: Robust Semantic Occupancy Prediction Through Fusion of 4D Radar and Camera | David Ninfa et.al. | 2603.07794 | null |
| 2026-03-08 | Fast Attention-Based Simplification of LiDAR Point Clouds for Object Detection and Classification | Z. Rozsa et.al. | 2603.07593 | null |
| 2026-03-08 | ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction | Haibao Yu et.al. | 2603.07552 | null |
| 2026-03-08 | RayD3D: Distilling Depth Knowledge Along the Ray for Robust Multi-View 3D Object Detection | Rui Ding et.al. | 2603.07493 | null |
| 2026-03-07 | Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface | Balint K. Hodossy et.al. | 2603.07364 | null |
| 2026-03-07 | Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving | Jiazhuo Li et.al. | 2603.07264 | null |
| 2026-03-07 | Perception-Aware Multimodal Spatial Reasoning from Monocular Images | Yanchun Cheng et.al. | 2603.06985 | null |
| 2026-03-06 | Feasibility Restoration under Conflicting STL Specifications with Pareto-Optimal Refinement | Tianhao Wu et.al. | 2603.06947 | null |
| 2026-03-06 | VertiAdaptor: Online Kinodynamics Adaptation for Vertically Challenging Terrain | Tong Xu et.al. | 2603.06887 | null |
| 2026-03-06 | Improved Constrained Generation by Bridging Pretrained Generative Models | Xiaoxuan Liang et.al. | 2603.06742 | null |
| 2026-03-06 | BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations | Thomas Monninger et.al. | 2603.06576 | null |
| 2026-03-06 | Modeling and Measuring Redundancy in Multisource Multimodal Data for Autonomous Driving | Yuhan Zhou et.al. | 2603.06544 | null |
| 2026-03-06 | NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving | Kai Luo et.al. | 2603.06254 | null |
| 2026-03-06 | TaPD: Temporal-adaptive Progressive Distillation for Observation-Adaptive Trajectory Forecasting in Autonomous Driving | Mingyu Fan et.al. | 2603.06231 | null |
| 2026-03-06 | VG3S: Visual Geometry Grounded Gaussian Splatting for Semantic Occupancy Prediction | Xiaoyang Yan et.al. | 2603.06210 | null |
| 2026-03-06 | Transforming Omnidirectional RGB-LiDAR data into 3D Gaussian Splatting | Semin Bae et.al. | 2603.06061 | null |
| 2026-03-06 | TADPO: Reinforcement Learning Goes Off-road | Zhouchonghao Wu et.al. | 2603.05995 | null |
| 2026-03-06 | OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous Driving | Kota Shimomura et.al. | 2603.05936 | null |
| 2026-03-06 | Expert Knowledge-driven Reinforcement Learning for Autonomous Racing via Trajectory Guidance and Dynamics Constraints | Bo Leng et.al. | 2603.05842 | null |
| 2026-03-05 | Post Fusion Bird’s Eye View Feature Stabilization for Robust Multimodal 3D Detection | Trung Tien Dong et.al. | 2603.05623 | null |
| 2026-03-05 | Fusion4CA: Boosting 3D Object Detection via Comprehensive Image Exploitation | Kang Luo et.al. | 2603.05305 | null |
| 2026-03-05 | From Code to Road: A Vehicle-in-the-Loop and Digital Twin-Based Framework for Central Car Server Testing in Autonomous Driving | Chengdong Wu et.al. | 2603.05279 | null |
| 2026-03-05 | K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation | Mingxuan Mu et.al. | 2603.04868 | null |
| 2026-03-05 | On the Strengths and Weaknesses of Data for Open-set Embodied Assistance | Pradyumna Tambwekar et.al. | 2603.04819 | null |
| 2026-03-04 | Risk-Aware Rulebooks for Multi-Objective Trajectory Evaluation under Uncertainty | Tichakorn Wongpiromsarn et.al. | 2603.04603 | null |
| 2026-03-04 | PRAM-R: A Perception-Reasoning-Action-Memory Framework with LLM-Guided Modality Routing for Adaptive Autonomous Driving | Yi Zhang et.al. | 2603.04222 | null |
| 2026-03-04 | GSeg3D: A High-Precision Grid-Based Algorithm for Safety-Critical Ground Segmentation in LiDAR Point Clouds | Muhammad Haider Khan Lodhi et.al. | 2603.04208 | null |
| 2026-03-04 | SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling | Jinlong Cui et.al. | 2603.04071 | null |
| 2026-03-04 | Map-Agnostic And Interactive Safety-Critical Scenario Generation via Multi-Objective Tree Search | Wenyun Li et.al. | 2603.03978 | null |
| 2026-03-04 | Spatial Causal Prediction in Video | Yanguang Zhao et.al. | 2603.03944 | null |
| 2026-03-04 | LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving | Qihao Sun et.al. | 2603.03765 | null |
| 2026-03-03 | Analyzing the Impact of Adversarial Attacks on C-V2X-Enabled Road Safety: An Age of Information Perspective | Mahmudul Hassan Ashik et.al. | 2603.03462 | null |
| 2026-03-03 | Radar-based Pose Optimization for HD Map Generation from Noisy Multi-Drive Vehicle Fleet Data | Alexander Blumberg et.al. | 2603.03453 | null |
| 2026-03-03 | Utonia: Toward One Encoder for All Point Clouds | Yujia Zhang et.al. | 2603.03283 | null |
| 2026-03-03 | ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments | Ziyang Gong et.al. | 2603.03198 | null |
| 2026-03-03 | Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving | Tianze Zhu et.al. | 2603.02613 | null |
| 2026-03-03 | VLMFusionOcc3D: VLM Assisted Multi-Modal 3D Semantic Occupancy Prediction | A. Enes Doruk et.al. | 2603.02609 | null |
| 2026-03-03 | CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration | Huichun Liu et.al. | 2603.02560 | null |
| 2026-03-03 | AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation | Zhulin Jiang et.al. | 2603.02542 | null |
| 2026-03-03 | EIMC: Efficient Instance-aware Multi-modal Collaborative Perception | Kang Yang et.al. | 2603.02532 | null |
| 2026-03-03 | LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model | Xiangyu Li et.al. | 2603.02528 | null |
| 2026-03-03 | ModalPatch: A Plug-and-Play Module for Robust Multi-Modal 3D Object Detection under Modality Drop | Shuangzhi Li et.al. | 2603.02481 | null |
| 2026-03-02 | TruckDrive: Long-Range Autonomous Highway Driving Dataset | Filippo Ghilotti et.al. | 2603.02413 | null |
| 2026-03-02 | LAD-Drive: Bridging Language and Trajectory with Action-Aware Diffusion Transformers | Fabian Schmidt et.al. | 2603.02035 | null |
| 2026-03-02 | LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving | Yuechen Luo et.al. | 2603.01928 | null |
| 2026-03-02 | Streaming Real-Time Trajectory Prediction Using Endpoint-Aware Modeling | Alexander Prutsch et.al. | 2603.01864 | null |
| 2026-03-02 | GroupEnsemble: Efficient Uncertainty Estimation for DETR-based Object Detection | Yutong Yang et.al. | 2603.01847 | null |
| 2026-03-02 | WhisperNet: A Scalable Solution for Bandwidth-Efficient Collaboration | Gong Chen et.al. | 2603.01708 | null |
| 2026-03-02 | DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving | Enhui Ma et.al. | 2603.01637 | null |
| 2026-03-02 | Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing | Zijin Yin et.al. | 2603.01535 | null |
| 2026-03-02 | VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models | Duoxun Tang et.al. | 2603.01454 | null |
| 2026-03-02 | Unifying Language-Action Understanding and Generation for Autonomous Driving | Xinyang Wang et.al. | 2603.01441 | null |
| 2026-03-02 | Perspective-Equivariant Fine-tuning for Multispectral Demosaicing without Ground Truth | Andrew Wang et.al. | 2603.01332 | null |
| 2026-03-01 | FoSS: Modeling Long Range Dependencies and Multimodal Uncertainty in Trajectory Prediction via Fourier State Space Integration | Yizhou Huang et.al. | 2603.01284 | null |
| 2026-03-01 | Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures | Yuechen Luo et.al. | 2603.01063 | null |
| 2026-03-01 | An Open-Source Modular Benchmark for Diffusion-Based Motion Planning in Closed-Loop Autonomous Driving | Yun Li et.al. | 2603.01023 | null |
| 2026-03-01 | Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving | Xubo Zhu et.al. | 2603.01007 | null |
| 2026-03-01 | DriveCode: Domain Specific Numerical Encoding for LLM-Based Autonomous Driving | Zhiye Wang et.al. | 2603.00919 | null |
| 2026-02-28 | DRIV-EX: Counterfactual Explanations for Driving LLMs | Amaia Cardiel et.al. | 2603.00696 | null |
| 2026-02-28 | Wild-Drive: Off-Road Scene Captioning and Path Planning via Robust Multi-modal Routing and Efficient Large Language Model | Zihang Wang et.al. | 2603.00694 | null |
| 2026-02-28 | ReMoT: Reinforcement Learning with Motion Contrast Triplets | Cong Wan et.al. | 2603.00461 | null |
| 2026-02-28 | PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models | Yuanhao Su et.al. | 2603.00412 | null |
| 2026-02-27 | TSC: Topology-Conditioned Stackelberg Coordination for Multi-Agent Reinforcement Learning in Interactive Driving | Xiaotong Zhang et.al. | 2602.23896 | null |
| 2026-02-27 | SelfOccFlow: Towards end-to-end self-supervised 3D Occupancy Flow prediction | Xavier Timoneda et.al. | 2602.23894 | null |
| 2026-02-27 | Bandwidth-adaptive Cloud-Assisted 360-Degree 3D Perception for Autonomous Vehicles | Faisal Hawladera et.al. | 2602.23871 | null |
| 2026-02-27 | FPPS: An FPGA-Based Point Cloud Processing System | Xiaofeng Zhou et.al. | 2602.23787 | null |
| 2026-02-27 | CycleBEV: Regularizing View Transformation Networks via View Cycle Consistency for Bird’s-Eye-View Semantic Segmentation | Jeongbin Hong et.al. | 2602.23575 | null |
| 2026-02-26 | TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving | Tugrul Gorgulu et.al. | 2602.23499 | null |
| 2026-02-26 | Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving | Jiangxin Sun et.al. | 2602.23259 | null |
| 2026-02-27 | Towards Intelligible Human-Robot Interaction: An Active Inference Approach to Occluded Pedestrian Scenarios | Kai Chen et.al. | 2602.23109 | null |
| 2026-02-26 | Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving | Yinan Zheng et.al. | 2602.22801 | null |
| 2026-02-26 | Transformer Actor-Critic for Efficient Freshness-Aware Resource Allocation | Maryam Ansarifard et.al. | 2602.22774 | null |
| 2026-02-27 | The Swarm Intelligence Freeway-Urban Trajectories (SWIFTraj) Dataset – Part I: Dataset Description and Applications | Yu Han et.al. | 2602.22563 | null |
| 2026-02-26 | DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation | Zhechao Wang et.al. | 2602.22549 | null |
| 2026-02-25 | WeatherCity: Urban Scene Reconstruction with Controllable Multi-Weather Transformation | Wenhua Wu et.al. | 2602.22096 | null |
| 2026-02-25 | Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos | Matthew Strong et.al. | 2602.22091 | null |
| 2026-02-25 | PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning | Zekai Lin et.al. | 2602.21992 | null |
| 2026-02-25 | MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving | Lingjun Zhang et.al. | 2602.21952 | null |
| 2026-02-25 | GFPL: Generative Federated Prototype Learning for Resource-Constrained and Data-Imbalanced Vision Task | Shiwei Lu et.al. | 2602.21873 | null |
| 2026-02-25 | SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction | Haoxiang Fu et.al. | 2602.21589 | null |
| 2026-02-25 | Unified Unsupervised and Sparsely-Supervised 3D Object Detection by Semantic Pseudo-Labeling and Prototype Learning | Yushen He et.al. | 2602.21484 | null |
| 2026-02-24 | HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles | Yifan Wang et.al. | 2602.21333 | null |
| 2026-02-24 | Uncertainty-Aware Diffusion Model for Multimodal Highway Trajectory Prediction via DDIM Sampling | Marion Neumeier et.al. | 2602.21319 | null |
| 2026-02-25 | NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning | Ishaan Rawal et.al. | 2602.21172 | null |
| 2026-02-24 | UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling | Kaiyuan Tan et.al. | 2602.20943 | null |
| 2026-02-24 | VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving | Jie Wang et.al. | 2602.20794 | null |
| 2026-02-24 | GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generatio | Hao Zhang et.al. | 2602.20673 | null |
| 2026-02-24 | An LLM-driven Scenario Generation Pipeline Using an Extended Scenic DSL for Autonomous Driving Safety Validation | Fida Khandaker Safa et.al. | 2602.20644 | null |
| 2026-02-24 | Boosting Instance Awareness via Cross-View Correlation with 4D Radar and Camera for 3D Object Detection | Xiaokai Bai et.al. | 2602.20632 | null |
| 2026-02-24 | Efficient and Explainable End-to-End Autonomous Driving via Masked Vision-Language-Action Diffusion | Jiaru Zhang et.al. | 2602.20577 | null |
| 2026-02-24 | An interactive enhanced driving dataset for autonomous driving | Haojie Feng et.al. | 2602.20575 | null |
| 2026-02-23 | MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving | Junli Wang et.al. | 2602.20060 | null |
| 2026-02-23 | Probabilistic Photonic Computing | Frank Brückerhoff-Plückelmann et.al. | 2602.19968 | null |
| 2026-02-23 | VGGT-MPR: VGGT-Enhanced Multimodal Place Recognition in Autonomous Driving Environments | Jingyi Xu et.al. | 2602.19735 | null |
| 2026-02-22 | Safe and Interpretable Multimodal Path Planning for Multi-Agent Cooperation | Haojun Shi et.al. | 2602.19304 | null |
| 2026-02-22 | OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness | Phuc D. A. Nguyen et.al. | 2602.19035 | null |
| 2026-02-21 | Open-Vocabulary Domain Generalization in Urban-Scene Segmentation | Dong Zhao et.al. | 2602.18853 | null |
| 2026-02-21 | Driving with A Thousand Faces: A Benchmark for Closed-Loop Personalized End-to-End Autonomous Driving | Xiaoru Dong et.al. | 2602.18757 | null |
| 2026-02-20 | OODBench: Out-of-Distribution Benchmark for Large Vision-Language Models | Ling Lin et.al. | 2602.18094 | null |
| 2026-02-20 | Dynamic Deception: When Pedestrians Team Up to Fool Autonomous Cars | Masoud Jamshidiyan Tehrani et.al. | 2602.18079 | null |
| 2026-02-20 | Faster Training, Fewer Labels: Self-Supervised Pretraining for Fine-Grained BEV Segmentation | Daniel Busch et.al. | 2602.18066 | null |
| 2026-02-19 | Conditional Flow Matching for Continuous Anomaly Detection in Autonomous Driving on a Manifold-Aware Spectral Space | Antonio Guillen-Perez et.al. | 2602.17586 | null |
| 2026-02-19 | Hybrid System Planning using a Mixed-Integer ADMM Heuristic and Hybrid Zonotopes | Joshua A. Robbins et.al. | 2602.17574 | link |
| 2026-02-19 | HiMAP: History-aware Map-occupancy Prediction with Fallback | Yiming Xu et.al. | 2602.17231 | null |
| 2026-02-19 | Multi-session Localization and Mapping Exploiting Topological Information | Lorenzo Montano-Olivan et.al. | 2602.17226 | null |
| 2026-02-19 | 3D Scene Rendering with Multimodal Gaussian Splatting | Chi-Shiang Gau et.al. | 2602.17124 | null |
| 2026-02-18 | Boreas Road Trip: A Multi-Sensor Autonomous Driving Dataset on Challenging Roads | Daniil Lisus et.al. | 2602.16870 | null |
| 2026-02-18 | PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction | Bo Lang et.al. | 2602.16669 | null |
| 2026-02-18 | A Contrastive Learning Framework Empowered by Attention-based Feature Adaptation for Street-View Image Classification | Qi You et.al. | 2602.16590 | null |
| 2026-02-17 | ScenicRules: An Autonomous Driving Benchmark with Multi-Objective Specifications and Abstract Scenarios | Kevin Kai-Chun Chang et.al. | 2602.16073 | null |
| 2026-02-15 | A Comprehensive Survey on Deep Learning-Based LiDAR Super-Resolution for Autonomous Driving | June Moh Goo et.al. | 2602.15904 | null |
| 2026-02-17 | RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution | Youngwan Jin et.al. | 2602.15490 | link |
| 2026-02-16 | Near-Optimal Sample Complexity for Online Constrained MDPs | Chang Liu et.al. | 2602.15076 | null |
| 2026-02-16 | ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery | Ayush Shrivastava et.al. | 2602.14989 | link |
| 2026-02-16 | DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI | En Yu et.al. | 2602.14974 | null |
| 2026-02-16 | DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving | Chenxu Dang et.al. | 2602.14577 | null |
| 2026-02-16 | Multimodal Covariance Steering in Belief Space with Active Probing and Influence for Autonomous Driving | Devodita Chakravarty et.al. | 2602.14540 | null |
| 2026-02-15 | A Generalizable Physics-guided Causal Model for Trajectory Prediction in Autonomous Driving | Zhenyu Zong et.al. | 2602.13936 | null |
| 2026-02-14 | Privacy-Concealing Cooperative Perception for BEV Scene Segmentation | Song Wang et.al. | 2602.13555 | null |
| 2026-02-14 | Nighttime Autonomous Driving Scene Reconstruction with Physically-Based Gaussian Splatting | Tae-Kyeong Kim et.al. | 2602.13549 | null |
| 2026-02-13 | TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios | Wentao Xu et.al. | 2602.13040 | null |
| 2026-02-13 | MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting | Mohammed Amine Bencheikh Lehocine et.al. | 2602.13003 | null |
| 2026-02-13 | RoadscapesQA: A Multitask, Multimodal Dataset for Visual Question Answering on Indian Roads | Vijayasri Iyer et.al. | 2602.12877 | null |
| 2026-02-13 | DPUConfig: Optimizing ML Inference in FPGAs Using Reinforcement Learning | Alexandros Patras et.al. | 2602.12847 | null |
| 2026-02-13 | The Constant Eye: Benchmarking and Bridging Appearance Robustness in Autonomous Driving | Jiabao Wang et.al. | 2602.12563 | null |
| 2026-02-13 | Self-Supervised JEPA-based World Models for LiDAR Occupancy Completion and Forecasting | Haoran Zhu et.al. | 2602.12540 | null |
| 2026-02-12 | DiffPlace: Street View Generation via Place-Controllable Diffusion Model Enhancing Place Recognition | Ji Li et.al. | 2602.11875 | null |
| 2026-02-12 | Talk2DM: Enabling Natural Language Querying and Commonsense Reasoning for Vehicle-Road-Cloud Integrated Dynamic Maps with Large Language Models | Lu Tao et.al. | 2602.11860 | null |
| 2026-02-12 | SToRM: Supervised Token Reduction for Multi-modal LLMs toward efficient end-to-end autonomous driving | Seo Hyun Kim et.al. | 2602.11656 | null |
| 2026-02-11 | DD-MDN: Human Trajectory Forecasting with Diffusion-Based Dual Mixture Density Networks and Uncertainty Self-Calibration | Manuel Hetzel et.al. | 2602.11214 | null |
| 2026-02-11 | Interpretable Vision Transformers in Monocular Depth Estimation via SVDA | Vasileios Arampatzakis et.al. | 2602.11005 | null |
| 2026-02-11 | ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving | Jinqing Zhang et.al. | 2602.10884 | null |
| 2026-02-11 | Viewpoint Recommendation for Point Cloud Labeling through Interaction Cost Modeling | Yu Zhang et.al. | 2602.10871 | null |
| 2026-02-11 | From Steering to Pedalling: Do Autonomous Driving VLMs Generalize to Cyclist-Assistive Spatial Perception and Planning? | Krishna Kanth Nakka et.al. | 2602.10771 | null |
| 2026-02-11 | AurigaNet: A Real-Time Multi-Task Network for Enhanced Urban Driving Perception | Kiarash Ghasemzadeh et.al. | 2602.10660 | null |
| 2026-02-11 | Found-RL: foundation model-enhanced reinforcement learning for autonomous driving | Yansong Qu et.al. | 2602.10458 | null |
| 2026-02-10 | Adaptive Time Step Flow Matching for Autonomous Driving Motion Planning | Ananya Trivedi et.al. | 2602.10285 | null |
| 2026-02-10 | AD $^2$ : Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems | Ishan Sahu et.al. | 2602.10160 | null |
| 2026-02-11 | Robust Vision Systems for Connected and Autonomous Vehicles: Security Challenges and Attack Vectors | Sandeep Gupta et.al. | 2602.09740 | null |
| 2026-02-09 | Robustness Is a Function, Not a Number: A Factorized Comprehensive Study of OOD Robustness in Vision-Based Driving | Amir Mallak et.al. | 2602.09018 | null |
| 2026-02-09 | Modeling 3D Pedestrian-Vehicle Interactions for Vehicle-Conditioned Pose Forecasting | Guangxun Zhu et.al. | 2602.08962 | null |
| 2026-02-09 | Multi-Staged Framework for Safety Analysis of Offloaded Services in Distributed Intelligent Transportation Systems | Robin Dehler et.al. | 2602.08821 | null |
| 2026-02-09 | A Generic Service-Oriented Function Offloading Framework for Connected Automated Vehicles | Robin Dehler et.al. | 2602.08799 | null |
| 2026-02-09 | Overview and Comparison of AVS Point Cloud Compression Standard | Wei Gao et.al. | 2602.08613 | null |
| 2026-02-09 | Head-to-Head autonomous racing at the limits of handling in the A2RL challenge | Simon Hoffmann et.al. | 2602.08571 | null |
| 2026-02-09 | SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios | Tian Gao et.al. | 2602.08440 | null |
| 2026-02-09 | Vec-QMDP: Vectorized POMDP Planning on CPUs for Real-Time Autonomous Driving | Xuanjin Jin et.al. | 2602.08334 | null |
| 2026-02-09 | Personalized Autonomous Driving via Optimal Control with Clearance Constraints from Questionnaires | Yongjae Lim et.al. | 2602.08326 | null |
| 2026-02-09 | Generating Adversarial Events: A Motion-Aware Point Cloud Framework | Hongwei Ren et.al. | 2602.08230 | null |
| 2026-02-09 | Self-Supervised Bootstrapping of Action-Predictive Embodied Reasoning | Milan Ganai et.al. | 2602.08167 | null |
| 2026-02-08 | MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection | Venkatraman Narayanan et.al. | 2602.08126 | null |
| 2026-02-08 | ForecastOcc: Vision-based Semantic Occupancy Forecasting | Riya Mohan et.al. | 2602.08006 | null |
| 2026-02-08 | Analyzing the Impact of Simulation Fidelity on the Evaluation of Autonomous Driving Motion Control | Simon Sagmeister et.al. | 2602.07984 | null |
| 2026-02-07 | All-Optical Segmentation via Diffractive Neural Networks for Autonomous Driving | Yingjie Li et.al. | 2602.07717 | null |
| 2026-02-07 | Vision and language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning | Ross Greer et.al. | 2602.07680 | null |
| 2026-02-07 | Seeing Roads Through Words: A Language-Guided Framework for RGB-T Driving Scene Segmentation | Ruturaj Reddy et.al. | 2602.07343 | null |
| 2026-02-07 | RAPiD: Real-time Deterministic Trajectory Planning via Diffusion Behavior Priors for Safe and Efficient Autonomous Driving | Ruturaj Reddy et.al. | 2602.07339 | null |
| 2026-02-09 | Temperature Scaling Attack Disrupting Model Confidence in Federated Learning | Kichang Lee et.al. | 2602.06638 | null |
| 2026-02-06 | DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving | Feiyang jia et.al. | 2602.06521 | null |
| 2026-02-06 | Rebenchmarking Unsupervised Monocular 3D Occupancy Prediction | Zizhan Guo et.al. | 2602.06488 | null |
| 2026-02-05 | Addressing the Waypoint-Action Gap in End-to-End Autonomous Driving via Vehicle Motion Models | Jorge Daniel Rodríguez-Vidal et.al. | 2602.06214 | null |
| 2026-02-05 | Driving with DINO: Vision Foundation Features as a Unified Bridge for Sim-to-Real Generation in Autonomous Driving | Xuyang Chen et.al. | 2602.06159 | null |
| 2026-02-05 | Thinking with Geometry: Active Geometry Integration for Spatial Reasoning | Haoyuan Li et.al. | 2602.06037 | null |
| 2026-02-05 | LSA: Localized Semantic Alignment for Enhancing Temporal Consistency in Traffic Video Generation | Mirlan Karimov et.al. | 2602.05966 | null |
| 2026-02-05 | ROMAN: Reward-Orchestrated Multi-Head Attention Network for Autonomous Driving System Testing | Jianlei Chi et.al. | 2602.05629 | null |
| 2026-02-05 | Unified Sensor Simulation for Autonomous Driving | Nikolay Patakin et.al. | 2602.05617 | null |
| 2026-02-05 | Visual Implicit Geometry Transformer for Autonomous Driving | Arsenii Shirokov et.al. | 2602.05573 | null |
| 2026-02-06 | A Comparative Study of 3D Person Detection: Sensor Modalities and Robustness in Diverse Indoor and Outdoor Environments | Malaz Tamim et.al. | 2602.05538 | null |
| 2026-02-05 | Imagine a City: CityGenAgent for Procedural 3D City Generation | Zishan Liu et.al. | 2602.05362 | null |
| 2026-02-04 | Reinforcement Learning Enhancement Using Vector Semantic Representation and Symbolic Reasoning for Human-Centered Autonomous Emergency Braking | Vinal Asodia et.al. | 2602.05079 | null |
| 2026-02-04 | Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty | Rui Liu et.al. | 2602.04763 | null |
| 2026-02-06 | DRMOT: A Dataset and Framework for RGBD Referring Multi-Object Tracking | Sijia Chen et.al. | 2602.04692 | null |
| 2026-02-04 | Safe and Stylized Trajectory Planning for Autonomous Driving via Diffusion Model | Shuo Pei et.al. | 2602.04329 | null |
| 2026-02-04 | AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models | Yuxuan Han et.al. | 2602.04256 | null |
| 2026-02-04 | Natural Language Instructions for Scene-Responsive Human-in-the-Loop Motion Planning in Autonomous Driving using Vision-Language-Action Models | Angel Martinez-Sanchez et.al. | 2602.04184 | null |
| 2026-02-04 | The Dynamics of Attention across Automated and Manual Driving Modes: A Driving Simulation Study | Yuan Cai et.al. | 2602.04164 | null |
| 2026-02-03 | Multi-Player, Multi-Strategy Quantum Game Model for Interaction-Aware Decision-Making in Autonomous Driving | Karim Essalmi et.al. | 2602.03571 | null |
| 2026-02-03 | HetroD: A High-Fidelity Drone Dataset and Benchmark for Autonomous Driving in Heterogeneous Traffic | Yu-Hsiang Chen et.al. | 2602.03447 | null |
| 2026-02-03 | PlanTRansformer: Unified Prediction and Planning with Goal-conditioned Transformer | Constantin Selzer et.al. | 2602.03376 | null |
| 2026-02-03 | Multi-Resolution Alignment for Voxel Sparsity in Camera-Based 3D Semantic Scene Completion | Zhiwen Yang et.al. | 2602.03371 | null |
| 2026-02-03 | InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation | Zhuoran Yang et.al. | 2602.03242 | null |
| 2026-02-03 | ConsisDrive: Identity-Preserving Driving World Models for Video Generation by Instance Mask | Zhuoran Yang et.al. | 2602.03213 | null |
| 2026-02-04 | A Unified Candidate Set with Scene-Adaptive Refinement via Diffusion for End-to-End Autonomous Driving | Zhengfei Wu et.al. | 2602.03112 | null |
| 2026-02-03 | JRDB-Pose3D: A Multi-person 3D Human Pose and Shape Estimation Dataset for Robotics | Sandika Biswas et.al. | 2602.03064 | null |
| 2026-02-02 | Accelerating Structured Chain-of-Thought in Autonomous Vehicles | Yi Gu et.al. | 2602.02864 | null |
| 2026-02-02 | AROLA: A Modular Layered Architecture for Scaled Autonomous Racing | Fam Shihata et.al. | 2602.02730 | null |
| 2026-02-03 | Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL | Julian Lemmel et.al. | 2602.02236 | null |
| 2026-02-02 | LiFlow: Flow Matching for 3D LiDAR Scene Completion | Andrea Matteazzi et.al. | 2602.02232 | null |
| 2026-02-02 | UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving | Guosheng Zhao et.al. | 2602.02002 | null |
| 2026-02-02 | ForSim: Stepwise Forward Simulation for Traffic Policy Fine-Tuning | Keyu Chen et.al. | 2602.01916 | null |
| 2026-02-02 | UniDWM: Towards a Unified Driving World Model via Multifaceted Representation Learning | Shuai Liu et.al. | 2602.01536 | null |
| 2026-02-01 | TF-Lane: Traffic Flow Module for Robust Lane Perception | Yihan Xie et.al. | 2602.01277 | null |
| 2026-02-01 | OASIS-DC: Generalizable Depth Completion via Output-level Alignment of Sparse-Integrated Monocular Pseudo Depth | Jaehyeon Cho et.al. | 2602.01268 | null |
| 2026-02-01 | LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions | Jingjing Wang et.al. | 2602.01118 | null |
| 2026-02-01 | HERMES: A Holistic End-to-End Risk-Aware Multimodal Embodied System with Vision-Language Models for Long-Tail Autonomous Driving | Weizhe Tang et.al. | 2602.00993 | null |
| 2026-01-31 | A Graph-based Framework for Coverage Analysis in Autonomous Driving | Thomas Muehlenstädt et.al. | 2602.00903 | null |
| 2026-01-31 | VVLoc: Prior-free 3-DoF Vehicle Visual Localization | Ze Huang et.al. | 2602.00810 | null |
| 2026-01-31 | Physics-informed Diffusion Mamba Transformer for Real-world Driving | Hang Zhou et.al. | 2602.00808 | null |
| 2026-01-31 | UniMotion: A Unified Motion Framework for Simulation, Prediction and Planning | Nan Song et.al. | 2602.00566 | null |
| 2026-01-30 | Deep Learning-Based Object Detection for Autonomous Vehicles: A Comparative Study of One-Stage and Two-Stage Detectors on Basic Traffic Objects | Bsher Karbouj et.al. | 2602.00385 | null |
| 2026-01-30 | IRL-DAL: Safe and Adaptive Trajectory Planning for Autonomous Driving via Energy-Guided Diffusion Models | Seyed Ahmad Hosseini Miangoleh et.al. | 2601.23266 | null |
| 2026-01-30 | FlowCalib: LiDAR-to-Vehicle Miscalibration Detection using Scene Flows | Ilir Tahiraj et.al. | 2601.23107 | null |
| 2026-01-30 | MTDrive: Multi-turn Interactive Reinforcement Learning for Autonomous Driving | Xidong Li et.al. | 2601.22930 | null |
| 2026-01-30 | Toward Fully Autonomous Driving: AI, Challenges, Opportunities, and Needs | Lars Ullrich et.al. | 2601.22927 | null |
| 2026-01-30 | A Serverless Edge-Native Data Processing Architecture for Autonomous Driving Training | Fabian Bally et.al. | 2601.22919 | null |
| 2026-01-30 | AutoMerge: Search-Based Model Merging Framework for Effective Model Reuse | You Lu et.al. | 2601.22748 | null |
| 2026-01-30 | GaussianOcc3D: A Gaussian-Based Adaptive Multi-modal 3D Occupancy Prediction | A. Enes Doruk et.al. | 2601.22729 | null |
| 2026-01-29 | FlexMap: Generalized HD Map Construction from Flexible Camera Configurations | Run Wang et.al. | 2601.22376 | null |
| 2026-01-29 | PoSafeNet: Safe Learning with Poset-Structured Neural Nets | Kiwan Wong et.al. | 2601.22356 | null |
| 2026-01-29 | Drive-JEPA: Video JEPA Meets Multimodal Trajectory Distillation for End-to-End Driving | Linhan Wang et.al. | 2601.22032 | null |
| 2026-01-29 | LLM-Driven Scenario-Aware Planning for Autonomous Driving | He Li et.al. | 2601.21876 | null |
| 2026-01-29 | 4D-CAAL: 4D Radar-Camera Calibration and Auto-Labeling for Autonomous Driving | Shanliang Yao et.al. | 2601.21454 | null |
| 2026-01-29 | Drive-KD: Multi-Teacher Distillation for VLMs in Autonomous Driving | Weitong Lian et.al. | 2601.21288 | null |
| 2026-01-28 | Li-ViP3D++: Query-Gated Deformable Camera-LiDAR Fusion for End-to-End Perception and Trajectory Prediction | Matej Halinkovic et.al. | 2601.20720 | null |
| 2026-01-28 | Learning Contextual Runtime Monitors for Safe AI-Based Autonomy | Alejandro Luque-Cerpa et.al. | 2601.20666 | null |
| 2026-01-28 | Unsupervised Anomaly Detection in Multi-Agent Trajectory Prediction via Transformer-Based Models | Qing Lyu et.al. | 2601.20367 | null |
| 2026-01-27 | Game-Theoretic Autonomous Driving: A Graphs of Convex Sets Approach | Nikolaj Käfer et.al. | 2601.20054 | null |
| 2026-01-27 | ScenePilot-Bench: A Large-Scale Dataset and Benchmark for Evaluation of Vision-Language Models in Autonomous Driving | Yujin Wang et.al. | 2601.19582 | null |
| 2026-01-27 | Instance-Guided Radar Depth Estimation for 3D Object Detection | Chen-Chou Lo et.al. | 2601.19314 | null |
| 2026-01-26 | Learning the Pareto Space of Multi-Objective Autonomous Driving: A Modular, Data-Driven Approach | Mohammad Elayan et.al. | 2601.18913 | null |
| 2026-01-26 | Towards Safety-Compliant Transformer Architectures for Automotive Systems | Sven Kirchner et.al. | 2601.18850 | null |
| 2026-01-25 | Masked Depth Modeling for Spatial Perception | Bin Tan et.al. | 2601.17895 | null |
| 2026-01-23 | PocketDVDNet: Realtime Video Denoising for Real Camera Noise | Crispian Morris et.al. | 2601.16780 | null |
| 2026-01-22 | DMAVA: Distributed Multi-Autonomous Vehicle Architecture Using Autoware | Zubair Islam et.al. | 2601.16336 | null |
| 2026-01-22 | EVolSplat4D: Efficient Volume-based Gaussian Splatting for 4D Urban Scene Synthesis | Sheng Miao et.al. | 2601.15951 | null |
| 2026-01-22 | DualShield: Safe Model Predictive Diffusion via Reachability Analysis for Interactive Autonomous Driving | Rui Yang et.al. | 2601.15729 | null |
| 2026-01-22 | SuperOcc: Toward Cohesive Temporal Modeling for Superquadric-based Occupancy Prediction | Zichen Yu et.al. | 2601.15644 | null |
| 2026-01-21 | SplatBus: A Gaussian Splatting Viewer Framework via GPU Interprocess Communication | Yinghan Xu et.al. | 2601.15431 | null |
| 2026-01-29 | DrivIng: A Large-Scale Multimodal Driving Dataset with Full Digital Twin Integration | Dominik Rößle et.al. | 2601.15260 | null |
| 2026-01-21 | AutoDriDM: An Explainable Benchmark for Decision-Making of Vision-Language Models in Autonomous Driving | Zecong Tang et.al. | 2601.14702 | null |
| 2026-01-20 | Vision-Based Natural Language Scene Understanding for Autonomous Driving: An Extended Dataset and a New Model for Traffic Scene Description Generation | Danial Sadrian Zadeh et.al. | 2601.14438 | null |
| 2026-01-20 | Correcting and Quantifying Systematic Errors in 3D Box Annotations for Autonomous Driving | Alexandre Justo Miro et.al. | 2601.14038 | null |
| 2026-01-20 | PAtt: A Pattern Attention Network for ETA Prediction Using Historical Speed Profiles | ByeoungDo Kim et.al. | 2601.13793 | null |
| 2026-01-19 | NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness | Ali Shafiee Sarvestani et.al. | 2601.13162 | null |
| 2026-01-19 | AsyncBEV: Cross-modal Flow Alignment in Asynchronous 3D Object Detection | Shiming Wang et.al. | 2601.12994 | null |
| 2026-01-19 | PlannerRFT: Reinforcing Diffusion Planners through Closed-Loop and Sample-Efficient Fine-Tuning | Hongchen Li et.al. | 2601.12901 | null |
| 2026-01-19 | Efficient Local-to-Global Collaborative Perception via Joint Communication and Computation Optimization | Hui Zhang et.al. | 2601.12749 | null |
| 2026-01-19 | VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy Robustness | Qimao Chen et.al. | 2601.12672 | null |
| 2026-01-18 | SGCP: A Self-Organized Game-Theoretic Framework For Collaborative Perception | Zechuan Gong et.al. | 2601.12524 | null |
| 2026-01-18 | HOT-POT: Optimal Transport for Sparse Stereo Matching | Antonin Clerc et.al. | 2601.12423 | null |
| 2026-01-17 | Neural Process-Based Reactive Controller for Autonomous Racing | Devin Hunter et.al. | 2601.12143 | null |
| 2026-01-17 | Listen, Look, Drive: Coupling Audio Instructions for User-aware VLA-based Autonomous Driving | Ziang Guo et.al. | 2601.12142 | null |
| 2026-01-17 | Kernel-Based Learning of Safety Barriers | Oliver Schön et.al. | 2601.12002 | null |
| 2026-01-17 | Beyond Target-Level: ISAC-Enabled Event-Level Sensing for Behavioral Intention Prediction | Haotian Liu et.al. | 2601.11894 | null |
| 2026-01-16 | Toward Human-Centered Human-AI Interaction: Advances in Theoretical Frameworks and Practice | Zaifeng Gao et.al. | 2601.11812 | null |
| 2026-01-16 | Cross-Domain Object Detection Using Unsupervised Image Translation | Vinicius F. Arruda et.al. | 2601.11779 | null |
| 2026-01-16 | Generative Scenario Rollouts for End-to-End Autonomous Driving | Rajeev Yasarla et.al. | 2601.11475 | null |
| 2026-01-21 | SUG-Occ: An Explicit Semantics and Uncertainty Guided Sparse Learning Framework for Real-Time 3D Occupancy Prediction | Hanlin Wu et.al. | 2601.11396 | null |
| 2026-01-15 | A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems | Yizhou Wang et.al. | 2601.10819 | null |
| 2026-01-15 | See Less, Drive Better: Generalizable End-to-End Autonomous Driving via Foundation Models Stochastic Patch Selection | Amir Mallak et.al. | 2601.10707 | null |
| 2026-01-15 | DeepUrban: Interaction-Aware Trajectory Prediction and Planning for Automated Driving by Aerial Imagery | Constantin Selzer et.al. | 2601.10554 | null |
| 2026-01-15 | BikeActions: An Open Platform and Benchmark for Cyclist-Centric VRU Action Recognition | Max A. Buettner et.al. | 2601.10521 | null |
| 2026-01-15 | SatMap: Revisiting Satellite Maps as Prior for Online HD Map Construction | Kanak Mazumder et.al. | 2601.10512 | null |
| 2026-01-15 | OT-Drive: Out-of-Distribution Off-Road Traversable Area Segmentation via Optimal Transport | Zhihua Zhao et.al. | 2601.09952 | null |
| 2026-01-14 | LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving | Carlo Sgaravatti et.al. | 2601.09812 | null |
| 2026-01-14 | MAD: Motion Appearance Decoupling for efficient Driving World Models | Ahmad Rahimi et.al. | 2601.09452 | null |
| 2026-01-14 | Data Scaling for Navigation in Unknown Environments | Lauri Suomela et.al. | 2601.09444 | null |
| 2026-01-14 | ReflexDiffusion: Reflection-Enhanced Trajectory Planning for High-lateral-acceleration Scenarios in Autonomous Driving | Xuemei Yao et.al. | 2601.09377 | null |
| 2026-01-14 | Monte-Carlo Tree Search with Neural Network Guidance for Lane-Free Autonomous Driving | Ioannis Peridis et.al. | 2601.09353 | null |
| 2026-01-13 | SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning | Leo Fillioux et.al. | 2601.08617 | null |
| 2026-01-13 | Coverage-Guided Road Selection and Prioritization for Efficient Testing in Autonomous Driving Systems | Qurban Ali et.al. | 2601.08609 | null |
| 2026-01-14 | Large Multimodal Models for Embodied Intelligent Driving: The Next Frontier in Self-Driving? | Long Zhang et.al. | 2601.08434 | null |
| 2026-01-15 | Semantic Misalignment in Vision-Language Models under Perceptual Degradation | Guo Cheng et.al. | 2601.08355 | null |
| 2026-01-09 | An Empirical Study on Knowledge Transfer under Domain and Label Shifts in 3D LiDAR Point Clouds | Subeen Lee et.al. | 2601.07855 | null |
| 2026-01-12 | Leveraging 3D Representation Alignment and RGB Pretrained Priors for LiDAR Scene Generation | Nicolas Sereyjol-Garros et.al. | 2601.07692 | link |
| 2026-01-13 | ViewMorpher3D: A 3D-aware Diffusion Framework for Multi-Camera Novel View Synthesis in Autonomous Driving | Farhad G. Zanjani et.al. | 2601.07540 | null |
| 2026-01-12 | Task Prototype-Based Knowledge Retrieval for Multi-Task Learning from Partially Annotated Data | Youngmin Oh et.al. | 2601.07474 | null |
| 2026-01-12 | Software-Hardware Co-optimization for Modular E2E AV Paradigm: A Unified Framework of Optimization Approaches, Simulation Environment and Evaluation Metrics | Chengzhi Ji et.al. | 2601.07393 | null |
| 2026-01-12 | SC-MII: Infrastructure LiDAR-based 3D Object Detection on Edge Devices for Split Computing with Multiple Intermediate Outputs Integration | Taisuke Noguchi et.al. | 2601.07119 | null |
| 2026-01-11 | Efficient Visual Question Answering Pipeline for Autonomous Driving via Scene Region Compression | Yuliang Cai et.al. | 2601.07092 | null |
| 2026-01-11 | Conditional Normalizing Flows for Forward and Backward Joint State and Parameter Estimation | Luke S. Lagunowich et.al. | 2601.07013 | null |
| 2026-01-10 | SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning | Chenxu Dang et.al. | 2601.06474 | null |
| 2026-01-10 | WHU-PCPR: A cross-platform heterogeneous point cloud dataset for place recognition in complex urban scenes | Xianghong Zou et.al. | 2601.06442 | null |
| 2026-01-09 | Toward Safe and Responsible AI Agents: A Three-Pillar Model for Transparency, Accountability, and Trustworthiness | Edward C. Cheng et.al. | 2601.06223 | null |
| 2026-01-09 | GeoSurDepth: Spatial Geometry-Consistent Self-Supervised Depth Estimation for Surround-View Cameras | Weimin Liu et.al. | 2601.05839 | null |
| 2026-01-09 | Modular Autonomy with Conversational Interaction: An LLM-driven Framework for Decision Making in Autonomous Driving | Marvin Seegert et.al. | 2601.05806 | null |
| 2026-01-09 | Drivora: A Unified and Extensible Infrastructure for Search-based Autonomous Driving Testing | Mingfei Cheng et.al. | 2601.05685 | null |
| 2026-01-12 | SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving | Jingyu Li et.al. | 2601.05640 | null |
| 2026-01-09 | LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction | Chengen Xie et.al. | 2601.05611 | null |
| 2026-01-08 | UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition | Filippo Ghilotti et.al. | 2601.05105 | null |
| 2026-01-08 | Driving on Registers | Ellington Kirby et.al. | 2601.05083 | link |
| 2026-01-08 | SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection | Maximilian Pittner et.al. | 2601.04968 | null |
| 2026-01-08 | ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving | Chang Zhao et.al. | 2601.04714 | null |
| 2026-01-08 | The UnScripted Trip: Fostering Policy Discussion on Future Human-Vehicle Collaboration in Autonomous Driving Through Design-Oriented Methods | Xinyan Yu et.al. | 2601.04601 | null |
| 2026-01-08 | Timeliness-Oriented Scheduling and Resource Allocation in Multi-Region Collaborative Perception | Mengmeng Zhu et.al. | 2601.04542 | null |
| 2026-01-07 | UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving | Zhexiao Xiong et.al. | 2601.04453 | null |
| 2026-01-07 | 3D-Agent:Tri-Modal Multi-Agent Collaboration for Scalable 3D Object Annotation | Jusheng Zhang et.al. | 2601.04404 | null |
| 2026-01-07 | A Systematic Mapping Study on the Debugging of Autonomous Driving Systems | Nathan Shaw et.al. | 2601.04293 | null |
| 2026-01-07 | Correcting Autonomous Driving Object Detection Misclassifications with Automated Commonsense Reasoning | Keegan Kimbrell et.al. | 2601.04271 | null |
| 2026-01-07 | Towards Safe Autonomous Driving: A Real-Time Motion Planning Algorithm on Embedded Hardware | Korbinian Moller et.al. | 2601.03904 | null |
| 2026-01-07 | On the Robustness of Fairness Practices: A Causal Framework for Systematic Evaluation | Verya Monjezi et.al. | 2601.03621 | null |
| 2026-01-07 | A Vision-Language-Action Model with Visual Prompt for OFF-Road Autonomous Driving | Liangdong Zhang et.al. | 2601.03519 | null |
| 2026-01-06 | FROST-Drive: Scalable and Efficient End-to-End Driving with a Frozen Vision Encoder | Zeyu Dong et.al. | 2601.03460 | null |
| 2026-01-06 | Enhancing Safety in Automated Ports: A Virtual Reality Study of Pedestrian-Autonomous Vehicle Interactions under Time Pressure, Visual Constraints, and Varying Vehicle Size | Yuan Che et.al. | 2601.03218 | null |
| 2026-01-06 | Towards Efficient 3D Object Detection for Vehicle-Infrastructure Collaboration via Risk-Intent Selection | Li Wang et.al. | 2601.03001 | null |
| 2026-01-07 | HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps | Xuchang Zhong et.al. | 2601.02730 | null |
| 2026-01-05 | VIT-Ped: Visionary Intention Transformer for Pedestrian Behavior Analysis | Aly R. Elkammar et.al. | 2601.01989 | null |
| 2026-01-05 | Sparse Threats, Focused Defense: Criticality-Aware Robust Reinforcement Learning for Safe Autonomous Driving | Qi Wei et.al. | 2601.01800 | null |
| 2026-01-05 | AlignDrive: Aligned Lateral-Longitudinal Planning for End-to-End Autonomous Driving | Yanhao Wu et.al. | 2601.01762 | null |
| 2026-01-04 | LabelAny3D: Label Any Object 3D in the Wild | Jin Yao et.al. | 2601.01676 | null |
| 2026-01-04 | Optically Transparent Meta-Grating Embedded in Rear Windshields for Automotive Radar Detection | Sergey Geyman et.al. | 2601.01551 | null |
| 2026-01-04 | DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving | Yang Zhou et.al. | 2601.01528 | link |
| 2026-01-04 | ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking | Xiaobao Wei et.al. | 2601.01386 | null |
| 2025-12-31 | Dichotomous Diffusion Policy Optimization | Ruiming Liang et.al. | 2601.00898 | link |
| 2026-01-01 | PatchBlock: A Lightweight Defense Against Adversarial Patches for Embedded EdgeAI Devices | Nandish Chattopadhyay et.al. | 2601.00367 | null |
| 2026-01-01 | Rectifying Adversarial Examples Using Their Vulnerabilities | Fumiya Morimoto et.al. | 2601.00270 | null |
| 2025-12-31 | Semi-Supervised Diversity-Aware Domain Adaptation for 3D Object detection | Bartłomiej Olber et.al. | 2512.24922 | null |
| 2026-01-04 | LSRE: Latent Semantic Rule Encoding for Real-Time Semantic Risk Detection in Autonomous Driving | Qian Cheng et.al. | 2512.24712 | null |
| 2025-12-31 | Decentralized No-Regret Frequency-Time Scheduling for FMCW Radar Interference Avoidance | Yunian Pan et.al. | 2512.24619 | null |
| 2025-12-30 | Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning | Zhenghao “Mark” Peng et.al. | 2512.24426 | null |
| 2025-12-30 | Spatial-aware Vision Language Model for Autonomous Driving | Weijie Wei et.al. | 2512.24331 | null |
| 2025-12-30 | MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation | Fuqiang Gu et.al. | 2512.24243 | null |
| 2025-12-30 | Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes | Shuyun Wang et.al. | 2512.24227 | null |
| 2025-12-30 | Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks | Yongtao Chen et.al. | 2512.24111 | null |
| 2025-12-30 | Multi-Scenario Highway Lane-Change Intention Prediction: A Temporal Physics-Informed Multi-Modal Framework | Jiazhao Shi et.al. | 2512.24075 | null |
| 2025-12-30 | DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation | Yuang Jia et.al. | 2512.23983 | null |
| 2025-12-29 | Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception | Xiaoyu Li et.al. | 2512.23635 | null |
| 2025-12-29 | Parallelized Code Generation from Simulink Models for Event-driven and Timer-driven ROS 2 Nodes | Kenshin Obi et.al. | 2512.23605 | null |
| 2025-12-29 | A Kalman Filter-Based Disturbance Observer for Steer-by-Wire Systems | Nikolai Beving et.al. | 2512.23593 | null |
| 2025-12-29 | Unsupervised Learning for Detection of Rare Driving Scenarios | Dat Le et.al. | 2512.23585 | null |
| 2025-12-29 | Model-based Development for Autonomous Driving Software Considering Parallelization | Kenshin Obi et.al. | 2512.23575 | null |
| 2025-12-29 | Assessing behaviour coverage in a multi-agent system simulation for autonomous vehicle testing | Manuel Franco-Vivo et.al. | 2512.23445 | null |
| 2025-12-31 | DriveLaW:Unifying Planning and Video Generation in a Latent Driving World | Tianze Xia et.al. | 2512.23421 | null |
| 2025-12-29 | A Human-Oriented Cooperative Driving Approach: Integrating Driving Intention, State, and Conflict | Qin Wang et.al. | 2512.23220 | link |
| 2025-12-29 | Exploring Syn-to-Real Domain Adaptation for Military Target Detection | Jongoh Jeong et.al. | 2512.23208 | null |
| 2025-12-29 | A Weak Signal Learning Dataset and Its Baseline Method | Xianqi Liu et.al. | 2512.23160 | null |
| 2025-12-28 | Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection | Runwei Guan et.al. | 2512.22972 | null |
| 2025-12-28 | ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving | Qihang Peng et.al. | 2512.22939 | link |
| 2025-12-27 | SCPainter: A Unified Framework for Realistic 3D Asset Insertion and Novel View Synthesis | Paul Dobre et.al. | 2512.22706 | null |
| 2025-12-27 | CoDS: Collaborative Perception via Digital Semantic Communication | Jipeng Gan et.al. | 2512.22513 | null |
| 2025-12-27 | SCAFusion: A Multimodal 3D Detection Framework for Small Object Detection in Lunar Surface Exploration | Xin Chen et.al. | 2512.22503 | null |
| 2025-12-26 | Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models | Zongmin Zhang et.al. | 2512.22046 | null |
| 2025-12-26 | RT-Focuser: A Real-Time Lightweight Model for Edge-side Image Deblurring | Zhuoyu Wu et.al. | 2512.21975 | null |
| 2025-12-26 | TimeBill: Time-Budgeted Inference for Large Language Models | Qi Fan et.al. | 2512.21859 | null |
| 2025-12-26 | End-to-End 3D Spatiotemporal Perception with Multimodal Fusion and V2X Collaboration | Zhenwei Yang et.al. | 2512.21831 | null |
| 2025-12-25 | SymDrive: Realistic and Controllable Driving Simulator via Symmetric Auto-regressive Online Restoration | Zhiyuan Liu et.al. | 2512.21618 | null |
| 2025-12-24 | SparScene: Efficient Traffic Scene Representation via Sparse Graph Learning for Large-Scale Trajectory Generation | Xiaoyu Mo et.al. | 2512.21133 | null |
| 2025-12-25 | Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation | Reeshad Khan et.al. | 2512.20815 | null |
| 2025-12-23 | OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective | Markus Gross et.al. | 2512.20770 | null |
| 2025-12-23 | KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System | Zhongyu Xia et.al. | 2512.20299 | null |
| 2025-12-23 | UrbanV2X: A Multisensory Vehicle-Infrastructure Dataset for Cooperative Navigation in Urban Areas | Qijun Qin et.al. | 2512.20224 | null |
| 2025-12-23 | RESPOND: Risk-Enhanced Structured Pattern for LLM-driven Online Node-level Decision-making | Dan Chen et.al. | 2512.20179 | null |
| 2025-12-23 | LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs | Haiyun Wei et.al. | 2512.20105 | null |
| 2025-12-22 | Vehicle-centric Perception via Multimodal Structured Pre-training | Wentao Wu et.al. | 2512.19934 | null |
| 2025-12-22 | A Gauss-Newton-Induced Structure-Exploiting Algorithm for Differentiable Optimal Control | Yuankun Chen et.al. | 2512.19447 | link |
| 2025-12-22 | Are All Data Necessary? Efficient Data Pruning for Large-scale Autonomous Driving Dataset via Trajectory Entropy Maximization | Zhaoyang Liu et.al. | 2512.19270 | null |
| 2025-12-22 | AMap: Distilling Future Priors for Ahead-Aware Online HD Map Construction | Ruikai Li et.al. | 2512.19150 | null |
| 2025-12-22 | WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving | Pengxuan Yang et.al. | 2512.19133 | null |
| 2025-12-22 | VOIC: Visible-Occluded Decoupling for Monocular 3D Semantic Scene Completion | Zaidao Han et.al. | 2512.18954 | null |
| 2025-12-21 | CrashChat: A Multimodal Large Language Model for Multitask Traffic Crash Video Analysis | Kaidi Liang et.al. | 2512.18878 | null |
| 2025-12-21 | InDRiVE: Reward-Free World-Model Pretraining for Autonomous Driving via Latent Disagreement | Feeza Khan Khanzada et.al. | 2512.18850 | null |
| 2025-12-21 | Misbehavior Forecasting for Focused Autonomous Driving Systems Testing | M M Abid Naziri et.al. | 2512.18823 | null |
| 2025-12-21 | CauTraj: A Causal-Knowledge-Guided Framework for Lane-Changing Trajectory Planning of Autonomous Vehicles | Cailin Lei et.al. | 2512.18703 | null |
| 2025-12-21 | Offline Reinforcement Learning for End-to-End Autonomous Driving | Chihiro Noguchi et.al. | 2512.18662 | null |
| 2025-12-20 | Systematic Benchmarking of SUMO Against Data-Driven Traffic Simulators | Erdao Liang et.al. | 2512.18537 | null |
| 2025-12-20 | Prioritized Constraints in Optimization-Based Control | Daniel Arnström et.al. | 2512.18458 | null |
| 2025-12-20 | LLaViDA: A Large Language Vision Driving Assistant for Explicit Reasoning and Enhanced Trajectory Planning | Yudong Liu et.al. | 2512.18211 | null |
| 2025-12-19 | Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation | Shreshth Rajan et.al. | 2512.18082 | null |
| 2025-12-19 | StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection | Di Wu et.al. | 2512.17620 | null |
| 2025-12-19 | Learning Safe Autonomous Driving Policies Using Predictive Safety Representations | Mahesh Keswani et.al. | 2512.17586 | null |
| 2025-12-22 | TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data | Deqing Liu et.al. | 2512.17370 | null |
| 2025-12-18 | DVGT: Driving Visual Geometry Transformer | Sicheng Zuo et.al. | 2512.16919 | null |
| 2025-12-18 | Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future | Tianshuai Hu et.al. | 2512.16760 | link |
| 2025-12-18 | The Bi-objective Electric Autonomous Dial-a-Ride Problem | Yue Su et.al. | 2512.16605 | null |
| 2025-12-18 | Autoencoder-based Denoising Defense against Adversarial Attacks on Object Detection | Min Geun Song et.al. | 2512.16123 | null |
| 2025-12-18 | Driving in Corner Case: A Real-World Adversarial Closed-Loop Evaluation Platform for End-to-End Autonomous Driving | Jiaheng Geng et.al. | 2512.16055 | null |
| 2025-12-17 | From Words to Wavelengths: VLMs for Few-Shot Multispectral Object Detection | Manuel Nkegoum et.al. | 2512.15971 | null |
| 2025-12-17 | Human-like Working Memory from Artificial Intrinsic Plasticity Neurons | Jingli Liu et.al. | 2512.15829 | null |
| 2025-12-17 | OccSTeP: Benchmarking 4D Occupancy Spatio-Temporal Persistence | Yu Zheng et.al. | 2512.15621 | null |
| 2025-12-17 | Gaussian Process Dual MPC using Active Inference: An Autonomous Vehicle Usecase | Mohammad Mahmoudi Filabadi et.al. | 2512.15381 | null |
| 2025-12-17 | KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird’s-Eye-View Segmentation | Wenke E et.al. | 2512.15311 | null |
| 2025-12-17 | EPSM: A Novel Metric to Evaluate the Safety of Environmental Perception in Autonomous Driving | Jörg Gamerdinger et.al. | 2512.15195 | null |
| 2025-12-17 | Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network | Zhuoran Li et.al. | 2512.15109 | null |
| 2025-12-18 | LADY: Linear Attention for Autonomous Driving Efficiency without Transformers | Jihao Huang et.al. | 2512.15038 | null |
| 2025-12-16 | DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance | Shreedhar Govil et.al. | 2512.14266 | link |
| 2025-12-16 | OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving | Tao Tang et.al. | 2512.14225 | null |
| 2025-12-16 | CIS-BA: Continuous Interaction Space Based Backdoor Attack for Object Detection in the Real-World | Shuxin Zhao et.al. | 2512.14158 | null |
| 2025-12-16 | OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving | Zhenguo Zhang et.al. | 2512.14044 | null |
| 2025-12-16 | FocalComm: Hard Instance-Aware Multi-Agent Perception | Dereje Shenkut et.al. | 2512.13982 | null |
| 2025-12-15 | A Convex Obstacle Avoidance Formulation | Ricardo Tapia et.al. | 2512.13836 | null |
| 2025-12-16 | MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning | Haoyu Fu et.al. | 2512.13636 | null |
| 2025-12-15 | Post-Training and Test-Time Scaling of Generative Agent Behavior Models for Interactive Autonomous Driving | Hyunki Seong et.al. | 2512.13262 | null |
| 2025-12-16 | MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion | Minghui Hou et.al. | 2512.13177 | null |
| 2025-12-15 | Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather | Zhijian He et.al. | 2512.13107 | null |
| 2025-12-15 | Sequence of Expert: Boosting Imitation Planners for Autonomous Driving through Temporal Alternation | Xiang Li et.al. | 2512.13094 | null |
| 2025-12-15 | Machine Learning Architectures for the Estimation of Predicted Occupancy Grids in Road Traffic | Parthasarathy Nadarajan et.al. | 2512.12907 | null |
| 2025-12-14 | GradID: Adversarial Detection via Intrinsic Dimensionality of Gradients | Mohammad Mahdi Razmjoo et.al. | 2512.12827 | null |
| 2025-12-14 | DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning | Zhe Liu et.al. | 2512.12799 | null |
| 2025-12-14 | High Order Control Lyapunov Function - Control Barrier Function - Quadratic Programming Based Autonomous Driving Controller for Bicyclist Safety | Haochong Chen et.al. | 2512.12776 | null |
| 2025-12-13 | From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving | Huan Zheng et.al. | 2512.12302 | null |
| 2025-12-13 | Measuring What Matters: Scenario-Driven Evaluation for Trajectory Predictors in Autonomous Driving | Longchao Da et.al. | 2512.12211 | null |
| 2025-12-12 | A Review of Learning-Based Motion Planning: Toward a Data-Driven Optimal Control Approach | Jia Hu et.al. | 2512.11944 | null |
| 2025-12-12 | TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder | Qinghao Meng et.al. | 2512.11926 | null |
| 2025-12-12 | LUCID: Learning-Enabled Uncertainty-Aware Certification of Stochastic Dynamical Systems | Ernesto Casablanca et.al. | 2512.11750 | null |
| 2025-12-12 | Evaluating Foundation Models’ 3D Understanding Through Multi-View Correspondence Analysis | Valentina Lilova et.al. | 2512.11574 | null |
| 2025-12-12 | CarlaNCAP: A Framework for Quantifying the Safety of Vulnerable Road Users in Infrastructure-Assisted Collective Perception Using EuroNCAP Scenarios | Jörg Gamerdinger et.al. | 2512.11551 | null |
| 2025-12-12 | SATMapTR: Satellite Image Enhanced Online HD Map Construction | Bingyuan Huang et.al. | 2512.11319 | null |
| 2025-12-12 | Elevation Aware 2D/3D Co-simulation Framework for Large-scale Traffic Flow and High-fidelity Vehicle Dynamics | Chandra Raskoti et.al. | 2512.11249 | null |
| 2025-12-12 | FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model | Hongbin Lin et.al. | 2512.11226 | null |
| 2025-12-12 | Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving | Jiawei Yang et.al. | 2512.10947 | null |
| 2025-12-11 | SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving | Peizheng Li et.al. | 2512.10719 | null |
| 2025-12-11 | NaviHydra: Controllable Navigation-guided End-to-end Autonomous Driving with Hydra-distillation | Hanfeng Wu et.al. | 2512.10660 | null |
| 2025-12-11 | UACER: An Uncertainty-Aware Critic Ensemble Framework for Robust Adversarial Reinforcement Learning | Jiaxi Wu et.al. | 2512.10492 | null |
| 2025-12-11 | T-SKM-Net: Trainable Neural Network Framework for Linear Constraint Satisfaction via Sampling Kaczmarz-Motzkin Method | Haoyu Zhu et.al. | 2512.10461 | null |
| 2025-12-11 | Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method | Ge Zhang et.al. | 2512.10386 | null |
| 2025-12-11 | InfoCom: Kilobyte-Scale Communication-Efficient Collaborative Perception with Information Bottleneck | Quanmin Wei et.al. | 2512.10305 | null |
| 2025-12-11 | Latent Chain-of-Thought World Modeling for End-to-End Driving | Shuhan Tan et.al. | 2512.10226 | null |
| 2025-12-10 | UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving | Hao Lu et.al. | 2512.09864 | null |
| 2025-12-10 | COVLM-RL: Critical Object-Oriented Reasoning for Autonomous Driving Using VLM-Guided Reinforcement Learning | Lin Li et.al. | 2512.09349 | null |
| 2025-12-10 | Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving | Songhan Wu et.al. | 2512.09296 | null |
| 2025-12-09 | Understanding Mental States in Active and Autonomous Driving with EEG | Prithila Angkan et.al. | 2512.09190 | null |
| 2025-12-09 | Astra: General Interactive World Model with Autoregressive Denoising | Yixuan Zhu et.al. | 2512.08931 | null |
| 2025-12-09 | A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems | Po-An Shih et.al. | 2512.08476 | null |
| 2025-12-09 | Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection | Haowen Zheng et.al. | 2512.08247 | null |
| 2025-12-09 | Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators | Yuki Kubota et.al. | 2512.08163 | null |
| 2025-12-08 | DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving | Jialv Zou et.al. | 2512.07745 | null |
| 2025-12-08 | VP-AutoTest: A Virtual-Physical Fusion Autonomous Driving Testing Platform | Yiming Cui et.al. | 2512.07507 | null |
| 2025-12-08 | Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood | Gilhyun Nam et.al. | 2512.07390 | null |
| 2025-12-08 | Unified Camera Positional Encoding for Controlled Video Generation | Cheng Zhang et.al. | 2512.07237 | link |
| 2025-12-09 | TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning | Zebin Xing et.al. | 2512.07135 | null |
| 2025-12-08 | Mimir: Hierarchical Goal-Driven Diffusion with Uncertainty Propagation for End-to-End Autonomous Driving | Zebin Xing et.al. | 2512.07130 | link |
| 2025-12-07 | Spatial Retrieval Augmented Autonomous Driving | Xiaosong Jia et.al. | 2512.06865 | link |
| 2025-12-07 | SparseCoop: Cooperative Perception with Kinematic-Grounded Queries | Jiahao Wang et.al. | 2512.06838 | null |
| 2025-12-07 | FedDSR: Federated Deep Supervision and Regularization Towards Autonomous Driving | Wei-Bin Kou et.al. | 2512.06676 | null |
| 2025-12-07 | Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving | Wei-Bin Kou et.al. | 2512.06664 | null |
| 2025-12-06 | UncertaintyZoo: A Unified Toolkit for Quantifying Predictive Uncertainty in Deep Learning Systems | Xianzong Wu et.al. | 2512.06406 | null |
| 2025-12-06 | Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework | Xinhao Xiang et.al. | 2512.06376 | null |
| 2025-12-06 | NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks | Fangzhou Lin et.al. | 2512.06251 | null |
| 2025-12-05 | Situation-Aware Interactive MPC Switching for Autonomous Driving | Shuhao Qi et.al. | 2512.06182 | null |
| 2025-12-05 | WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving | Yifang Xu et.al. | 2512.06112 | null |
| 2025-12-05 | BeLLA: End-to-End Birds Eye View Large Language Assistant for Autonomous Driving | Karthik Mohan et.al. | 2512.06096 | null |
| 2025-12-05 | Representation Learning for Point Cloud Understanding | Siming Yan et.al. | 2512.06058 | null |
| 2025-12-05 | OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning | Xusheng Guo et.al. | 2512.05698 | null |
| 2025-12-05 | LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving | Yiming Shu et.al. | 2512.05686 | null |
| 2025-12-05 | Scenario-aware Uncertainty Quantification for Trajectory Prediction with Statistical Guarantees | Yiming Shu et.al. | 2512.05682 | null |
| 2025-12-05 | Concept-based Explainable Data Mining with VLM for 3D Detection | Mai Tsujimoto et.al. | 2512.05482 | null |
| 2025-12-05 | MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare | Zag ElSayed et.al. | 2512.05365 | null |
| 2025-12-05 | State-Conditional Adversarial Learning: An Off-Policy Visual Domain Transfer Method for End-to-End Imitation Learning | Yuxiang Liu et.al. | 2512.05335 | null |
| 2025-12-04 | WhatsCode: Large-Scale GenAI Deployment for Developer Efficiency at WhatsApp | Ke Mao et.al. | 2512.05314 | null |
| 2025-12-04 | From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model | Kevin Cannons et.al. | 2512.05277 | null |
| 2025-12-04 | Are Your Agents Upward Deceivers? | Dadi Guo et.al. | 2512.04864 | null |
| 2025-12-04 | FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis | Shijie Chen et.al. | 2512.04830 | null |
| 2025-12-04 | MT-Depth: Multi-task Instance feature analysis for the Depth Completion | Abdul Haseeb Nizamani et.al. | 2512.04734 | null |
| 2025-12-04 | E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving | Yihong Tang et.al. | 2512.04733 | null |
| 2025-12-04 | Efficient Safety Verification of Autonomous Vehicles with Neural Network Operator | Lingxiang Fan et.al. | 2512.04557 | null |
| 2025-12-04 | dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning | Yingzi Ma et.al. | 2512.04459 | null |
| 2025-12-04 | MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving | Bin Sun et.al. | 2512.04441 | null |
| 2025-12-03 | DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle | Fangyu Lei et.al. | 2512.04324 | null |
| 2025-12-03 | Driving Beyond Privilege: Distilling Dense-Reward Knowledge into Sparse-Reward Policies | Feeza Khan Khanzada et.al. | 2512.04279 | null |
| 2025-12-03 | PINN vs LSTM: A Comparative Study for Steam Temperature Control in Heat Recovery Steam Generators | Mojtaba Fanoodi et.al. | 2512.04183 | null |
| 2025-12-03 | Fast & Efficient Normalizing Flows and Applications of Image Generative Models | Sandeep Nagar et.al. | 2512.04039 | null |
| 2025-12-03 | DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation | Zexin Lin et.al. | 2512.03992 | null |
| 2025-12-03 | Classification of User Satisfaction in HRI with Social Signals in the Wild | Michael Schiffmann et.al. | 2512.03945 | null |
| 2025-12-03 | Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response | Aron Distelzweig et.al. | 2512.03936 | null |
| 2025-12-03 | Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties | Vineel Tummala et.al. | 2512.03931 | null |
| 2025-12-03 | A Modular Architecture Design for Autonomous Driving Racing in Controlled Environments | Brais Fontan-Costas et.al. | 2512.03886 | null |
| 2025-12-03 | Multi-Agent Deep Reinforcement Learning for UAV-Assisted 5G Network Slicing: A Comparative Study of MAPPO, MADDPG, and MADQN | Ghoshana Bista et.al. | 2512.03835 | null |
| 2025-12-03 | MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving | Jia Hu et.al. | 2512.03795 | null |
| 2025-12-03 | Safety Reinforced Model Predictive Control (SRMPC): Improving MPC with Reinforcement Learning for Motion Planning in Autonomous Driving | Johannes Fischer et.al. | 2512.03774 | null |
| 2025-12-03 | Context-Triggered Contingency Games for Strategic Multi-Agent Interaction | Kilian Schweppe et.al. | 2512.03639 | null |
| 2025-12-03 | Market share maximizing strategies of CAV fleet operators may cause chaos in our cities | Grzegorz Jamróz et.al. | 2512.03524 | null |
| 2025-12-03 | Left shifting analysis of Human-Autonomous Team interactions to analyse risks of autonomy in high-stakes AI systems | Ben Larwood et.al. | 2512.03519 | null |
| 2025-12-03 | CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving | Zhijian Qiao et.al. | 2512.03510 | null |
| 2025-12-03 | Double-Edge-Assisted Computation Offloading and Resource Allocation for Space-Air-Marine Integrated Networks | Zhen Wang et.al. | 2512.03487 | null |
| 2025-12-03 | Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles | Haicheng Liao et.al. | 2512.03454 | null |
| 2025-12-03 | Generalization Evaluation of Deep Stereo Matching Methods for UAV-Based Forestry Applications | Yida Lin et.al. | 2512.03427 | null |
| 2025-12-03 | NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction | Thomas Monninger et.al. | 2512.03317 | null |
| 2025-12-02 | SpatialReasoner: Active Perception for Large-Scale 3D Scene Understanding | Hongpei Zheng et.al. | 2512.03284 | null |
| 2025-12-02 | Flux4D: Flow-based Unsupervised 4D Reconstruction | Jingkang Wang et.al. | 2512.03210 | null |
| 2025-12-02 | AGENTSAFE: A Unified Framework for Ethical Assurance and Governance in Agentic AI | Rafflesia Khan et.al. | 2512.03180 | null |
| 2025-12-02 | The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models | Saeid Jamshidi et.al. | 2512.03026 | null |
| 2025-12-02 | DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images | Xiaoxue Chen et.al. | 2512.03004 | link |
| 2025-12-02 | U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences | Xiang Xu et.al. | 2512.02982 | link |
| 2025-12-02 | Lumos: Let there be Language Model System Certification | Isha Chaudhary et.al. | 2512.02966 | null |
| 2025-12-02 | EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis | Yancheng Zhang et.al. | 2512.02932 | link |
| 2025-12-02 | VLM as Strategist: Adaptive Generation of Safety-critical Testing Scenarios via Guided Diffusion | Xinzheng Wu et.al. | 2512.02844 | null |
| 2025-12-02 | CogDrive: Cognition-Driven Multimodal Prediction-Planning Fusion for Safe Autonomy | Heye Huang et.al. | 2512.02777 | null |
| 2025-12-02 | Adaptive hydrogels with spatiotemporal stiffening using pH-modulating enzymes | Natascha Gray et.al. | 2512.02698 | null |
| 2025-12-02 | ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data | Yuxing Liu et.al. | 2512.02686 | null |
| 2025-12-02 | Wi-Fi Rate Adaptation for Moving Equipment in Industrial Environments | Pietro Chiavassa et.al. | 2512.02455 | null |
| 2025-12-02 | nuScenes Revisited: Progress and Challenges in Autonomous Driving | Whye Kit Fong et.al. | 2512.02448 | null |
| 2025-12-02 | Vehicle Dynamics Embedded World Models for Autonomous Driving | Huiqian Li et.al. | 2512.02417 | null |
| 2025-12-02 | Synthetic Error Injection Fails to Elicit Self-Correction In Language Models | David X. Wu et.al. | 2512.02389 | null |
| 2025-12-02 | Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention | Wenyi Xiong et.al. | 2512.02368 | null |
| 2025-12-02 | Near-Memory Architecture for Threshold-Ordinal Surface-Based Corner Detection of Event Cameras | Hongyang Shang et.al. | 2512.02346 | null |
| 2025-12-01 | RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies | Guillermo Garcia-Cobo et.al. | 2512.01993 | null |
| 2025-12-01 | Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory | Chenyi Wang et.al. | 2512.01934 | null |
| 2025-12-01 | NeuroHJR: Hamilton-Jacobi Reachability-based Obstacle Avoidance in Complex Environments with Physics-Informed Neural Networks | Granthik Halder et.al. | 2512.01897 | null |
| 2025-12-02 | OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic | Songyan Zhang et.al. | 2512.01830 | link |
| 2025-12-01 | AgriLiRa4D: A Multi-Sensor UAV Dataset for Robust SLAM in Challenging Agricultural Fields | Zhihao Zhan et.al. | 2512.01753 | link |
| 2025-12-01 | In-context Inverse Optimality for Fair Digital Twins: A Preference-based approach | Daniele Masti et.al. | 2512.01650 | null |
| 2025-12-01 | Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track | Mo Chen et.al. | 2512.01608 | null |
| 2025-12-01 | Language-Guided Open-World Anomaly Segmentation | Klara Reichard et.al. | 2512.01427 | null |
| 2025-12-01 | Accelerating Probabilistic Response-Time Analysis: Revised Critical Instant and Optimized Convolution | Hiroto Takahashi et.al. | 2512.01381 | null |
| 2025-12-01 | SocialDriveGen: Generating Diverse Traffic Scenarios with Controllable Social Interactions | Jiaguo Tian et.al. | 2512.01363 | null |
| 2025-12-01 | OpenBox: Annotate Any Bounding Boxes in 3D | In-Jae Lee et.al. | 2512.01352 | null |
| 2025-12-01 | CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL | Shinji Mai et.al. | 2512.01311 | null |
| 2025-12-01 | RoboDriveVLM: A Novel Benchmark and Baseline towards Robust Vision-Language Models for Autonomous Driving | Dacheng Liao et.al. | 2512.01300 | null |
| 2025-12-01 | COMET: A Dual Swashplate Autonomous Coaxial Bi-copter AAV with High-Maneuverability and Long-Endurance | Shuai Wang et.al. | 2512.01246 | null |
| 2025-12-01 | RoboLoc: A Benchmark Dataset for Point Place Recognition and Localization in Indoor-Outdoor Integrated Environments | Jaejin Jeon et.al. | 2512.01194 | null |
| 2025-12-01 | DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks | Hyunjun Kim et.al. | 2512.01174 | null |
| 2025-11-30 | Semantic Communications for Vehicle-Based Mission-Critical Services: Challenges and Solutions | Hui Zhou et.al. | 2512.01102 | null |
| 2025-11-30 | SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds | Jiawei Ren et.al. | 2512.01078 | null |
| 2025-11-30 | Autonomous Grasping On Quadruped Robot With Task Level Interaction | Muhtadin et.al. | 2512.01052 | null |
| 2025-11-30 | Approximating Analytically-Intractable Likelihood Densities with Deterministic Arithmetic for Optimal Particle Filtering | Orestis Kaparounakis et.al. | 2512.01023 | null |
| 2025-11-28 | Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation | Bernhard Klein et.al. | 2511.23440 | null |
| 2025-11-28 | SimScale: Learning to Drive via Real-World Simulation at Scale | Haochen Tian et.al. | 2511.23369 | null |
| 2025-11-28 | Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach | Haruki Sakajo et.al. | 2511.23311 | null |
| 2025-11-28 | Seeing before Observable: Potential Risk Reasoning in Autonomous Driving via Vision Language Models | Jiaxin Liu et.al. | 2511.22928 | null |
| 2025-11-28 | DM $^3$ T: Harmonizing Modalities via Diffusion for Multi-Object Tracking | Weiran Li et.al. | 2511.22896 | null |
| 2025-11-28 | SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving | Wonjeong Ryu et.al. | 2511.22865 | null |
| 2025-11-28 | Safe Autonomous Lane Changing: Planning with Dynamic Risk Fields and Time-Varying Convex Space Generation | Zhen Tian et.al. | 2511.22829 | null |
| 2025-11-27 | Active flow-driven DNA remodeling generates millimeter-scale mechanical oscillations | Maya Levanon et.al. | 2511.22589 | null |
| 2025-11-27 | BUDD-e: an autonomous robotic guide for visually impaired users | Jinyang Li et.al. | 2511.22541 | null |
| 2025-11-27 | CoT4AD: A Vision-Language-Action Model with Explicit Chain-of-Thought Reasoning for Autonomous Driving | Zhaohui Wang et.al. | 2511.22532 | null |
| 2025-11-27 | Motion-to-Motion Latency Measurement Framework for Connected and Autonomous Vehicle Teleoperation | François Provost et.al. | 2511.22467 | null |
| 2025-11-27 | RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding | Xiyan Liu et.al. | 2511.22466 | null |
| 2025-11-27 | Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning | Bokang Zhang et.al. | 2511.22415 | null |
| 2025-11-27 | LLM-Based Generalizable Hierarchical Task Planning and Execution for Heterogeneous Robot Teams with Event-Driven Replanning | Suraj Borate et.al. | 2511.22354 | null |
| 2025-11-27 | DriveVGGT: Visual Geometry Transformer for Autonomous Driving | Xiaosong Jia et.al. | 2511.22264 | null |
| 2025-11-27 | Co-Evolving Agents: Learning from Failures as Hard Negatives | Yeonsung Jung et.al. | 2511.22254 | null |
| 2025-11-27 | HybridWorldSim: A Scalable and Controllable High-fidelity Simulator for Autonomous Driving | Qiang Li et.al. | 2511.22187 | null |
| 2025-11-27 | MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction | Maitrayee Keskar et.al. | 2511.22181 | null |
| 2025-11-27 | SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions | Aiyinsi Zuo et.al. | 2511.22142 | null |
| 2025-11-27 | Aligning with Human Values to Enhance Interaction: An eHMI-Mediated Lane-Changing Negotiation Strategy Using Bayesian Inference | Boyao Peng et.al. | 2511.22061 | null |
| 2025-11-26 | Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving | Haohong Lin et.al. | 2511.21584 | null |
| 2025-11-26 | Improvement of Collision Avoidance in Cut-In Maneuvers Using Time-to-Collision Metrics | Jamal Raiyn et.al. | 2511.21280 | null |
| 2025-11-26 | LaGen: Towards Autoregressive LiDAR Scene Generation | Sizhuo Zhou et.al. | 2511.21256 | null |
| 2025-11-25 | Hierarchical Evaluation of Software Design Capabilities of Large Language Models of Code | Mootez Saad et.al. | 2511.20933 | null |
| 2025-11-25 | Accelerating Sparse Convolutions in Voxel-Based Point Cloud Networks | Dionysios Adamopoulos et.al. | 2511.20834 | null |
| 2025-11-25 | Learning from Risk: LLM-Guided Generation of Safety-Critical Scenarios with Prior Knowledge | Yuhang Wang et.al. | 2511.20726 | null |
| 2025-11-25 | DeeAD: Dynamic Early Exit of Vision-Language Action for Efficient Autonomous Driving | Haibo HU et.al. | 2511.20720 | null |
| 2025-11-25 | Efficient Parallel Implementation of the Pilot Assignment Problem in Massive MIMO Systems | Eman Alqudah et.al. | 2511.20511 | null |
| 2025-11-25 | AD-R1: Closed-Loop Reinforcement Learning for End-to-End Autonomous Driving with Impartial World Models | Tianyi Yan et.al. | 2511.20325 | null |
| 2025-11-25 | LLM-Driven Transient Stability Assessment: From Automated Simulation to Neural Architecture Design | Lianzhe Hu et.al. | 2511.20276 | null |
| 2025-11-25 | Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving | Bin Hu et.al. | 2511.20156 | link |
| 2025-11-25 | “Are We Done Yet?”: A Vision-Based Judge for Autonomous Task Completion of Computer Use Agents | Marta Sumyk et.al. | 2511.20067 | null |
| 2025-11-25 | DeLightMono: Enhancing Self-Supervised Monocular Depth Estimation in Endoscopy by Decoupling Uneven Illumination | Mingyang Ou et.al. | 2511.20058 | null |
| 2025-11-25 | Energy Efficient Nonlinear Microscopic Dynamical Model for Autonomous and Electric Vehicles | Yuneil Yeo et.al. | 2511.20054 | null |
| 2025-11-25 | WaymoQA: A Multi-View Visual Question Answering Dataset for Safety-Critical Reasoning in Autonomous Driving | Seungjun Yu et.al. | 2511.20022 | null |
| 2025-11-25 | Cross-Modal Semantic Communication for Heterogeneous Collaborative Perception | Mingyi Lu et.al. | 2511.20000 | null |
| 2025-11-25 | On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices | Lianming Huang et.al. | 2511.19986 | null |
| 2025-11-25 | Hierarchical Spatio-Temporal Attention Network with Adaptive Risk-Aware Decision for Forward Collision Warning in Complex Scenarios | Haoran Hu et.al. | 2511.19952 | null |
| 2025-11-25 | CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model | Dapeng Zhang et.al. | 2511.19914 | null |
| 2025-11-25 | Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving | Dapeng Zhang et.al. | 2511.19912 | null |
| 2025-11-25 | 4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models | Yiting Lu et.al. | 2511.19836 | null |
| 2025-11-24 | Normative active inference: A numerical proof of principle for a computational and economic legal analytic approach to AI governance | Axel Constant et.al. | 2511.19334 | null |
| 2025-11-24 | IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes | Carl Lindström et.al. | 2511.19235 | null |
| 2025-11-24 | Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving | Jianhua Han et.al. | 2511.19221 | null |
| 2025-11-25 | VIL2C: Value-of-Information Aware Low-Latency Communication for Multi-Agent Reinforcement Learning | Qian Zhang et.al. | 2511.19146 | null |
| 2025-11-24 | Autonomous Docking of Multi-Rotor UAVs on Blimps under the Influence of Wind Gusts | Pascal Goldschmid et.al. | 2511.19135 | null |
| 2025-11-24 | MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images | Qirui Wang et.al. | 2511.19119 | link |
| 2025-11-24 | Agent Discovery in Internet of Agents: Challenges and Solutions | Shaolong Guo et.al. | 2511.19113 | null |
| 2025-11-24 | HABIT: Human Action Benchmark for Interactive Traffic in CARLA | Mohan Ramesh et.al. | 2511.19109 | null |
| 2025-11-24 | End-to-end Autonomous Vehicle Following System using Monocular Fisheye Camera | Jiale Zhang et.al. | 2511.19011 | null |
| 2025-11-24 | SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation | Nimeshika Udayangani et.al. | 2511.18816 | link |
| 2025-11-24 | From Features to Reference Points: Lightweight and Adaptive Fusion for Cooperative Autonomous Driving | Yongqi Zhu et.al. | 2511.18757 | null |
| 2025-11-24 | Thinking Ahead: Foresight Intelligence in MLLMs and World Models | Zhantao Gong et.al. | 2511.18735 | null |
| 2025-11-24 | GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving | Lin Liu et.al. | 2511.18729 | null |
| 2025-11-24 | DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving | Hongbin Lin et.al. | 2511.18713 | link |
| 2025-11-24 | Online Learning-Enhanced Lie Algebraic MPC for Robust Trajectory Tracking of Autonomous Surface Vehicles | Yinan Dong et.al. | 2511.18683 | null |
| 2025-11-24 | Data Augmentation Strategies for Robust Lane Marking Detection | Flora Lian et.al. | 2511.18668 | null |
| 2025-11-23 | The Evaluation for Usability Methods of Unmanned Surface Vehicles: Are Current Usability Methods Viable for Unmanned Surface Vehicles? Insights from a Multiple Case Study Approach to Human-Robot Interaction | Zitian Peng et.al. | 2511.18561 | null |
| 2025-11-23 | From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence | Jian Yang et.al. | 2511.18538 | null |
| 2025-11-23 | Splatblox: Traversability-Aware Gaussian Splatting for Outdoor Robot Navigation | Samarth Chopra et.al. | 2511.18525 | null |
| 2025-11-23 | Energy-Efficient Task Computation at the Edge for Vehicular Services | Paniz Parastar et.al. | 2511.18449 | null |
| 2025-11-21 | MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments | Zhiyu Huang et.al. | 2511.17496 | null |
| 2025-11-21 | Feasibility of Embodied Dynamics Based Bayesian Learning for Continuous Pursuit Motion Control of Assistive Mobile Robots in the Built Environment | Xiaoshan Zhou et.al. | 2511.17401 | null |
| 2025-11-21 | Vector Cost Behavioral Planning for Autonomous Robotic Systems with Contemporary Validation Strategies | Benjamin R. Toaz et.al. | 2511.17375 | null |
| 2025-11-21 | FORWARD: Dataset of a forwarder operating in rough terrain | Mikael Lundbäck et.al. | 2511.17318 | null |
| 2025-11-21 | Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing | Suchetan G. Uppur et.al. | 2511.17269 | null |
| 2025-11-21 | QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy | Adam Lilja et.al. | 2511.17221 | null |
| 2025-11-21 | Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition | Aditya Mishra et.al. | 2511.17183 | null |
| 2025-11-21 | DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving | Liuhan Yin et.al. | 2511.17150 | null |
| 2025-11-21 | Sparse Reasoning is Enough: Biological-Inspired Framework for Video Anomaly Detection with Large Pre-trained Models | He Huang et.al. | 2511.17094 | null |
| 2025-11-21 | VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions | Qianyi Shao et.al. | 2511.16998 | null |
| 2025-11-21 | MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots | Junseo Kim et.al. | 2511.16949 | null |
| 2025-11-20 | AutoBackdoor: Automating Backdoor Attacks via LLM Agents | Yige Li et.al. | 2511.16709 | null |
| 2025-11-20 | MiMo-Embodied: X-Embodied Foundation Model Technical Report | Xiaoshuai Hao et.al. | 2511.16518 | null |
| 2025-11-20 | Tube-Based Model Predictive Control with Random Fourier Features for Nonlinear Systems | Ákos M. Bokor et.al. | 2511.16425 | null |
| 2025-11-20 | Flow-Aided Flight Through Dynamic Clutters From Point To Motion | Bowen Xu et.al. | 2511.16372 | null |
| 2025-11-20 | DynaMimicGen: A Data Generation Framework for Robot Learning of Dynamic Tasks | Vincenzo Pomponi et.al. | 2511.16223 | null |
| 2025-11-20 | AskDB: An LLM Agent for Natural Language Interaction with Relational Databases | Xuan-Quang Phan et.al. | 2511.16131 | null |
| 2025-11-20 | LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving | Pei Liu et.al. | 2511.16049 | null |
| 2025-11-19 | RE for AI in Practice: Managing Data Annotation Requirements for AI Autonomous Driving Systems | Hina Saeeda et.al. | 2511.15859 | null |
| 2025-11-19 | Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges | Kim N. Nolle et.al. | 2511.15652 | link |
| 2025-11-19 | Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2511.15597 | null |
| 2025-11-20 | CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking | Sifan Zhou et.al. | 2511.15580 | null |
| 2025-11-19 | Computer-Use Agents as Judges for Generative User Interface | Kevin Qinghong Lin et.al. | 2511.15567 | null |
| 2025-11-19 | Scriboora: Rethinking Human Pose Forecasting | Daniel Bermuth et.al. | 2511.15565 | null |
| 2025-11-20 | UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy | Ruoqu Chen et.al. | 2511.15550 | null |
| 2025-11-19 | Uncoordinated Cooperative OFDM Multi-Hop UAV Relay Networks Using Virtual Channels Based on All-Pass Filters | Noura Sellami et.al. | 2511.15545 | null |
| 2025-11-19 | Driving in Spikes: An Entropy-Guided Object Detector for Spike Cameras | Ziyan Liu et.al. | 2511.15459 | null |
| 2025-11-19 | WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes | Marc-Emmanuel Coupvent des Graviers et.al. | 2511.15429 | null |
| 2025-11-19 | Unveiling Inference Scaling for Difference-Aware User Modeling in LLM Personalization | Suyu Chen et.al. | 2511.15389 | link |
| 2025-11-19 | Symmetry-Breaking in Multi-Agent Navigation: Winding Number-Aware MPC with a Learned Topological Strategy | Tomoki Nakao et.al. | 2511.15239 | null |
| 2025-11-19 | Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth Estimation | Jing Cao et.al. | 2511.15167 | null |
| 2025-11-19 | SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection | Chun-Jung Lin et.al. | 2511.15153 | link |
| 2025-11-19 | Data-driven control of network systems: Accounting for communication adaptivity and security | Gang Wang et.al. | 2511.15044 | null |
| 2025-11-18 | Z-Merge: Multi-Agent Reinforcement Learning for On-Ramp Merging with Zone-Specific V2X Traffic Information | Yassine Ibork et.al. | 2511.14910 | null |
| 2025-11-18 | Attacking Autonomous Driving Agents with Adversarial Machine Learning: A Holistic Evaluation with the CARLA Leaderboard | Henry Wong et.al. | 2511.14876 | null |
| 2025-11-18 | Uncertainty-Aware Measurement of Scenario Suite Representativeness for Autonomous Systems | Robab Aghazadeh Chakherlou et.al. | 2511.14853 | null |
| 2025-11-19 | Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks | Xianhui Meng et.al. | 2511.14592 | null |
| 2025-11-18 | Enhancing End-to-End Autonomous Driving with Risk Semantic Distillaion from VLM | Jack Qin et.al. | 2511.14499 | null |
| 2025-11-18 | CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring | Mingchen Zhong et.al. | 2511.14469 | null |
| 2025-11-18 | Context-aware, Ante-hoc Explanations of Driving Behaviour | Dominik Grundt et.al. | 2511.14428 | null |
| 2025-11-18 | Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition | Fabian Schmidt et.al. | 2511.14391 | null |
| 2025-11-18 | Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving | Kangqiao Zhao et.al. | 2511.14386 | null |
| 2025-11-18 | Emergent Cooperative Driving Strategies for Stop-and-Go Wave Mitigation via Multi-Agent Reinforcement Learning | Raphael Korbmacher et.al. | 2511.14378 | null |
| 2025-11-18 | PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation | Xiangyu Li et.al. | 2511.14185 | null |
| 2025-11-18 | RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment | Zeyu Cheng et.al. | 2511.14107 | null |
| 2025-11-18 | Cosmological dynamics of interacting dark matter-dark energy in generalized Rastall gravity | Manuel Gonzalez-Espinoza et.al. | 2511.14089 | null |
| 2025-11-17 | LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering | Jielin Qiu et.al. | 2511.13998 | null |
| 2025-11-17 | VLMs Guided Interpretable Decision Making for Autonomous Driving | Xin Hu et.al. | 2511.13881 | null |
| 2025-11-17 | In-memory phononic learning toward cognitive mechanical intelligence | Yuning Zhang et.al. | 2511.13543 | null |
| 2025-11-17 | An Automated Framework for Analyzing Structural Evolution in On-the-fly Non-adiabatic Molecular Dynamics Using Autoencoder and Multiple Molecular Descriptors | Hangxu Liu et.al. | 2511.13364 | null |
| 2025-11-17 | DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving | Kaiwen Cai et.al. | 2511.13309 | null |
| 2025-11-17 | DAP: A Discrete-token Autoregressive Planner for Autonomous Driving | Bowen Ye et.al. | 2511.13306 | null |
| 2025-11-17 | CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving | Enhui Ma et.al. | 2511.13297 | null |
| 2025-11-17 | GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models | Yushuo Zheng et.al. | 2511.13259 | null |
| 2025-11-17 | Event-Triggered Regulation of Mixed-Autonomy Traffic Under Varying Traffic Conditions | Yihuai Zhang et.al. | 2511.13206 | null |
| 2025-11-17 | Difficulty-Aware Label-Guided Denoising for Monocular 3D Object Detection | Soyul Lee et.al. | 2511.13195 | null |
| 2025-11-17 | Autonomous Sensing UAV for Accurate Multi-User Identification and Localization in Cellular Networks | Niccolò Paglierani et.al. | 2511.13171 | null |
| 2025-11-17 | WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection | Longhui Zheng et.al. | 2511.13138 | null |
| 2025-11-17 | Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining | Zhaocheng Yu et.al. | 2511.13113 | null |
| 2025-11-17 | ResAlignNet: A Data-Driven Approach for INS/DVL Alignment | Guy Damari et.al. | 2511.13096 | null |
| 2025-11-17 | Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving | Jiacheng Tang et.al. | 2511.13079 | link |
| 2025-11-17 | Towards 3D Object-Centric Feature Learning for Semantic Scene Completion | Weihua Wang et.al. | 2511.13031 | null |
| 2025-11-17 | T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving | Chen Ma et.al. | 2511.12956 | null |
| 2025-11-17 | GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving | Chunyong Hu et.al. | 2511.12941 | null |
| 2025-11-17 | Yanyun-3: Enabling Cross-Platform Strategy Game Operation with Vision-Language Models | Guoyan Wang et.al. | 2511.12937 | null |
| 2025-11-17 | Text2Traffic: A Text-to-Image Generation and Editing Method for Traffic Scenes | Feng Lv et.al. | 2511.12932 | null |
| 2025-11-17 | Distributed Self-allocated Time Slot Reuse: Multi-hop Communication in Rigid UAV Formations | Amelia Samandari et.al. | 2511.12888 | null |
| 2025-11-16 | Multi-Agent Reinforcement Learning for Heterogeneous Satellite Cluster Resources Optimization | Mohamad A. Hady et.al. | 2511.12792 | null |
| 2025-11-14 | Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy | Asraful Haque et.al. | 2511.11558 | null |
| 2025-11-14 | A Comparative Evaluation of Prominent Methods in Autonomous Vehicle Certification | Mustafa Erdem Kırmızıgül et.al. | 2511.11484 | null |
| 2025-11-14 | Robust and Efficient Communication in Multi-Agent Reinforcement Learning | Zejiao Liu et.al. | 2511.11393 | null |
| 2025-11-14 | Simulating an Autonomous System in CARLA using ROS 2 | Joseph Abdo et.al. | 2511.11310 | null |
| 2025-11-14 | GraphPilot: Grounded Scene Graph Conditioning for Language-Based Autonomous Driving | Fabian Schmidt et.al. | 2511.11266 | null |
| 2025-11-14 | UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios | Mohamed Amine Ferrag et.al. | 2511.11252 | null |
| 2025-11-14 | One-to-N Backdoor Attack in 3D Point Cloud via Spherical Trigger | Dongmei Shan et.al. | 2511.11210 | null |
| 2025-11-14 | CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios | Hangyu Li et.al. | 2511.11168 | null |
| 2025-11-14 | Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids | Ke Ma et.al. | 2511.11077 | null |
| 2025-11-14 | Autonomous Vehicle Path Planning by Searching With Differentiable Simulation | Asen Nachkov et.al. | 2511.11043 | null |
| 2025-11-14 | Miniature Testbed for Validating Multi-Agent Cooperative Autonomous Driving | Hyunchul Bae et.al. | 2511.11022 | null |
| 2025-11-14 | Requirements for Aligned, Dynamic Resolution of Conflicts in Operational Constraints | Steven J. Jones et.al. | 2511.10952 | null |
| 2025-11-13 | Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction | Omid Mirzaeedodangeh et.al. | 2511.10586 | null |
| 2025-11-13 | LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction | Benjamin Stoler et.al. | 2511.10411 | null |
| 2025-11-13 | nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation | Mingxing Peng et.al. | 2511.10403 | null |
| 2025-11-13 | AgentEvolver: Towards Efficient Self-Evolving Agent System | Yunpeng Zhai et.al. | 2511.10395 | null |
| 2025-11-13 | Operator Models for Continuous-Time Offline Reinforcement Learning | Nicolas Hoischen et.al. | 2511.10383 | null |
| 2025-11-13 | Physically Interpretable Multi-Degradation Image Restoration via Deep Unfolding and Explainable Convolution | Hu Gao et.al. | 2511.10166 | null |
| 2025-11-13 | Trapped by Their Own Light: Deployable and Stealth Retroreflective Patch Attacks on Traffic Sign Recognition Systems | Go Tsuruoka et.al. | 2511.10050 | null |
| 2025-11-13 | DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection | Feiyang Jia et.al. | 2511.10035 | null |
| 2025-11-13 | Efficient Verification and Falsification of ReLU Neural Barrier Certificates | Dejin Ren et.al. | 2511.10015 | null |
| 2025-11-13 | Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching | Uday Bhaskar et.al. | 2511.09955 | null |
| 2025-11-12 | Coherent Optical Quantum Computing-Aided Resource Optimization for Transportation Digital Twin Construction | Huixiang Zhang et.al. | 2511.09760 | null |
| 2025-11-12 | Baby Sophia: A Developmental Approach to Self-Exploration through Self-Touch and Hand Regard | Stelios Zarifis et.al. | 2511.09727 | null |
| 2025-11-12 | FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2511.09347 | null |
| 2025-11-12 | SimPath: Mitigating Motion Sickness in In-vehicle Infotainment Systems via Driving Condition Adaptation | Jinghao Huang et.al. | 2511.09240 | null |
| 2025-11-12 | D-AWSIM: Distributed Autonomous Driving Simulator for Dynamic Map Generation Framework | Shunsuke Ito et.al. | 2511.09080 | null |
| 2025-11-12 | Advancing Autonomous Emergency Response Systems: A Generative AI Perspective | Yousef Emami et.al. | 2511.09044 | null |
| 2025-11-12 | Argus: Resilience-Oriented Safety Assurance Framework for End-to-End ADSs | Dingji Wang et.al. | 2511.09032 | null |
| 2025-11-12 | FLAD: Federated Learning for LLM-based Autonomous Driving in Vehicle-Edge-Cloud Networks | Tianao Xiang et.al. | 2511.09025 | null |
| 2025-11-12 | UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving | Ziyi Song et.al. | 2511.09013 | null |
| 2025-11-11 | Information-Driven Fault Detection and Identification for Multi-Agent Spacecraft Systems: Collaborative On-Orbit Inspection Mission | Akshita Gupta et.al. | 2511.08752 | null |
| 2025-11-10 | PlanT 2.0: Exposing Biases and Structural Flaws in Closed-Loop Driving | Simon Gerstenecker et.al. | 2511.07292 | null |
| 2025-11-10 | MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs | Tianhao Peng et.al. | 2511.07250 | null |
| 2025-11-10 | Leveraging Text-Driven Semantic Variation for Robust OOD Segmentation | Seungheon Song et.al. | 2511.07238 | null |
| 2025-11-10 | Dynamics-Decoupled Trajectory Alignment for Sim-to-Real Transfer in Reinforcement Learning for Autonomous Driving | Thomas Steinecker et.al. | 2511.07155 | null |
| 2025-11-10 | HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving | Zhongyu Xia et.al. | 2511.07106 | null |
| 2025-11-10 | Certified L2-Norm Robustness of 3D Point Cloud Recognition in the Frequency Domain | Liang Zhou et.al. | 2511.07029 | null |
| 2025-11-10 | Relative Energy Learning for LiDAR Out-of-Distribution Detection | Zizhao Li et.al. | 2511.06720 | null |
| 2025-11-10 | Differentiable Semantic Meta-Learning Framework for Long-Tail Motion Forecasting in Autonomous Driving | Bin Rao et.al. | 2511.06649 | null |
| 2025-11-10 | DIAL-GS: Dynamic Instance Aware Reconstruction for Label-free Street Scenes with 4D Gaussian Splatting | Chenpeng Su et.al. | 2511.06632 | null |
| 2025-11-09 | A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving | Keke Long et.al. | 2511.06496 | null |
| 2025-11-09 | VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes | Zhengyu Zou et.al. | 2511.06408 | null |
| 2025-11-09 | LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation | Zijie Wang et.al. | 2511.06272 | null |
| 2025-11-09 | VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving | Ruifei Zhang et.al. | 2511.06256 | link |
| 2025-11-09 | AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving | Ruifei Zhang et.al. | 2511.06253 | link |
| 2025-11-09 | ROAR: Robust Accident Recognition and Anticipation for Autonomous Driving | Xingcheng Liu et.al. | 2511.06226 | null |
| 2025-11-08 | Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration | Umar Rashid et.al. | 2511.06087 | link |
| 2025-11-08 | Runtime Safety Monitoring of Deep Neural Networks for Perception: A Survey | Albert Schotschneider et.al. | 2511.05982 | null |
| 2025-11-08 | Polymap: generating high definition map based on rasterized polygons | Shiyu Gao et.al. | 2511.05944 | null |
| 2025-11-07 | SnowyLane: Robust Lane Detection on Snow-covered Rural Roads Using Infrastructural Elements | Jörg Gamerdinger et.al. | 2511.05108 | null |
| 2025-11-07 | J-SGFT: Joint Spatial and Graph Fourier Domain Learning for Point Cloud Attribute Deblocking | Muhammad Talha et.al. | 2511.05047 | null |
| 2025-11-07 | 4D Imaging in ISAC Systems: A Framework Based on 5G NR Downlink Signals | Haoyang Weng et.al. | 2511.04913 | null |
| 2025-11-06 | ReGen: Generative Robot Simulation via Inverse Design | Phat Nguyen et.al. | 2511.04769 | null |
| 2025-11-06 | SAFe-Copilot: Unified Shared Autonomy Framework | Phat Nguyen et.al. | 2511.04664 | null |
| 2025-11-06 | UniSplat: Unified Spatio-Temporal Fusion via 3D Latent Scaffolds for Dynamic Driving Scene Reconstruction | Chen Shi et.al. | 2511.04595 | null |
| 2025-11-06 | A Tool for Benchmarking Large Language Models’ Robustness in Assessing the Realism of Driving Scenarios | Jiahui Wu et.al. | 2511.04267 | null |
| 2025-11-06 | ScaleDL: Towards Scalable and Efficient Runtime Prediction for Distributed Deep Learning Workloads | Xiaokai Wang et.al. | 2511.04162 | null |
| 2025-11-04 | Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data | Syed Mostaquim Ali et.al. | 2511.02994 | null |
| 2025-11-04 | Keeping it Local, Tiny and Real: Automated Report Generation on Edge Computing Devices for Mechatronic-Based Cognitive Systems | Nicolas Schuler et.al. | 2511.02507 | null |
| 2025-11-04 | 3D Point Cloud Object Detection on Edge Devices for Split Computing | Taisuke Noguchi et.al. | 2511.02293 | null |
| 2025-11-04 | LLMs as Judges: Toward The Automatic Review of GSN-compliant Assurance Cases | Gerhard Yu et.al. | 2511.02203 | link |
| 2025-11-03 | UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs | Zhe Liu et.al. | 2511.01768 | null |
| 2025-11-03 | LLM-Assisted Tool for Joint Generation of Formulas and Functions in Rule-Based Verification of Map Transformations | Ruidi He et.al. | 2511.01423 | null |
| 2025-11-03 | VeriODD: From YAML to SMT-LIB – Automating Verification of Operational Design Domains | Bassel Rafie et.al. | 2511.01417 | null |
| 2025-11-03 | Risk Aware Safe Control with Cooperative Sensing for Dynamic Obstacle Avoidance | Pei Yu Chang et.al. | 2511.01403 | null |
| 2025-11-03 | Embodied Cognition Augmented End2End Autonomous Driving | Ling Niu et.al. | 2511.01334 | null |
| 2025-11-02 | Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion | Jaehyun Park et.al. | 2511.00859 | null |
| 2025-11-04 | Towards classification-based representation learning for place recognition on LiDAR scans | Maksim Konoplia et.al. | 2511.00738 | null |
| 2025-11-01 | Unveiling Uniform Shifted Power Law in Stochastic Human and Autonomous Driving Behavior | Wang Chen et.al. | 2511.00659 | link |
| 2025-11-01 | RNN-based linear parameter varying adaptive model predictive control for autonomous driving | Yassine Kebbati et.al. | 2511.00610 | null |
| 2025-10-31 | Dynamic Model Selection for Trajectory Prediction via Pairwise Ranking and Meta-Features | Lu Bowen et.al. | 2511.00126 | link |
| 2025-10-30 | Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail | NVIDIA et.al. | 2511.00088 | null |
| 2025-10-31 | Trends and Challenges in Next-Generation GNSS Interference Management | Leatile Marata et.al. | 2510.27576 | null |
| 2025-10-31 | Modified-Emergency Index (MEI): A Criticality Metric for Autonomous Driving in Lateral Conflict | Hao Cheng et.al. | 2510.27333 | link |
| 2025-10-30 | AD-SAM: Fine-Tuning the Segment Anything Vision Foundation Model for Autonomous Driving Perception | Mario Camarena et.al. | 2510.27047 | null |
| 2025-10-29 | VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes | Simon Yu et.al. | 2510.26833 | null |
| 2025-10-30 | Beyond Imitation: Constraint-Aware Trajectory Generation with Flow Matching For End-to-End Autonomous Driving | Lin Liu et.al. | 2510.26292 | null |
| 2025-10-30 | Self-localization on a 3D map by fusing global and local features from a monocular camera | Satoshi Kikuch et.al. | 2510.26170 | null |
| 2025-11-05 | WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios | Runsheng Xu et.al. | 2510.26125 | null |
| 2025-10-29 | Integrating Legal and Logical Specifications in Perception, Prediction, and Planning for Automated Driving: A Survey of Methods | Kumar Manas et.al. | 2510.25386 | null |
| 2025-10-31 | MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding | Runxi Huang et.al. | 2510.25327 | null |
| 2025-10-29 | Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision | Yuyang Xia et.al. | 2510.25205 | null |
| 2025-11-02 | D $^2$ GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction | Kejing Xia et.al. | 2510.25173 | null |
| 2025-10-28 | SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving | Anil Yildiz et.al. | 2510.24949 | null |
| 2025-10-28 | Delay Tolerant Control for Autonomous Driving Using CDOB | Xincheng Cao et.al. | 2510.24898 | null |
| 2025-10-28 | Learning to Drive Safely with Hybrid Options | Bram De Cooman et.al. | 2510.24674 | null |
| 2025-10-28 | Enhancing Vision-Language Models for Autonomous Driving through Task-Specific Prompting and Spatial Reasoning | Aodi Wu et.al. | 2510.24152 | null |
| 2025-10-28 | ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring | Zhenxin Li et.al. | 2510.24108 | null |
| 2025-10-28 | SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration | Jongsuk Kim et.al. | 2510.24052 | null |
| 2025-10-27 | Modeling and Scheduling of Fusion Patterns in Autonomous Driving Systems (Extended Version) | Hoora Sobhani et.al. | 2510.23895 | null |
| 2025-10-27 | VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting | Hoonhee Cho et.al. | 2510.23205 | null |
| 2025-10-27 | Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method | Bohan Li et.al. | 2510.22973 | null |
| 2025-10-26 | Uncertainty-Aware Autonomous Vehicles: Predicting the Road Ahead | Shireen Kudukkil Manchingal et.al. | 2510.22680 | null |
| 2025-10-26 | DAMap: Distance-aware MapNet for High Quality HD Map Construction | Jinpeng Dong et.al. | 2510.22675 | null |
| 2025-10-25 | 3D Roadway Scene Object Detection with LIDARs in Snowfall Conditions | Ghazal Farhani et.al. | 2510.22436 | null |
| 2025-10-25 | Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework | Amir Mohammad Khadem Hosseini et.al. | 2510.22243 | null |
| 2025-10-25 | HARMONY: Hidden Activation Representations and Model Output-Aware Uncertainty Estimation for Vision-Language Models | Erum Mushtaq et.al. | 2510.22171 | null |
| 2025-10-23 | Addressing Corner Cases in Autonomous Driving: A World Model-based Approach with Mixture of Experts and LLMs | Haicheng Liao et.al. | 2510.21867 | null |
| 2025-10-24 | Learning Neural Control Barrier Functions from Expert Demonstrations using Inverse Constraint Learning | Yuxuan Yang et.al. | 2510.21560 | null |
| 2025-10-24 | Scalpel: Automotive Deep Learning Framework Testing via Assembling Model Components | Yinglong Zou et.al. | 2510.21451 | null |
| 2025-10-24 | Track-to-Track Association for Collective Perception based on Stochastic Optimization | Laura M. Wolf et.al. | 2510.21278 | null |
| 2025-10-24 | Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study | Guanlin Wu et.al. | 2510.21160 | null |
| 2025-10-24 | Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility | Hezam Albagami et.al. | 2510.21112 | null |
| 2025-10-23 | Adversary-Aware Private Inference over Wireless Channels | Mohamed Seif et.al. | 2510.20518 | null |
| 2025-10-23 | Behavior-Aware Online Prediction of Obstacle Occupancy using Zonotopes | Alvaro Carrizosa-Rendon et.al. | 2510.20437 | null |
| 2025-10-23 | Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses | Wu Yichao et.al. | 2510.20314 | null |
| 2025-10-23 | Privacy Protection of Automotive Location Data Based on Format-Preserving Encryption of Geographical Coordinates | Haojie Ji et.al. | 2510.20300 | null |
| 2025-10-23 | Seeing the Unseen: Mask-Driven Positional Encoding and Strip-Convolution Context Modeling for Cross-View Object Geo-Localization | Shuhan Hu et.al. | 2510.20247 | null |
| 2025-10-23 | Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists | Eduardo R. Corral-Soto et.al. | 2510.20158 | null |
| 2025-10-22 | VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction | Junhong Lin et.al. | 2510.19578 | null |
| 2025-10-22 | AutoMT: A Multi-Agent LLM Framework for Automated Metamorphic Testing of Autonomous Driving Systems | Linfeng Liang et.al. | 2510.19438 | null |
| 2025-10-22 | SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion | Xiaozhi Li et.al. | 2510.19215 | null |
| 2025-10-24 | Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks | Kai Zeng et.al. | 2510.19195 | link |
| 2025-10-21 | Robust Driving QA through Metadata-Grounded Context and Task-Specific Prompts | Seungjun Yu et.al. | 2510.19001 | null |
| 2025-10-23 | Occluded nuScenes: A Multi-Sensor Dataset for Evaluating Perception Robustness in Automated Driving | Sanjay Kumar et.al. | 2510.18552 | null |
| 2025-10-21 | MMRHP: A Miniature Mixed-Reality HIL Platform for Auditable Closed-Loop Evaluation | Mingxin Li et.al. | 2510.18371 | null |
| 2025-10-21 | ViSE: A Systematic Approach to Vision-Only Street-View Extrapolation | Kaiyuan Tan et.al. | 2510.18341 | null |
| 2025-10-24 | OmniNWM: Omniscient Driving Navigation World Models | Bohan Li et.al. | 2510.18313 | null |
| 2025-10-21 | OpenInsGaussian: Open-vocabulary Instance Gaussian Segmentation with Context-aware Cross-view Fusion | Tianyu Huang et.al. | 2510.18253 | null |
| 2025-10-21 | BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal Pretraining | Ajinkya Khoche et.al. | 2510.18244 | null |
| 2025-10-20 | SPACeR: Self-Play Anchoring with Centralized Reference Models | Wei-Jer Chang et.al. | 2510.18060 | null |
| 2025-10-20 | SAVANT: Semantic Analysis with Vision-Augmented Anomaly deTection | Roberto Brusnicki et.al. | 2510.18034 | null |
| 2025-10-20 | 4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads | Ling Liu et.al. | 2510.17664 | null |
| 2025-10-20 | Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models | Katie Luo et.al. | 2510.17274 | null |
| 2025-10-20 | Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations | Shahin Atakishiyev et.al. | 2510.17256 | null |
| 2025-10-20 | SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving | Peiru Zheng et.al. | 2510.17191 | null |
| 2025-10-21 | DiffVLA++: Bridging Cognitive Reasoning and End-to-End Driving through Metric-Guided Alignment | Yu Gao et.al. | 2510.17148 | null |
| 2025-10-20 | ProDAT: Progressive Density-Aware Tail-Drop for Point Cloud Coding | Zhe Luo et.al. | 2510.17068 | null |
| 2025-10-19 | UNDREAM: Bridging Differentiable Rendering and Photorealistic Simulation for End-to-end Adversarial Attacks | Mansi Phute et.al. | 2510.16923 | null |
| 2025-10-19 | Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry | Sara Hatami Rostami et.al. | 2510.16790 | null |
| 2025-10-19 | A Comprehensive Survey on World Models for Embodied AI | Xinqing Li et.al. | 2510.16732 | link |
| 2025-10-19 | Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models | Jianbiao Mei et.al. | 2510.16729 | null |
| 2025-10-18 | Advancing Off-Road Autonomous Driving: The Large-Scale ORAD-3D Dataset and Comprehensive Benchmarks | Chen Min et.al. | 2510.16500 | null |
| 2025-10-18 | Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance | Chien Thai et.al. | 2510.16445 | null |
| 2025-10-17 | ObjectTransforms for Uncertainty Quantification and Reduction in Vision-Based Perception for Autonomous Vehicles | Nishad Sahu et.al. | 2510.16118 | null |
| 2025-10-17 | LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal | Shr-Ruei Tsai et.al. | 2510.15868 | null |
| 2025-10-17 | Perfect Prediction or Plenty of Proposals? What Matters Most in Planning for Autonomous Driving | Aron Distelzweig et.al. | 2510.15505 | null |
| 2025-10-17 | VDRive: Leveraging Reinforced VLA and Diffusion Policy for End-to-end Autonomous Driving | Ziang Guo et.al. | 2510.15446 | null |
| 2025-10-17 | FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers | Haisheng Su et.al. | 2510.15385 | null |
| 2025-10-15 | David vs. Goliath: A comparative study of different-sized LLMs for code generation in the domain of automotive scenario generation | Philipp Bauerfeind et.al. | 2510.14115 | null |
| 2025-10-15 | Provably Invincible Adversarial Attacks on Reinforcement Learning Systems: A Rate-Distortion Information-Theoretic Approach | Ziqing Lu et.al. | 2510.13792 | null |
| 2025-10-15 | XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation | Huawei Sun et.al. | 2510.13565 | null |
| 2025-10-17 | CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation | Yushan Han et.al. | 2510.13432 | null |
| 2025-10-15 | Partitioned Scheduling for DAG Tasks Considering Probabilistic Execution Time | Fuma Omori et.al. | 2510.13279 | null |
| 2025-10-15 | SAJA: A State-Action Joint Attack Framework on Multi-Agent Deep Reinforcement Learning | Weiqi Guo et.al. | 2510.13262 | null |
| 2025-10-16 | CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation | Li Liang et.al. | 2510.13245 | null |
| 2025-10-15 | An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities | Jalal Khan et.al. | 2510.13230 | null |
| 2025-10-15 | Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion | Rongtao Xu et.al. | 2510.13198 | null |
| 2025-10-15 | Safe Driving in Occluded Environments | Zhuoyuan Wang et.al. | 2510.13114 | null |
| 2025-10-15 | DriveCritic: Towards Context-Aware, Human-Aligned Evaluation for Autonomous Driving with Vision-Language Models | Jingyu Song et.al. | 2510.13108 | null |
| 2025-10-15 | ADPerf: Investigating and Testing Performance in Autonomous Driving Systems | Tri Minh-Triet Pham et.al. | 2510.13078 | null |
| 2025-10-16 | SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms | Haithem Turki et.al. | 2510.12901 | null |
| 2025-10-14 | DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving | Yingyan Li et.al. | 2510.12796 | null |
| 2025-10-14 | CAMNet: Leveraging Cooperative Awareness Messages for Vehicle Trajectory Prediction | Mattia Grasselli et.al. | 2510.12703 | null |
| 2025-10-14 | CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving | Xiaoji Zheng et.al. | 2510.12560 | null |
| 2025-10-14 | Biased-Attention Guided Risk Prediction for Safe Decision-Making at Unsignalized Intersections | Chengyang Dong et.al. | 2510.12428 | null |
| 2025-10-14 | CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion | Jinzhou Lin et.al. | 2510.12362 | null |
| 2025-10-14 | PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes | Ying A et.al. | 2510.12282 | null |
| 2025-10-14 | AngularFuse: A Closer Look at Angle-based Perception for Spatial-Sensitive Multi-Modality Image Fusion | Xiaopeng Liu et.al. | 2510.12260 | null |
| 2025-10-14 | Hierarchical Reasoning with Vision-Language Models for Incident Reports from Dashcam Videos | Shingo Yokoi et.al. | 2510.12190 | null |
| 2025-10-13 | Context-Aware Model-Based Reinforcement Learning for Autonomous Racing | Emran Yasser Moustafa et.al. | 2510.11501 | null |
| 2025-10-13 | A Faster and More Reliable Middleware for Autonomous Driving Systems | Yuankai He et.al. | 2510.11448 | null |
| 2025-10-13 | Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution | Bozhou Zhang et.al. | 2510.11092 | null |
| 2025-10-13 | Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling | Tianyi Tan et.al. | 2510.11083 | link |
| 2025-10-13 | Game-Theoretic Risk-Shaped Reinforcement Learning for Safe Autonomous Driving | Dong Hu et.al. | 2510.10960 | link |
| 2025-10-13 | Neutral Agent-based Adversarial Policy Learning against Deep Reinforcement Learning in Multi-party Open Systems | Qizhou Peng et.al. | 2510.10937 | null |
| 2025-10-13 | rareboost3d: a synthetic lidar dataset with enhanced rare classes | Shutong Lin et.al. | 2510.10876 | null |
| 2025-10-12 | Stability Under Scrutiny: Benchmarking Representation Paradigms for Online HD Mapping | Hao Shan et.al. | 2510.10660 | null |
| 2025-10-12 | A Machine Learning Perspective on Automated Driving Corner Cases | Sebastian Schmidt et.al. | 2510.10653 | null |
| 2025-10-12 | Reinforcement Learning-based Dynamic Adaptation for Sampling-Based Motion Planning in Agile Autonomous Driving | Alexander Langmann et.al. | 2510.10567 | null |
| 2025-10-12 | Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving | Kanishkha Jaisankar et.al. | 2510.10503 | null |
| 2025-10-12 | Risk-Budgeted Control Framework for Balanced Performance and Safety in Autonomous Vehicles | Pei Yu Chang et.al. | 2510.10442 | null |
| 2025-10-11 | Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking | Markus Käppeler et.al. | 2510.10287 | null |
| 2025-10-11 | A Style-Based Metric for Quantifying the Synthetic-to-Real Gap in Autonomous Driving Image Datasets | Dingyi Yao et.al. | 2510.10203 | null |
| 2025-10-11 | Beyond ADE and FDE: A Comprehensive Evaluation Framework for Safety-Critical Prediction in Multi-Agent Autonomous Driving Scenarios | Feifei Liu et.al. | 2510.10086 | null |
| 2025-10-11 | Probabilistic Hyper-Graphs using Multiple Randomly Masked Autoencoders for Semi-supervised Multi-modal Multi-task Learning | Pîrvu Mihai-Cristian et.al. | 2510.10068 | null |
| 2025-10-11 | Explainable Human-in-the-Loop Segmentation via Critic Feedback Signals | Pouya Shaeri et.al. | 2510.09945 | null |
| 2025-10-10 | SpaceVista: All-Scale Visual Spatial Reasoning from mm to km | Peiwen Sun et.al. | 2510.09606 | link |
| 2025-10-10 | Autonomous Soft Robotic Guidewire Navigation via Imitation Learning | Noah Barnes et.al. | 2510.09497 | null |
| 2025-10-10 | Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation | Vijay M. Galshetwar et.al. | 2510.09228 | null |
| 2025-10-10 | Towards Safer and Understandable Driver Intention Prediction | Mukilan Karuppasamy et.al. | 2510.09200 | null |
| 2025-10-10 | TARO: Toward Semantically Rich Open-World Object Detection | Yuchen Zhang et.al. | 2510.09173 | null |
| 2025-10-10 | Robust Driving Control for Autonomous Vehicles: An Intelligent General-sum Constrained Adversarial Reinforcement Learning Approach | Junchao Fan et.al. | 2510.09041 | null |
| 2025-10-10 | Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels | Weitong Kong et.al. | 2510.09035 | null |
| 2025-10-09 | Scalable Offline Metrics for Autonomous Driving | Animikh Aich et.al. | 2510.08571 | null |
| 2025-10-09 | ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving | Zhiyu Zheng et.al. | 2510.08562 | null |
| 2025-10-09 | Approximate Domain Unlearning for Vision-Language Models | Kodai Kawamura et.al. | 2510.08132 | null |
| 2025-10-09 | LinguaSim: Interactive Multi-Vehicle Testing Scenario Generation via Natural Language Instruction Based on Large Language Models | Qingyuan Shi et.al. | 2510.08046 | null |
| 2025-10-09 | RayFusion: Ray Fusion Enhanced Collaborative Visual Perception | Shaohong Wang et.al. | 2510.08017 | link |
| 2025-10-09 | CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving | Tianrui Zhang et.al. | 2510.07944 | null |
| 2025-10-09 | MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding | Peiran Wu et.al. | 2510.07915 | null |
| 2025-10-10 | GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models | Qinghongbing Xie et.al. | 2510.07791 | null |
| 2025-10-08 | VeMo: A Lightweight Data-Driven Approach to Model Vehicle Dynamics | Girolamo Oddo et.al. | 2510.07447 | null |
| 2025-10-08 | HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving | Donald Pfaffmann et.al. | 2510.07210 | null |
| 2025-10-08 | A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model | Tony Zhang et.al. | 2510.07133 | null |
| 2025-10-08 | Learning Global Representation from Queries for Vectorized HD Map Construction | Shoumeng Qiu et.al. | 2510.06969 | null |
| 2025-10-08 | OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects | Bing Li et.al. | 2510.06952 | null |
| 2025-10-08 | DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning | Ke Guo et.al. | 2510.06913 | null |
| 2025-10-08 | Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion | Jie Luo et.al. | 2510.06687 | null |
| 2025-10-08 | AIM 2025 Challenge on Real-World RAW Image Denoising | Feiran Li et.al. | 2510.06601 | null |
| 2025-10-07 | Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models | Jiahao Wang et.al. | 2510.06209 | null |
| 2025-10-07 | From Learning to Mastery: Achieving Safe and Efficient Real-World Autonomous Driving with Human-In-The-Loop Reinforcement Learning | Li Zeqiao et.al. | 2510.06038 | null |
| 2025-10-07 | The Safety Challenge of World Models for Embodied AI Agents: A Review | Lorenzo Baraldi et.al. | 2510.05865 | null |
| 2025-10-07 | ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving | Yongxuan Lyu et.al. | 2510.05752 | null |
| 2025-10-07 | Precise and Efficient Collision Prediction under Uncertainty in Autonomous Driving | Marc Kaufeld et.al. | 2510.05729 | null |
| 2025-10-06 | Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context | Ngeyen Yinkfu et.al. | 2510.04912 | null |
| 2025-10-08 | Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction | Chi Yan et.al. | 2510.04759 | link |
| 2025-10-05 | Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction | Yuhao Luo et.al. | 2510.04365 | null |
| 2025-10-04 | From Filters to VLMs: Benchmarking Defogging Methods through Object Detection and Segmentation Performance | Ardalan Aryashad et.al. | 2510.03906 | null |
| 2025-10-04 | Referring Expression Comprehension for Small Objects | Kanoko Goto et.al. | 2510.03701 | null |
| 2025-10-04 | Safety-Oriented Dynamic Path Planning for Automated Vehicles | Mostafa Emam et.al. | 2510.03640 | null |
| 2025-10-03 | Training-Free Out-Of-Distribution Segmentation With Foundation Models | Laith Nayal et.al. | 2510.02909 | null |
| 2025-10-03 | GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting | Xinran Zhang et.al. | 2510.02884 | null |
| 2025-10-03 | Action Deviation-Aware Inference for Low-Latency Wireless Robots | Jeyoung Park et.al. | 2510.02851 | null |
| 2025-10-03 | Work Zones challenge VLM Trajectory Planning: Toward Mitigation and Robust Autonomous Driving | Yifan Liao et.al. | 2510.02803 | null |
| 2025-10-03 | High Pixel Resolution Visible to Extended Shortwave Infrared Single Pixel Imaging with a black Phosphorus-Molybdenum disulfide (bP-MoS2) photodiode | Seyed Saleh Mousavi Khaleghi et.al. | 2510.02673 | null |
| 2025-10-03 | A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios | Ruining Yang et.al. | 2510.02627 | null |
| 2025-10-02 | Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving | Cornelius Schröder et.al. | 2510.01829 | null |
| 2025-10-02 | Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving | Haibo Hu et.al. | 2510.01795 | null |
| 2025-10-02 | Predictive Preference Learning from Human Interventions | Haoyuan Cai et.al. | 2510.01545 | null |
| 2025-10-01 | Strategic Fusion of Vision Language Models: Shapley-Credited Context-Aware Dawid-Skene for Multi-Label Tasks in Autonomous Driving | Yuxiang Feng et.al. | 2510.01126 | null |
| 2025-10-01 | Datasets for Valence and Arousal Inference: A Survey | Helen Schneider et.al. | 2510.00738 | null |
| 2025-09-30 | CHAI: Command Hijacking against embodied AI | Luis Burbano et.al. | 2510.00181 | null |
| 2025-09-30 | Adaptive and Resource-efficient Agentic AI Systems for Mobile and Embedded Devices: A Survey | Sicong Liu et.al. | 2510.00078 | null |
| 2025-10-03 | Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving | Sheng Yang et.al. | 2510.00060 | null |
| 2025-09-30 | PRISM: Progressive Rain removal with Integrated State-space Modeling | Pengze Xue et.al. | 2509.26413 | null |
| 2025-09-30 | Beyond Overall Accuracy: Pose- and Occlusion-driven Fairness Analysis in Pedestrian Detection for Autonomous Driving | Mohammad Khoshkdahan et.al. | 2509.26166 | null |
| 2025-09-30 | NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving | Yuan Gao et.al. | 2509.25944 | null |
| 2025-09-30 | Preemptive Spatiotemporal Trajectory Adjustment for Heterogeneous Vehicles in Highway Merging Zones | Yuan Li et.al. | 2509.25929 | null |
| 2025-09-30 | MuSLR: Multimodal Symbolic Logical Reasoning | Jundong Xu et.al. | 2509.25851 | null |
| 2025-09-30 | Cooperative Autonomous Driving in Diverse Behavioral Traffic: A Heterogeneous Graph Reinforcement Learning Approach | Qi Liu et.al. | 2509.25751 | null |
| 2025-09-29 | Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments | Zihan Zhang et.al. | 2509.25542 | null |
| 2025-09-29 | StreamForest: Efficient Online Video Understanding with Persistent Event Memory | Xiangyu Zeng et.al. | 2509.24871 | null |
| 2025-09-29 | TACO-Net: Topological Signatures Triumph in 3D Object Classification | Anirban Ghosh et.al. | 2509.24802 | null |
| 2025-09-29 | FuncPoison: Poisoning Function Library to Hijack Multi-agent Autonomous Driving Systems | Yuzhen Long et.al. | 2509.24408 | null |
| 2025-09-29 | Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning | Korbinian Moller et.al. | 2509.24313 | null |
| 2025-09-29 | Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds | Yongqiang Wang et.al. | 2509.24273 | null |
| 2025-09-28 | Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning | Muleilan Pei et.al. | 2509.23993 | null |
| 2025-09-28 | AutoPrune: Each Complexity Deserves a Pruning Policy | Hanshi Wang et.al. | 2509.23931 | null |
| 2025-09-30 | DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation | Haibao Yu et.al. | 2509.23922 | null |
| 2025-09-28 | Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios | Jinghan Xu Yuyang Zhang Qixuan Cai Jiancheng Chen Keqiu Li et.al. | 2509.23895 | null |
| 2025-09-28 | From Static to Dynamic: a Survey of Topology-Aware Perception in Autonomous Driving | Yixiao Chen et.al. | 2509.23641 | null |
| 2025-09-28 | Foundation Model-Based Adaptive Semantic Image Transmission for Dynamic Wireless Environments | Fangyu Liu et.al. | 2509.23590 | null |
| 2025-09-28 | BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving | Shu Liu et.al. | 2509.23589 | null |
| 2025-09-27 | WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving | Ziyue Zhu et.al. | 2509.23402 | link |
| 2025-09-27 | Preventing Robotic Jailbreaking via Multimodal Domain Adaptation | Francesco Marchiori et.al. | 2509.23281 | null |
| 2025-09-26 | Persistent Autoregressive Mapping with Traffic Rules for Autonomous Driving | Shiyi Liang et.al. | 2509.22756 | null |
| 2025-09-26 | Self-driving cars: Are we there yet? | Merve Atasever et.al. | 2509.22754 | null |
| 2025-09-26 | An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment | Xiaoyun Qiu et.al. | 2509.22550 | null |
| 2025-09-26 | EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model | Andrii Litvynchuk et.al. | 2509.22527 | null |
| 2025-09-29 | A Multi-Modality Evaluation of the Reality Gap in Autonomous Driving Systems | Stefano Carlo Lambertenghi et.al. | 2509.22379 | null |
| 2025-09-26 | UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data | Yujian Yuan et.al. | 2509.22262 | link |
| 2025-09-26 | An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose | Qifeng Wang et.al. | 2509.22058 | null |
| 2025-09-25 | PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines | Zhixin Zhang et.al. | 2509.21563 | null |
| 2025-09-25 | Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement | Jianbo Zhao et.al. | 2509.20938 | null |
| 2025-09-25 | MTRDrive: Memory-Tool Synergistic Reasoning for Robust Autonomous Driving in Corner Cases | Ziang Luo et.al. | 2509.20843 | null |
| 2025-09-25 | DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation | Ved Umrajkar et.al. | 2509.20792 | null |
| 2025-09-25 | MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM | Yuxuan Zhou et.al. | 2509.20757 | null |
| 2025-09-25 | Cyber Racing Coach: A Haptic Shared Control Framework for Teaching Advanced Driving Skills | Congkai Shen et.al. | 2509.20653 | null |
| 2025-09-26 | AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving | Jinhao Chai et.al. | 2509.20253 | null |
| 2025-09-24 | Universal Camouflage Attack on Vision-Language Models for Autonomous Driving | Dehong Kong et.al. | 2509.20196 | null |
| 2025-09-24 | Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving | Pengxiang Li et.al. | 2509.20109 | null |
| 2025-09-25 | Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models | Juana Valeria Hurtado et.al. | 2509.20107 | null |
| 2025-09-24 | Steerable Adversarial Scenario Generation through Test-Time Preference Alignment | Tong Nie et.al. | 2509.20102 | null |
| 2025-09-25 | OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving | Pei Liu et.al. | 2509.19973 | link |
| 2025-09-24 | BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting | Yixun Zhang et.al. | 2509.19793 | null |
| 2025-09-24 | RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving | Carlo Bosio et.al. | 2509.19789 | null |
| 2025-09-24 | EfficienT-HDR: An Efficient Transformer-Based Framework via Multi-Exposure Fusion for HDR Reconstruction | Yu-Shen Huang et.al. | 2509.19779 | null |
| 2025-09-23 | The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar | William L. Muckelroy III et.al. | 2509.19644 | null |
| 2025-09-23 | Coordinated PSO-PID based longitudinal control with LPV-MPC based lateral control for autonomous vehicles | Yassine Kebbati et.al. | 2509.19529 | link |
| 2025-09-23 | Autonomous driving using an optimized neural network based adaptive LPV-MPC controller | Yassine Kebbati et.al. | 2509.19523 | link |
| 2025-09-23 | Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation | Sherwin Bahmani et.al. | 2509.19296 | null |
| 2025-09-24 | An on-chip Pixel Processing Approach with 2.4μs latency for Asynchronous Read-out of SPAD-based dToF Flash LiDARs | Yiyang Liu et.al. | 2509.19192 | null |
| 2025-09-23 | TriFusion-AE: Language-Guided Depth and LiDAR Fusion for Robust Point Cloud Processing | Susmit Neogi et.al. | 2509.18743 | null |
| 2025-09-23 | The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving | Jay Patrikar et.al. | 2509.18626 | null |
| 2025-09-23 | MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving | Yuzhi Wu et.al. | 2509.18613 | null |
| 2025-09-23 | PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving | Chengran Yuan et.al. | 2509.18609 | null |
| 2025-09-23 | Spatial Envelope MPC: High Performance Driving without a Reference | Siyuan Yu et.al. | 2509.18506 | null |
| 2025-09-22 | AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback | Yunhao Yang et.al. | 2509.18384 | null |
| 2025-09-23 | V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts | Hsu-kuang Chiu et.al. | 2509.18053 | link |
| 2025-09-22 | DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving | Shuyao Shang et.al. | 2509.17940 | null |
| 2025-09-22 | SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model | Xiao Zhou et.al. | 2509.17850 | null |
| 2025-09-22 | RSU-Assisted Resource Allocation for Collaborative Perception | Guowei Liu et.al. | 2509.17691 | null |
| 2025-09-22 | Predicting Depth Maps from Single RGB Images and Addressing Missing Information in Depth Estimation | Mohamad Mofeed Chaar et.al. | 2509.17686 | null |
| 2025-09-22 | Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method | Gregory Schroeder et.al. | 2509.17620 | null |
| 2025-09-22 | Vision-Based Driver Drowsiness Monitoring: Comparative Analysis of YOLOv5-v11 Models | Dilshara Herath et.al. | 2509.17498 | null |
| 2025-09-22 | FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR | Junzhe Wu et.al. | 2509.17390 | null |
| 2025-09-22 | Multi-Scenario Highway Lane-Change Intention Prediction: A Physics-Informed AI Framework for Three-Class Classification | Jiazhao Shi et.al. | 2509.17354 | null |
| 2025-09-21 | Optimized adaptive MPC for lateral control of autonomous vehicles | Yassine Kebbati et.al. | 2509.17215 | null |
| 2025-09-21 | CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving | Ruiguo Zhong et.al. | 2509.17080 | null |
| 2025-09-21 | Orchestrate, Generate, Reflect: A VLM-Based Multi-Agent Collaboration Framework for Automated Driving Policy Learning | Zengqi Peng et.al. | 2509.17042 | null |
| 2025-09-21 | Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving | Xuan Chen et.al. | 2509.16950 | null |
| 2025-09-21 | End2Race: Efficient End-to-End Imitation Learning for Real-Time F1Tenth Racing | Zhijie Qiao et.al. | 2509.16894 | null |
| 2025-09-20 | Improve bounding box in Carla Simulator | Mohamad Mofeed Chaar et.al. | 2509.16773 | null |
| 2025-09-20 | Are VLMs Ready for Lane Topology Awareness in Autonomous Driving? | Xin Chen et.al. | 2509.16654 | null |
| 2025-09-20 | ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents | Yichen Wang et.al. | 2509.16645 | null |
| 2025-09-20 | SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving | Haiming Zhang et.al. | 2509.16588 | null |
| 2025-09-20 | ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting | Xiaoyang Yan et.al. | 2509.16552 | null |
| 2025-09-20 | RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation | Tianyi Yan et.al. | 2509.16500 | null |
| 2025-09-19 | RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars | Weiyi Xiong et.al. | 2509.16119 | null |
| 2025-09-19 | CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios | Kangyu Wu et.al. | 2509.15984 | null |
| 2025-09-19 | CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine | Shiyu Fang et.al. | 2509.15968 | link |
| 2025-09-19 | RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation | Paul Julius Kühn et.al. | 2509.15886 | null |
| 2025-09-19 | ThermalGuardian: Temperature-Aware Testing of Automotive Deep Learning Frameworks | Yinglong Zou et.al. | 2509.15815 | null |
| 2025-09-19 | CBPNet: A Continual Backpropagation Prompt Network for Alleviating Plasticity Loss on Edge Devices | Runjie Shao et.al. | 2509.15785 | null |
| 2025-09-19 | Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution | Chang Soo Lim et.al. | 2509.15781 | null |
| 2025-09-18 | Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing | Christopher Oeltjen et.al. | 2509.15423 | null |
| 2025-09-18 | Out-of-Sight Trajectories: Tracking, Fusion, and Prediction | Haichao Zhang et.al. | 2509.15219 | null |
| 2025-09-18 | Digital Twin-based Cooperative Autonomous Driving in Smart Intersections: A Multi-Agent Reinforcement Learning Approach | Taoyuan Yu et.al. | 2509.15099 | null |
| 2025-09-18 | Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression | Xuan Deng et.al. | 2509.14591 | null |
| 2025-09-18 | DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising | Li Gao et.al. | 2509.14565 | null |
| 2025-09-17 | FlowDrive: Energy Flow Field for End-to-End Autonomous Driving | Hao Jiang et.al. | 2509.14303 | link |
| 2025-09-17 | MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping | Zhihao Cao et.al. | 2509.14191 | null |
| 2025-09-17 | BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection | Rongyu Zhang et.al. | 2509.14151 | null |
| 2025-09-17 | SEG-Parking: Towards Safe, Efficient, and Generalizable Autonomous Parking via End-to-End Offline Reinforcement Learning | Zewei Yang et.al. | 2509.13956 | null |
| 2025-09-17 | MAP: End-to-End Autonomous Driving with Map-Assisted Planning | Huilin Yin et.al. | 2509.13926 | null |
| 2025-09-17 | Ensemble of Pre-Trained Models for Long-Tailed Trajectory Prediction | Divya Thuremella et.al. | 2509.13914 | null |
| 2025-09-17 | Data-Efficient Spectral Classification of Hyperspectral Data Using MiniROCKET and HDC-MiniROCKET | Nick Theisen et.al. | 2509.13809 | null |
| 2025-09-17 | AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving | Yuechen Luo et.al. | 2509.13769 | null |
| 2025-09-17 | UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry | Tae-Wook Um et.al. | 2509.13713 | null |
| 2025-09-17 | FishBEV: Distortion-Resilient Bird’s Eye View Segmentation with Surround-View Fisheye Cameras | Hang Li et.al. | 2509.13681 | null |
| 2025-09-16 | TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning | Momchil S. Tomov et.al. | 2509.13579 | null |
| 2025-09-16 | Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving | Artem Savkin et.al. | 2509.13507 | null |
| 2025-09-16 | Road Obstacle Video Segmentation | Shyam Nandan Rai et.al. | 2509.13181 | null |
| 2025-09-17 | TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving | Jiawei Wang et.al. | 2509.13164 | null |
| 2025-09-16 | An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios | Zhihao Zhang et.al. | 2509.13132 | null |
| 2025-09-16 | Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving | Ruibo Li et.al. | 2509.13116 | null |
| 2025-09-16 | 4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar | Xiao Tang et.al. | 2509.12931 | null |
| 2025-09-16 | StereoCarla: A High-Fidelity Driving Dataset for Generalizable Stereo | Xianda Guo et.al. | 2509.12683 | null |
| 2025-09-16 | Maps for Autonomous Driving: Full-process Survey and Frontiers | Pengxin Chen et.al. | 2509.12632 | null |
| 2025-09-16 | DisorientLiDAR: Physical Attacks on LiDAR-based Localization | Yizhen Lao et.al. | 2509.12595 | null |
| 2025-09-15 | Approaches to Analysis and Design of AI-Based Autonomous Vehicles | Tao Yan et.al. | 2509.12169 | null |
| 2025-09-16 | Embodied Navigation Foundation Model | Jiazhao Zhang et.al. | 2509.12129 | null |
| 2025-09-15 | Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network | Navid Hashemi et.al. | 2509.11838 | null |
| 2025-09-15 | HeLoFusion: An Efficient and Scalable Encoder for Modeling Heterogeneous and Multi-Scale Interactions in Trajectory Prediction | Bingqing Wei et.al. | 2509.11719 | null |
| 2025-09-14 | SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion | Zhiwen Yang et.al. | 2509.11171 | null |
| 2025-09-13 | Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios | Simone Mosco et.al. | 2509.10841 | null |
| 2025-09-11 | Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey | Wei Dai et.al. | 2509.10570 | null |
| 2025-09-17 | DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training | Jianxin Shi et.al. | 2509.10426 | null |
| 2025-09-12 | Multimodal SAM-adapter for Semantic Segmentation | Iacopo Curti et.al. | 2509.10408 | null |
| 2025-09-12 | CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion | Santiago Montiel-Marín et.al. | 2509.10139 | null |
| 2025-09-12 | BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals | Minsang Kong et.al. | 2509.10080 | null |
| 2025-09-11 | MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network | Ge Sun et.al. | 2509.09200 | null |
| 2025-09-10 | LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations | Payal Varshney et.al. | 2509.08422 | null |
| 2025-09-10 | Sparse BEV Fusion with Self-View Consistency for Multi-View Detection and Tracking | Keisuke Toida et.al. | 2509.08421 | null |
| 2025-09-10 | InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection | Zhongyu Xia et.al. | 2509.08374 | null |
| 2025-09-10 | Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities | Rajendramayavan Sathyam et.al. | 2509.08302 | null |
| 2025-09-10 | A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator | Elahe Delavari et.al. | 2509.08221 | null |
| 2025-09-09 | Mean Field Game-Based Interactive Trajectory Planning Using Physics-Inspired Unified Potential Fields | Zhen Tian et.al. | 2509.08147 | null |
| 2025-09-09 | TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models | Zongzheng Zhang et.al. | 2509.07962 | null |
| 2025-09-09 | Bias-Aware Machine Unlearning: Towards Fairer Vision Models via Controllable Forgetting | Sai Siddhartha Chary Aylapuram et.al. | 2509.07456 | null |
| 2025-09-09 | Attention and Risk-Aware Decision Framework for Safe Autonomous Driving | Zhen Tian et.al. | 2509.07412 | null |
| 2025-09-09 | TEGRA: A Flexible & Scalable NextGen Mobile Core | Bilal Saleem et.al. | 2509.07410 | null |
| 2025-09-08 | SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis | Zhengqing Chen et.al. | 2509.06798 | null |
| 2025-09-08 | Adaptive Evolution Factor Risk Ellipse Framework for Reliable and Safe Autonomous Driving | Fujiang Yuan et.al. | 2509.06375 | null |
| 2025-09-07 | Asymmetry Vulnerability and Physical Attacks on Online Map Construction for Autonomous Driving | Yang Lou et.al. | 2509.06071 | null |
| 2025-09-06 | Scenario-based Decision-making Using Game Theory for Interactive Autonomous Driving: A Survey | Zhihao Lin et.al. | 2509.05777 | null |
| 2025-09-06 | Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh | Ha Meem Hossain et.al. | 2509.05652 | null |
| 2025-09-06 | OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision | Ruixun Liu et.al. | 2509.05578 | null |
| 2025-09-08 | LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation | Yinglin Duan et.al. | 2509.05263 | null |
| 2025-09-05 | Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet | Mohammad Saeid et.al. | 2509.05198 | link |
| 2025-09-05 | A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing | Chengkai Xu et.al. | 2509.04853 | null |
| 2025-09-05 | Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization | Dharsan Ravindran et.al. | 2509.04735 | null |
| 2025-09-04 | Bootstrapping Reinforcement Learning with Sub-optimal Policies for Autonomous Driving | Zhihao Zhang et.al. | 2509.04712 | null |
| 2025-09-04 | Domain Adaptation for Different Sensor Configurations in 3D Object Detection | Satoshi Tanaka et.al. | 2509.04711 | null |
| 2025-09-04 | In-Context Policy Adaptation via Cross-Domain Skill Diffusion | Minjong Yoo et.al. | 2509.04535 | null |
| 2025-09-09 | One Flight Over the Gap: A Survey from Perspective to Panoramic Vision | Xin Lin et.al. | 2509.04444 | null |
| 2025-09-04 | TriLiteNet: Lightweight Model for Multi-Task Visual Perception | Quang-Huy Che et.al. | 2509.04092 | null |
| 2025-09-04 | SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation | Han Huang et.al. | 2509.03999 | null |
| 2025-09-03 | sam-llm: interpretable lane change trajectoryprediction via parametric finetuning | Zhuo Cao et.al. | 2509.03462 | null |
| 2025-09-03 | Rashomon in the Streets: Explanation Ambiguity in Scene Understanding | Helge Spieker et.al. | 2509.03169 | null |
| 2025-09-03 | Automatically Generating High-Precision Simulated Road Networking in Traffic Scenario | Liang Xie et.al. | 2509.02990 | null |
| 2025-09-03 | KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models | Yujin Wang et.al. | 2509.02966 | null |
| 2025-09-02 | Do LLM Modules Generalize? A Study on Motion Generation for Autonomous Driving | Mingyi Wang et.al. | 2509.02754 | null |
| 2025-09-02 | 2nd Place Solution for CVPR2024 E2E Challenge: End-to-End Autonomous Driving Using Vision Language Model | Zilong Guo et.al. | 2509.02659 | null |
| 2025-09-02 | Omnidirectional Spatial Modeling from Correlated Panoramas | Xinshen Zhang et.al. | 2509.02164 | null |
| 2025-09-02 | Txt2Sce: Scenario Generation for Autonomous Driving System Testing Based on Textual Reports | Pin Ji et.al. | 2509.02150 | null |
| 2025-09-02 | Curiosity-Driven Testing for Sequential Decision-Making Process | Junda He et.al. | 2509.02025 | null |
| 2025-09-02 | Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions | Beibei Zhou et.al. | 2509.02011 | null |
| 2025-09-01 | 2COOOL: 2nd Workshop on the Challenge Of Out-Of-Label Hazards in Autonomous Driving | Ali K. AlShami et.al. | 2508.21080 | null |
| 2025-10-22 | Interpretable Decision-Making for End-to-End Autonomous Driving | Mona Mirzaie et.al. | 2508.18898 | null |
| 2025-02-18 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208 | null |
| 2024-12-05 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | null |
| 2024-04-16 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
| 2023-05-30 | Selective Communication for Cooperative Perception in End-to-End Autonomous Driving | Hsu-kuang Chiu et.al. | 2305.17181 | null |
| 2023-09-14 | Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges | Yushan Han et.al. | 2301.06262 | link |
| 2022-06-28 | A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning | Balint Gyevnar et.al. | 2206.08783 | null |
| 2023-07-31 | Pushing the Limits of Learning-based Traversability Analysis for Autonomous Driving on CPU | Daniel Fusaro et.al. | 2206.03083 | null |
| 2021-11-16 | A Scenario-Based Platform for Testing Autonomous Vehicle Behavior Prediction Models in Simulation | Francis Indaheng et.al. | 2110.14870 | null |
| 2021-11-03 | Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement | Tianyu Shi et.al. | 2110.07067 | null |
| 2021-11-22 | ViSTA: a Framework for Virtual Scenario-based Testing of Autonomous Vehicles | Andrea Piazzoni et.al. | 2109.02529 | null |
| 2021-08-10 | Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge | Songyang Zhang et.al. | 2108.04230 | null |
| 2021-04-23 | Multi-task Learning with Attention for End-to-end Autonomous Driving | Keishi Ishihara et.al. | 2104.10753 | null |
| 2021-03-31 | Multi-modal Trajectory Prediction for Autonomous Driving with Semantic Map and Dynamic Graph Attention Network | Bo Dong et.al. | 2103.16273 | null |
| 2023-05-16 | Control Strategies for Autonomous Vehicles | Chinmay Vilas Samak et.al. | 2011.08729 | null |
| 2019-12-03 | Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles | Pin Wang et.al. | 1912.00074 | null |
| 2019-09-18 | A*3D Dataset: Towards Autonomous Driving in Challenging Environments | Quang-Hieu Pham et.al. | 1909.07541 | null |
| 2017-04-11 | Deep Reinforcement Learning framework for Autonomous Driving | Ahmad El Sallab et.al. | 1704.02532 | null |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2009-10-21 | Quantum Error Correction Beyond Completely Positive Maps | A. Shabani et.al. | quant-ph/0610028 | null |
| 2026-04-02 | TOL: Textual Localization with OpenStreetMap | Youqi Liao et.al. | 2604.01644 | null |
| 2026-04-01 | Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM | Monica M. Q. Li et.al. | 2604.00804 | null |
| 2026-03-30 | Pandora: Articulated 3D Scene Graphs from Egocentric Vision | Alan Yu et.al. | 2603.28732 | null |
| 2026-03-24 | Active Robotic Perception for Disease Detection and Mapping in Apple Trees | Hayden Feddock et.al. | 2603.23112 | null |
| 2026-03-18 | Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting | Guillem Casadesus Vila et.al. | 2603.18218 | null |
| 2026-03-17 | ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial Imagery | Weiqin Jiao et.al. | 2603.16616 | null |
| 2026-03-06 | Word-Anchored Temporal Forgery Localization | Tianyi Wang et.al. | 2603.06220 | null |
| 2026-03-03 | Probabilistic Occupancy Grid for Radio-Based SLAM | Xuhong Li et.al. | 2603.03559 | null |
| 2026-03-02 | Randomized Neural Networks for Partial Differential Equation on Static and Evolving Surfaces | Jingbo Sun et.al. | 2603.01689 | null |
| 2026-03-02 | B $^2$ F-Map: Crowd-sourced Mapping with Bayesian B-spline Fusion | Yiping Xie et.al. | 2603.01673 | null |
| 2026-02-25 | Tacmap: Bridging the Tactile Sim-to-Real Gap via Geometry-Consistent Penetration Depth Map | Lei Su et.al. | 2602.21625 | null |
| 2026-02-22 | Beyond Behavioural Trade-Offs: Mechanistic Tracing of Pain-Pleasure Decisions in an LLM | Francesca Bianco et.al. | 2602.19159 | null |
| 2026-03-15 | H.265/HEVC Video Steganalysis Based on CU Block Structure Gradients and IPM Mapping | Xiang Zhang et.al. | 2602.11547 | null |
| 2026-02-08 | SPOT: Spatio-Temporal Obstacle-free Trajectory Planning for UAVs in an Unknown Dynamic Environment | Astik Srivastava et.al. | 2602.01189 | null |
| 2026-02-03 | MapDream: Task-Driven Map Learning for Vision-Language Navigation | Guoxin Lian et.al. | 2602.00222 | null |
| 2026-01-18 | OpenNavMap: Structure-Free Topometric Mapping via Large-Scale Collaborative Localization | Jianhao Jiao et.al. | 2601.12291 | null |
| 2025-12-16 | ACE-SLAM: Scene Coordinate Regression for Neural Implicit Real-Time SLAM | Ignacio Alzugaray et.al. | 2512.14032 | null |
| 2025-12-05 | Categorifying isomonodromic deformations via Lie groupoids I: Logarithmic singularities | Waleed Qaisar et.al. | 2512.05966 | null |
| 2025-12-05 | AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement | Munsif Ali et.al. | 2512.05960 | null |
| 2025-12-05 | World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty | Zhiting Mei et.al. | 2512.05927 | null |
| 2025-12-05 | Invariant polynomials, gaps, and sparseness | John P. D’Angelo et.al. | 2512.05892 | null |
| 2025-12-05 | Machine-learning-enabled interpretation of tribological deformation patterns in large-scale MD data | Hendrik J. Ehrich et.al. | 2512.05818 | null |
| 2025-12-05 | Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation | Fabian Konstantinidis et.al. | 2512.05812 | null |
| 2025-12-05 | Emergence of Language in the Developing Brain | Linnea Evanson et.al. | 2512.05718 | null |
| 2025-12-05 | Physics-Informed Graph Neural Network with Frequency-Aware Learning for Optical Aberration Correction | Yong En Kok et.al. | 2512.05683 | null |
| 2025-12-05 | Scenario-aware Uncertainty Quantification for Trajectory Prediction with Statistical Guarantees | Yiming Shu et.al. | 2512.05682 | null |
| 2025-12-05 | The Power of Network Pluralism: Multi-Perspective Modeling of Heterogeneous Legal Document Networks | Titus Pünder et.al. | 2512.05679 | null |
| 2025-12-05 | Over-the-Air Semantic Alignment with Stacked Intelligent Metasurfaces | Mario Edoardo Pandolfo et.al. | 2512.05657 | null |
| 2025-12-05 | Modular Jets for Supervised Pipelines: Diagnosing Mirage vs Identifiability | Suman Sanyal et.al. | 2512.05638 | null |
| 2025-12-05 | Experts-Guided Unbalanced Optimal Transport for ISP Learning from Unpaired and/or Paired Data | Georgy Perevozchikov et.al. | 2512.05635 | null |
| 2025-12-05 | Sticky eigenstates in systems with sharply-divided phase space | Hua Yan et.al. | 2512.05627 | null |
| 2025-12-05 | Refined HLA Linkage Disequilibrium Architectures of World Populations by a Novel Allelic Correlation Measure | Fei Zhang et.al. | 2512.05573 | null |
| 2025-12-05 | Knowing Your Uncertainty – On the application of LLM in social sciences | Bolun Zhang et.al. | 2512.05461 | null |
| 2025-12-05 | On-Orbit Calibration of Danuri/PolCam. I. Geometric Calibration | Kilho Baek et.al. | 2512.05330 | null |
| 2025-12-04 | Restriction of the metaplectic representation over a $p$ -adic field to an anisotropic torus | Khemais Maktouf et.al. | 2512.05317 | null |
| 2026-01-16 | Systematically Evaluating Equivalent Purpose for Digital Maps | Brandon Biggs et.al. | 2512.05310 | null |
| 2025-12-04 | Seabed-to-Sky Mapping of Maritime Environments with a Dual Orthogonal SONAR and LiDAR Sensor Suite | Christian Westerdahl et.al. | 2512.05303 | null |
| 2025-12-04 | Stable Single-Pixel Contrastive Learning for Semantic and Geometric Tasks | Leonid Pogorelyuk et.al. | 2512.04970 | null |
| 2025-12-04 | Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty | Kailiang Liu et.al. | 2512.04918 | null |
| 2025-12-04 | VNS Tokamak OpenMC-Serpent Validation for Medical Isotope Studies | Christopher Ehrich et.al. | 2512.04873 | null |
| 2025-12-04 | LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation | Huynh Trinh Ngoc et.al. | 2512.04821 | null |
| 2025-12-04 | TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards | Mauro Martini et.al. | 2512.04772 | null |
| 2025-12-04 | Spectral micro-CT for quantitative analysis of calcification in fibrocartilage | Vittoria Mazzini et.al. | 2512.04662 | null |
| 2025-12-04 | Standard audiogram classification from loudness scaling data using unsupervised, supervised, and explainable machine learning techniques | Chen Xu et.al. | 2512.04616 | null |
| 2025-12-04 | Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot | Sheng Hang et.al. | 2512.04599 | link |
| 2025-12-04 | Prompt2Craft: Generating Functional Craft Assemblies with LLMs | Vitor Hideyo Isume et.al. | 2512.04568 | null |
| 2025-12-04 | Convergence Dynamics and Scaling Laws in the Dissipative Relativistic Kicked Rotator | Daniel Borin et.al. | 2512.04471 | null |
| 2025-12-04 | MAFNet:Multi-frequency Adaptive Fusion Network for Real-time Stereo Matching | Ao Xu et.al. | 2512.04358 | null |
| 2025-12-03 | UniLight: A Unified Representation for Lighting | Zitian Zhang et.al. | 2512.04267 | null |
| 2025-12-03 | Warped & Hooked: Mapping the Magellanic Clouds in 3D using Red Clump stars | Slater J. Oden et.al. | 2512.04200 | null |
| 2025-12-03 | SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows | Qinyu Zhao et.al. | 2512.04084 | null |
| 2025-12-03 | The Loss Landscape of Powder X-Ray Diffraction-Based Structure Optimization Is Too Rough for Gradient Descent | Nofit Segal et.al. | 2512.04036 | null |
| 2025-12-03 | Learning Group Actions In Disentangled Latent Image Representations | Farhana Hossain Swarnali et.al. | 2512.04015 | null |
| 2025-12-03 | MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction | Guole Shen et.al. | 2512.03939 | null |
| 2025-12-03 | Rethinking Collapse: Coupling Quantum States to Classical Bits with quasi-probabilities | Dagomir Kaszlikowski et.al. | 2512.03929 | null |
| 2025-12-03 | Feature-aware Modulation for Learning from Temporal Tabular Data | Hao-Run Cai et.al. | 2512.03678 | null |
| 2025-12-03 | Multi-Scale Visual Prompting for Lightweight Small-Image Classification | Salim Khazem et.al. | 2512.03663 | null |
| 2025-12-03 | MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms | Jiahao Zhang et.al. | 2512.03640 | null |
| 2025-12-03 | From fractional Chern insulators to topological electronic crystals in moiré MoTe2: quantum geometry tuning via remote layer | Feng Liu et.al. | 2512.03622 | null |
| 2025-12-03 | Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding | Haoran Zhou et.al. | 2512.03601 | null |
| 2025-12-03 | Quantum Hash Function Based on Spectral Properties of Graphs and Discrete Walker Dynamics | Mohana Priya Thinesh Kumar et.al. | 2512.03581 | null |
| 2025-12-03 | GeoVideo: Introducing Geometric Regularization into Video Generation Model | Yunpeng Bai et.al. | 2512.03453 | null |
| 2025-12-03 | What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models | Tianchen Deng et.al. | 2512.03422 | null |
| 2025-12-03 | Surfel-LIO: Fast LiDAR-Inertial Odometry with Pre-computed Surfels and Hierarchical Z-order Voxel Hashing | Seungwon Choi et.al. | 2512.03397 | null |
| 2025-12-03 | New linear invariants of hypergraphs | Peter A. Brooksbank et.al. | 2512.03342 | null |
| 2025-12-03 | Epistemic Substitution: How Grokipedia’s AI-Generated Encyclopedia Restructures Authority | Aliakbar Mehdizadeh et.al. | 2512.03337 | null |
| 2025-12-03 | When does Gaussian equivalence fail and how to fix it: Non-universal behavior of random features with quadratic scaling | Garrett G. Wen et.al. | 2512.03325 | null |
| 2025-12-03 | NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction | Thomas Monninger et.al. | 2512.03317 | null |
| 2025-12-02 | Retrofitting Earth System Models with Cadence-Limited Neural Operator Updates | Aniruddha Bora et.al. | 2512.03309 | null |
| 2025-12-02 | Learning Network Sheaves for AI-native Semantic Communication | Enrico Grimaldi et.al. | 2512.03248 | null |
| 2025-12-02 | CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models | Minkyung Kwon et.al. | 2512.03045 | link |
| 2025-12-02 | U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences | Xiang Xu et.al. | 2512.02982 | link |
| 2025-12-02 | Unipotent quantum coordinate ring and minuscule prefundamental representations: twisted case | Il-Seung Jang et.al. | 2512.02946 | null |
| 2025-12-02 | MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding | Fan Yang et.al. | 2512.02906 | null |
| 2025-12-02 | FAIRY2I: Universal Extremely-Low Bit QAT framework via Widely-Linear Representation and Phase-Aware Quantization | Feiyu Wang et.al. | 2512.02901 | null |
| 2025-12-02 | Assessing the performance of correlation-based multi-fidelity neural emulators | Cristian J. Villatoro et.al. | 2512.02868 | null |
| 2025-12-02 | Revisiting Theory of Contrastive Learning for Domain Generalization | Ali Alvandi et.al. | 2512.02831 | null |
| 2025-12-02 | Implementation and Analysis of Quantum Majority Rules under Noisy Conditions | Gal Amit et.al. | 2512.02813 | null |
| 2025-12-02 | Exploring Definitions of Quality and Diversity in Sonic Measurement Spaces | Björn Þór Jónsson et.al. | 2512.02783 | null |
| 2025-12-02 | Digit-Indexed q-ary SEC-DED Codes with Near-Hamming Overhead | Jiaxu Hu et.al. | 2512.02747 | null |
| 2025-12-02 | Efficient Simulation of the 2D Hubbard Model via Hilbert Space-Filling Curve Mapping | Ashkan Abedi et.al. | 2512.02666 | null |
| 2025-12-02 | PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes | Derui Shan et.al. | 2512.02664 | null |
| 2025-12-02 | Content-Aware Texturing for Gaussian Splatting | Panagiotis Papantonakis et.al. | 2512.02621 | null |
| 2025-12-02 | Quantum LLMs Using Quantum Computing to Analyze and Process Semantic Information | Timo Aukusti Laine et.al. | 2512.02619 | null |
| 2025-12-02 | Interface Correlators in Symmetric Product Orbifolds | Sebastian Harris et.al. | 2512.02616 | null |
| 2025-12-02 | Updates on dipolar anisotropy in local measurements of the Hubble constant from Cosmicflows-4 | Vincenzo Salzano et.al. | 2512.02526 | null |
| 2025-12-02 | Individual-specific precision neuroimaging of learning-related plasticity | Simon Leipold et.al. | 2512.02503 | null |
| 2025-12-02 | Quantum-Based Self-Attention Mechanism for Hardware-Aware Differentiable Quantum Architecture Search | Yuxiang Liu et.al. | 2512.02476 | null |
| 2025-12-02 | nuScenes Revisited: Progress and Challenges in Autonomous Driving | Whye Kit Fong et.al. | 2512.02448 | null |
| 2025-12-02 | Vehicle Dynamics Embedded World Models for Autonomous Driving | Huiqian Li et.al. | 2512.02417 | null |
| 2025-12-01 | ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation | Chenyang Gu et.al. | 2512.02013 | null |
| 2025-12-01 | JWST & the Waz Arc I: Spatially Resolving the Physical Conditions within a Post-Starburst Galaxy at Redshift 5 with NIRSpec IFS | Taylor A. Hutchison et.al. | 2512.02000 | null |
| 2025-12-01 | The Lebesgue constant for uniform approximation of differential forms | Ludovico Bruni Bruno et.al. | 2512.01944 | null |
| 2025-12-01 | SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception | Gurmeher Khurana et.al. | 2512.01908 | null |
| 2025-12-01 | Decision Tree Embedding by Leaf-Means | Cencheng Shen et.al. | 2512.01819 | link |
| 2025-12-01 | Secure Over-the-Air Computation Against Multiple Eavesdroppers using Correlated Artificial Noise | David Nordlund et.al. | 2512.01778 | null |
| 2025-12-01 | DiG-Flow: Discrepancy-Guided Flow Matching for Robust VLA Models | Wanpeng Zhang et.al. | 2512.01715 | link |
| 2025-12-01 | A unified framework for geometry-independent operator learning in cardiac electrophysiology simulations | Bei Zhou et.al. | 2512.01702 | null |
| 2025-12-01 | Integrating Artificial Intelligence and Mixed Integer Linear Programming: Explainable Graph-Based Instance Space Analysis in Air Transportation | Artur Guerra Rosa et.al. | 2512.01698 | null |
| 2026-03-12 | The Spin-MInt Algorithm: an Accurate and Symplectic Propagator for the Spin-Mapping Representation of Nonadiabatic Dynamics | Lauren E. Cook et.al. | 2512.01579 | null |
| 2025-12-01 | QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions | Can Polat et.al. | 2512.01519 | null |
| 2025-12-01 | RadioPiT: Radio Map Generation with Pixel Transformer Driven by Ultra-Sparse Real-World Data | Zeyao Sun et.al. | 2512.01451 | null |
| 2025-12-01 | Consistency Flow Model Achieves One-step Denoising Error Correction Codes | Haoyu Lei et.al. | 2512.01389 | null |
| 2025-12-01 | Reversible Inversion for Training-Free Exemplar-guided Image Editing | Yuke Li et.al. | 2512.01382 | null |
| 2025-12-01 | EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly | Xiaokun Pan et.al. | 2512.01296 | link |
| 2025-12-01 | Diffusion Model in Latent Space for Medical Image Segmentation Task | Huynh Trinh Ngoc et.al. | 2512.01292 | null |
| 2025-12-01 | Knowledge Graph Augmented Large Language Models for Next-Visit Disease Prediction | Ruiyu Wang et.al. | 2512.01210 | null |
| 2025-12-01 | Pay Attention Later: From Vector Space Diffusion to Linearithmic Spectral Phase-Locking | Alper Yıldırım et.al. | 2512.01208 | null |
| 2025-11-30 | Generalized Medical Phrase Grounding | Wenjun Zhang et.al. | 2512.01085 | null |
| 2025-11-30 | Stability analysis of action potential generation using Markov models of voltage-gated sodium channel isoforms | Youssof Abdullah et.al. | 2512.01058 | null |
| 2025-11-28 | Detection of the Pairwise Kinematic Sunyaev-Zel’dovich Effect and Pairwise Velocity with DESI DR1 Galaxies and ACT DR6 and Planck CMB Data | Yulin Gong et.al. | 2511.23417 | null |
| 2025-11-28 | Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon | Isabel Whiteley Tscherniak et.al. | 2511.23384 | null |
| 2025-11-28 | DAONet-YOLOv8: An Occlusion-Aware Dual-Attention Network for Tea Leaf Pest and Disease Detection | Yefeng Wu et.al. | 2511.23222 | null |
| 2025-11-28 | Robust 3DGS-based SLAM via Adaptive Kernel Smoothing | Shouhe Zhang et.al. | 2511.23221 | null |
| 2025-11-28 | Quantum graphs in infinite-dimensions: Hilbert–Schmidts and Hilbert modules | Matthew Daws et.al. | 2511.23121 | null |
| 2025-11-28 | Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM | Shouhe Zhang et.al. | 2511.22968 | null |
| 2025-11-28 | Scalable Diffusion Transformer for Conditional 4D fMRI Synthesis | Jungwoo Seo et.al. | 2511.22870 | null |
| 2025-11-28 | CoordSpeaker: Exploiting Gesture Captioning for Coordinated Caption-Empowered Co-Speech Gesture Generation | Fengyi Fang et.al. | 2511.22863 | null |
| 2025-11-28 | Plumbings of lens spaces and crepant resolutions of compound $A_n$ singularities | Bilun Xie et.al. | 2511.22837 | null |
| 2025-11-27 | A Functional Field Theorem: An Explicit Proof of Axioms and Equations for Applying iSAFT in Polymer Field Theory | Maximo T. Estrada et.al. | 2511.22760 | null |
| 2025-11-27 | Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction | Boyao Zhou et.al. | 2511.22704 | null |
| 2025-11-27 | Emergent Extreme-View Geometry in 3D Foundation Models | Yiwen Zhang et.al. | 2511.22686 | null |
| 2025-11-27 | Spatially Aware Dictionary-Free Eigenfunction Identification for Modeling and Control of Nonlinear Dynamical Systems | David Grasev et.al. | 2511.22648 | null |
| 2025-11-27 | Non-Gaussianity in SMICA | M. Citran et.al. | 2511.22641 | null |
| 2025-11-27 | Bringing Your Portrait to 3D Presence | Jiawei Zhang et.al. | 2511.22553 | link |
| 2025-11-27 | DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA | Ahmad Mohammadshirazi et.al. | 2511.22521 | null |
| 2025-11-27 | Design of Cycles by Impulsive Feedback: Application to Discrete Dosing | Alexander Medvedev et.al. | 2511.22417 | null |
| 2025-11-27 | FADiff: Fusion-Aware Differentiable Optimization for DNN Scheduling on Tensor Accelerators | Shuao Jia et.al. | 2511.22348 | null |
| 2025-11-27 | NOMA Assisted Downlink Power Allocation in Pinching Antenna Systems Using Convolutional Neural Network | Saeed Mohammadzadeh et.al. | 2511.22328 | null |
| 2025-11-27 | UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries | Hoang-Bao Le et.al. | 2511.22253 | null |
| 2025-11-26 | Machine Learning Approaches to Clinical Risk Prediction: Multi-Scale Temporal Alignment in Electronic Health Records | Wei-Chen Chang et.al. | 2511.21561 | null |
| 2025-11-26 | CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation | Shizhe Sun et.al. | 2511.21503 | link |
| 2025-11-26 | Scaling limits of critical FK-decorated random planar maps with $q=4$ | William Da Silva et.al. | 2511.21480 | null |
| 2025-11-26 | $\texttt{CRLS}$ : Convolutional Regularized Least Squares Framework for Reduced Order Modeling of Transonic Flows | Muhammad Bilal et.al. | 2511.21425 | null |
| 2025-11-26 | Bombyx: OpenCilk Compilation for FPGA Hardware Acceleration | Mohamed Shahawy et.al. | 2511.21346 | null |
| 2025-11-26 | Discovery and recovery of crystalline materials with property-conditioned transformers | Cyprien Bone et.al. | 2511.21299 | link |
| 2025-11-26 | Rigidity of bounded-type Siegel polynomials | Kostiantyn Drach et.al. | 2511.21246 | null |
| 2025-11-26 | When barchan dunes move over craters | Paulo Vitor Ribeiro Plácido et.al. | 2511.21177 | null |
| 2025-11-26 | Referring Video Object Segmentation with Cross-Modality Proxy Queries | Baoli Sun et.al. | 2511.21139 | link |
| 2025-11-26 | MNM : Multi-level Neuroimaging Meta-analysis with Hyperbolic Brain-Text Representations | Seunghun Baek et.al. | 2511.21092 | null |
| 2025-11-26 | Witness wedges in fidelity-deviation plane: separating teleportation advantage and Bell-inequality violation | Kyoungho Cho et.al. | 2511.21079 | null |
| 2025-11-25 | Exploring Time-Step Size in Reinforcement Learning for Sepsis Treatment | Yingchuan Sun et.al. | 2511.20913 | null |
| 2025-11-25 | Restoring a Missing Meta-Symmetry of Quantum Mechanics | Sheng Ran et.al. | 2511.20907 | null |
| 2025-11-25 | Primal: A Unified Deterministic Framework for Quasi-Orthogonal Hashing and Manifold Learning | Vladimer Khasia et.al. | 2511.20839 | null |
| 2025-11-25 | Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model | Ziyue Wang et.al. | 2511.20636 | null |
| 2025-11-25 | Quantum Key Distribution: Bridging Theoretical Security Proofs, Practical Attacks, and Error Correction for Quantum-Augmented Networks | Nitin Jha et.al. | 2511.20602 | null |
| 2025-11-25 | Time-Domain Linear Model-based Framework for Passive Acoustic Mapping of Cavitation Activity | Tatiana Gelvez-Barrera et.al. | 2511.20551 | null |
| 2025-11-25 | Wide Area Surface Dosimetry with Conformal Scintillator Array for External Beam Radiotherapy | Roman Vasyltsiv et.al. | 2511.20472 | null |
| 2025-11-25 | MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts | Zilong Huang et.al. | 2511.20415 | null |
| 2025-11-26 | VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild | Xin Ming et.al. | 2511.20366 | null |
| 2025-11-25 | Quality-guided UAV Surface Exploration for 3D Reconstruction | Benjamin Sportich et.al. | 2511.20353 | null |
| 2025-11-25 | Plumbing Analog of Molecular Computation | Roger D. Jones et.al. | 2511.20339 | null |
| 2025-11-25 | Data Augmentation Techniques to Reverse-Engineer Neural Network Weights from Input-Output Queries | Alexander Beiser et.al. | 2511.20312 | null |
| 2025-11-25 | In-Context Compositional Learning via Sparse Coding Transformer | Wei Chen et.al. | 2511.20194 | null |
| 2025-11-25 | Alzheimers Disease Progression Prediction Based on Manifold Mapping of Irregularly Sampled Longitudinal Data | Xin Hong et.al. | 2511.20154 | null |
| 2025-11-25 | Designs on the Tautological bundle | Ikeda Yuya et.al. | 2511.20114 | null |
| 2025-11-25 | ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction | Yuanzhe Li et.al. | 2511.20020 | null |
| 2025-11-25 | iRadioDiff: Physics-Informed Diffusion Model for Indoor Radio Map Construction and Localization | Xiucheng Wang et.al. | 2511.20015 | null |
| 2025-11-25 | HybriDLA: Hybrid Generation for Document Layout Analysis | Yufan Chen et.al. | 2511.19919 | null |
| 2025-11-25 | MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization | Chengyue Huang et.al. | 2511.19878 | null |
| 2025-11-25 | DOGE: Differentiable Bezier Graph Optimization for Road Network Extraction | Jiahui Sun et.al. | 2511.19850 | null |
| 2025-11-25 | Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation | Xuewen Liu et.al. | 2511.19835 | link |
| 2025-11-24 | Rigidity of $\mathbf{SU(2)}$ and $\mathbf{SO(3)}$ quantum representations of mapping class groups at prime levels | Pierre Godfard et.al. | 2511.19795 | null |
| 2025-11-24 | Flow Map Distillation Without Data | Shangyuan Tong et.al. | 2511.19428 | null |
| 2025-11-24 | Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection | Zixuan Wang et.al. | 2511.19306 | null |
| 2025-11-24 | IOMMU Support for Virtual-Address Remote DMA in an ARMv8 environment | Antonis Psistakis et.al. | 2511.19258 | null |
| 2025-11-24 | SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control | Yuxuan Wang et.al. | 2511.19236 | null |
| 2025-11-27 | Learning Plug-and-play Memory for Guiding Video Diffusion Models | Selena Song et.al. | 2511.19229 | null |
| 2025-11-24 | In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings | Teresa Guallart-Naval et.al. | 2511.19226 | null |
| 2025-11-24 | MambaRefine-YOLO: A Dual-Modality Small Object Detector for UAV Imagery | Shuyu Cao et.al. | 2511.19134 | null |
| 2025-11-24 | Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation | Siqi Ding et.al. | 2511.19114 | null |
| 2025-11-24 | The TAG array of a multiple sequence alignment | Jannik Olbrich et.al. | 2511.19068 | null |
| 2025-11-26 | Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors | Yuchen Zhou et.al. | 2511.19031 | null |
| 2025-11-24 | 3D Dynamic Radio Map Prediction Using Vision Transformers for Low-Altitude Wireless Networks | Nguyen Duc Minh Quang et.al. | 2511.19019 | null |
| 2025-11-24 | Web of Non-invertible Dualities for (2+1) Dimensional Models with Subsystem Symmetries | Avijit Maity et.al. | 2511.18969 | null |
| 2025-11-24 | GContextFormer: A global context-aware hybrid multi-head attention approach with scaled additive aggregation for multimodal trajectory prediction | Yuzhi Chen et.al. | 2511.18874 | null |
| 2025-11-24 | Mitigating Long-Tail Bias in HOI Detection via Adaptive Diversity Cache | Yuqiu Jiang et.al. | 2511.18811 | null |
| 2025-11-24 | Sentiment Analysis of Financial Text Using Quantum Language Processing QDisCoCirc | Takayuki Sakuma et.al. | 2511.18804 | null |
| 2025-11-24 | ChronoGS: Disentangling Invariants and Changes in Multi-Period Scenes | Zhongtao Wang et.al. | 2511.18794 | null |
| 2025-11-24 | SAOT: An Enhanced Locality-Aware Spectral Transformer for Solving PDEs | Chenhong Zhou et.al. | 2511.18777 | null |
| 2025-11-24 | From Features to Reference Points: Lightweight and Adaptive Fusion for Cooperative Autonomous Driving | Yongqi Zhu et.al. | 2511.18757 | null |
| 2025-11-24 | Robust Multimodal Sentiment Analysis with Distribution-Based Feature Recovery and Fusion | Daiqing Wu et.al. | 2511.18751 | null |
| 2025-11-24 | Seeing What Matters: Visual Preference Policy Optimization for Visual Generation | Ziqi Ni et.al. | 2511.18719 | null |
| 2025-11-21 | DSeq-JEPA: Discriminative Sequential Joint-Embedding Predictive Architecture | Xiangteng He et.al. | 2511.17354 | link |
| 2025-11-21 | Learning Latent Transmission and Glare Maps for Lens Veiling Glare Removal | Xiaolong Qian et.al. | 2511.17353 | null |
| 2025-11-21 | Phase-adjusted realification of a $\mathbb{C}^3$ Kochen-Specker configuration into $\mathbb{R}^6$ | Andrei Khrennikov et.al. | 2511.17223 | null |
| 2025-11-21 | FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception | Shubham Sonarghare et.al. | 2511.17210 | null |
| 2025-11-21 | SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors | Kunyi Li et.al. | 2511.17207 | null |
| 2025-11-21 | A lightweight detector for real-time detection of remote sensing images | Qianyi Wang et.al. | 2511.17147 | null |
| 2025-11-21 | Generative MIMO Beam Map Construction for Location Recovery and Beam Tracking | Wangqian Chen et.al. | 2511.17007 | null |
| 2025-11-21 | MirrorMind: Empowering OmniScientist with the Expert Perspectives and Collective Knowledge of Human Scientists | Qingbin Zeng et.al. | 2511.16997 | null |
| 2025-11-21 | MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis | Di Luo et.al. | 2511.16957 | null |
| 2025-11-21 | UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation | Chi Zhang et.al. | 2511.16917 | null |
| 2025-11-21 | A deep ALMA Band 3 survey of HDFS/MUSE3D: Survey description and initial results | Hugo Messias et.al. | 2511.16909 | null |
| 2025-11-20 | Evolution mapping III: A new recipe for the halo mass function | Andrea Fiorilli et.al. | 2511.16730 | null |
| 2025-11-20 | Quasiparticle Variational Quantum Eigensolver | Saavanth Velury et.al. | 2511.16721 | null |
| 2025-11-20 | SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation | Zhenyuan Qin et.al. | 2511.16666 | null |
| 2025-11-20 | Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems | Elias Lumer et.al. | 2511.16654 | null |
| 2025-11-20 | Toward Artificial Palpation: Representation Learning of Touch on Soft Bodies | Zohar Rimon et.al. | 2511.16596 | null |
| 2025-11-21 | POMA-3D: The Point Map Way to 3D Scene Understanding | Ye Mao et.al. | 2511.16567 | null |
| 2025-11-20 | TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval | Özay Ezerceli et.al. | 2511.16528 | null |
| 2025-11-20 | Flow-Aided Flight Through Dynamic Clutters From Point To Motion | Bowen Xu et.al. | 2511.16372 | null |
| 2025-11-20 | Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling | Minseok Seo et.al. | 2511.16301 | null |
| 2025-11-20 | Optimizing 3D Gaussian Splattering for Mobile GPUs | Md Musfiqur Rahman Sanim et.al. | 2511.16298 | null |
| 2025-11-20 | Exponential map in DT theory | Sarunas Kaubrys et.al. | 2511.16261 | null |
| 2025-11-20 | Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning | Yibin Huang et.al. | 2511.16160 | null |
| 2025-11-20 | VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation | Chenyang Wu et.al. | 2511.16124 | null |
| 2025-11-20 | Clustered Error Correction with Grouped 4D Gaussian Splatting | Taeho Kang et.al. | 2511.16112 | null |
| 2025-11-20 | Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2511.16091 | null |
| 2025-11-20 | JWST observations of cosmic-ray-excited H $_2$ in Barnard 68: spatial variations and constraints on cosmic-ray attenuation | David A. Neufeld et.al. | 2511.16003 | null |
| 2025-11-20 | Self-supervised and Multi-fidelity Learning for Extended Predictive Soil Spectroscopy | Luning Sun et.al. | 2511.15965 | null |
| 2025-11-20 | A Simple and Robust Multi-Fidelity Data Fusion Method for Effective Modeling of Citizen-Science Air Pollution Data | Camilla Andreozzi et.al. | 2511.15942 | null |
| 2025-11-19 | A scattering perspective on gravitational lensing | Mariana Carrillo Gonzalez et.al. | 2511.15797 | null |
| 2025-11-19 | Joint Semantic-Channel Coding and Modulation for Token Communications | Jingkai Ying et.al. | 2511.15699 | null |
| 2025-11-19 | The JWST weather report from the nearest brown dwarfs III: Heterogeneous clouds and Thermochemical instabilities as possible drivers of WISE 1049AB’s spectroscopic variability | Natalia Oliveros-Gomez et.al. | 2511.15667 | null |
| 2025-11-19 | Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography | Shourov Joarder et.al. | 2511.15640 | null |
| 2025-11-19 | Cartan meets Cramér-Rao | Sunder Ram Krishnan et.al. | 2511.15612 | null |
| 2025-11-19 | From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers | Huiyuan Tian et.al. | 2511.15572 | null |
| 2025-11-19 | RS-CA-HSICT: A Residual and Spatial Channel Augmented CNN Transformer Framework for Monkeypox Detection | Rashid Iqbal et.al. | 2511.15476 | null |
| 2025-11-19 | Fidelity-Preserving Quantum Encoding for Quantum Neural Networks | Yuhu Lu et.al. | 2511.15363 | null |
| 2025-11-19 | Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection | Spyridon Loukovitis et.al. | 2511.15343 | null |
| 2025-11-19 | Physics-Based Benchmarking Metrics for Multimodal Synthetic Images | Kishor Datta Gupta et.al. | 2511.15204 | null |
| 2025-11-19 | SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection | Chun-Jung Lin et.al. | 2511.15153 | null |
| 2025-11-19 | CASPER: Cross-modal Alignment of Spatial and single-cell Profiles for Expression Recovery | Amit Kumar et.al. | 2511.15139 | null |
| 2025-11-19 | Proper derivation of subspace mapping from whole space mapping in boson expansion theory | Kimikazu Taniguchi et.al. | 2511.15129 | null |
| 2025-11-19 | WiCo-MG: Wireless Channel Foundation Model for Multipath Generation via Synesthesia of Machines | Zengrui Han et.al. | 2511.15026 | null |
| 2025-11-18 | EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects | Gbenga Omotara et.al. | 2511.14970 | null |
| 2025-11-18 | From minimal-length quantum theory to modified gravity | Rocco D’Agostino et.al. | 2511.14869 | null |
| 2025-11-18 | Geometry of Generalized Density Functional Theories | Chih-Chun Wang et.al. | 2511.14822 | null |
| 2025-11-18 | High-resolution weak lensing mass mapping from DES-Y3 data using diffusion-based prior | Supranta S. Boruah et.al. | 2511.14667 | null |
| 2025-11-18 | Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains | Qingwei Ben et.al. | 2511.14625 | null |
| 2025-11-18 | A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder | Dengyun Huang et.al. | 2511.14600 | null |
| 2025-11-18 | A Bayesian INLA-SPDE Approach to Spatio-Temporal Point-Grid Fusion with Change-of-Support and Misaligned Covariates | Weiyue Zheng et.al. | 2511.14535 | null |
| 2025-11-18 | A Generative Data Framework with Authentic Supervision for Underwater Image Restoration and Enhancement | Yufeng Tian et.al. | 2511.14521 | null |
| 2025-11-18 | Adversarial Learning-Based Radio Map Reconstruction for Fingerprinting Localization | Jiaming Zhang et.al. | 2511.14495 | null |
| 2025-11-18 | Covariance-based Imaging and Multi-View Fusion for Networked Sensing | Junyuan Gao et.al. | 2511.14490 | null |
| 2025-11-18 | Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation | Aditi Agarwal et.al. | 2511.14481 | null |
| 2025-11-18 | Going Places: Place Recognition in Artificial and Natural Systems | Michael Milford et.al. | 2511.14341 | null |
| 2025-11-18 | MA-SLAM: Active SLAM in Large-Scale Unknown Environment using Map Aware Deep Reinforcement Learning | Yizhen Yin et.al. | 2511.14330 | null |
| 2025-11-18 | LSP-YOLO: A Lightweight Single-Stage Network for Sitting Posture Recognition on Embedded Devices | Nanjun Li et.al. | 2511.14322 | null |
| 2025-11-18 | Secure parameter identification of ARX systems with CKKS cryptosystem | Jialong Chen et.al. | 2511.14267 | null |
| 2025-11-18 | Harnessing Deep LLM Participation for Robust Entity Linking | Jiajun Hou et.al. | 2511.14181 | null |
| 2025-11-18 | SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts | Fan Zhang et.al. | 2511.14093 | null |
| 2025-11-17 | Structural Flexibility of the TCF7L2-DNA Complex with the Type 2 Diabetes SNP rs7903146 | Karthik Venuturimilli et.al. | 2511.13916 | null |
| 2025-11-17 | TaoSearchEmb: A Multi-Objective Reinforcement Learning Framework for Dense Retrieval in Taobao Search | Xingxian Liu et.al. | 2511.13885 | null |
| 2025-11-17 | Dynamic state estimation of hybrid systems: Inverters that switch between grid-following and grid-forming control schemes | Bukunmi G. Odunlami et.al. | 2511.13872 | null |
| 2025-11-17 | GRLoc: Geometric Representation Regression for Visual Localization | Changyang Li et.al. | 2511.13864 | null |
| 2025-11-17 | RSPose: Ranking Based Losses for Human Pose Estimation | Muhammed Can Keles et.al. | 2511.13857 | null |
| 2025-11-17 | Aletheia: Emulating the non-linear matter power spectrum in the context of evolution mapping | Ariel G. Sanchez et.al. | 2511.13826 | null |
| 2025-11-17 | Bosonisation Cohomology: Spin Structure Summation in Every Dimension | Philip Boyle Smith et.al. | 2511.13718 | null |
| 2025-11-17 | Composition and Coherence: The Syntax of Operator Networks | Shih-Yu Chang et.al. | 2511.13706 | null |
| 2025-11-17 | Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting | Jiangnan Ye et.al. | 2511.13684 | null |
| 2025-11-17 | HilbMult: A Banach-Enriched Multicategory for Operator Algebras | Shih-Yu Chang et.al. | 2511.13674 | null |
| 2025-11-17 | Universal Kernel Models for Iterated Completely Positive Maps | James Tian et.al. | 2511.13599 | null |
| 2025-11-17 | Electron Correlation by Exchange Mapping in Electronic Structure Calculations | Jerry L. Whitten et.al. | 2511.13570 | null |
| 2025-11-17 | Sequences of Bivariate Bicycle Codes from Covering Graphs | Benjamin C. B. Symons et.al. | 2511.13560 | null |
| 2025-11-17 | Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew | Farhin Farhad Riya et.al. | 2511.13535 | null |
| 2025-11-17 | FUSE: A Flow-based Mapping Between Shapes | Lorenzo Olearo et.al. | 2511.13431 | null |
| 2025-11-17 | Learning Cosmology from Nearest Neighbour Statistics | Atrideb Chatterjee et.al. | 2511.13393 | null |
| 2025-11-17 | Cognitive Maps in Language Models: A Mechanistic Analysis of Spatial Planning | Caroline Baumgartner et.al. | 2511.13371 | null |
| 2025-11-17 | Unifying points of interest taxonomies: mapping OpenStreetMap tags to the Foursquare category system | Lilou Soulas et.al. | 2511.13369 | null |
| 2025-11-17 | Computer Vision based group activity detection and action spotting | Narthana Sivalingam et.al. | 2511.13315 | null |
| 2025-11-17 | PyPeT: A Python Perfusion Tool for Automated Quantitative Brain CT and MR Perfusion Analysis | Marijn Borghouts et.al. | 2511.13310 | null |
| 2025-11-17 | The free Banach $f$ -algebra generated by a Banach space | David Muñoz-Lahoz et.al. | 2511.13299 | null |
| 2025-11-17 | Vortex creep heating in neutron star cooling with direct Urca processes in heavy neutron stars | Yoonhak Nam et.al. | 2511.13263 | null |
| 2025-11-17 | GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry | Chiyun Noh et.al. | 2511.13216 | null |
| 2025-11-17 | Collision-Free Navigation of Mobile Robots via Quadtree-Based Model Predictive Control | Osama Al Sheikh Ali et.al. | 2511.13188 | null |
| 2025-11-17 | Region-Point Joint Representation for Effective Trajectory Similarity Learning | Hao Long et.al. | 2511.13125 | null |
| 2025-11-17 | A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features | Hanzhe Liang et.al. | 2511.13115 | null |
| 2025-11-14 | Estimating Total Effects in Bipartite Experiments with Spillovers and Partial Eligibility | Albert Tan et.al. | 2511.11564 | null |
| 2025-11-14 | Terrain Costmap Generation via Scaled Preference Conditioning | Luisa Mao et.al. | 2511.11529 | null |
| 2025-11-14 | Lispchitz modulus of the argmin mapping in convex quadratic optimization | María Josefa Cánovas et.al. | 2511.11455 | null |
| 2025-11-14 | VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation | Maximilian Rokuss et.al. | 2511.11450 | null |
| 2025-11-14 | Robust inverse material design with physical guarantees using the Voigt-Reuss Net | Sanath Keshav et.al. | 2511.11388 | null |
| 2025-11-14 | SCL Decoding of Non-Binary Linear Block Codes | Jingyu Lin et.al. | 2511.11256 | null |
| 2025-11-14 | Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs | Jitesh Chavan et.al. | 2511.11243 | null |
| 2025-11-14 | RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting | Ruocheng Wu et.al. | 2511.11213 | null |
| 2025-11-14 | Galactic foreground residual biases in CMB lensing convergence reconstruction and delensing of B-mode maps | Kishan Deka et.al. | 2511.11147 | null |
| 2025-11-14 | Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation | Zhiwei Zhang et.al. | 2511.11011 | null |
| 2025-11-14 | Binary Verification for Zero-Shot Vision | Jeffrey Liu et.al. | 2511.10983 | null |
| 2025-11-14 | A proposal to construct the dark-matter-only counterpart of the observed Universe combining weak lensing and baryon censuses | Shuren Zhou et.al. | 2511.10975 | null |
| 2025-11-14 | Heterogeneous Complementary Distillation | Liuchi Xu et.al. | 2511.10942 | null |
| 2025-11-14 | A Compilation Framework for Quantum Circuits with Mid-Circuit Measurement Error Awareness | Ming Zhong et.al. | 2511.10921 | null |
| 2025-11-13 | The Map of Misbelief: Tracing Intrinsic and Extrinsic Hallucinations Through Attention Patterns | Elyes Hajji et.al. | 2511.10837 | null |
| 2025-11-13 | Neural Local Wasserstein Regression | Inga Girshfeld et.al. | 2511.10824 | null |
| 2025-11-13 | Universal Thermodynamic Uncertainty Relation for Quantum $f-$ Divergences | Domingos S. P. Salazar et.al. | 2511.10817 | null |
| 2025-11-13 | Transformers know more than they can tell – Learning the Collatz sequence | François Charton et.al. | 2511.10811 | null |
| 2025-11-13 | Semantic Property Maps for Driving Applications | Marcus Greiff et.al. | 2511.10798 | null |
| 2025-11-13 | Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces | Farhan Sheth et.al. | 2511.10793 | null |
| 2025-11-13 | Domination between non-Fuchsian representations and anti-de Sitter geometry | Farid Diaf et.al. | 2511.10570 | null |
| 2025-11-13 | OmniVGGT: Omni-Modality Driven Visual Geometry Grounded | Haosong Peng et.al. | 2511.10560 | link |
| 2025-11-13 | A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space | Huijie Liu et.al. | 2511.10555 | link |
| 2025-11-13 | Noncommutative tensor triangular geometry: modules, bimodules, and unipotent Hopf algebras | Øyvind Solberg et.al. | 2511.10531 | null |
| 2025-11-13 | Learning Post-Newtonian Corrections from Numerical Relativity | Jooheon Yoo et.al. | 2511.10522 | null |
| 2025-11-13 | LLM-YOLOMS: Large Language Model-based Semantic Interpretation and Fault Diagnosis for Wind Turbine Components | Yaru Li et.al. | 2511.10394 | null |
| 2025-11-13 | Modeling Layout Abstractions Using Integer Set Relations | Somashekaracharya G Bhaskaracharya et.al. | 2511.10374 | null |
| 2025-11-13 | Ancilla-Free Fast-Forwarding Lindbladian Simulation Algorithms by Hamiltonian Twirling | Minbo Gao et.al. | 2511.10253 | null |
| 2025-11-13 | VISTA: A Vision and Intent-Aware Social Attention Framework for Multi-Agent Trajectory Prediction | Stephane Da Silva Martins et.al. | 2511.10203 | null |
| 2025-11-13 | GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation | Jun Zhang et.al. | 2511.10138 | null |
| 2025-11-13 | Tailored Three Dimensional Betatron Dynamics in UltraStable Hybrid Laser Plasma RF Accelerators | A. A. Molavi Choobini et.al. | 2511.10096 | null |
| 2025-11-13 | Tree-Based Stochastic Optimization for Solving Large-Scale Urban Network Security Games | Shuxin Zhuang et.al. | 2511.10072 | null |
| 2025-11-13 | DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection | Feiyang Jia et.al. | 2511.10035 | null |
| 2025-11-13 | Convergent series of Stokes wave of arbitrary height in deep water via machine learning | Chong Lin et.al. | 2511.09927 | null |
| 2025-11-13 | Compensating Distribution Drifts in Class-incremental Learning of Pre-trained Vision Transformers | Xuan Rao et.al. | 2511.09926 | null |
| 2025-11-12 | A Smooth Penalty-Based Feedback Law for Reactive Obstacle Avoidance with Convergence Guarantees | Lyes Smaili et.al. | 2511.09799 | null |
| 2025-11-12 | Towards model-free stellar chemical abundances. Potential applications in the search for chemically peculiar stars in large spectroscopic surveys | Theosamuele Signor et.al. | 2511.09733 | null |
| 2025-11-12 | Efficient Hyperdimensional Computing with Modular Composite Representations | Marco Angioli et.al. | 2511.09708 | null |
| 2025-11-12 | Entanglement, Yang-Mills, and the Scattering Matrix as an SU(N)-equivariant Kernel | Kun-Feng Lyu et.al. | 2511.09623 | null |
| 2025-11-12 | Probing the Critical Behavior of a Sign-Problematic Model with Monte Carlo Simulations | Ye Ling et.al. | 2511.09356 | null |
| 2025-11-05 | SENT Map – Semantically Enhanced Topological Maps with Foundation Models | Raj Surya Rajendran Kathirvel et.al. | 2511.03165 | null |
| 2025-11-03 | An Adjoint Method for Differentiable Fluid Simulation on Flow Maps | Zhiqi Li et.al. | 2511.01259 | null |
| 2025-11-15 | OmniNWM: Omniscient Driving Navigation World Models | Bohan Li et.al. | 2510.18313 | null |
| 2025-10-17 | HEADER: Hierarchical Robot Exploration via Attention-Based Deep Reinforcement Learning with Expert-Guided Reward | Yuhong Cao et.al. | 2510.15679 | null |
| 2025-11-13 | UniGS: Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering | Yusen Xie et.al. | 2510.12174 | null |
| 2025-10-13 | ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training | Leonard Bruns et.al. | 2510.11605 | null |
| 2025-10-10 | Robust Visual Teach-and-Repeat Navigation with Flexible Topo-metric Graph Map Representation | Jikai Wang et.al. | 2510.09089 | null |
| 2025-10-06 | OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS | Simon Boche et.al. | 2510.04612 | null |
| 2025-10-05 | Constructing coherent spatial memory in LLM agents through graph rectification | Puzhen Zhang et.al. | 2510.04195 | null |
| 2025-10-01 | A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features | Axel Barroso-Laguna et.al. | 2510.00978 | null |
| 2025-09-30 | Updates to the WFC3/UVIS Saturation Map | Mitchell Revalski et.al. | 2510.00097 | null |
| 2025-09-30 | Memory-Efficient 2D/3D Shape Assembly of Robot Swarms | Shuoyu Yue et.al. | 2509.26518 | null |
| 2025-09-30 | Classical feature map surrogates and metrics for quantum control landscapes | Martino Calzavara et.al. | 2509.25930 | null |
| 2025-09-25 | Neural Integrated Sensing and Communication for the MIMO-OFDM Downlink | Ziyi Wang et.al. | 2509.21118 | null |
| 2025-09-18 | Semantic-LiDAR-Inertial-Wheel Odometry Fusion for Robust Localization in Large-Scale Dynamic Environments | Haoxuan Jiang et.al. | 2509.14999 | null |
| 2025-09-17 | Charting trajectories of human thought using large language models | Matthew M Nour et.al. | 2509.14455 | null |
| 2025-10-30 | FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph | Xiaolin Zhou et.al. | 2509.13733 | null |
| 2025-09-16 | Maps for Autonomous Driving: Full-process Survey and Frontiers | Pengxin Chen et.al. | 2509.12632 | null |
| 2025-09-15 | Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing | Bingyu Li et.al. | 2509.12040 | null |
| 2025-09-11 | ObjectReact: Learning Object-Relative Control for Visual Navigation | Sourav Garg et.al. | 2509.09594 | null |
| 2025-09-01 | Hierarchical Motion Captioning Utilizing External Text Data Source | Clayton Leite et.al. | 2509.01471 | null |
| 2025-08-19 | MMIS-Net for Retinal Fluid Segmentation and Detection | Nchongmaje Ndipenocha et.al. | 2508.13936 | null |
| 2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
| 2025-07-29 | MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving | Thomas Monninger et.al. | 2507.21423 | null |
| 2025-09-15 | RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction | Yuqing Lan et.al. | 2507.17594 | null |
| 2025-07-15 | Mapping Fusion: Improving FPGA Technology Mapping with ASIC Mapper | Cunxi Yu et.al. | 2507.10912 | null |
| 2025-07-07 | Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR | Tao Du et.al. | 2507.04662 | null |
| 2025-07-11 | Learning to Generate Vectorized Maps at Intersections with Multiple Roadside Cameras | Quanxin Zheng et.al. | 2507.02899 | null |
| 2025-06-27 | Norm-dependent Lamperti-type MAP representations of stable processes and Brownian motions in the orthant | Andreas E. Kyprianou et.al. | 2506.22020 | null |
| 2025-06-26 | CURL-SLAM: Continuous and Compact LiDAR Mapping | Kaicheng Zhang et.al. | 2506.21077 | null |
| 2025-06-25 | Communication-Aware Map Compression for Online Path-Planning: A Rate-Distortion Approach | Ali Reza Pedram et.al. | 2506.20579 | null |
| 2025-07-16 | Cross-Layer Discrete Concept Discovery for Interpreting Language Models | Ankur Garg et.al. | 2506.20040 | null |
| 2025-06-17 | TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping | Jeewon Kim et.al. | 2506.14178 | null |
| 2025-06-16 | Complexity of Coexistence Regions in the GRHT Map | Sishu Shankar Muni et.al. | 2506.13515 | null |
| 2025-06-09 | ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving | Yongkang Li et.al. | 2506.08052 | null |
| 2025-06-07 | Multimodal Spatial Language Maps for Robot Navigation and Manipulation | Chenguang Huang et.al. | 2506.06862 | null |
| 2025-08-19 | Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU | Wenhao Dai et.al. | 2506.06095 | null |
| 2025-07-31 | X-ray Polarization Detection of the Pulsar Wind Nebula in G21.5-0.9 with IXPE | Niccolò Di Lalla et.al. | 2506.05630 | null |
| 2025-08-13 | DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes | Jiajun Jiang et.al. | 2506.01950 | null |
| 2025-06-05 | ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real | Youwei Yu et.al. | 2506.01759 | null |
| 2025-06-01 | Globally Consistent RGB-D SLAM with 2D Gaussian Splatting | Xingguang Zhong et.al. | 2506.00970 | null |
| 2025-05-29 | Bridging Scales in Map Generation: A scale-aware cascaded generative mapping framework for seamless and consistent multi-scale cartographic representation | Chenxing Sun et.al. | 2502.04991 | null |
| 2025-02-10 | MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction | Xiaoshuai Hao et.al. | 2502.04377 | null |
| 2025-02-07 | Construction of an invertible mapping to boundary conforming coordinates for arbitrarily shaped toroidal domains | Robert Babin et.al. | 2411.04683 | null |
| 2024-12-30 | Local Map Construction with SDMap: A Comprehensive Survey | Jiaqi Li et.al. | 2409.02415 | null |
| 2024-10-15 | MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping | Jiacheng Chen et.al. | 2403.15951 | null |
| 2023-06-08 | NeMO: Neural Map Growing System for Spatiotemporal Fusion in Bird’s-Eye-View and BDD-Map Benchmark | Xi Zhu et.al. | 2306.04540 | null |
| 2023-03-07 | Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit Representation | Xingrui Yang et.al. | 2210.15858 | null |
| 2022-10-17 | Fast genomic optical map assembly algorithm using binary representation | Przemysław Stawczyk et.al. | 2210.06865 | null |
| 2023-08-17 | Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping | Bodong Zhou et.al. | 2208.11061 | link |
| 2023-05-18 | LiDAR Road-Atlas: An Efficient Map Representation for General 3D Urban Environment | Banghe Wu et.al. | 2204.05727 | link |
| 2023-08-08 | NeuralBlox: Real-Time Neural Representation Fusion for Robust Volumetric Mapping | Stefan Lionar et.al. | 2110.09415 | null |
| 2021-04-08 | VGF-Net: Visual-Geometric Fusion Learning for Simultaneous Drone Navigation and Height Mapping | Yilin Liu et.al. | 2104.03109 | null |
| 2022-09-23 | Distributed Dynamic Map Fusion via Federated Learning for Intelligent Networked Vehicles | Zijian Zhang et.al. | 2103.03786 | null |
| 2020-03-13 | Learning word-referent mappings and concepts from raw inputs | Wai Keen Vong et.al. | 2003.05573 | null |
| 2019-08-01 | Recovery Map for Fermionic Gaussian Channels | Brian Swingle et.al. | 1811.04956 | null |
| 2017-11-15 | Finiteness of Mapping Class Group Representations from Twisted Dijkgraaf-Witten Theory | Paul Gustafson et.al. | 1610.06069 | null |
| 2017-10-18 | The moment map on symplectic vector space and oscillator representation | Takashi Hashimoto et.al. | 1408.6597 | null |
| 2015-06-15 | Maximum Likelihood Fusion of Stochastic Maps | Brandon Jones et.al. | 1303.6170 | null |
| 2008-03-13 | Quantum Reference Frames and the Classification of Rotationally-Invariant Maps | J. -C. Boileau et.al. | 0709.0142 | null |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-03-22 | Domain Elastic Transform: Bayesian Function Registration for High-Dimensional Scientific Data | Osamu Hirose et.al. | 2603.21235 | null |
| 2026-03-20 | Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement | Chunlei Zhang et.al. | 2603.19623 | link |
| 2026-03-02 | Preoperative-to-intraoperative Liver Registration for Laparoscopic Surgery via Latent-Grounded Correspondence Constraints | Ruize Cui et.al. | 2603.01720 | null |
| 2026-01-22 | Coarse-to-Fine Non-rigid Multi-modal Image Registration for Historical Panel Paintings based on Crack Structures | Aline Sindel et.al. | 2601.16348 | null |
| 2026-01-09 | Deformation-Aware Observation Modeling for Radar-Based Human Sensing via 3D Scan-Depth Sequence Fusion | Guangqi Shi et.al. | 2601.05676 | null |
| 2025-12-27 | Multimodal Diffeomorphic Registration with Neural ODEs and Structural Descriptors | Salvador Rodriguez-Sanz et.al. | 2512.22689 | null |
| 2025-12-16 | Test Time Optimized Generalized AI-based Medical Image Registration Method | Sneha Sree C. et.al. | 2512.14556 | null |
| 2025-12-01 | Robust Rigid and Non-Rigid Medical Image Registration Using Learnable Edge Kernels | Ahsan Raza Siyal et.al. | 2512.01771 | null |
| 2025-11-19 | Coarse-to-Fine Non-Rigid Registration for Side-Scan Sonar Mosaicking | Can Lei et.al. | 2512.00052 | null |
| 2025-11-06 | Systematic Evaluation of Preprocessing Techniques for Accurate Image Registration in Digital Pathology | Fatemehzahra Darzi et.al. | 2511.04171 | null |
| 2025-11-25 | CORE – A Cell-Level Coarse-to-Fine Image Registration Engine for Multi-stain Image Alignment | Esha Sadia Nasir et.al. | 2511.03826 | null |
| 2025-12-06 | Structural Stress as a Predictor of the Rate and Spatial Location of Aortic Growth in Uncomplicated Type B Aortic Dissection | Yuhang Du et.al. | 2511.03287 | null |
| 2025-10-30 | Simultaneous optimization of non-coplanar beam orientations and cumulative EQD2 distribution for high-dose reirradiation of locoregionally recurrent non-small cell lung cancer | Nathan Torelli et.al. | 2510.26272 | null |
| 2025-10-21 | MorphModes: Non-rigid Registration via Adaptive Skinning Eigenmodes | Gabrielle Browne et.al. | 2510.18658 | null |
| 2025-10-17 | ERNet: Efficient Non-Rigid Registration Network for Point Sequences | Guangzhao He et.al. | 2510.15800 | null |
| 2026-02-24 | A geometric feature tracking approach for noninvasive patient specific estimation of leaflet strain from 3D images of heart valves | Wensi Wu et.al. | 2510.06578 | null |
| 2025-09-12 | Human Body Segment Volume Estimation with Two RGB-D Cameras | Giulia Bassani et.al. | 2509.10429 | null |
| 2025-09-09 | A Comprehensive Pipeline for Aortic Segmentation and Shape Analysis | Nairouz Shehata et.al. | 2509.09718 | null |
| 2025-08-19 | Shape-from-Template with Generalised Camera | Agniva Sengupta et.al. | 2508.13791 | null |
| 2025-08-24 | FractMorph: A Fractional Fourier-Based Multi-Domain Transformer for Deformable Image Registration | Shayan Kebriti et.al. | 2508.12445 | null |
| 2025-07-27 | PIVOTS: Aligning unseen Structures using Preoperative to Intraoperative Volume-To-Surface Registration for Liver Navigation | Peng Liu et.al. | 2507.20337 | null |
| 2025-07-10 | X-RAFT: Cross-Modal Non-Rigid Registration of Blue and White Light Neurosurgical Hyperspectral Images | Charlie Budd et.al. | 2507.07747 | null |
| 2025-11-12 | Geo-Registration of Terrestrial LiDAR Point Clouds with Satellite Images without GNSS | Xinyu Wang et.al. | 2507.05999 | null |
| 2025-07-06 | Robot-assisted Transcranial Magnetic Stimulation (Robo-TMS): A Review | Wenzhi Bai et.al. | 2507.04345 | null |
| 2025-07-28 | ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction | Juming Xiong et.al. | 2506.21923 | null |
| 2025-06-26 | A Novel Framework for Integrating 3D Ultrasound into Percutaneous Liver Tumour Ablation | Shuwei Xing et.al. | 2506.21162 | null |
| 2025-05-29 | VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration | Ben Li et.al. | 2505.23439 | null |
| 2025-05-28 | NFR: Neural Feature-Guided Non-Rigid Shape Registration | Puhua Jiang et.al. | 2505.22445 | null |
| 2025-05-19 | GuidedMorph: Two-Stage Deformable Registration for Breast MRI | Yaqian Chen et.al. | 2505.13414 | null |
| 2025-05-28 | GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats | Simeon Adebola et.al. | 2505.10923 | link |
| 2025-04-21 | Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection | Jun Zhou et.al. | 2504.15152 | null |
| 2025-04-11 | X2BR: High-Fidelity 3D Bone Reconstruction from a Planar X-Ray Image with Hybrid Neural Implicit Methods | Gokce Guven et.al. | 2504.08675 | null |
| 2025-03-22 | MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability | Paul Hill et.al. | 2503.17700 | null |
| 2025-02-26 | An anatomically-informed correspondence initialisation method to improve learning-based registration for radiotherapy | Edward G. A. Henderson et.al. | 2502.19101 | null |
| 2025-02-15 | Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy | Mingyang Zhao et.al. | 2502.10704 | null |
| 2025-04-03 | MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography | Xinkai Wang et.al. | 2502.08786 | null |
| 2025-01-10 | A Steerable Deep Network for Model-Free Diffusion MRI Registration | Gianfranco Cortes et.al. | 2501.04794 | null |
| 2024-10-31 | UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration | Geng Li et.al. | 2410.22909 | null |
| 2025-06-06 | SynBench: A Synthetic Benchmark for Non-rigid 3D Point Cloud Registration | Sara Monji-Azad et.al. | 2409.14474 | null |
| 2025-10-01 | SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration | Yuxin Yao et.al. | 2405.20188 | link |
| 2024-04-19 | DeeperHistReg: Robust Whole Slide Images Registration Framework | Marek Wodzinski et.al. | 2404.14434 | null |
| 2024-04-26 | RegWSI: Whole Slide Image Registration using Combined Deep Feature- and Intensity-Based Methods: Winner of the ACROBAT 2023 Challenge | Marek Wodzinski et.al. | 2404.13108 | null |
| 2024-01-05 | Partition-based Nonrigid Registration for 3D Face Model | Yuping Ye et.al. | 2401.02607 | null |
| 2023-03-20 | Deep Graph-based Spatial Consistency for Robust Non-rigid Point Cloud Registration | Zheng Qin et.al. | 2303.09950 | link |
| 2023-02-21 | Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization | Yuxin Yao et.al. | 2206.03410 | null |
| 2022-10-06 | Non-rigid Point Cloud Registration with Neural Deformation Pyramid | Yang Li et.al. | 2205.12796 | null |
| 2022-05-21 | Myocardial Segmentation of Late Gadolinium Enhanced MR Images by Propagation of Contours from Cine MR Images | Dong Wei et.al. | 2205.10595 | null |
| 2022-03-18 | A Survey of Non-Rigid 3D Registration | Bailin Deng et.al. | 2203.07858 | null |
| 2021-12-23 | Geodesic squared exponential kernel for non-rigid shape registration | Florent Jousse et.al. | 2112.11853 | null |
| 2021-04-27 | Deep Convolutional Neural Network for Non-rigid Image Registration | Eduard F. Durech et.al. | 2104.12034 | null |
| 2020-06-15 | Nonrigid registration using Gaussian processes and local likelihood estimation | Ashton Wiens et.al. | 2006.06864 | null |
| 2020-07-28 | Cortical surface registration using unsupervised learning | Jieyu Cheng et.al. | 2004.04617 | null |
| 2020-04-10 | Quasi-Newton Solver for Robust Non-Rigid Registration | Yuxin Yao et.al. | 2004.04322 | link |
| 2020-01-14 | A Comparative Study for Non-rigid Image Registration and Rigid Image Registration | Xiaoran Zhang et.al. | 2001.03831 | null |
| 2019-04-02 | Automatic Nonrigid Histological Image Registration with Adaptive Multistep Algorithm | Marek Wodzinski et.al. | 1904.00982 | null |
| 2019-04-07 | Symmetry-guided nonrigid registration: the case for distortion correction in multidimensional photoemission spectroscopy | Rui Patrick Xian et.al. | 1901.00312 | null |
| 2018-12-25 | A Survey on Non-rigid 3D Shape Analysis | Hamid Laga et.al. | 1812.10111 | null |
| 2019-06-20 | Robust Non-Rigid Registration with Reweighted Position and Transformation Sparsity | Kun Li et.al. | 1703.04861 | null |
| 2015-04-15 | A Multicomponent Approach to Nonrigid Registration of Diffusion Tensor Images | Mohammed Khader et.al. | 1504.01800 | null |
| 2014-03-27 | Optimized imaging using non-rigid registration | Benjamin Berkels et.al. | 1403.6774 | null |
| 2013-04-03 | Scale Selection of Adaptive Kernel Regression by Joint Saliency Map for Nonrigid Image Registration | Zhuangming Shen et.al. | 1303.0479 | null |
| 2013-04-15 | Local Structure Matching Driven by Joint-Saliency-Structure Adaptive Kernel Regression | Binjie Qin et.al. | 1302.0494 | null |
| 2011-04-22 | A Meshless Method for Variational Nonrigid 2-D Shape Registration | Wei Liu et.al. | 1104.4168 | null |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-04-02 | The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level | Jeremy Herbst et.al. | 2604.02178 | null |
| 2026-04-02 | FlatAttention: Dataflow and Fabric Collectives Co-Optimization for Large Attention-Based Model Inference on Tile-Based Accelerators | Chi Zhang et.al. | 2604.02110 | null |
| 2026-04-02 | SURE: Synergistic Uncertainty-aware Reasoning for Multimodal Emotion Recognition in Conversations | Yiqiang Cai et.al. | 2604.01916 | null |
| 2026-04-02 | FourierMoE: Fourier Mixture-of-Experts Adaptation of Large Language Models | Juyong Jiang et.al. | 2604.01762 | null |
| 2026-04-02 | M3D-BFS: a Multi-stage Dynamic Fusion Strategy for Sample-Adaptive Multi-Modal Brain Network Analysis | Rui Dong et.al. | 2604.01667 | null |
| 2026-04-02 | Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models | Shuibai Zhang et.al. | 2604.01622 | null |
| 2026-04-02 | DWDP: Distributed Weight Data Parallelism for High-Performance LLM Inference on NVL72 | Wanqian Li et.al. | 2604.01621 | null |
| 2026-04-01 | Learning When to See and When to Feel: Adaptive Vision-Torque Fusion for Contact-Aware Manipulation | Jiuzhou Lei et.al. | 2604.01414 | null |
| 2026-04-01 | Sparse Spectral LoRA: Routed Experts for Medical VLMs | Omid Nejati Manzari et.al. | 2604.01310 | null |
| 2026-04-01 | Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning | Mohammad R. Abu Ayyash et.al. | 2604.01152 | null |
| 2026-04-02 | Asymptotically Optimal Sequential Testing with Heterogeneous LLMs | Guokai Li et.al. | 2604.01086 | null |
| 2026-04-01 | PHASOR: Anatomy- and Phase-Consistent Volumetric Diffusion for CT Virtual Contrast Enhancement | Zilong Li et.al. | 2604.01053 | null |
| 2026-04-01 | KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection | Abdullah Al Shafi et.al. | 2604.00878 | null |
| 2026-04-01 | Cost-Penalized Fitness in FMA-Orchestrated Mixture of Experts: Experimental Evidence for Molecular Memory in Domain Adaptation | Martin Jaraiz et.al. | 2604.00812 | null |
| 2026-04-01 | Routing-Free Mixture-of-Experts | Yilun Liu et.al. | 2604.00801 | null |
| 2026-04-01 | Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer | Dharma Teja Vooturi et.al. | 2604.00785 | null |
| 2026-04-01 | Toward Optimal Sampling Rate Selection and Unbiased Classification for Precise Animal Activity Recognition | Axiu Mao et.al. | 2604.00517 | null |
| 2026-04-01 | Self-Routing: Parameter-Free Expert Routing from Hidden States | Jama Hussein Mohamud et.al. | 2604.00421 | null |
| 2026-03-31 | From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU Clusters | Jinghan Yao et.al. | 2604.00317 | null |
| 2026-03-31 | Directly visualizing the energy level structure of quantum dot molecules | Heun Mo Yoo et.al. | 2604.00232 | null |
| 2026-03-31 | Short proofs in combinatorics and number theory | Boris Alexeev et.al. | 2603.29961 | null |
| 2026-03-31 | First energy scan measurement of $e^{+}e^{-}\to K^{+}K^{-}$ around the $ψ(2S)$ resonance | BESIII Collaboration et.al. | 2603.29854 | null |
| 2026-03-31 | Counterfactual Analysis of Brain Network Dynamics | Moo K. Chung et.al. | 2603.29843 | null |
| 2026-03-31 | Training-Free Dynamic Upcycling of Expert Language Models | Eros Fanì et.al. | 2603.29765 | null |
| 2026-03-31 | TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic Classification | Qing He et.al. | 2603.29520 | null |
| 2026-03-31 | Aligning Multimodal Sequential Recommendations via Robust Direct Preference Optimization with Sparse MoE | Hejin Huang et.al. | 2603.29259 | null |
| 2026-03-31 | Route-Induced Density and Stability (RIDE): Controlled Intervention and Mechanism Analysis of Routing-Style Meta Prompts on LLM Internal States | Dianxing Zhang et.al. | 2603.29206 | null |
| 2026-03-31 | BiMoE: Brain-Inspired Experts for EEG-Dominant Affective State Recognition | Hongyu Zhu et.al. | 2603.29205 | null |
| 2026-03-30 | Rethinking Language Model Scaling under Transferable Hypersphere Optimization | Liliang Ren et.al. | 2603.28743 | null |
| 2026-03-30 | StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation | Yiran Shi et.al. | 2603.28565 | null |
| 2026-03-30 | Observation of $Λ^+_c\to nπ^+η$ and search for $Λ^+_c\to na_0(980)^+$ | BESIII Collaboration et.al. | 2603.28232 | null |
| 2026-03-30 | Graph Vector Field: A Unified Framework for Multimodal Health Risk Assessment from Heterogeneous Wearable and Environmental Data Streams | Silvano Coletti et.al. | 2603.28115 | null |
| 2026-03-30 | ExFusion: Efficient Transformer Training via Multi-Experts Fusion | Jiacheng Ruan et.al. | 2603.27965 | null |
| 2026-03-31 | MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation | Ruiyao Liu et.al. | 2603.27959 | null |
| 2026-03-29 | KAT-Coder-V2 Technical Report | Fengxiang Li et.al. | 2603.27703 | null |
| 2026-03-29 | LVRPO: Language-Visual Alignment with GRPO for Multimodal Understanding and Generation | Shentong Mo et.al. | 2603.27693 | null |
| 2026-03-29 | PRBench: End-to-end Paper Reproduction in Physics Research | Shi Qiu et.al. | 2603.27646 | null |
| 2026-03-29 | Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling | Songchen Ma et.al. | 2603.27624 | null |
| 2026-03-29 | Fully Spiking Neural Networks with Target Awareness for Energy-Efficient UAV Tracking | Pengzhi Zhong et.al. | 2603.27493 | null |
| 2026-03-29 | On Token’s Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models | Chongyang Zhao et.al. | 2603.27481 | null |
| 2026-03-28 | Unveiling Code Clones in the Eclipse IIoT Software Ecosystem | Zengyang Li et.al. | 2603.27308 | null |
| 2026-03-28 | Persistent Memory Through Triple-Loop Consolidation in a Non-Gradient Dissipative Cognitive Architecture | Jianwei Lou et.al. | 2603.27188 | null |
| 2026-03-28 | Routing Sensitivity Without Controllability: A Diagnostic Study of Fairness in MoE Language Models | Junhyeok Lee et.al. | 2603.27141 | null |
| 2026-03-27 | TAPS: Task Aware Proposal Distributions for Speculative Sampling | Mohamad Zbib et.al. | 2603.27027 | null |
| 2026-03-27 | Learning to Commit: Generating Organic Pull Requests via Online Repository Memory | Mo Li et.al. | 2603.26664 | null |
| 2026-03-27 | Sustainability Is Not Linear: Quantifying Performance, Energy, and Privacy Trade-offs in On-Device Intelligence | Eziyo Ehsani et.al. | 2603.26603 | null |
| 2026-03-26 | Can Small Models Reason About Legal Documents? A Comparative Study | Snehit Vaddi et.al. | 2603.25944 | null |
| 2026-03-26 | Narrowband searches for continuous gravitational waves from known pulsars in the first two parts of the fourth LIGO–Virgo–KAGRA observing run | The LIGO Scientific Collaboration et.al. | 2603.25938 | null |
| 2026-03-26 | AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer’s Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study | Wenlong Hou et.al. | 2603.25322 | null |
| 2026-03-26 | SliderQuant: Accurate Post-Training Quantization for LLMs | Shigeng Wang et.al. | 2603.25284 | null |
| 2026-03-26 | A Wireless World Model for AI-Native 6G Networks | Ziqi Chen et.al. | 2603.25216 | null |
| 2026-03-26 | MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation | Ranxu Zhang et.al. | 2603.25126 | null |
| 2026-03-26 | MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting | Huyen Ngoc Tran et.al. | 2603.25046 | null |
| 2026-03-26 | MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models | Dohwan Ko et.al. | 2603.24984 | null |
| 2026-03-26 | CROSS: A Mixture-of-Experts Reinforcement Learning Framework for Generalizable Large-Scale Traffic Signal Control | Xibei Chen et.al. | 2603.24930 | null |
| 2026-03-25 | OptiSAR-Net++: A Large-Scale Benchmark and Transformer-Free Framework for Cross-Domain Remote Sensing Visual Grounding | Xiaoyu Tang et.al. | 2603.24876 | null |
| 2026-03-25 | Enes Causal Discovery | Alexis Kafantaris et.al. | 2603.24436 | null |
| 2026-03-25 | Cross Section Measurements of $\bar{n}p \rightarrow K^{+}K^{-}π^{+}(π^{0})$ via Antineutrons Produced by $J/ψ\to p π^{-} \bar{n}$ Decays | BESIII Collaboration et.al. | 2603.24272 | null |
| 2026-03-25 | B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition | Nishit Poddar et.al. | 2603.24245 | null |
| 2026-03-25 | Sequence-aware Large Language Models for Explainable Recommendation | Gangyi Zhang et.al. | 2603.24136 | null |
| 2026-03-25 | PCHC: Enabling Preference Conditioned Humanoid Control via Multi-Objective Reinforcement Learning | Huanyu Li et.al. | 2603.24047 | null |
| 2026-03-25 | LGEST: Dynamic Spatial-Spectral Expert Routing for Hyperspectral Image Classification | Jiawen Wen et.al. | 2603.24045 | null |
| 2026-03-25 | MoE-Sieve: Routing-Guided LoRA for Efficient MoE Fine-Tuning | Andrea Manzoni et.al. | 2603.24044 | null |
| 2026-03-25 | SiftMoE: Similarity-Aware Energy-Efficient Expert Selection for Wireless Distributed MoE Inference | Qian Chen et.al. | 2603.23888 | null |
| 2026-03-24 | Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters | Nan Cui et.al. | 2603.23780 | null |
| 2026-03-24 | The Diminishing Returns of Early-Exit Decoding in Modern LLMs | Rui Wei et.al. | 2603.23701 | null |
| 2026-03-24 | VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs | Haoran Yuan et.al. | 2603.23481 | null |
| 2026-03-24 | Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning | Connor Mclaughlin et.al. | 2603.23436 | null |
| 2026-03-24 | Amplitude Analysis of the Isospin-Violating Decay $J/ψ\rightarrowγηπ^{0}$ | BESIII Collaboration et.al. | 2603.23081 | null |
| 2026-03-24 | IntentWeave: A Progressive Entry Ladder for Multi-Surface Browser Agents in Cloud Portals | Wanying Mo et.al. | 2603.22917 | null |
| 2026-03-24 | Search for the radiative decays $D^0\to γ\bar K_1(1270)^0$ and $D^+\to γK_1(1270)^+$ | BESIII Collaboration et.al. | 2603.22804 | null |
| 2026-03-24 | KALAVAI: Predicting When Independent Specialist Fusion Works – A Quantitative Model for Post-Hoc Cooperative LLM Training | Ramchand Kumaresan et.al. | 2603.22755 | null |
| 2026-03-24 | Why Database Manuals Are Not Enough: Efficient and Reliable Configuration Tuning for DBMSs via Code-Driven LLM Agents | Xinyi Zhang et.al. | 2603.22708 | null |
| 2026-03-23 | Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning | Jihyun Janice Ahn et.al. | 2603.22619 | null |
| 2026-03-23 | FullCircle: Effortless 3D Reconstruction from Casual 360 $^\circ$ Captures | Yalda Foroutan et.al. | 2603.22572 | null |
| 2026-03-23 | 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing | Haoyu Zhen et.al. | 2603.22279 | null |
| 2026-03-23 | A bending in the size-mass relation of star-forming galaxies across $0.5 < z < 6.0$ at a critical stellar mass of $10^{10}M_\odot$ revealed by JWST | Longyue Chen et.al. | 2603.22239 | null |
| 2026-03-23 | Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning | Daniel Shao et.al. | 2603.22198 | null |
| 2026-03-23 | ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive Text-to-Image Retrieval | Zhuocheng Zhang et.al. | 2603.21886 | null |
| 2026-03-23 | Holistic Scaling Laws for Optimal Mixture-of-Experts Architecture Optimization | Weilin Wan et.al. | 2603.21862 | null |
| 2026-03-23 | DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers | Tianyu Cao et.al. | 2603.21608 | null |
| 2026-03-22 | Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity | Zihan Fang et.al. | 2603.21276 | null |
| 2026-03-22 | QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression | Zhongyang Li et.al. | 2603.21232 | null |
| 2026-03-22 | MI-DPG: Decomposable Parameter Generation Network Based on Mutual Information for Multi-Scenario Recommendation | Wenzhuo Cheng et.al. | 2603.21209 | null |
| 2026-03-22 | Diffusion-based Probabilistic Air Quality Forecasting with Mechanistic Insight | Ao Ding et.al. | 2603.21131 | null |
| 2026-03-22 | Mixture of Chapters: Scaling Learnt Memory in Transformers | Tasmay Pankaj Tibrewal et.al. | 2603.21096 | null |
| 2026-03-22 | CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models | Nan Zhou et.al. | 2603.21077 | null |
| 2026-03-22 | LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning | Jianing Wang et.al. | 2603.21065 | null |
| 2026-03-21 | Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models | Yifan Yang et.al. | 2603.20697 | null |
| 2026-03-21 | CFNN: Continued Fraction Neural Network | Chao Wang et.al. | 2603.20634 | null |
| 2026-03-21 | A 4R-supported circular product-service system for luxury branded events | Ke Ma et.al. | 2603.20613 | null |
| 2026-03-20 | AE-LLM: Adaptive Efficiency Optimization for Large Language Models | Kaito Tanaka et.al. | 2603.20492 | null |
| 2026-03-20 | Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation | Marcus Armstrong et.al. | 2603.20406 | null |
| 2026-03-20 | Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech? | Lokesh Kumar et.al. | 2603.19831 | null |
| 2026-03-20 | Making Video Models Adhere to User Intent with Minor Adjustments | Daniel Ajisafe et.al. | 2603.19672 | null |
| 2026-03-20 | Structured Prompting for Arabic Essay Proficiency: A Trait-Centric Evaluation Approach | Salim Al Mandhari et.al. | 2603.19668 | null |
| 2026-03-20 | CS-MUNet: A Channel-Spatial Dual-Stream Mamba Network for Multi-Organ Segmentation | Yuyang Zheng et.al. | 2603.19659 | null |
| 2026-03-20 | UniBioTransfer: A Unified Framework for Multiple Biometrics Transfer | Caiyi Sun et.al. | 2603.19637 | null |
| 2026-03-19 | Scalable Prompt Routing via Fine-Grained Latent Task Discovery | Yunyi Zhang et.al. | 2603.19415 | null |
| 2026-03-22 | Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation | Zhuolin Yang et.al. | 2603.19220 | null |
| 2026-03-19 | DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge | Yuegui Huang et.al. | 2603.19172 | null |
| 2026-03-19 | ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning | Weihang Huang et.al. | 2603.19029 | null |
| 2026-03-19 | GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants | The LIGO Scientific Collaboration et.al. | 2603.19021 | null |
| 2026-03-19 | GWTC-4.0: Tests of General Relativity. II. Parameterized Tests | The LIGO Scientific Collaboration et.al. | 2603.19020 | null |
| 2026-03-19 | GWTC-4.0: Tests of General Relativity. I. Overview and General Tests | The LIGO Scientific Collaboration et.al. | 2603.19019 | null |
| 2026-03-19 | DriftGuard: Mitigating Asynchronous Data Drift in Federated Learning | Yizhou Han et.al. | 2603.18872 | null |
| 2026-03-19 | Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision–Language–Motion Diffusion Architecture | Fuze Sun et.al. | 2603.18771 | null |
| 2026-03-19 | Observation of $D_s^+ \to a_0(980)^+f_0(500)$ in the Amplitude Analysis of $D_s^+ \to π^+ π^0 π^0 η$ | BESIII Collaboration et.al. | 2603.18521 | null |
| 2026-03-19 | AIMER: Calibration-Free Task-Agnostic MoE Pruning | Zongfang Liu et.al. | 2603.18492 | null |
| 2026-03-19 | AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba | Yan Li et.al. | 2603.18462 | null |
| 2026-03-19 | Spatially Indirect Exciton Condensation in Two-Dimensional Strongly Correlated Semimetals | Yao Zeng et.al. | 2603.18445 | null |
| 2026-03-18 | Path-Constrained Mixture-of-Experts | Zijin Gu et.al. | 2603.18297 | null |
| 2026-03-18 | CORE: Robust Out-of-Distribution Detection via Confidence and Orthogonal Residual Scoring | Jin Mo Yang et.al. | 2603.18290 | null |
| 2026-03-18 | Resonance-enhanced integrated acousto-optic beam steering | Yue Yu et.al. | 2603.18191 | null |
| 2026-03-18 | Understanding Task Aggregation for Generalizable Ultrasound Foundation Models | Fangyijie Wang et.al. | 2603.18123 | null |
| 2026-03-18 | DebugLM: Learning Traceable Training Data Provenance for LLMs | Wenjie Jacky Mo et.al. | 2603.17884 | null |
| 2026-03-18 | The 1/W Law: An Analytical Study of Context-Length Routing Topology and GPU Generation Gains for LLM Inference Energy Efficiency | Huamin Chen et.al. | 2603.17280 | null |
| 2026-03-17 | Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency | Lucas Bandarkar et.al. | 2603.17102 | null |
| 2026-03-17 | Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection | Haitian Wang et.al. | 2603.17069 | null |
| 2026-03-17 | SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding | D. Darankoum et.al. | 2603.16739 | null |
| 2026-03-17 | HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture | Aojie Yuan et.al. | 2603.16679 | null |
| 2026-03-19 | Mixture of Style Experts for Diverse Image Stylization | Shihao Zhu et.al. | 2603.16649 | null |
| 2026-03-17 | Tarab: A Multi-Dialect Corpus of Arabic Lyrics and Poetry | Mo El-Haj et.al. | 2603.16601 | null |
| 2026-03-17 | Visual Distraction Undermines Moral Reasoning in Vision-Language Models | Xinyi Yang et.al. | 2603.16445 | null |
| 2026-03-18 | EngGPT2: Sovereign, Efficient and Open Intelligence | G. Ciarfaglia et.al. | 2603.16430 | null |
| 2026-03-17 | PlotTwist: A Creative Plot Generation Framework with Small Language Models | Abhinav Thorat et.al. | 2603.16410 | null |
| 2026-03-17 | DynamicGate MLP Conditional Computation via Learned Structural Dropout and Input Dependent Gating for Functional Plasticity | Yong Il Choi et.al. | 2603.16367 | null |
| 2026-03-17 | Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits | Jia Qing Yap et.al. | 2603.16335 | null |
| 2026-03-17 | AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection | Hongwei Lin et.al. | 2603.16261 | null |
| 2026-03-17 | Accelerating Approximate Analytical Join Queries over Unstructured Data with Statistical Guarantees | Yuxuan Zhu et.al. | 2603.16153 | null |
| 2026-03-16 | Confidently Wrong: Why Ignoring Binaries Biases IMF Inference at Large Sample Sizes | Anna L. Rosen et.al. | 2603.15779 | null |
| 2026-03-16 | Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning | Ye Wang et.al. | 2603.15708 | null |
| 2026-03-16 | Bridging Local and Global Knowledge: Cascaded Mixture-of-Experts Learning for Near-Shortest Path Routing | Yung-Fu Chen et.al. | 2603.15541 | null |
| 2026-03-16 | Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis | Penny Chong et.al. | 2603.15483 | null |
| 2026-03-16 | A Closer Look into LLMs for Table Understanding | Jia Wang et.al. | 2603.15402 | null |
| 2026-03-16 | MoE-ACT: Scaling Multi-Task Bimanual Manipulation with Sparse Language-Conditioned Mixture-of-Experts Transformers | Kangjun Guo et.al. | 2603.15265 | null |
| 2026-03-17 | Tracking the Discriminative Axis: Dual Prototypes for Test-Time OOD Detection Under Covariate Shift | Wooseok Lee et.al. | 2603.15213 | null |
| 2026-03-16 | ForceVLA2: Unleashing Hybrid Force-Position Control with Force Awareness for Contact-Rich Manipulation | Yang Li et.al. | 2603.15169 | null |
| 2026-03-16 | M2IR: Proactive All-in-One Image Restoration via Mamba-style Modulation and Mixture-of-Experts | Shiwei Wang et.al. | 2603.14816 | null |
| 2026-03-16 | Genetic Algorithms in Regression | Mo Li et.al. | 2603.14801 | null |
| 2026-03-16 | Universe Routing: Why Self-Evolving Agents Need Epistemic Control | Zhaohui Geoffrey Wang et.al. | 2603.14799 | null |
| 2026-03-15 | TopoCL: Topological Contrastive Learning for Medical Imaging | Guangyu Meng et.al. | 2603.14647 | null |
| 2026-03-15 | A measurement of gas rotation in galaxy groups via the kinetic Sunyaev-Zeldovich effect | Tianyi Yang et.al. | 2603.14494 | null |
| 2026-03-15 | Towards One-for-All Anomaly Detection for Tabular Data | Shiyuan Li et.al. | 2603.14407 | null |
| 2026-03-15 | WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems | Yuchen Wang et.al. | 2603.14392 | null |
| 2026-03-15 | M $^2$ RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling | Mayank Mishra et.al. | 2603.14360 | null |
| 2026-03-15 | A Physically-Grounded Attack and Adaptive Defense Framework for Real-World Low-Light Image Enhancement | Tongshun Zhang et.al. | 2603.14304 | null |
| 2026-03-15 | All-sky Searches for Continuous Gravitational Waves from Isolated Neutron Stars in the Data from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run | The LIGO Scientific Collaboration et.al. | 2603.14168 | null |
| 2026-03-14 | PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting | Xinyu Xiao et.al. | 2603.13818 | null |
| 2026-03-14 | Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control | Grayson Lee et.al. | 2603.13733 | null |
| 2026-03-14 | Sparse-Dense Mixture of Experts Adapter for Multi-Modal Tracking | Yabin Zhu et.al. | 2603.13719 | null |
| 2026-03-13 | NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL | Amos Goldman et.al. | 2603.13606 | null |
| 2026-03-13 | MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models | Md. Abdul Awal et.al. | 2603.13213 | null |
| 2026-03-13 | Reference-Free Image Quality Assessment for Virtual Try-On via Human Feedback | Yuki Hirakawa et.al. | 2603.13057 | null |
| 2026-03-13 | Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach | Elena Ryumina et.al. | 2603.13056 | null |
| 2026-03-13 | Multimodal Protein Language Models for Enzyme Kinetic Parameters: From Substrate Recognition to Conformational Adaptation | Fei Wang et.al. | 2603.12845 | null |
| 2026-03-13 | Serving Hybrid LLM Loads with SLO Guarantees Using CPU-GPU Attention Piggybacking | Zizhao Mo et.al. | 2603.12831 | null |
| 2026-03-13 | LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing | Jiawei Hao et.al. | 2603.12645 | null |
| 2026-03-13 | CarPLAN: Context-Adaptive and Robust Planning with Dynamic Scene Awareness for Autonomous Driving | Junyong Yun et.al. | 2603.12607 | null |
| 2026-03-13 | Spectral Dataset of Stripped-Envelope Supernovae from the Tsinghua Supernova Group | Danfeng Xiang et.al. | 2603.12604 | null |
| 2026-03-13 | Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation | Jia-Chen Zhang et.al. | 2603.12577 | null |
| 2026-03-13 | Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation | Alaa Dalaq et.al. | 2603.12538 | null |
| 2026-03-12 | TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition | Prabhu Vellaisamy et.al. | 2603.12465 | null |
| 2026-03-12 | NeuroLoRA: Context-Aware Neuromodulation for Parameter-Efficient Multi-Task Adaptation | Yuxin Yang et.al. | 2603.12378 | null |
| 2026-03-12 | A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition | Jiajun Sun et.al. | 2603.12221 | null |
| 2026-03-12 | CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation | Ziqi Ye et.al. | 2603.12008 | null |
| 2026-03-12 | AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization | Qiyang Li et.al. | 2603.11873 | null |
| 2026-03-12 | Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing | Hanchi Sun et.al. | 2603.11535 | null |
| 2026-03-11 | Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers | Mynampati Sri Ranganadha Avinash et.al. | 2603.11114 | null |
| 2026-03-11 | Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions | Kangke Cheng et.al. | 2603.10721 | null |
| 2026-03-11 | UniStitch: Unifying Semantic and Geometric Features for Image Stitching | Yuan Mei et.al. | 2603.10568 | null |
| 2026-03-11 | Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design | Junzhuo Li et.al. | 2603.10379 | null |
| 2026-03-12 | The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance | Jesse Yu et.al. | 2603.10323 | null |
| 2026-03-10 | Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions | Mingyang Song et.al. | 2603.09938 | null |
| 2026-03-10 | Quantifying the Necessity of Chain of Thought through Opaque Serial Depth | Jonah Brown-Cohen et.al. | 2603.09786 | null |
| 2026-03-10 | MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants | Zuhao Zhang et.al. | 2603.09652 | null |
| 2026-03-10 | MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning | Xiang Yuan et.al. | 2603.09478 | null |
| 2026-03-12 | Multi-tasking through quantum annealing | Jargalsaikhan Artag et.al. | 2603.09468 | null |
| 2026-03-10 | Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers | Albus Yizhuo Li et.al. | 2603.09453 | null |
| 2026-03-10 | Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking | Shilei Wang et.al. | 2603.09287 | null |
| 2026-03-10 | Acoustic and Semantic Modeling of Emotion in Spoken Language | Soumya Dutta et.al. | 2603.09212 | null |
| 2026-03-10 | GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models | Md Selim Sarowar et.al. | 2603.09079 | null |
| 2026-03-09 | The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference | Vignesh Adhinarayanan et.al. | 2603.08960 | null |
| 2026-03-09 | ConFu: Contemplate the Future for Better Speculative Sampling | Zongyue Qin et.al. | 2603.08899 | null |
| 2026-03-09 | Microwave response of electrically driven spins in a three-qubit quantum processor | Tanner M. Janda et.al. | 2603.08577 | null |
| 2026-03-09 | LAR-MoE: Latent-Aligned Routing for Mixture of Experts in Robotic Imitation Learning | Ariel Rodriguez et.al. | 2603.08476 | null |
| 2026-03-09 | Amplitude Analysis of Singly Cabibbo-Suppressed Decay $Λ^{+}_{c}\to p K^{+} K^{-}$ | BESIII Collaboration et.al. | 2603.08469 | null |
| 2026-03-09 | IronEngine: Towards General AI Assistant | Xi Mo et.al. | 2603.08425 | null |
| 2026-03-09 | Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows | Shentong Mo et.al. | 2603.08126 | null |
| 2026-03-09 | An improved measurement of $η^\prime\rightarrow e^{+}e^{-}ω$ | BESIII Collaboration et.al. | 2603.08120 | null |
| 2026-03-09 | SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving | Zihan You et.al. | 2603.08113 | null |
| 2026-03-09 | Deterministic Differentiable Structured Pruning for Large Language Models | Weiyu Huang et.al. | 2603.08065 | null |
| 2026-03-09 | Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization | Jingwei Li et.al. | 2603.08022 | null |
| 2026-03-09 | Scaling Machine Learning Interatomic Potentials with Mixtures of Experts | Yuzhi Liu et.al. | 2603.07977 | null |
| 2026-03-09 | Structural Design and Performance Analysis of Laser Transmitting Telescope for Space Gravitational Wave Detection | Long Yongtao et.al. | 2603.07967 | null |
| 2026-03-09 | SGG-R $^{\rm 3}$ : From Next-Token Prediction to End-to-End Unbiased Scene Graph Generation | Jiaye Feng et.al. | 2603.07961 | null |
| 2026-03-09 | SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans | Hansi Zeng et.al. | 2603.07853 | null |
| 2026-03-08 | Scalable Training of Mixture-of-Experts Models with Megatron Core | Zijie Yan et.al. | 2603.07685 | null |
| 2026-03-08 | AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots | Likui Zhang et.al. | 2603.07648 | null |
| 2026-03-08 | Mixed Effects Mixture of Experts: Modeling Double Heterogeneous Trajectories | Xinkai Yue et.al. | 2603.07479 | null |
| 2026-03-08 | UnSCAR: Universal, Scalable, Controllable, and Adaptable Image Restoration | Debabrata Mandal et.al. | 2603.07406 | null |
| 2026-03-07 | Scheduling Parallel Optical Circuit Switches for AI Training | Kevin Liang et.al. | 2603.07373 | null |
| 2026-03-07 | Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures | Shuqing Luo et.al. | 2603.07006 | null |
| 2026-03-06 | Swimba: Switch Mamba Model Scales State Space Models | Zhixu Du et.al. | 2603.06938 | null |
| 2026-03-06 | PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection | Zhengjian Kang et.al. | 2603.06917 | null |
| 2026-03-06 | PICS: Pairwise Image Compositing with Spatial Interactions | Hang Zhou et.al. | 2603.06873 | null |
| 2026-03-06 | ButterflyViT: 354 $\times$ Expert Compression for Edge Vision Transformers | Aryan Karmore et.al. | 2603.06746 | null |
| 2026-03-06 | RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering | Gaia A. Bertolino et.al. | 2603.06542 | null |
| 2026-03-06 | A Mixture-of-Experts Framework for Practical Hybrid-Quantum Models in Credit Card Fraud Detection | Rodrigo Chaves et.al. | 2603.06473 | null |
| 2026-03-06 | MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis | Dongqing Xie et.al. | 2603.06378 | null |
| 2026-03-06 | MoEless: Efficient MoE LLM Serving via Serverless Computing | Hanfei Yu et.al. | 2603.06350 | null |
| 2026-03-06 | WMoE-CLIP: Wavelet-Enhanced Mixture-of-Experts Prompt Learning for Zero-Shot Anomaly Detection | Peng Chen et.al. | 2603.06313 | null |
| 2026-03-06 | GazeMoE: Perception of Gaze Target with Mixture-of-Experts | Zhuangzhuang Dai et.al. | 2603.06256 | null |
| 2026-03-06 | EvoESAP: Non-Uniform Expert Pruning for Sparse MoE | Zongfang Liu et.al. | 2603.06003 | link |
| 2026-03-06 | MoE Lens – An Expert Is All You Need | Marmik Chaudhari et.al. | 2603.05806 | null |
| 2026-03-06 | Sparse Crosscoders for diffing MoEs and Dense models | Marmik Chaudhari et.al. | 2603.05805 | null |
| 2026-03-05 | Change Point Detection for Cell Populations Measured via Flow Cytometry | Yik Lun Kei et.al. | 2603.05700 | null |
| 2026-03-05 | FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation | Hung Nguyen Huy et.al. | 2603.05690 | null |
| 2026-03-05 | Multi-channel joint analysis of the exotic charmonium-like state $T_{c\bar{c}}(4020)$ | BESIII Collaboration et.al. | 2603.05564 | null |
| 2026-03-05 | VietJobs: A Vietnamese Job Advertisement Dataset | Hieu Pham Dinh et.al. | 2603.05262 | null |
| 2026-03-05 | NeuronMoE: Neuron-Guided Mixture-of-Experts for Efficient Multilingual LLM Extension | Rongzhi Li et.al. | 2603.05046 | null |
| 2026-03-05 | Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation | Yilong Chen et.al. | 2603.04971 | null |
| 2026-03-05 | Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling | Yong Liu et.al. | 2603.04791 | null |
| 2026-03-05 | TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings | Yebo Wu et.al. | 2603.04772 | null |
| 2026-03-04 | ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model | Yuhao Xu et.al. | 2603.04589 | null |
| 2026-03-04 | Augmenting representations with scientific papers | Nicolò Oreste Pinciroli Vago et.al. | 2603.04516 | null |
| 2026-03-04 | RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation | Yixin Chen et.al. | 2603.04348 | null |
| 2026-03-04 | CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation | Jinfeng Xu et.al. | 2603.04320 | null |
| 2026-03-04 | Precise measurement of the form factors in $D^0\rightarrow K^(892)^-\ell^+ν_{\ell}$ and observation of $D^0\rightarrow K_2^(1430)^-\ell^+ν_{\ell}$ | BESIII Collaboration et.al. | 2603.04136 | null |
| 2026-03-04 | UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization | Qianfeng Yang et.al. | 2603.03967 | null |
| 2026-03-04 | Glass Segmentation with Fusion of Learned and General Visual Features | Risto Ojala et.al. | 2603.03718 | null |
| 2026-03-04 | Plasmonic polaron in self-intercalated 1T-TiS2 | Byoung Ki Choi et.al. | 2603.03663 | null |
| 2026-03-03 | Modeling Cross-vision Synergy for Unified Large Vision Model | Shengqiong Wu et.al. | 2603.03564 | null |
| 2026-03-03 | Beyond Language Modeling: An Exploration of Multimodal Pretraining | Shengbang Tong et.al. | 2603.03276 | null |
| 2026-03-03 | Search for a massless particle beyond the Standard Model in the $Ξ^0\toΛ+ \text{invisible}$ decay | BESIII Collaboration et.al. | 2603.03199 | null |
| 2026-03-04 | MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection | Jun Yeong Park et.al. | 2603.03101 | null |
| 2026-03-03 | CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots | Shihao Ma et.al. | 2603.03067 | null |
| 2026-03-03 | EduVQA: Benchmarking AI-Generated Video Quality Assessment for Education | Baoliang Chen et.al. | 2603.03066 | null |
| 2026-03-03 | Practical FP4 Training for Large-Scale MoE Models on Hopper GPUs | Wuyue Zhang et.al. | 2603.02731 | null |
| 2026-03-03 | TenExp: Mixture-of-Experts-Based Tensor Decomposition Structure Search Framework | Ting-Wei Zhou et.al. | 2603.02720 | null |
| 2026-03-03 | MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration | Lingshun Kong et.al. | 2603.02710 | null |
| 2026-03-03 | Addressing Missing and Noisy Modalities in One Solution: Unified Modality-Quality Framework for Low-quality Multimodal Data | Sijie Mai et.al. | 2603.02695 | null |
| 2026-03-03 | Robust Heterogeneous Analog-Digital Computing for Mixture-of-Experts Models with Theoretical Generalization Guarantees | Mohammed Nowaz Rabbani Chowdhury et.al. | 2603.02633 | null |
| 2026-03-02 | Search for the charmonium weak decay $ψ(2S)\to D_s^-π^+ + c.c.$ and $ψ(2S)\to D_s^-ρ^+ + c.c.$ | BESIII Collaboration et.al. | 2603.01777 | null |
| 2026-03-02 | DynaMoE: Dynamic Token-Level Expert Activation with Layer-Wise Adaptive Capacity for Mixture-of-Experts Neural Networks | Gökdeniz Gülmez et.al. | 2603.01697 | null |
| 2026-03-02 | PathMoE: Interpretable Multimodal Interaction Experts for Pediatric Brain Tumor Classification | Jian Yu et.al. | 2603.01547 | null |
| 2026-03-02 | Multimodal Mixture-of-Experts with Retrieval Augmentation for Protein Active Site Identification | Jiayang Wu et.al. | 2603.01511 | null |
| 2026-03-02 | DOCFORGE-BENCH: A Comprehensive Benchmark for Document Forgery Detection and Analysis | Zengqi Zhao et.al. | 2603.01433 | null |
| 2026-03-03 | UETrack: A Unified and Efficient Framework for Single Object Tracking | Ben Kang et.al. | 2603.01412 | null |
| 2026-03-02 | Fed-GAME: Personalized Federated Learning with Graph Attention Mixture-of-Experts For Time-Series Forecasting | Yi Li et.al. | 2603.01363 | null |
| 2026-03-01 | Truth as a Trajectory: What Internal Representations Reveal About Large Language Model Reasoning | Hamed Damirchi et.al. | 2603.01326 | null |
| 2026-03-01 | Fast Confidence-Aware Human Prediction via Hardware-accelerated Bayesian Inference for Safe Robot Navigation | Michael Lu et.al. | 2603.01122 | null |
| 2026-03-01 | TriMoE: Augmenting GPU with AMX-Enabled CPU and DIMM-NDP for High-Throughput MoE Inference via Offloading | Yudong Pan et.al. | 2603.01058 | null |
| 2026-03-01 | Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving | Xubo Zhu et.al. | 2603.01007 | null |
| 2026-02-28 | MME: Mixture of Mesh Experts with Random Walk Transformer Gating | Amir Belder et.al. | 2603.00828 | null |
| 2026-02-28 | First Amplitude Analysis of $D^0\rightarrow K^-π^0e^+ν_e$ and Observation of $D^0\rightarrow K^*_2(1430)^-e^+ν_e$ | BESIII Collaboration et.al. | 2603.00743 | null |
| 2026-02-28 | K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control | Zhe Wu et.al. | 2603.00676 | null |
| 2026-02-28 | Precise Measurement and Control of Radon Progeny on Detector Surfaces | C. B. Z. Luo et.al. | 2603.00647 | null |
| 2026-02-28 | CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging | Jie Cao et.al. | 2603.00573 | null |
| 2026-02-27 | CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning | Yuxuan Liu et.al. | 2602.24142 | null |
| 2026-02-27 | Precision Studies and Searches for CP Asymmetries in the Inclusive Decay $Λ_{c}^{+}\to ΛX$ | BESIII Collaboration et.al. | 2602.24089 | null |
| 2026-02-27 | Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization | Chenwei Jia et.al. | 2602.24059 | null |
| 2026-02-27 | Measurement of Born Cross Sections for $e^+e^-\toΣ^-\barΣ^+$ at $\sqrt{s}=3.51-4.95$ GeV and Observation of $ψ(3770)\toΣ^-\barΣ^+$ | BESIII Collaboration et.al. | 2602.23835 | null |
| 2026-02-27 | ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation | Jiangyuan Wang et.al. | 2602.23716 | null |
| 2026-02-26 | Brain-OF: An Omnifunctional Foundation Model for fMRI, EEG and MEG | Hanning Guo et.al. | 2602.23410 | null |
| 2026-02-26 | A Mixture-of-Experts Model for Multimodal Emotion Recognition in Conversations | Soumya Dutta et.al. | 2602.23300 | null |
| 2026-02-26 | Learning Physical Operators using Neural Operators | Vignesh Gopakumar et.al. | 2602.23113 | null |
| 2026-02-26 | Residual Koopman Spectral Profiling for Predicting and Preventing Transformer Training Instability | Bum Jun Kim et.al. | 2602.22988 | null |
| 2026-02-26 | pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation | Shentong Mo et.al. | 2602.22938 | null |
| 2026-02-26 | MEDNA-DFM: A Dual-View FiLM-MoE Model for Explainable DNA Methylation Prediction | Yi He et.al. | 2602.22850 | null |
| 2026-02-26 | DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation | Hao Zheng et.al. | 2602.22839 | null |
| 2026-02-26 | Productivity and Collaboration in Hybrid Agile Teams: An Interview Study | Elisabeth Mo et.al. | 2602.22835 | null |
| 2026-02-26 | Measurements of branching fractions of $Λ_{c}^{+}\toΣ^{0}K_{S}^{0}π^{+}$ and $Λ_{c}^{+}\toΣ^{0}K_{S}^{0}K^{+}$ | BESIII Collaboration et.al. | 2602.22754 | null |
| 2026-02-26 | IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation | Yanpei Guo et.al. | 2602.22700 | null |
| 2026-02-26 | Switch-Hurdle: A MoE Encoder with AR Hurdle Decoder for Intermittent Demand Forecasting | Fabian Muşat et.al. | 2602.22685 | null |
| 2026-02-26 | Accelerating LLM Pre-Training through Flat-Direction Dynamics Enhancement | Shuchen Zhu et.al. | 2602.22681 | null |
| 2026-02-26 | Predictive variational inference for flexible regression models | Lucas Kock et.al. | 2602.22582 | null |
| 2026-02-26 | Towards Dynamic Dense Retrieval with Routing Strategy | Zhan Su et.al. | 2602.22547 | null |
| 2026-02-25 | NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training | Dengdi Sun et.al. | 2602.22059 | null |
| 2026-02-25 | Excitation: Momentum For Experts | Sagi Shaier et.al. | 2602.21798 | null |
| 2026-02-25 | Learning from Yesterday’s Error: An Efficient Online Learning Method for Traffic Demand Prediction | Xiannan Huang et.al. | 2602.21757 | null |
| 2026-02-25 | TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts | Jiafeng Lin et.al. | 2602.21693 | null |
| 2026-02-25 | Multi-Layer Scheduling for MoE-Based LLM Reasoning | Yifan Sun et.al. | 2602.21626 | null |
| 2026-02-24 | A Path to an All-Sky Survey with Roman | Jiwon Jesse Han et.al. | 2602.21280 | null |
| 2026-02-24 | On infinite sets with no $3$ on a line | Moe Putterman et.al. | 2602.21275 | null |
| 2026-02-24 | ReviveMoE: Fast Recovery for Hardware Failures in Large-Scale MoE LLM Inference Deployments | Haley Li et.al. | 2602.21140 | null |
| 2026-02-24 | MUSE: Harnessing Precise and Diverse Semantics for Few-Shot Whole Slide Image Classification | Jiahao Xu et.al. | 2602.20873 | null |
| 2026-02-25 | GeCo-SRT: Geometry-aware Continual Adaptation for Robotic Cross-Task Sim-to-Real Transfer | Wenbo Yu et.al. | 2602.20871 | null |
| 2026-02-24 | Multi-time Loewner energy: rate function for large deviation | Mo Chen et.al. | 2602.20642 | null |
| 2026-02-24 | Precise Measurement of Matter-Antimatter Asymmetry with Entangled Hyperon Antihyperon Pairs | BESIII Collaboration et.al. | 2602.20524 | null |
| 2026-02-24 | Search for Light-Mass Fractionally Charged Particles in Space with DAMPE Experiment | F. Alemanno et.al. | 2602.20519 | null |
| 2026-02-24 | Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA | Nuocheng Yang et.al. | 2602.20492 | null |
| 2026-02-23 | Learning Discriminative and Generalizable Anomaly Detector for Dynamic Graph with Limited Supervision | Yuxing Tian et.al. | 2602.20019 | null |
| 2026-02-23 | Counterfactual Understanding via Retrieval-aware Multimodal Modeling for Time-to-Event Survival Prediction | Ha-Anh Hoang Nguyen et.al. | 2602.19987 | link |
| 2026-02-23 | ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting | Yuxing Tian et.al. | 2602.19969 | null |
| 2026-02-23 | A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs | Zijie Liu et.al. | 2602.19938 | null |
| 2026-02-23 | Towards Dexterous Embodied Manipulation via Deep Multi-Sensory Fusion and Sparse Expert Scaling | Yirui Sun et.al. | 2602.19764 | null |
| 2026-02-23 | Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis | Junhyeok Choi et.al. | 2602.19756 | null |
| 2026-02-23 | RAID: Retrieval-Augmented Anomaly Detection | Mingxiu Cai et.al. | 2602.19611 | null |
| 2026-02-23 | EMS-FL: Federated Tuning of Mixture-of-Experts in Satellite-Terrestrial Networks via Expert-Driven Model Splitting | Angzi Xu et.al. | 2602.19485 | null |
| 2026-02-22 | RegionRoute: Regional Style Transfer with Diffusion Model | Bowen Chen et.al. | 2602.19254 | null |
| 2026-02-22 | Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts | Toshihide Ubukata et.al. | 2602.19244 | null |
| 2026-02-22 | SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation | Yujie Lu et.al. | 2602.19213 | null |
| 2026-02-22 | JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation | Kai Liu et.al. | 2602.19163 | null |
| 2026-02-22 | K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model | Shiyi Cao et.al. | 2602.19128 | null |
| 2026-02-22 | Routing-Aware Explanations for Mixture of Experts Graph Models in Malware Detection | Hossein Shokouhinejad et.al. | 2602.19025 | null |
| 2026-02-21 | NeuroWise: A Multi-Agent LLM “Glass-Box” System for Practicing Double-Empathy Communication with Autistic Partners | Albert Tang et.al. | 2602.18962 | null |
| 2026-02-21 | Give Users the Wheel: Towards Promptable Recommendation Paradigm | Fuyuan Lyu et.al. | 2602.18929 | null |
| 2026-02-21 | Diverse properties of electron Forbush decreases revealed by the Dark Matter Particle Explorer | F. Alemanno et.al. | 2602.18743 | null |
| 2026-02-21 | Comprehensive measurement of $η^\prime$ photoproduction off the proton at $E_γ< 2.4$ $\mathrm{GeV}$ | N. Muramatsu et.al. | 2602.18675 | null |
| 2026-02-20 | Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory | Vatsal Agarwal et.al. | 2602.18434 | null |
| 2026-02-20 | RamanSeg: Interpretability-driven Deep Learning on Raman Spectra for Cancer Diagnosis | Chris Tomy et.al. | 2602.18119 | null |
| 2026-02-20 | DeepSVU: Towards In-depth Security-oriented Video Understanding via Unified Physical-world Regularized MoE | Yujie Jin et.al. | 2602.18019 | null |
| 2026-02-19 | Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds | Ibne Farabi Shihab et.al. | 2602.17798 | null |
| 2026-02-19 | Phase-Aware Mixture of Experts for Agentic Reinforcement Learning | Shengtian Yang et.al. | 2602.17038 | null |
| 2026-02-19 | Arcee Trinity Large Technical Report | Varun Singh et.al. | 2602.17004 | null |
| 2026-02-19 | Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation | Yan Wang et.al. | 2602.16990 | null |
| 2026-02-18 | Claim Automation using Large Language Model | Zhengda Mo et.al. | 2602.16836 | null |
| 2026-02-18 | Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning | Zifan Wang et.al. | 2602.16796 | null |
| 2026-02-18 | Geometric Neural Operators via Lie Group-Constrained Latent Dynamics | Jiaquan Zhang et.al. | 2602.16209 | null |
| 2026-02-18 | OmniCT: Towards a Unified Slice-Volume LVLM for Comprehensive CT Analysis | Tianwei Lin et.al. | 2602.16110 | null |
| 2026-02-18 | Federated Graph AGI for Cross-Border Insider Threat Intelligence in Government Financial Schemes | Srikumar Nayak et.al. | 2602.16109 | null |
| 2026-02-17 | MoE-Spec: Expert Budgeting for Efficient Speculative Decoding | Bradley McDanel et.al. | 2602.16052 | null |
| 2026-02-17 | ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns | Ziyu Zhao et.al. | 2602.15521 | null |
| 2026-02-17 | GMAIL: Generative Modality Alignment for generated Image Learning | Shentong Mo et.al. | 2602.15368 | null |
| 2026-02-16 | Mixture-of-Experts under Finite-Rate Gating: Communication–Generalization Trade-offs | Ali Khalesi et.al. | 2602.15091 | null |
| 2026-02-13 | RynnBrain: Open Embodied Foundation Models | Ronghao Dang et.al. | 2602.14979 | null |
| 2026-02-16 | Topological and arithmetic characteristics about products of projective lines with complex tori | Jia-Li Mo et.al. | 2602.14745 | null |
| 2026-02-16 | DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving | Chenxu Dang et.al. | 2602.14577 | null |
| 2026-02-15 | DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices | Songyuan Li et.al. | 2602.14301 | null |
| 2026-02-15 | MILD: Multi-Intent Learning and Disambiguation for Proactive Failure Prediction in Intent-based Networking | Md. Kamrul Hossain et.al. | 2602.14283 | null |
| 2026-02-15 | Multi-Agent Debate: A Unified Agentic Framework for Tabular Anomaly Detection | Pinqiao Wang et.al. | 2602.14251 | null |
| 2026-02-15 | Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws | Jinbo Wang et.al. | 2602.14208 | null |
| 2026-02-15 | Synergistic Intra- and Cross-Layer Regularization Losses for MoE Expert Specialization | Rizhen Hu et.al. | 2602.14159 | null |
| 2026-02-15 | REAL: Resolving Knowledge Conflicts in Knowledge-Intensive Visual Question Answering via Reasoning-Pivot Alignment | Kai Ye et.al. | 2602.14065 | null |
| 2026-02-15 | LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts | Yang Liu et.al. | 2602.14060 | null |
| 2026-02-15 | Geometry-Preserving Aggregation for Mixture-of-Experts Embedding Models | Sajjad Kachuee et.al. | 2602.14039 | null |
| 2026-02-15 | Eureka-Audio: Triggering Audio Intelligence in Compact Language Models | Dan Zhang et.al. | 2602.13954 | null |
| 2026-02-14 | Assessing Cybersecurity Risks and Traffic Impact in Connected Autonomous Vehicles | Saurav Silwal et.al. | 2602.13898 | null |
| 2026-02-14 | Mixture-of-experts Wishart model for covariance matrices with an application to Cancer drug screening | The Tien Mai et.al. | 2602.13888 | null |
| 2026-02-13 | Dyad: a binary-star dynamics and statistics library for Python | Amery Gration et.al. | 2602.13388 | null |
| 2026-02-13 | Improved measurements of the coherence factors and strong-phase differences in $D\to K^-π^+π^+π^-$ and $D\to K^-π^+π^0$ with quantum-correlated $D\bar{D}$ decays | BESIII Collaboration et.al. | 2602.13002 | null |
| 2026-02-13 | Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews | Hamidreza Kazemi Taskooh et.al. | 2602.12778 | null |
| 2026-02-13 | Mixture of Predefined Experts: Maximizing Data Usage on Vertical Federated Learning | Jon Irureta et.al. | 2602.12708 | null |
| 2026-02-13 | Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers | Anrui Chen et.al. | 2602.12587 | null |
| 2026-02-13 | SD-MoE: Spectral Decomposition for Effective Expert Specialization | Ruijun Huang et.al. | 2602.12556 | null |
| 2026-02-13 | Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR | Jaeyoung Lee et.al. | 2602.12546 | null |
| 2026-02-12 | Query-focused and Memory-aware Reranker for Long Context Processing | Yuqing Li et.al. | 2602.12192 | null |
| 2026-02-12 | Measurement of the singly Cabibbo-suppressed decay $Λ_c^+\to pη’$ with Deep Learning | BESIII Collaboration et.al. | 2602.11974 | null |
| 2026-02-12 | Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration | Akhiad Bercovich et.al. | 2602.11937 | null |
| 2026-02-12 | Deep Kernel Fusion for Transformers | Zixi Zhang et.al. | 2602.11808 | null |
| 2026-02-12 | LAER-MoE: Load-Adaptive Expert Re-layout for Efficient Mixture-of-Experts Training | Xinyi Liu et.al. | 2602.11686 | null |
| 2026-02-12 | Evolutionary Router Feature Generation for Zero-Shot Graph Anomaly Detection with Mixture-of-Experts | Haiyang Jiang et.al. | 2602.11622 | null |
| 2026-02-12 | Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm | Jinrui Zhang et.al. | 2602.11543 | null |
| 2026-02-12 | Adaptive Milestone Reward for GUI Agents | Congmin Zheng et.al. | 2602.11524 | null |
| 2026-02-12 | Observation of a New Excited $Σ$ State in $ψ(3686)\to\bar{p}K^+Σ^0+c.c.$ | BESIII Collaboration et.al. | 2602.11501 | null |
| 2026-02-11 | Charting Empirical Laws for LLM Fine-Tuning in Scientific Multi-Discipline Learning | Lintao Wang et.al. | 2602.11215 | null |
| 2026-02-11 | MoEEdit: Efficient and Routing-Stable Knowledge Editing for Mixture-of-Experts LLMs | Yupu Gu et.al. | 2602.10965 | null |
| 2026-02-11 | CMAD: Cooperative Multi-Agent Diffusion via Stochastic Optimal Control | Riccardo Barbano et.al. | 2602.10933 | null |
| 2026-02-11 | VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training | Guobin Shen et.al. | 2602.10693 | null |
| 2026-02-11 | Multimodal Priors-Augmented Text-Driven 3D Human-Object Interaction Generation | Yin Wang et.al. | 2602.10659 | null |
| 2026-02-11 | A Vision-Language Foundation Model for Zero-shot Clinical Collaboration and Automated Concept Discovery in Dermatology | Siyuan Yan et.al. | 2602.10624 | null |
| 2026-02-11 | Supercharging Packet-level Network Simulation of Large Model Training via Memoization and Fast-Forwarding | Fei Long et.al. | 2602.10615 | null |
| 2026-02-11 | Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters | Ailin Huang et.al. | 2602.10604 | null |
| 2026-02-11 | Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity | Guangzhi Xiong et.al. | 2602.10585 | null |
| 2026-02-12 | 3DXTalker: Unifying Identity, Lip Sync, Emotion, and Spatial Dynamics in Expressive 3D Talking Avatars | Zhongju Wang et.al. | 2602.10516 | null |
| 2026-02-10 | Area-Efficient In-Memory Computing for Mixture-of-Experts via Multiplexing and Caching | Hanyuan Gao et.al. | 2602.10254 | link |
| 2026-02-10 | TDE 2025abcr: A Tidal Disruption Event in the Outskirts of a Massive Galaxy | Robert Stein et.al. | 2602.10180 | null |
| 2026-02-10 | MalMoE: Mixture-of-Experts Enhanced Encrypted Malicious Traffic Detection Under Graph Drift | Yunpeng Tan et.al. | 2602.10157 | null |
| 2026-02-10 | Diverse Skill Discovery for Quadruped Robots via Unsupervised Learning | Ruopeng Cui et.al. | 2602.09767 | null |
| 2026-02-10 | Revealing the Challenges of Attention-FFN Disaggregation for Modern MoE Models and Hardware Systems | Guowei Liu et.al. | 2602.09721 | null |
| 2026-02-10 | First observation of the $η_{c}\toΞ^{0} \barΞ^{0}$ decay | BESIII Collaboration et.al. | 2602.09652 | null |
| 2026-02-10 | DR.Experts: Differential Refinement of Distortion-Aware Experts for Blind Image Quality Assessment | Bohan Fu et.al. | 2602.09531 | null |
| 2026-02-10 | SMES: Towards Scalable Multi-Task Recommendation via Expert Sparsity | Yukun Zhang et.al. | 2602.09386 | null |
| 2026-02-10 | Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density | Zhendong Mi et.al. | 2602.09316 | null |
| 2026-02-09 | Generalizing GNNs with Tokenized Mixture of Experts | Xiaoguang Guo et.al. | 2602.09258 | null |
| 2026-02-09 | UI-Venus-1.5 Technical Report | Veuns-Team et.al. | 2602.09082 | null |
| 2026-02-09 | DirMoE: Dirichlet-routed Mixture of Experts | Amirhossein Vahidi et.al. | 2602.09001 | null |
| 2026-02-09 | OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation | Yehua Huang et.al. | 2602.08896 | link |
| 2026-02-09 | FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models | Annemette Brok Pirchert et.al. | 2602.08818 | null |
| 2026-02-10 | MOVA: Towards Scalable and Synchronized Video-Audio Generation | SII-OpenMOSS Team et.al. | 2602.08794 | null |
| 2026-02-10 | Redundancy-Free View Alignment for Multimodal Human Activity Recognition with Arbitrarily Missing Views | Duc-Anh Nguyen et.al. | 2602.08755 | null |
| 2026-02-09 | Large Language Lobotomy: Jailbreaking Mixture-of-Experts via Expert Silencing | Jona te Lintelo et.al. | 2602.08741 | null |
| 2026-02-09 | 6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks | Mohamed Amine Ferrag et.al. | 2602.08675 | null |
| 2026-02-10 | Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models | Mingzi Cao et.al. | 2602.08658 | null |
| 2026-02-09 | Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs | Yukun Jiang et.al. | 2602.08621 | null |
| 2026-02-09 | Giant Magnetocaloric Effect in a High-Spin Shastry-Sutherland Dipolar Magnet | Jianjian Gong et.al. | 2602.08497 | null |
| 2026-02-09 | TEAM: Temporal-Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration | Linye Wei et.al. | 2602.08404 | null |
| 2026-02-09 | Tighnari v2: Mitigating Label Noise and Distribution Shift in Multimodal Plant Distribution Prediction via Mixture of Experts and Weakly Supervised Learning | Haixu Liu et.al. | 2602.08282 | null |
| 2026-02-09 | Large Language Models in Peer-Run Community Behavioral Health Services: Understanding Peer Specialists and Service Users’ Perspectives on Opportunities, Risks, and Mitigation Strategies | Cindy Peng et.al. | 2602.08187 | null |
| 2026-02-08 | Multimodal normative modeling in Alzheimers Disease with introspective variational autoencoders | Sayantan Kumar et.al. | 2602.08077 | null |
| 2026-02-08 | Efficient and Adaptable Detection of Malicious LLM Prompts via Bootstrap Aggregation | Shayan Ali Hassan et.al. | 2602.08062 | null |
| 2026-02-08 | Enhanced Mixture 3D CGAN for Completion and Generation of 3D Objects | Yahia Hamdi et.al. | 2602.08046 | null |
| 2026-02-08 | The Rise of Sparse Mixture-of-Experts: A Survey from Algorithmic Foundations to Decentralized Architectures and Vertical Domain Applications | Dong Pan et.al. | 2602.08019 | null |
| 2026-02-08 | Fast Model Selection and Stable Optimization for Softmax-Gated Multinomial-Logistic Mixture of Experts Models | TrungKhang Tran et.al. | 2602.07997 | null |
| 2026-02-08 | Thinking in Structures: Evaluating Spatial Intelligence through Reasoning on Constrained Manifolds | Chen Yang et.al. | 2602.07864 | null |
| 2026-02-07 | SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models | Juntong Wu et.al. | 2602.07616 | null |
| 2026-02-07 | MSN: A Memory-based Sparse Activation Scaling Framework for Large-scale Industrial Recommendation | Shikang Wu et.al. | 2602.07526 | null |
| 2026-02-07 | From Native Memes to Global Moderation: Cros-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection | Mo Wang et.al. | 2602.07497 | null |
| 2026-02-07 | Wavelet-Domain Masked Image Modeling for Color-Consistent HDR Video Reconstruction | Yang Zhang et.al. | 2602.07393 | link |
| 2026-02-07 | When the Model Said ‘No Comment’, We Knew Helpfulness Was Dead, Honesty Was Alive, and Safety Was Terrified | Gautam Siddharth Kashyap et.al. | 2602.07381 | null |
| 2026-02-07 | Semantic Search At LinkedIn | Fedor Borisyuk et.al. | 2602.07309 | null |
| 2026-02-06 | XShare: Collaborative in-Batch Expert Sharing for Faster MoE Inference | Daniil Vankov et.al. | 2602.07265 | null |
| 2026-02-06 | DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos | Shenyuan Gao et.al. | 2602.06949 | null |
| 2026-02-06 | Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing | Meng Lou et.al. | 2602.06862 | null |
| 2026-02-06 | POP: Online Structural Pruning Enables Efficient Inference of Large Foundation Models | Yi Chen et.al. | 2602.06822 | null |
| 2026-02-06 | SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers | Shentong Mo et.al. | 2602.06706 | null |
| 2026-02-06 | Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making | Baichuan-M3 Team et.al. | 2602.06570 | null |
| 2026-02-06 | TokenMixer-Large: Scaling Up Large Ranking Models in Industrial Recommenders | Yuchen Jiang et.al. | 2602.06563 | null |
| 2026-02-06 | HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction | Shengxuan Qiu et.al. | 2602.06527 | null |
| 2026-02-05 | GRP-Obliteration: Unaligning LLMs With a Single Unlabeled Prompt | Mark Russinovich et.al. | 2602.06258 | null |
| 2026-02-05 | To 2:4 Sparsity and Beyond: Neuron-level Activation Function to Accelerate LLM Pre-Training | Meghana Madhyastha et.al. | 2602.06183 | null |
| 2026-02-05 | MoSE: Mixture of Slimmable Experts for Efficient and Adaptive Language Models | Nurbek Tastan et.al. | 2602.06154 | null |
| 2026-02-05 | OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale | Jingze Shi et.al. | 2602.05711 | null |
| 2026-02-05 | Hidden simplicity in AdS spinning Mellin amplitudes via scaffolding | Song He et.al. | 2602.05568 | null |
| 2026-02-05 | M $^2$ -Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining | Rui Lv et.al. | 2602.05429 | null |
| 2026-02-05 | Mergers Drive Structural Complexity but Not Starbursts in Lyman- $α$ Emitters at $3 < z < 4$ : A JWST Spatially Resolved View | Qi Song et.al. | 2602.05411 | null |
| 2026-02-05 | Decision-Focused Sequential Experimental Design: A Directional Uncertainty-Guided Approach | Beichen Wan et.al. | 2602.05340 | null |
| 2026-02-05 | Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink | Guozhi Liu et.al. | 2602.05228 | null |
| 2026-02-04 | Rule-Based Spatial Mixture-of-Experts U-Net for Explainable Edge Detection | Bharadwaj Dogga et.al. | 2602.05100 | null |
| 2026-02-04 | Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism | Chenwei Cui et.al. | 2602.04870 | null |
| 2026-02-04 | PDF-HR: Pose Distance Fields for Humanoid Robots | Yi Gu et.al. | 2602.04851 | null |
| 2026-02-04 | ERNIE 5.0 Technical Report | Haifeng Wang et.al. | 2602.04705 | null |
| 2026-02-04 | Let Experts Feel Uncertainty: A Multi-Expert Label Distribution Approach to Probabilistic Time Series Forecasting | Zhen Zhou et.al. | 2602.04678 | null |
| 2026-02-04 | RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models | Jiacheng Liang et.al. | 2602.04448 | null |
| 2026-02-04 | Mixture of Masters: Sparse Chess Language Models with Player Routing | Giacomo Frisoni et.al. | 2602.04447 | null |
| 2026-02-04 | Study of $\barΛ$-$p$ Annihilation into Light Mesons | BESIII Collaboration et.al. | 2602.04276 | null |
| 2026-02-04 | Universal Quantized Berry-Dipole Flat Bands | Qingyang Mo et.al. | 2602.04194 | null |
| 2026-02-04 | OMG-Agent: Toward Robust Missing Modality Generation with Decoupled Coarse-to-Fine Agentic Workflows | Ruiting Dai et.al. | 2602.04144 | null |
| 2026-02-04 | Expert Selections In MoE Models Reveal (Almost) As Much As Text | Amir Nuriyev et.al. | 2602.04105 | null |
| 2026-02-03 | SpecMD: A Comprehensive Study On Speculative Expert Prefetching | Duc Hoang et.al. | 2602.03921 | null |
| 2026-02-03 | UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining | Changhao Wang et.al. | 2602.03772 | null |
| 2026-02-03 | HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing | Yizhao Gao et.al. | 2602.03560 | null |
| 2026-02-03 | DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs | Zeyu Zhu et.al. | 2602.03495 | null |
| 2026-02-03 | Scaling Continual Learning with Bi-Level Routing Mixture-of-Experts | Meng Lou et.al. | 2602.03473 | null |
| 2026-02-03 | VIRAL: Visual In-Context Reasoning via Analogy in Diffusion Transformers | Zhiwen Li et.al. | 2602.03210 | null |
| 2026-02-03 | Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry | Ye Su et.al. | 2602.03204 | null |
| 2026-02-03 | Aligning Forest and Trees in Images and Long Captions for Visually Grounded Understanding | Byeongju Woo et.al. | 2602.02977 | null |
| 2026-02-02 | Decision-Focused Optimal Transport | Suhan Liu et.al. | 2602.02800 | null |
| 2026-02-02 | Loss mechanisms of microwave frequency acoustic waves in thin film lithium niobate | Qixuan Lin et.al. | 2602.02797 | null |
| 2026-02-02 | SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning | Qifan Yu et.al. | 2602.02472 | null |
| 2026-02-02 | Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE | Yuanteng Chen et.al. | 2602.02443 | null |
| 2026-02-02 | DFKI-Speech System for WildSpoof Challenge: A robust framework for SASV In-the-Wild | Arnab Das et.al. | 2602.02286 | null |
| 2026-02-02 | MoLF: Mixture-of-Latent-Flow for Pan-Cancer Spatial Gene Expression Prediction from Histology | Susu Hu et.al. | 2602.02282 | null |
| 2026-02-02 | Kimi K2.5: Visual Agentic Intelligence | Kimi Team et.al. | 2602.02276 | null |
| 2026-02-02 | vLLM-Omni: Fully Disaggregated Serving for Any-to-Any Multimodal Models | Peiqi Yin et.al. | 2602.02204 | null |
| 2026-02-02 | No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs | Liyan Xu et.al. | 2602.02103 | null |
| 2026-02-02 | Edge-Aligned Initialization of Kernels for Steered Mixture-of-Experts | Martin Determann et.al. | 2602.02031 | null |
| 2026-02-02 | SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning | Zhen-Hao Xie et.al. | 2602.01990 | null |
| 2026-02-02 | Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition | Wonjun Lee et.al. | 2602.01967 | null |
| 2026-02-02 | SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures | Liangtao Lin et.al. | 2602.01858 | null |
| 2026-02-02 | From Knowing to Doing Precisely: A General Self-Correction and Termination Framework for VLA models | Wentao Zhang et.al. | 2602.01811 | null |
| 2026-02-02 | Mutual-Guided Expert Collaboration for Cross-Subject EEG Classification | Zhi Zhang et.al. | 2602.01728 | null |
| 2026-02-02 | AdNanny: One Reasoning LLM for All Offline Ads Recommendation Tasks | Nan Hu et.al. | 2602.01563 | null |
| 2026-02-01 | A Statistical Theory of Gated Attention through the Lens of Hierarchical Mixture of Experts | Viet Nguyen et.al. | 2602.01468 | null |
| 2026-02-01 | Rethinking Multinomial Logistic Mixture of Experts with Sigmoid Gating Function | Tuan Minh Pham et.al. | 2602.01466 | null |
| 2026-02-01 | Exposing and Defending the Achilles’ Heel of Video Mixture-of-Experts | Songping Wang et.al. | 2602.01369 | null |
| 2026-02-01 | Observation of $\barΛp\to K^{+}π^{+}π^{-}π^{0}$ and $\barΛp\to K^{+}π^{+}π^{-}2π^{0}$ | BESIII Collaboration et.al. | 2602.01282 | null |
| 2026-02-01 | MiTA Attention: Efficient Fast-Weight Scaling via a Mixture of Top- $k$ Activations | Qishuai Wen et.al. | 2602.01219 | null |
| 2026-02-01 | Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse | Zizhuo Fu et.al. | 2602.01203 | null |
| 2026-01-30 | Omni-fMRI: A Universal Atlas-Free fMRI Foundation Model | Mo Wang et.al. | 2601.23090 | null |
| 2026-01-30 | UrbanMoE: A Sparse Multi-Modal Mixture-of-Experts Framework for Multi-Task Urban Region Profiling | Pingping Liu et.al. | 2601.22746 | null |
| 2026-01-30 | A Cross-Domain Graph Learning Protocol for Single-Step Molecular Geometry Refinement | Chengchun Liu et.al. | 2601.22723 | null |
| 2026-01-30 | A Step Back: Prefix Importance Ratio Stabilizes Policy Optimization | Shiye Lei et.al. | 2601.22718 | null |
| 2026-01-30 | A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation | Haonan He et.al. | 2601.22708 | null |
| 2026-01-30 | Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments | Jinwoo Jang et.al. | 2601.22647 | null |
| 2026-01-30 | SpanNorm: Reconciling Training Stability and Performance in Deep Transformers | Chao Wang et.al. | 2601.22580 | null |
| 2026-01-30 | SHED Light on Segmentation for Dense Prediction | Seung Hyun Lee et.al. | 2601.22529 | null |
| 2026-01-30 | Continual Policy Distillation from Distributed Reinforcement Learning Teachers | Yuxuan Li et.al. | 2601.22475 | null |
| 2026-01-29 | ECO: Quantized Training without Full-Precision Master Weights | Mahdi Nikdan et.al. | 2601.22101 | null |
| 2026-01-29 | Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference | Yiren Zhao et.al. | 2601.22001 | null |
| 2026-01-29 | MoE-ACT: Improving Surgical Imitation Learning Policies through Supervised Mixture-of-Experts | Lorenzo Mazza et.al. | 2601.21971 | null |
| 2026-01-29 | MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts | Evandro S. Ortigossa et.al. | 2601.21866 | null |
| 2026-01-29 | OneMall: One Model, More Scenarios – End-to-End Generative Recommender Family at Kuaishou E-Commerce | Kun Zhang et.al. | 2601.21770 | null |
| 2026-01-29 | Seg-MoE: Multi-Resolution Segment-wise Mixture-of-Experts for Time Series Forecasting Transformers | Evandro S. Ortigossa et.al. | 2601.21641 | null |
| 2026-01-29 | Depth-Recurrent Attention Mixtures: Giving Latent Reasoning the Attention it Deserves | Jonas Knupp et.al. | 2601.21582 | null |
| 2026-01-29 | Multi-Modal Time Series Prediction via Mixture of Modulated Experts | Lige Zhang et.al. | 2601.21547 | null |
| 2026-01-29 | ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory | Yang Zhao et.al. | 2601.21545 | null |
| 2026-01-30 | L $^3$ : Large Lookup Layers | Albert Tseng et.al. | 2601.21461 | null |
| 2026-01-29 | ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation | Zihao Huang et.al. | 2601.21420 | null |
| 2026-01-29 | L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts | Minghao Yang et.al. | 2601.21349 | null |
| 2026-01-29 | Abstracting Robot Manipulation Skills via Mixture-of-Experts Diffusion Policies | Ce Hao et.al. | 2601.21251 | null |
| 2026-01-29 | Scaling Embeddings Outperforms Scaling Experts in Language Models | Hong Liu et.al. | 2601.21204 | null |
| 2026-01-29 | ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling | Yuchen Yang et.al. | 2601.21198 | null |
| 2026-01-29 | Precise measurements of $D^0 \to K^-\ell^+ν_\ell$ and $D^+ \to \bar K^0\ell^+ν_\ell$ decays | BESIII Collaboration et.al. | 2601.21196 | null |
| 2026-01-29 | Search for $ψ_0(4360)\rightarrow ηψ(2S)$ through the process $e^+e^- \rightarrow ηηψ(2S)$ | BESIII Collaboration et.al. | 2601.21190 | null |
| 2026-01-29 | First Experimental Constraint on the Scalar Current in the $D^{0(+)}\to \bar K\ell^+ν_{\ell}$ Transition | BESIII Collaboration et.al. | 2601.21185 | null |
| 2026-01-29 | BrainStack: Neuro-MoE with Functionally Guided Expert Routing for EEG-Based Language Decoding | Ziyi Zhao et.al. | 2601.21148 | null |
| 2026-01-29 | TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning | Shicheng Fan et.al. | 2601.21135 | null |
| 2026-01-28 | ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler | Bohua Zou et.al. | 2601.20755 | null |
| 2026-01-28 | ShieldedCode: Learning Robust Representations for Virtual Machine Protected Code | Mingqiao Mo et.al. | 2601.20679 | null |
| 2026-01-28 | Unsupervised Ensemble Learning Through Deep Energy-based Models | Ariel Maymon et.al. | 2601.20556 | null |
| 2026-01-28 | OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution | Le Zhang et.al. | 2601.20380 | null |
| 2026-01-28 | OSDEnhancer: Taming Real-World Space-Time Video Super-Resolution with One-Step Diffusion | Shuoyan Wei et.al. | 2601.20308 | null |
| 2026-01-28 | MiLorE-SSL: Scaling Multilingual Capabilities in Self-Supervised Models without Forgetting | Jing Xu et.al. | 2601.20300 | null |
| 2026-01-28 | HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-BENCH | Yueyang Wang et.al. | 2601.20255 | null |
| 2026-01-28 | Hyperparameter Transfer with Mixture-of-Expert Layers | Tianze Jiang et.al. | 2601.20205 | null |
| 2026-01-28 | Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery | Zhipeng Zhang et.al. | 2601.20193 | null |
| 2026-01-27 | Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts | TrungKhang Tran et.al. | 2601.19811 | null |
| 2026-01-27 | Component-Level Lesioning of Language Models Reveals Clinically Aligned Aphasia Phenotypes | Yifan Wang et.al. | 2601.19723 | null |
| 2026-01-27 | LoPRo: Enhancing Low-Rank Quantization via Permuted Block-Wise Rotation | Hongyaoxing Gu et.al. | 2601.19675 | null |
| 2026-01-27 | GMS-CAVP: Improving Audio-Video Correspondence with Multi-Scale Contrastive and Generative Pretraining | Shentong Mo et.al. | 2601.19606 | null |
| 2026-01-27 | Search for the isospin-violating decays $\boldsymbol{χ_{cJ}\toΛ\barΣ^{0}+c.c.}$ and $\boldsymbol{η_{c}\toΛ\barΣ^{0}+c.c.}$ | BESIII Collaboration et.al. | 2601.19493 | null |
| 2026-01-27 | Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition | Isha Pandey et.al. | 2601.19451 | null |
| 2026-01-26 | Superlinear Multi-Step Attention | Yufeng Huang et.al. | 2601.18401 | null |
| 2026-01-26 | FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning | Zhaopeng Qiu et.al. | 2601.18150 | null |
| 2026-01-26 | Beyond Static Datasets: Robust Offline Policy Optimization via Vetted Synthetic Transitions | Pedram Agand et.al. | 2601.18107 | null |
| 2026-01-26 | OneVoice: One Model, Triple Scenarios-Towards Unified Zero-shot Voice Conversion | Zhichao Wang et.al. | 2601.18094 | null |
| 2026-01-26 | LatentMoE: Toward Optimal Accuracy per FLOP and Parameter in Mixture of Experts | Venmugil Elango et.al. | 2601.18089 | null |
| 2026-01-25 | Domain-Expert-Guided Hybrid Mixture-of-Experts for Medical AI: Integrating Data-Driven Learning with Clinical Priors | Jinchen Gu et.al. | 2601.17977 | null |
| 2026-01-25 | EntWorld: A Holistic Environment and Benchmark for Verifiable Enterprise GUI Agents | Ying Mo et.al. | 2601.17722 | null |
| 2026-01-25 | $\infty$ -MoE: Generalizing Mixture of Experts to Infinite Experts | Shota Takashiro et.al. | 2601.17680 | null |
| 2026-01-25 | Health-ORSC-Bench: A Benchmark for Measuring Over-Refusal and Safety Completion in Health Context | Zhihao Zhang et.al. | 2601.17642 | null |
| 2026-01-24 | PILOT: A Perceptive Integrated Low-level Controller for Loco-manipulation over Unstructured Scenes | Xinru Cui et.al. | 2601.17440 | null |
| 2026-01-24 | Topological Protection by Local Support Symmetry and Destructive Interference | Jun-Won Rhim et.al. | 2601.17272 | null |
| 2026-01-23 | Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts | Xuan-Phi Nguyen et.al. | 2601.17111 | null |
| 2026-01-23 | First evidence for $D_s^+ \to f_1(1420) e^+ν_e$ and search for $D_s^+ \to f_1(1285) e^+ν_e$ | BESIII Collaboration et.al. | 2601.16938 | null |
| 2026-01-23 | Coarse-Grained Geometric Quantum Dynamics in the Tensor Network Representation | Mo Sha et.al. | 2601.16913 | null |
| 2026-01-23 | GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints | Andy Zhu et.al. | 2601.16905 | null |
| 2026-01-23 | Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation | Tims Pecerskis et.al. | 2601.16863 | null |
| 2026-01-23 | SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents | Yuhang Wang et.al. | 2601.16746 | null |
| 2026-01-23 | LongCat-Flash-Thinking-2601 Technical Report | Meituan LongCat Team et.al. | 2601.16725 | null |
| 2026-01-23 | Search for the radiative decay $D^+_s \to γK^*(892)^+$ | BESIII Collaboration et.al. | 2601.16476 | null |
| 2026-01-22 | proto-Lightspeed: a high-speed, ultra-low read noise imager on the Magellan Clay Telescope | Christopher Layden et.al. | 2601.16268 | null |
| 2026-01-22 | Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning | Moo Jin Kim et.al. | 2601.16163 | null |
| 2026-01-22 | Universal Refusal Circuits Across LLMs: Cross-Model Transfer via Trajectory Replay and Concept-Basis Reconstruction | Tony Cristofano et.al. | 2601.16034 | null |
| 2026-01-22 | Search for the reaction channel $e^+ e^- \to ηη\,J/ψ$ and the isospin partner of the $Z_c(3900)$ at center-of-mass energies $\sqrt{s} = 4.226-4.950$ GeV | BESIII Collaboration et.al. | 2601.15882 | null |
| 2026-01-22 | LL-GaussianImage: Efficient Image Representation for Zero-shot Low-Light Enhancement with 2D Gaussian Splatting | Yuhan Chen et.al. | 2601.15772 | null |
| 2026-01-22 | Redshift-Binned Constraints on the Hubble Constant under $Λ$ CDM, CPL, and Padé Cosmography | Zhi-Yuan Mo et.al. | 2601.15765 | null |
| 2026-01-21 | On the diagonal of low bidegree hypersurfaces | Morten Lüders et.al. | 2601.15409 | null |
| 2026-01-21 | Improving MoE Compute Efficiency by Composing Weight and Data Sparsity | Maciej Kilian et.al. | 2601.15370 | null |
| 2026-01-21 | Pb4U-GNet: Resolution-Adaptive Garment Simulation via Propagation-before-Update Graph Network | Aoran Liu et.al. | 2601.15110 | null |
| 2026-01-21 | Mixture-of-Experts Models in Vision: Routing, Optimization, and Generalization | Adam Rokah et.al. | 2601.15021 | null |
| 2026-01-21 | SynPerf: A Hybrid Analytical-ML Framework for GPU Performance Prediction | Kaixuan Zhang et.al. | 2601.14910 | null |
| 2026-01-21 | Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation | Rui Qi et.al. | 2601.14896 | null |
| 2026-01-21 | UBATrack: Spatio-Temporal State Space Model for General Multi-Modal Tracking | Qihua Liang et.al. | 2601.14799 | null |
| 2026-01-21 | UniRoute: Unified Routing Mixture-of-Experts for Modality-Adaptive Remote Sensing Change Detection | Qingling Shu et.al. | 2601.14797 | null |
| 2026-01-21 | Robustness of Mixtures of Experts to Feature Noise | Dong Sun et.al. | 2601.14792 | null |
| 2026-01-21 | Online Linear Programming with Replenishment | Yuze Chen et.al. | 2601.14629 | null |
| 2026-01-20 | $π$ MPC: A Parallel-in-horizon and Construction-free NMPC Solver | Liang Wu et.al. | 2601.14414 | null |
| 2026-01-20 | Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models | YuanLab. ai et.al. | 2601.14327 | null |
| 2026-01-20 | LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems | Badri N. Patro et.al. | 2601.14053 | null |
| 2026-01-20 | Understanding Multilingualism in Mixture-of-Experts LLMs: Routing Mechanism, Expert Specialization, and Layerwise Steering | Yuxin Chen et.al. | 2601.14050 | null |
| 2026-01-20 | DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging | Adrien Meyer et.al. | 2601.13954 | null |
| 2026-01-20 | The R2Pub Telescopes for Surveying: An Overview and Performance Evaluation of the System | Xuan Song et.al. | 2601.13587 | null |
| 2026-01-20 | ButterflyMoE: Sub-Linear Ternary Experts via Structured Butterfly Orbits | Aryan Karmore et.al. | 2601.13563 | null |
| 2026-01-20 | MN-TSG:Continuous Time Series Generation with Irregular Observations | Xu Zhang et.al. | 2601.13534 | null |
| 2026-01-19 | CLIP-Guided Adaptable Self-Supervised Learning for Human-Centric Visual Tasks | Mingshuang Luo et.al. | 2601.13133 | null |
| 2026-01-19 | Agentic Conversational Search with Contextualized Reasoning via Reinforcement Learning | Fengran Mo et.al. | 2601.13115 | null |
| 2026-01-19 | Polychronous Wave Computing: Timing-Native Address Selection in Spiking Networks | Natalila G. Berloff et.al. | 2601.13079 | null |
| 2026-01-19 | Synthesizing Strong-Coupling Kohn-Luttinger Superconductivity in 2D Van der Waals materials | Shi-Cong Mo et.al. | 2601.13074 | null |
| 2026-01-19 | PASs-MoE: Mitigating Misaligned Co-drift among Router and Experts via Pathway Activation Subspaces for Continual Learning | Zhiyan Hou et.al. | 2601.13020 | null |
| 2026-01-19 | HT-GNN: Hyper-Temporal Graph Neural Network for Customer Lifetime Value Prediction in Baidu Ads | Xiaohui Zhao et.al. | 2601.13013 | null |
| 2026-01-19 | OFA-MAS: One-for-All Multi-Agent System Topology Design based on Mixture-of-Experts Graph Generative Models | Shiyuan Li et.al. | 2601.12996 | null |
| 2026-01-19 | PhyG-MoE: A Physics-Guided Mixture-of-Experts Framework for Energy-Efficient GNSS Interference Recognition | Zhihan Zeng et.al. | 2601.12798 | null |
| 2026-01-19 | Topology-Aware Multiscale Mixture of Experts for Efficient Molecular Property Prediction | Long D. Nguyen et.al. | 2601.12637 | null |
| 2026-01-18 | A Mixture of Experts Vision Transformer for High-Fidelity Surface Code Decoding | Hoang Viet Nguyen et.al. | 2601.12483 | null |
| 2026-01-18 | Learning Diverse Skills for Behavior Models with Mixture of Experts | Wangtian Shen et.al. | 2601.12397 | null |
| 2026-01-18 | NADIR: Differential Attention Flow for Non-Autoregressive Transliteration in Indic Languages | Lakshya Tomar et.al. | 2601.12389 | null |
| 2026-01-18 | GazeFormer-MoE: Context-Aware Gaze Estimation via CLIP and MoE Transformer | Xinyuan Zhao et.al. | 2601.12316 | null |
| 2026-01-18 | Facet-Aware Multi-Head Mixture-of-Experts Model with Text-Enhanced Pre-training for Sequential Recommendation | Mingrui Liu et.al. | 2601.12301 | null |
| 2026-01-16 | Reasoning in Trees: Improving Retrieval-Augmented Generation for Multi-Hop Question Answering | Yuling Shi et.al. | 2601.11255 | null |
| 2026-01-16 | First Measurement of the Absolute Branching Fraction of $η_c \to γγ$ | BESIII Collaboration et.al. | 2601.11236 | null |
| 2026-01-16 | Self-Augmented Mixture-of-Experts for QoS Prediction | Kecheng Cai et.al. | 2601.11036 | null |
| 2026-01-16 | RobuMTL: Enhancing Multi-Task Learning Robustness Against Weather Conditions | Tasneem Shaffee et.al. | 2601.10921 | null |
| 2026-01-15 | Search for sub-GeV dark particles in $η\toπ^0+\rm{invisible}$ decay | BESIII Collaboration et.al. | 2601.10597 | null |
| 2026-01-15 | Deterministic and scalable generation of large Fock states | Mo Xiong et.al. | 2601.10559 | null |
| 2026-01-15 | Algebraic Farkas Lemma and Strong Duality for Perturbed Conic Linear Programming | P. D. Khanh et.al. | 2601.10390 | null |
| 2026-01-15 | MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts | Yuxuan Lou et.al. | 2601.10272 | null |
| 2026-01-15 | A Highly Magnetic Ultra Massive White Dwarf with a 23-minute Rotation Period | Jincheng Guo et.al. | 2601.10188 | null |
| 2026-01-15 | What Gets Activated: Uncovering Domain and Driver Experts in MoE Language Models | Guimin Hu et.al. | 2601.10159 | null |
| 2026-01-15 | MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation Learning | Yusong Wang et.al. | 2601.10157 | null |
| 2026-01-15 | Extremum Seeking Nonovershooting Control of Strict-Feedback Systems Under Unknown Control Direction | Kaixin Lu et.al. | 2601.09998 | null |
| 2026-01-14 | Progressive Mixture-of-Experts with autoencoder routing for continual RANS turbulence modelling | Haoyu Ji et.al. | 2601.09305 | null |
| 2026-01-14 | A Raman-Gas Spectral Compressor for High-Energy Femtosecond Laser Pulses | Zegui Wang et.al. | 2601.09234 | null |
| 2026-01-15 | A.X K1 Technical Report | Sung Jun Cheon et.al. | 2601.09200 | null |
| 2026-01-14 | WiFo-E: A Scalable Wireless Foundation Model for End-to-End FDD Precoding in Communication Networks | Weibo Wen et.al. | 2601.09186 | null |
| 2026-01-14 | Horseshoe Mixtures-of-Experts (HS-MoE) | Nick Polson et.al. | 2601.09043 | null |
| 2026-01-13 | OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG | Fengran Mo et.al. | 2601.09028 | null |
| 2026-01-12 | TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts | Yu Xu et.al. | 2601.08881 | null |
| 2026-01-13 | MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm | Bowen Zhou et.al. | 2601.08800 | null |
| 2026-01-13 | LWM-Spectro: A Foundation Model for Wireless Baseband Signal Spectrograms | Namhyun Kim et.al. | 2601.08780 | null |
| 2026-01-13 | M $^2$ FMoE: Multi-Resolution Multi-View Frequency Mixture-of-Experts for Extreme-Adaptive Time Series Forecasting | Yaohui Huang et.al. | 2601.08631 | null |
| 2026-01-13 | Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances | Ziqi Ding et.al. | 2601.08516 | null |
| 2026-01-13 | Taxon: Hierarchical Tax Code Prediction with Semantically Aligned LLM Expert Guidance | Jihang Li et.al. | 2601.08418 | null |
| 2026-01-13 | Controlled LLM Training on Spectral Sphere | Tian Xie et.al. | 2601.08393 | null |
| 2026-01-13 | Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models | Bo Wang et.al. | 2601.08383 | null |
| 2026-01-13 | Towards Principled Design of Mixture-of-Experts Language Models under Memory and Inference Constraints | Seng Pei Liew et.al. | 2601.08215 | null |
| 2026-01-12 | Towards Specialized Generalists: A Multi-Task MoE-LoRA Framework for Domain-Specific LLM Adaptation | Yuxin Yang et.al. | 2601.07935 | null |
| 2026-01-12 | An eclipsing 8.56 minute orbital period mass-transferring binary | Emma T. Chickles et.al. | 2601.07925 | null |
| 2026-01-12 | Emotional Support Evaluation Framework via Controllable and Diverse Seeker Simulator | Chaewon Heo et.al. | 2601.07698 | null |
| 2026-01-12 | Amplitude analysis and branching fraction measurement of $J/ψ\to Λ\barΣ^0η+\mathrm{c.c}$ | BESIII Collaboration et.al. | 2601.07617 | null |
| 2026-01-12 | Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models | Xin Cheng et.al. | 2601.07372 | null |
| 2026-01-11 | PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation | Yuanzhe Liu et.al. | 2601.07060 | null |
| 2026-01-11 | Solar Open Technical Report | Sungrae Park et.al. | 2601.07022 | null |
| 2026-01-11 | Deep Learning Based Channel Extrapolation for Dual-Band Massive MIMO Systems | Qikai Xiao et.al. | 2601.06858 | null |
| 2026-01-11 | MoE-DisCo:Low Economy Cost Training Mixture-of-Experts Models | Xin Ye et.al. | 2601.06857 | null |
| 2026-01-11 | MoEScore: Mixture-of-Experts-Based Text-Audio Relevance Score Prediction for Text-to-Audio System Evaluation | Bochao Sun et.al. | 2601.06829 | null |
| 2026-01-11 | SecMoE: Communication-Efficient Secure MoE Inference via Select-Then-Compute | Bowen Shen et.al. | 2601.06790 | null |
| 2026-01-11 | AutoTour: Automatic Photo Tour Guide with Smartphones and LLMs | Huatao Xu et.al. | 2601.06781 | null |
| 2026-01-11 | MTMCS-Bench: Evaluating Contextual Safety of Multimodal Large Language Models in Multi-Turn Dialogues | Zheyuan Liu et.al. | 2601.06757 | null |
| 2026-01-10 | R-Estimation with Right-Censored Data | Glen A. Satten et.al. | 2601.06685 | null |
| 2026-01-10 | Efficient and Reliable Estimation of Named Entity Linking Quality: A Case Study on GutBrainIE | Marco Martinelli et.al. | 2601.06624 | null |
| 2026-01-10 | Hellinger Multimodal Variational Autoencoders | Huyen Khanh Vo et.al. | 2601.06572 | null |
| 2026-01-10 | Physics-guided foundation model for universal speckle removal in ultrathin multimode fiber imaging | Xianrui Zeng et.al. | 2601.06448 | null |
| 2026-01-10 | The Promise of Time-Series Foundation Models for Agricultural Forecasting: Evidence from Marketing Year Average Prices | Le Wang et.al. | 2601.06371 | null |
| 2026-01-09 | Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning | Nusrat Jahan Prottasha et.al. | 2601.06356 | null |
| 2026-01-09 | AIConfigurator: Lightning-Fast Configuration Optimization for Multi-Framework LLM Serving | Tianhao Xu et.al. | 2601.06288 | null |
| 2026-01-09 | Orchestrating Tokens and Sequences: Dynamic Hybrid Policy Optimization for RLVR | Zijun Min et.al. | 2601.05607 | null |
| 2026-01-09 | Buffered AUC maximization for scoring systems via mixed-integer optimization | Moe Shiina et.al. | 2601.05544 | null |
| 2026-01-09 | Scalable Heterogeneous Graph Learning via Heterogeneous-aware Orthogonal Prototype Experts | Wei Zhou et.al. | 2601.05537 | null |
| 2026-01-08 | MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs | Jiyuan Zhang et.al. | 2601.05296 | null |
| 2026-01-08 | MoE3D: A Mixture-of-Experts Module for 3D Reconstruction | Zichen Wang et.al. | 2601.05208 | null |
| 2026-01-08 | FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts | Yiji Zhao et.al. | 2601.05174 | link |
| 2026-01-08 | How to Set the Learning Rate for Large-Scale Pre-training? | Yunhua Zhou et.al. | 2601.05049 | null |
| 2026-01-08 | CuMA: Aligning LLMs with Sparse Cultural Values via Demographic-Aware Mixture of Adapters | Ao Sun et.al. | 2601.04885 | null |
| 2026-01-08 | DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation | Guanzhi Deng et.al. | 2601.04823 | null |
| 2026-01-08 | Users Mispredict Their Own Preferences for AI Writing Assistance | Vivian Lai et.al. | 2601.04461 | null |
| 2026-01-08 | Re-Rankers as Relevance Judges | Chuan Meng et.al. | 2601.04455 | null |
| 2026-01-07 | Transitive Expert Error and Routing Problems in Complex AI Systems | Forest Mars et.al. | 2601.04416 | null |
| 2026-01-06 | Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language Models | Brady Steele et.al. | 2601.04254 | null |
| 2026-01-07 | When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life | Xinyue Lou et.al. | 2601.04043 | null |
| 2026-01-07 | A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems | Qi Wu et.al. | 2601.03992 | null |
| 2026-01-07 | Spectral Manifold Regularization for Stable and Modular Routing in Deep MoE Architectures | Ibrahim Delibasoglu et.al. | 2601.03889 | null |
| 2026-01-07 | PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation | Wenlong Huang et.al. | 2601.03782 | null |
| 2026-01-07 | Variational Inference, Entropy, and Orthogonality: A Unified Theory of Mixture-of-Experts | Ye Su et.al. | 2601.03577 | null |
| 2026-01-07 | CALM: Culturally Self-Aware Language Models | Lingzhi Shen et.al. | 2601.03483 | link |
| 2026-01-06 | The Illusion of Specialization: Unveiling the Domain-Invariant “Standing Committee” in Mixture-of-Experts Models | Yan Wang et.al. | 2601.03425 | null |
| 2026-01-06 | AT2024wpp: An Extremely Luminous Fast Ultraviolet Transient Powered by Accretion onto a Black Hole | Daniel A. Perley et.al. | 2601.03337 | null |
| 2026-01-06 | ReCCur: A Recursive Corner-Case Curation Framework for Robust Vision-Language Understanding in Open and Edge Scenarios | Yihan Wei et.al. | 2601.03011 | null |
| 2026-01-08 | MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free | Yishu Lei et.al. | 2601.02967 | null |
| 2026-01-06 | MixTTE: Multi-Level Mixture-of-Experts for Scalable and Adaptive Travel Time Estimation | Wenzhao Jiang et.al. | 2601.02943 | null |
| 2026-01-06 | MiMo-V2-Flash Technical Report | Bangjun Xiao et.al. | 2601.02780 | null |
| 2026-01-05 | Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts | Boxuan Lyu et.al. | 2601.02144 | null |
| 2026-01-05 | Cross section measurement of $e^{+}e^{-}\rightarrow π^{0}π^{0}ψ(3686)$ from $\sqrt{s}=$ 4.008 GeV to 4.951 GeV | BESIII Collaboration et.al. | 2601.02136 | null |
| 2026-01-07 | FormuLLA: A Large Language Model Approach to Generating Novel 3D Printable Formulations | Adeshola Okubena et.al. | 2601.02071 | null |
| 2026-01-05 | GCR: Geometry-Consistent Routing for Task-Agnostic Continual Anomaly Detection | Joongwon Chae et.al. | 2601.01856 | null |
| 2026-01-05 | First Observation of $D^{0(+)}\to \bar Kωe^+ν_e$ and Determination of the Branching Fraction of $\bar K_1(1270)\to \bar K ω$ | BESIII Collaboration et.al. | 2601.01817 | null |
| 2026-01-05 | Causality-Aware Temporal Projection for Video Understanding in Video-LLMs | Zhengjian Kang et.al. | 2601.01804 | null |
| 2026-01-05 | Measurements of the branching fractions of $χ_{cJ}\to 2K^+ 2K^- ω$ and $φK^+ K^- ω$ decays | BESIII Collaboration et.al. | 2601.01758 | null |
| 2026-01-05 | K-EXAONE Technical Report | Eunbi Choi et.al. | 2601.01739 | null |
| 2026-01-05 | Yuan3.0 Flash: An Open Multimodal Large Language Model for Enterprise Applications | YuanLab. ai et.al. | 2601.01718 | null |
| 2026-01-05 | Varying-Coefficient Mixture of Experts Model | Qicheng Zhao et.al. | 2601.01699 | null |
| 2026-01-06 | Measurements of the absolute branching fractions of the $Λ_{c}^{+}$ hadronic decays | BESIII Collaboration et.al. | 2601.01503 | null |
| 2026-01-04 | Multi-Subspace Multi-Modal Modeling for Diffusion Models: Estimation, Convergence and Mixture of Experts | Ruofeng Yang et.al. | 2601.01475 | null |
| 2026-01-06 | Making MoE-based LLM Inference Resilient with Tarragon | Songyu Zhang et.al. | 2601.01310 | null |
| 2026-01-03 | MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Efficient Clinical Assistance | Hamad Khan et.al. | 2601.01260 | null |
| 2026-01-02 | Reliability Under Randomness: An Empirical Analysis of Sparse and Dense Language Models Across Decoding Temperatures | Kabir Grover et.al. | 2601.00942 | null |
| 2026-01-02 | HFedMoE: Resource-aware Heterogeneous Federated Learning with Mixture-of-Experts | Zihan Fang et.al. | 2601.00583 | null |
| 2026-01-02 | A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR | Yuang Zheng et.al. | 2601.00557 | null |
| 2026-01-01 | Geometric Regularization in Mixture-of-Experts: The Disconnect Between Weights and Activations | Hyunjun Kim et.al. | 2601.00457 | null |
| 2026-01-01 | Traffic-MoE: A Sparse Foundation Model for Network Traffic Analysis | Jiajun Zhou et.al. | 2601.00357 | null |
| 2026-01-01 | Identification and Estimation under Multiple Versions of Treatment: Mixture-of-Experts Approach | Kohei Yoshikawa et.al. | 2601.00287 | null |
| 2025-12-31 | Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem | Weixun Wang et.al. | 2512.24873 | null |
| 2025-12-31 | Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models | Ákos Prucs et.al. | 2512.24776 | null |
| 2025-12-30 | Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning | Ziqing Fan et.al. | 2512.24265 | null |
| 2025-12-30 | Training Report of TeleChat3-MoE | Xinzhang Liu et.al. | 2512.24157 | null |
| 2025-12-30 | Skyrmion and Meron Crystals in Intermetallic Gd $3$Ru$_4$Al${12}$ : Microscopic Model Insights into Chiral Phases | Jiajun Mo et.al. | 2512.24071 | null |
| 2025-12-30 | RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress | Ruixuan Huang et.al. | 2512.23995 | null |
| 2025-12-30 | Towards a bottom-up formulation of spin kinetic theory | Zonglin Mo et.al. | 2512.23960 | null |
| 2026-01-02 | Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling | Chulun Zhou et.al. | 2512.23959 | null |
| 2025-12-30 | Learnable Query Aggregation with KV Routing for Cross-view Geo-localisation | Hualin Ye et.al. | 2512.23938 | null |
| 2025-12-29 | Observations of the Fermi bubbles and the Galactic center excess with the DArk Matter Particle Explorer | F. Alemanno et.al. | 2512.23458 | null |
| 2025-12-29 | Dynamic Subspace Composition: Efficient Adaptation via Contractive Basis Expansion | Vladimer Khasia et.al. | 2512.23448 | null |
| 2025-12-29 | Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss | Ang Lv et.al. | 2512.23447 | null |
| 2025-12-29 | Bitcoin-IPC: Scaling Bitcoin with a Network of Proof-of-Stake Subnets | Marko Vukolić et.al. | 2512.23439 | null |
| 2025-12-29 | Study of $\bar{K}^*(892)^0 η$ and $K_S^0 a_0(980)^0$ in the $D^{0} \to K_{S}^{0}π^0η$ decay | BESIII Collaboration et.al. | 2512.23389 | null |
| 2025-12-30 | YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection | Xu Lin et.al. | 2512.23273 | null |
| 2025-12-28 | Trust Region Masking for Long-Horizon LLM Reinforcement Learning | Yingru Li et.al. | 2512.23075 | null |
| 2025-12-28 | FLEX-MoE: Federated Mixture-of-Experts with Load-balanced Expert Assignment | Boyang Zhang et.al. | 2512.23070 | null |
| 2025-12-28 | Viability and Performance of a Private LLM Server for SMBs: A Benchmark Analysis of Qwen3-30B on Consumer-Grade Hardware | Alex Khalil et.al. | 2512.23029 | null |
| 2025-12-28 | Reach-Avoid Differential game with Reachability Analysis for UAVs: A decomposition approach | Minh Bui et.al. | 2512.22793 | null |
| 2025-12-28 | Text-Routed Sparse Mixture-of-Experts Model with Explanation and Temporal Alignment for Multi-Modal Sentiment Analysis | Dongning Rao et.al. | 2512.22741 | null |
| 2025-12-27 | RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure | Wei Gao et.al. | 2512.22560 | null |
| 2025-12-27 | Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection | Zihan Liu et.al. | 2512.22483 | null |
| 2025-12-27 | Bright 4B: Scaling Hyperspherical Learning for Segmentation in 3D Brightfield Microscopy | Amil Khan et.al. | 2512.22423 | null |
| 2025-12-26 | FUSCO: High-Performance Distributed Data Shuffling via Transformation-Communication Fusion | Zhuoran Zhu et.al. | 2512.22036 | null |
| 2025-12-26 | SWE-RM: Execution-free Feedback For Software Engineering Agents | KaShun Shum et.al. | 2512.21919 | null |
| 2025-12-26 | Accelerate Speculative Decoding with Sparse Computation in Verification | Jikai Wang et.al. | 2512.21911 | null |
| 2025-12-26 | MMCTOP: A Multimodal Textualization and Mixture-of-Experts Framework for Clinical Trial Outcome Prediction | Carolina Aparício et.al. | 2512.21897 | null |
| 2025-12-26 | CrownGen: Patient-customized Crown Generation via Point Diffusion Model | Juyoung Bae et.al. | 2512.21890 | null |
| 2025-12-26 | SLIM-Brain: A Data- and Training-Efficient Foundation Model for fMRI Data Analysis | Mo Wang et.al. | 2512.21881 | null |
| 2025-12-25 | Spatiotemporal-Untrammelled Mixture of Experts for Multi-Person Motion Prediction | Zheng Yin et.al. | 2512.21707 | null |
| 2025-12-25 | Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism | Xinglin Pan et.al. | 2512.21487 | null |
| 2025-12-24 | DeepCQ: General-Purpose Deep-Surrogate Framework for Lossy Compression Quality Prediction | Khondoker Mirazul Mumenin et.al. | 2512.21433 | null |
| 2025-12-24 | SparScene: Efficient Traffic Scene Representation via Sparse Graph Learning for Large-Scale Trajectory Generation | Xiaoyu Mo et.al. | 2512.21133 | null |
| 2025-12-26 | Identification with Orthogonal Basis Functions: Convergence Speed, Asymptotic Bias, and Rate-Optimal Pole Selection | Jiayun Li et.al. | 2512.21096 | null |
| 2025-12-25 | GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs | Lichao Wu et.al. | 2512.21008 | null |
| 2025-12-24 | SACodec: Asymmetric Quantization with Semantic Anchoring for Low-Bitrate High-Fidelity Neural Speech Codecs | Zhongren Dong et.al. | 2512.20944 | null |
| 2025-12-24 | RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks | Ningyuan Liu et.al. | 2512.20920 | null |
| 2025-12-24 | NVIDIA Nemotron 3: Efficient and Open Intelligence | NVIDIA et.al. | 2512.20856 | null |
| 2025-12-23 | Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning | NVIDIA et.al. | 2512.20848 | null |
| 2025-12-23 | Defending against adversarial attacks using mixture of experts | Mohammad Meymani et.al. | 2512.20821 | null |
| 2025-12-23 | MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts | Alexandros Christoforos et.al. | 2512.20604 | null |
| 2025-12-23 | Branch Learning in MRI: More Data, More Models, More Training | Yuyang Li et.al. | 2512.20330 | null |
| 2025-12-23 | Mixture-of-Experts with Gradient Conflict-Driven Subspace Topology Pruning for Emergent Modularity | Yuxing Gan et.al. | 2512.20291 | null |
| 2025-12-23 | Degradation-Aware Metric Prompting for Hyperspectral Image Restoration | Binfeng Wang et.al. | 2512.20251 | link |
| 2025-12-23 | AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model | Sofian Chaybouti et.al. | 2512.20157 | null |
| 2025-12-23 | Fun-Audio-Chat Technical Report | Qian Chen et.al. | 2512.20156 | null |
| 2025-12-23 | Bring My Cup! Personalizing Vision-Language-Action Models with Visual Attentive Prompting | Sangoh Lee et.al. | 2512.20014 | null |
| 2025-12-23 | Observation and branching fraction measurements of $χ_{cJ}\to p \bar p K^0_S K^0_S$ | BESIII Collaboration et.al. | 2512.19993 | null |
| 2025-12-22 | UCCL-EP: Portable Expert-Parallel Communication | Ziming Mao et.al. | 2512.19849 | null |
| 2025-12-21 | How Many Experts Are Enough? Towards Optimal Semantic Specialization for Mixture-of-Experts | Sumin Park et.al. | 2512.19765 | null |
| 2025-12-22 | Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios | Jiawen Wang et.al. | 2512.19551 | null |
| 2025-12-22 | EGM: Efficiently Learning General Motion Tracking Policy for High Dynamic Humanoid Whole-Body Control | Chao Yang et.al. | 2512.19043 | null |
| 2025-12-21 | Tempo as the Stable Cue: Hierarchical Mixture of Tempo and Beat Experts for Music to 3D Dance Generation | Guangtao Lyu et.al. | 2512.18804 | null |
| 2025-12-21 | Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with Prompts | Linwei Qiu et.al. | 2512.18718 | null |
| 2025-12-21 | Remoe: Towards Efficient and Low-Cost MoE Inference in Serverless Computing | Wentao Liu et.al. | 2512.18674 | null |
| 2025-12-21 | Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach | Zhe Li et.al. | 2512.18597 | null |
| 2025-12-20 | Secret mixtures of experts inside your LLM | Enric Boix-Adsera et.al. | 2512.18452 | link |
| 2025-12-20 | MoE Pathfinder: Trajectory-driven Expert Pruning | Xican Yang et.al. | 2512.18425 | link |
| 2025-12-20 | MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation | Kaixing Yang et.al. | 2512.18181 | null |
| 2025-12-20 | Cross section and parametrization of charmonium decay | Xiao-Hu Mo et.al. | 2512.18154 | null |
| 2025-12-19 | MoE-TransMov: A Transformer-based Model for Next POI Prediction in Familiar & Unfamiliar Movements | Ruichen Tan et.al. | 2512.17985 | null |
| 2025-12-19 | Interpreting the strong clustering of ultra-diffuse galaxies by halo spin bias | Qinglin Ma et.al. | 2512.17742 | null |
| 2025-12-19 | Cross sections measurement of $e^+e^-\to Ξ(1530)^0\barΞ^0 + c.c.$ and search for $ψ(3770)\toΞ(1530)^0\barΞ^0 + c.c.$ | BESIII Colaboration et.al. | 2512.17275 | null |
| 2025-12-19 | Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding | Yuqing Li et.al. | 2512.17220 | null |
| 2025-12-19 | Capturing Arbitrary Waveform without Absorption with Synthesis of Complex Frequencies | Zhaohua Tian et.al. | 2512.17156 | null |
| 2025-12-18 | Bandwidth-Efficient Adaptive Mixture-of-Experts via Low-Rank Compensation | Zhenyu Liu et.al. | 2512.17073 | null |
| 2025-12-18 | Compression is Routing: Reconstruction Error as an Intrinsic Signal for Modular Language Models | Zhongpan Tang et.al. | 2512.16963 | null |
| 2025-12-18 | LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation | Haichao Zhang et.al. | 2512.16891 | null |
| 2025-12-18 | The WINTER Observatory: A One-Degree InGaAs Survey Camera to study the Transient Infrared Sky | Danielle Frostig et.al. | 2512.16753 | null |
| 2025-12-18 | PoseMoE: Mixture-of-Experts Network for Monocular 3D Human Pose Estimation | Mengyuan Liu et.al. | 2512.16494 | null |
| 2025-12-18 | Efficient CPU-GPU Collaborative Inference for MoE-based LLMs on Memory-Limited Systems | En-Ming Huang et.al. | 2512.16473 | null |
| 2025-12-18 | Pretrained Battery Transformer (PBT): A battery life prediction foundation model | Ruifeng Tan et.al. | 2512.16334 | null |
| 2025-12-19 | Sigma-MoE-Tiny Technical Report | Qingguo Hu et.al. | 2512.16248 | null |
| 2025-12-18 | Open Ad-hoc Categorization with Contextualized Feature Learning | Zilin Wang et.al. | 2512.16202 | link |
| 2025-12-18 | INTELLECT-3: Technical Report | Prime Intellect Team et.al. | 2512.16144 | null |
| 2025-12-17 | Wake instability past a sphere settling in a strongly stratified flow | Chang-Fan Mo et.al. | 2512.15626 | null |
| 2025-12-17 | Measurements of the Absolute Branching Fraction of the Semileptonic Decay $\mathbf{Ξ^{-}\rightarrow Λe^- \barν_{e}}$ and the Axial Charge of the $\mathbfΞ^{-}$ | BESIII Collaboration et.al. | 2512.15273 | null |
| 2025-12-19 | VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments | Yuze Wu et.al. | 2512.15258 | null |
| 2025-12-17 | Search for the decays $X(3872)\to K_{S}^{0}K^{\pm}π^{\mp}$ and $K^*(892)\bar{K}$ at BESIII | BESIII Collaboration et.al. | 2512.15091 | null |
| 2025-12-19 | Let the Barbarians In: How AI Can Accelerate Systems Performance Research | Audrey Cheng et.al. | 2512.14806 | null |
| 2025-12-15 | SocialNav-MoE: A Mixture-of-Experts Vision Language Model for Socially Compliant Navigation with Reinforcement Fine-Tuning | Tomohito Kawabata et.al. | 2512.14757 | null |
| 2025-12-16 | Measurements of the branching fractions of $χ_{cJ}\to φφη, φφη^{\prime}$ and $φK^+K^-η$ | BESIII Collaboration et.al. | 2512.14369 | null |
| 2025-12-16 | SketchAssist: A Practical Assistant for Semantic Edits and Precise Local Redrawing | Han Zou et.al. | 2512.14140 | null |
| 2025-12-16 | SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations | Wentao Guo et.al. | 2512.14080 | null |
| 2025-12-16 | Sparsity-Controllable Dynamic Top-p MoE for Large Foundation Model Pre-training | Can Jin et.al. | 2512.13996 | null |
| 2025-12-15 | Connection between galaxy morphology and dark-matter halo structure II: predicting disk structure from dark-matter halo properties | Jinning Liang et.al. | 2512.13822 | null |
| 2025-12-13 | RAST-MoE-RL: A Regime-Aware Spatio-Temporal MoE Framework for Deep Reinforcement Learning in Ride-Hailing | Yuhan Tang et.al. | 2512.13727 | null |
| 2025-12-15 | StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion | Guransh Singh et.al. | 2512.13632 | null |
| 2025-12-16 | Janus: Disaggregating Attention and Experts for Scalable MoE Inference | Zhexiang Zhang et.al. | 2512.13525 | null |
| 2025-12-15 | SIGMA: An AI-Empowered Training Stack on Early-Life Hardware | Lei Qu et.al. | 2512.13488 | null |
| 2025-12-15 | Automated Information Flow Selection for Multi-scenario Multi-task Recommendation | Chaohua Yang et.al. | 2512.13396 | null |
| 2025-12-15 | Sharpen the Spec, Cut the Code: A Case for Generative File System with SYSSPEC | Qingyuan Liu et.al. | 2512.13047 | null |
| 2025-12-15 | Safe Control of Multi-Agent Systems with Minimal Communication | Mo Yang et.al. | 2512.13021 | null |
| 2025-12-15 | SliceMoE: Bit-Sliced Expert Caching under Miss-Rate Constraints for Efficient MoE Inference | Yuseon Choi et.al. | 2512.12990 | null |
| 2025-12-14 | Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution | Boyang Yan et.al. | 2512.12806 | null |
| 2025-12-14 | Bayesian Optimization Parameter Tuning Framework for a Lyapunov Based Path Following Controller | Zhewen Zheng et.al. | 2512.12649 | null |
| 2025-12-13 | Amplitude Analysis and Branching Fraction Measurement of $D^+ \to π^+π^0π^0$ | BESIII Collaboration et.al. | 2512.12397 | null |
| 2025-12-13 | Fine-Grained Zero-Shot Learning with Attribute-Centric Representations | Zhi Chen et.al. | 2512.12219 | null |
| 2025-12-13 | ALERT Open Dataset and Input-Size-Agnostic Vision Transformer for Driver Activity Recognition using IR-UWB | Jeongjun Park et.al. | 2512.12206 | null |
| 2025-12-13 | MixtureKit: A General Framework for Composing, Training, and Visualizing Mixture-of-Experts Models | Ahmad Chamma et.al. | 2512.12121 | null |
| 2025-12-12 | Measurement of the cosmic ray nickel energy spectrum from 10 GeV/n to 2 TeV/n with the DAMPE | F. Alemanno et.al. | 2512.11425 | null |
| 2025-12-11 | Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration | Sicheng Mo et.al. | 2512.10954 | null |
| 2025-12-11 | Unleashing Degradation-Carrying Features in Symmetric U-Net: Simpler and Stronger Baselines for All-in-One Image Restoration | Wenlong Jiao et.al. | 2512.10581 | null |
| 2025-12-11 | Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment | Han Li et.al. | 2512.10450 | null |
| 2025-12-12 | Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge | Junjie Bai et.al. | 2512.10071 | null |
| 2025-12-10 | Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach | Salvador Carrión et.al. | 2512.09910 | null |
| 2025-12-10 | DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation | Zhizhong Wang et.al. | 2512.09814 | null |
| 2025-12-10 | M3Net: A Multi-Metric Mixture of Experts Network Digital Twin with Graph Neural Networks | Blessed Guda et.al. | 2512.09797 | null |
| 2025-12-10 | First measurement of the absolute branching fractions of $Σ^+$ nonleptonic decays and test of the $ΔI = 1/2$ rule % $Σ^+ \to p π^0$ and $Σ^+ \to n π^+$ | BESIII Collaboration et.al. | 2512.09628 | null |
| 2025-12-10 | FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model | Xiang Chen et.al. | 2512.09282 | null |
| 2025-12-10 | Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens | Yanpeng Yu et.al. | 2512.09277 | null |
| 2025-12-10 | Bug Priority Change Prediction: An Exploratory Study on Apache Software | Guangzong Cai et.al. | 2512.09216 | null |
| 2025-12-09 | Ask, Answer, and Detect: Role-Playing LLMs for Personality Detection with Question-Conditioned Mixture-of-Experts | Yifan Lyu et.al. | 2512.08814 | null |
| 2025-12-09 | What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance | Athena Psalta et.al. | 2512.08697 | null |
| 2025-12-09 | Prismatic World Model: Learning Compositional Dynamics for Planning in Hybrid Systems | Mingwei Li et.al. | 2512.08411 | null |
| 2025-12-09 | FastBEV++: Fast by Algorithm, Deployable by Design | Yuanpeng Chen et.al. | 2512.08237 | null |
| 2025-12-08 | Relational Visual Similarity | Thao Nguyen et.al. | 2512.07833 | null |
| 2025-12-08 | Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE | Anxiang Zeng et.al. | 2512.07710 | null |
| 2025-12-08 | LongCat-Image Technical Report | Meituan LongCat Team et.al. | 2512.07584 | null |
| 2025-12-12 | MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer | Penghui Liu et.al. | 2512.07500 | null |
| 2025-12-08 | Equivariant Diffusion for Crystal Structure Prediction | Peijia Lin et.al. | 2512.07289 | null |
| 2025-12-08 | Measurement of the branching fraction of $η\to μ^+ μ^-$ and search for $η\to e^+ e^-$ | BESIII Collaboration et.al. | 2512.07144 | null |
| 2025-12-09 | TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning | Zebin Xing et.al. | 2512.07135 | null |
| 2025-12-08 | PlantBiMoE: A Bidirectional Foundation Model with SparseMoE for Plant Genomes | Kepeng Lin et.al. | 2512.07113 | null |
| 2025-12-07 | Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding | MinCheol Jeon et.al. | 2512.06929 | null |
| 2025-12-07 | Stable-MoE: Lyapunov-based Token Routing for Distributed Mixture-of-Experts Training over Edge Networks | Long Shi et.al. | 2512.06784 | null |
| 2025-12-07 | Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving | Wei-Bin Kou et.al. | 2512.06664 | null |
| 2025-12-06 | Enhancing Medical Cross-Modal Hashing Retrieval using Dropout-Voting Mixture-of-Experts Fusion | Jaewon Ahn et.al. | 2512.06449 | null |
| 2025-12-04 | The SAM2-to-SAM3 Gap in the Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation | Ranjan Sapkota et.al. | 2512.06032 | null |
| 2025-12-05 | HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies | Zhiying Du et.al. | 2512.05693 | link |
| 2025-12-05 | ProPhy: Progressive Physical Alignment for Dynamic World Simulation | Zijun Wang et.al. | 2512.05564 | null |
| 2025-12-04 | Evidence for the semileptonic decays $Λ_c^{+} \to Σ^{\pm} π^{\mp} e^+ ν_e$ | BESIII Collaboration et.al. | 2512.05178 | null |
| 2025-12-09 | EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture | Xin He et.al. | 2512.04810 | null |
| 2025-12-04 | Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild | Yigui Feng et.al. | 2512.04728 | null |
| 2025-12-04 | Study of the reaction $Ξ^{0}n\rightarrowΛΛX$ using $Ξ^{0}$ -nucleus scattering | BESIII Collaboration et.al. | 2512.04701 | null |
| 2025-12-04 | Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space | Joey Hong et.al. | 2512.04601 | null |
| 2025-12-04 | The Binary Fraction of Stars in the Dwarf Galaxy Ursa Minor via Dark Energy Spectroscopic Instrument | Tian Qiu et.al. | 2512.04477 | null |
| 2025-12-04 | Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems | Zehao Fan et.al. | 2512.04476 | null |
| 2025-12-03 | Small Models Achieve Large Language Model Performance: Evaluating Reasoning-Enabled AI for Secure Child Welfare Research | Zia Qi et.al. | 2512.04261 | null |
| 2025-12-03 | Decoding Large Language Diffusion Models with Foreseeing Movement | Yichuan Mo et.al. | 2512.04135 | null |
| 2025-12-03 | Stable Signer: Hierarchical Sign Language Generative Model | Sen Fang et.al. | 2512.04048 | null |
| 2025-12-03 | OD-MoE: On-Demand Expert Loading for Cacheless Edge-Distributed MoE Inference | Liujianfu Wang et.al. | 2512.03927 | null |
| 2025-12-04 | A Theoretical Framework for Auxiliary-Loss-Free Load Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models | X. Y. Han et.al. | 2512.03915 | null |
| 2025-12-03 | Parsimonious Clustering of Covariance Matrices | Yixi Xu et.al. | 2512.03912 | null |
| 2025-12-03 | Measurement of the hyperon weak radiative decay $Ξ^0\toγΣ^0$ at BESIII | BESIII Collaboration et.al. | 2512.03877 | null |
| 2025-12-03 | Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation | Subin Kim et.al. | 2512.03534 | null |
| 2025-12-03 | CellScout: Visual Analytics for Mining Biomarkers in Cell State Discovery | Rui Sheng et.al. | 2512.03485 | null |
| 2025-12-03 | Unconventional Magneto-Optical Effects in Altermagnets | Yongpan Li et.al. | 2512.03435 | null |
| 2025-12-03 | SSLfmm: An R Package for Semi-Supervised Learning with a Mixed-Missingness Mechanism in Finite Mixture Models | Geoffrey J. McLachlan et.al. | 2512.03322 | null |
| 2025-12-02 | Intrinsic Second-Order Topological Superconductors with Tunable Majorana Zero Modes | Xiao-Jiao Wang et.al. | 2512.02775 | null |
| 2025-12-02 | Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction | Xiang Yuan et.al. | 2512.02584 | link |
| 2025-12-02 | SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of Experts | Jiaqi Liu et.al. | 2512.02517 | link |
| 2025-12-02 | A Fully First-Order Layer for Differentiable Optimization | Zihao Zhao et.al. | 2512.02494 | null |
| 2025-12-02 | Quasi-steady electron-excitonic complexes coupling in a two-dimensional semiconductor | Shangkun Mo et.al. | 2512.02490 | null |
| 2025-12-02 | Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention | Wenyi Xiong et.al. | 2512.02368 | null |
| 2025-12-02 | Understanding and Harnessing Sparsity in Unified Multimodal Models | Shwai He et.al. | 2512.02351 | link |
| 2025-12-02 | OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning | Boyu Zhu et.al. | 2512.02306 | null |
| 2025-12-01 | Towards Unified Video Quality Assessment | Chen Feng et.al. | 2512.02224 | null |
| 2025-12-01 | ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation | Chenyang Gu et.al. | 2512.02013 | null |
| 2025-12-01 | Multimodal Mixture-of-Experts for ISAC in Low-Altitude Wireless Networks | Kai Zhang et.al. | 2512.01750 | null |
| 2025-12-01 | GRASP: Guided Residual Adapters with Sample-wise Partitioning | Felix Nützel et.al. | 2512.01675 | null |
| 2025-12-01 | Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery | Zhicheng Zhao et.al. | 2512.01665 | null |
| 2025-12-01 | Cuffless Blood Pressure Estimation from Six Wearable Sensor Modalities in Multi-Motion-State Scenarios | Yiqiao Chen et.al. | 2512.01653 | null |
| 2025-12-01 | Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track | Mo Chen et.al. | 2512.01608 | null |
| 2025-12-01 | Personalized optimization of pediatric HD-tDCS for dose consistency and target engagement | Zeming Liu et.al. | 2512.01406 | null |
| 2025-12-02 | Stabilizing Reinforcement Learning with LLMs: Formulation and Practices | Chujie Zheng et.al. | 2512.01374 | null |
| 2025-12-01 | TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking | Hanzhi Guo et.al. | 2512.01329 | null |
| 2025-12-01 | Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe | Yahui Liu et.al. | 2512.01252 | null |
| 2025-11-30 | Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios | Jianxiang Zang et.al. | 2512.00920 | null |
| 2025-11-30 | Elastic Mixture of Rank-Wise Experts for Knowledge Reuse in Federated Fine-Tuning | Yebo Wu et.al. | 2512.00902 | null |
| 2025-11-30 | Upcycled and Merged MoE Reward Model for Mitigating Reward Hacking | Lingling Fu et.al. | 2512.00724 | null |
| 2025-11-29 | GCMCG: A Clustering-Aware Graph Attention and Expert Fusion Network for Multi-Paradigm, Multi-task, and Cross-Subject EEG Decoding | Yiqiao Chen et.al. | 2512.00574 | null |
| 2025-11-28 | Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model | Junshu Tang et.al. | 2511.23429 | null |
| 2025-11-28 | LFM2 Technical Report | Alexander Amini et.al. | 2511.23404 | null |
| 2025-11-28 | Chart2Code-MoLA: Efficient Multi-Modal Code Generation via Adaptive Expert Routing | Yifei Wang et.al. | 2511.23321 | null |
| 2025-11-28 | Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models | Xiang Hu et.al. | 2511.23319 | null |
| 2025-11-28 | Multi-Modal Scene Graph with Kolmogorov-Arnold Experts for Audio-Visual Question Answering | Zijian Fu et.al. | 2511.23304 | null |
| 2025-11-28 | Experts are all you need: A Composable Framework for Large Language Model Inference | Shrihari Sridharan et.al. | 2511.22955 | null |
| 2025-11-28 | EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model | Yuhao Xu et.al. | 2511.22935 | null |
| 2025-11-27 | Architecture Decoupling Is Not All You Need For Unified Multimodal Model | Dian Zheng et.al. | 2511.22663 | null |
| 2025-11-27 | OmniInfer: System-Wide Acceleration Techniques for Optimizing LLM Serving Throughput and Latency | Jun Wang et.al. | 2511.22481 | null |
| 2025-11-27 | Foundation Model for Intelligent Wireless Communications | Boxun Liu et.al. | 2511.22222 | null |
| 2025-11-27 | MoE3D: Mixture of Experts meets Multi-Modal 3D Understanding | Yu Li et.al. | 2511.22103 | null |
| 2025-11-27 | Convergence Dynamics of Over-Parameterized Score Matching for a Single Gaussian | Yiran Zhang et.al. | 2511.22069 | null |
| 2025-11-26 | Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models | Naifu Zhang et.al. | 2511.21663 | null |
| 2025-11-26 | Continual Error Correction on Low-Resource Devices | Kirill Paramonov et.al. | 2511.21652 | null |
| 2025-11-27 | Qwen3-VL Technical Report | Shuai Bai et.al. | 2511.21631 | null |
| 2025-11-26 | Enhanced Landmark Detection Model in Pelvic Fluoroscopy using 2D/3D Registration Loss | Chou Mo et.al. | 2511.21575 | null |
| 2025-11-26 | Scaling limits of critical FK-decorated random planar maps with $q=4$ | William Da Silva et.al. | 2511.21480 | null |
| 2025-11-26 | Study of the reactions $\bar{n} p \to 2π^{+}π^{-}$, $2π^{+}π^{-}π^{0}$, and $2π^{+}π^{-}2π^{0}$ using $J/ψ\to p π^{-}\bar{n}$ | BESIII Collaboration et.al. | 2511.21462 | null |
| 2025-11-26 | MemFine: Memory-Aware Fine-Grained Scheduling for MoE Training | Lu Zhao et.al. | 2511.21431 | null |
| 2025-11-26 | Do Reasoning Vision-Language Models Inversely Scale in Test-Time Compute? A Distractor-centric Empirical Analysis | Jiyun Bae et.al. | 2511.21397 | null |
| 2025-11-26 | Conditional Generative Modeling of Stochastic LTI Systems: A Behavioral Approach | Jiayun Li et.al. | 2511.21219 | null |
| 2025-11-26 | MLPMoE: Zero-Shot Architectural Metamorphosis of Dense LLM MLPs into Static Mixture-of-Experts | Ivan Novikov et.al. | 2511.21089 | null |
| 2025-11-25 | HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation | Xiang Wang et.al. | 2511.20520 | null |
| 2025-11-25 | Soft Adaptive Policy Optimization | Chang Gao et.al. | 2511.20347 | null |
| 2025-11-25 | ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories | Hai Ling et.al. | 2511.20169 | null |
| 2025-11-25 | Adaptive Knowledge Transfer for Cross-Disciplinary Cold-Start Knowledge Tracing | Yulong Deng et.al. | 2511.20009 | null |
| 2025-11-25 | SONIC: Spectral Optimization of Noise for Inpainting with Consistency | Seungyeon Baek et.al. | 2511.19985 | null |
| 2025-11-25 | Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models | Wentao Hu et.al. | 2511.19822 | null |
| 2025-11-22 | Exploiting the Experts: Unauthorized Compression in MoE-LLMs | Pinaki Prasad Guha Neogi et.al. | 2511.19480 | null |
| 2025-11-22 | Tracking and Segmenting Anything in Any Modality | Tianlu Zhang et.al. | 2511.19475 | null |
| 2025-11-24 | Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling | Long Tang et.al. | 2511.19024 | null |
| 2025-11-24 | OrdMoE: Preference Alignment via Hierarchical Expert Group Ranking in Multimodal Mixture-of-Experts LLMs | Yuting Gao et.al. | 2511.19023 | null |
| 2025-11-24 | Dynamic Mixture of Experts Against Severe Distribution Shifts | Donghu Kim et.al. | 2511.18987 | null |
| 2025-11-23 | HiFi-MambaV2: Hierarchical Shared-Routed MoE for High-Fidelity MRI Reconstruction | Pengcheng Fang et.al. | 2511.18534 | null |
| 2025-11-23 | AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixture of Expert | Yuting Gao et.al. | 2511.18314 | null |
| 2025-11-22 | PromptMoE: Generalizable Zero-Shot Anomaly Detection via Visually-Guided Prompt Mixtures | Yuheng Shao et.al. | 2511.18116 | null |
| 2025-11-22 | CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking | Hao Li et.al. | 2511.17967 | null |
| 2025-11-22 | Measuring the Impact of Lexical Training Data Coverage on Hallucination Detection in Large Language Models | Shuo Zhang et.al. | 2511.17946 | null |
| 2025-11-22 | FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning | Guoyang Xia et.al. | 2511.17885 | null |
| 2025-11-22 | Equivalence of Context and Parameter Updates in Modern Transformer Blocks | Adrian Goldwaser et.al. | 2511.17864 | null |
| 2025-11-21 | Unified Class and Domain Incremental Learning with Mixture of Experts for Indoor Localization | Akhil Singampalli et.al. | 2511.17829 | null |
| 2025-11-21 | Boosting Brain-inspired Path Integration Efficiency via Learning-based Replication of Continuous Attractor Neurodynamics | Zhangyu Ge et.al. | 2511.17687 | null |
| 2025-11-21 | Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required? | Sukwon Yun et.al. | 2511.17400 | null |
| 2025-11-21 | MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment | Huangbiao Xu et.al. | 2511.17397 | link |
| 2025-11-21 | Training Foundation Models on a Full-Stack AMD Platform: Compute, Networking, and System Design | Quentin Anthony et.al. | 2511.17127 | null |
| 2025-11-21 | Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters | Zhan Su et.al. | 2511.17044 | null |
| 2025-11-21 | VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions | Qianyi Shao et.al. | 2511.16998 | null |
| 2025-11-21 | RadioKMoE: Knowledge-Guided Radiomap Estimation with Kolmogorov-Arnold Networks and Mixture-of-Experts | Fupei Guo et.al. | 2511.16986 | null |
| 2025-11-21 | MicroMoE: Fine-Grained Load Balancing for Mixture-of-Experts with Token Scheduling | Chenqi Zhao et.al. | 2511.16947 | null |
| 2025-11-20 | Search for the charmonium weak decay $J/ψ\to\bar{D}^0\bar{K}^{*0}+{\rm c.c.}$ | BESIII Collaboration et.al. | 2511.16083 | null |
| 2025-11-20 | Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution | Xiao He et.al. | 2511.16024 | null |
| 2025-11-19 | AquaSentinel: Next-Generation AI System Integrating Sensor Networks for Urban Underground Water Pipeline Anomaly Detection via Collaborative MoE-LLM Agent Architecture | Qiming Guo et.al. | 2511.15870 | null |
| 2025-11-19 | MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping | Yushi Huang et.al. | 2511.15690 | null |
| 2025-11-19 | Search for the lepton number violating process $Ξ^- \rightarrow Σ^+ e^- e^- +c.c.$ | BESIII Collaboration et.al. | 2511.15394 | null |
| 2025-11-19 | VIRAL: Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation | Tairan He et.al. | 2511.15200 | null |
| 2025-11-19 | GPU-Initiated Networking for NCCL | Khaled Hamidouche et.al. | 2511.15076 | null |
| 2025-11-19 | WiCo-PG: Wireless Channel Foundation Model for Pathloss Map Generation via Synesthesia of Machines | Mingran Sun et.al. | 2511.15030 | null |
| 2025-11-19 | WiCo-MG: Wireless Channel Foundation Model for Multipath Generation via Synesthesia of Machines | Zengrui Han et.al. | 2511.15026 | null |
| 2025-11-19 | Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference | Kexin Chu et.al. | 2511.15015 | null |
| 2025-11-18 | HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation | Lai Wei et.al. | 2511.14756 | null |
| 2025-11-18 | Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching | Jintao Zhang et.al. | 2511.14488 | null |
| 2025-11-18 | MoE-SpeQ: Speculative Quantized Decoding with Proactive Expert Prefetching and Offloading for Mixture-of-Experts | Wenfeng Wang et.al. | 2511.14102 | null |
| 2025-11-18 | FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration | Jingren Liu et.al. | 2511.14099 | null |
| 2025-11-18 | SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts | Fan Zhang et.al. | 2511.14093 | null |
| 2025-11-17 | MoMoE: A Mixture of Expert Agent Model for Financial Sentiment Analysis | Peng Shu et.al. | 2511.13983 | null |
| 2025-11-17 | InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE | Lipeng Wang et.al. | 2511.13488 | null |
| 2025-11-18 | YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection | Ori Meiraz et.al. | 2511.13344 | null |
| 2025-11-17 | Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification | Rifen Lin et.al. | 2511.13150 | null |
| 2025-11-17 | Self-Adaptive Graph Mixture of Models | Mohit Meena et.al. | 2511.13062 | link |
| 2025-11-17 | Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based Recommendation | Yu Hou et.al. | 2511.12922 | null |
| 2025-11-17 | Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from Drawings | Zihao Lin et.al. | 2511.12880 | null |
| 2025-11-16 | Connectivity-Guided Sparsification of 2-FWL GNNs: Preserving Full Expressivity with Improved Efficiency | Rongqin Chen et.al. | 2511.12838 | null |
| 2025-11-16 | Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data | Yunxin Li et.al. | 2511.12609 | null |
| 2025-11-16 | SEMC: Structure-Enhanced Mixture-of-Experts Contrastive Learning for Ultrasound Standard Plane Recognition | Qing Cai et.al. | 2511.12559 | null |
| 2025-11-16 | MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics | Jing Li et.al. | 2511.12525 | link |
| 2025-11-16 | MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding | Zhanheng Nie et.al. | 2511.12449 | null |
| 2025-11-16 | Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection | Xi Xiao et.al. | 2511.12410 | link |
| 2025-11-15 | SAC-MoE: Reinforcement Learning with Mixture-of-Experts for Control of Hybrid Dynamical Systems with Uncertainty | Leroy D’Souza et.al. | 2511.12361 | null |
| 2025-11-15 | AMR-MoEGA: Antimicrobial Resistance Prediction using Mixture of Experts and Genetic Algorithms | Anshul Bagaria et.al. | 2511.12223 | null |
| 2025-11-15 | ViTE: Virtual Graph Trajectory Expert Router for Pedestrian Trajectory Prediction | Ruochen Li et.al. | 2511.12214 | null |
| 2025-11-14 | FarSkip-Collective: Unhobbling Blocking Communication in Mixture of Experts Models | Yonatan Dukler et.al. | 2511.11505 | null |
| 2025-11-14 | Rethinking Efficient Mixture-of-Experts for Remote Sensing Modality-Missing Classification | Qinghao Gao et.al. | 2511.11460 | null |
| 2025-11-14 | SPOT: Single-Shot Positioning via Trainable Near-Field Rainbow Beamforming | Yeyue Cai et.al. | 2511.11391 | null |
| 2025-11-14 | Parameter-Efficient MoE LoRA for Few-Shot Multi-Style Editing | Cong Cao et.al. | 2511.11236 | null |
| 2025-11-14 | DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding | Mingwei Xing et.al. | 2511.11232 | null |
| 2025-11-14 | ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization | Anzhe Cheng et.al. | 2511.10971 | null |
| 2025-11-14 | Go-UT-Bench: A Fine-Tuning Dataset for LLM-Based Unit Test Generation in Go | Yashshi Pipalani et.al. | 2511.10868 | null |
| 2025-11-13 | Generalizable Slum Detection from Satellite Imagery with Mixture-of-Experts | Sumin Lee et.al. | 2511.10300 | null |
| 2025-11-13 | RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo | Jueun Ko et.al. | 2511.10107 | null |
| 2025-11-13 | BuddyMoE: Exploiting Expert Redundancy to Accelerate Memory-Constrained Mixture-of-Experts Inference | Yun Wang et.al. | 2511.10054 | null |
| 2025-11-14 | HI-TransPA: Hearing Impairments Translation Personal Assistant | Zhiming Ma et.al. | 2511.09915 | link |
| 2025-11-13 | ConSurv: Multimodal Continual Learning for Survival Analysis | Dianzhi Yu et.al. | 2511.09853 | null |
| 2025-11-11 | Let the Experts Speak: Improving Survival Prediction & Calibration via Mixture-of-Experts Heads | Todd Morrill et.al. | 2511.09567 | null |
| 2025-11-12 | SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields | Sangheon Yang et.al. | 2511.09072 | null |
| 2025-11-12 | UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving | Ziyi Song et.al. | 2511.09013 | null |
| 2025-11-12 | Selective Sinkhorn Routing for Improved Sparse Mixture of Experts | Duc Anh Nguyen et.al. | 2511.08972 | null |
| 2025-11-12 | Bayesian Mixture of Experts For Large Language Models | Maryam Dialameh et.al. | 2511.08968 | null |
| 2025-11-12 | An Improved Dual-Attention Transformer-LSTM for Small-Sample Prediction of Modal Frequency and Actual Anchor Radius in Micro Hemispherical Resonator Design | Yuyi Yao et.al. | 2511.08900 | null |
| 2025-11-11 | OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild | Yuncheng Guo et.al. | 2511.08423 | null |
| 2025-11-11 | Text-based Aerial-Ground Person Retrieval | Xinyu Zhou et.al. | 2511.08369 | null |
| 2025-11-14 | Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency Differencing | Junkai Lu et.al. | 2511.08229 | null |
| 2025-11-13 | National Institute on Aging PREPARE Challenge: Early Detection of Cognitive Impairment Using Speech – The SpeechCARE Solution | Maryam Zolnoori et.al. | 2511.08132 | null |
| 2025-11-13 | Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression | Cheng Yuan et.al. | 2511.08066 | null |
| 2025-11-11 | TouchWalker: Real-Time Avatar Locomotion from Touchscreen Finger Walking | Geuntae Park et.al. | 2511.07860 | null |
| 2025-11-10 | One Router to Route Them All: Homogeneous Expert Routing for Heterogeneous Graph Transformers | Georgiy Shakirov et.al. | 2511.07603 | null |
| 2025-11-12 | Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs | Zhongyang Li et.al. | 2511.07419 | null |
| 2025-11-11 | Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction | Hyeryun Park et.al. | 2511.07392 | null |
| 2025-11-10 | AgenticSciML: Collaborative Multi-Agent Systems for Emergent Discovery in Scientific Machine Learning | Qile Jiang et.al. | 2511.07262 | null |
| 2025-11-10 | Two Heads are Better than One: Distilling Large Language Model Features Into Small Models with Feature Decomposition and Mixture | Tianhao Fu et.al. | 2511.07110 | null |
| 2025-11-10 | CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition | Hung-Yang Sung et.al. | 2511.06860 | null |
| 2025-11-10 | S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous Reasoning | Jiangwen Dong et.al. | 2511.06727 | null |
| 2025-11-10 | Multi-Modal Continual Learning via Cross-Modality Adapters and Representation Alignment with Knowledge Preservation | Evelyn Chee et.al. | 2511.06723 | null |
| 2025-11-09 | Route Experts by Sequence, not by Token | Tiansheng Wen et.al. | 2511.06494 | null |
| 2025-11-09 | HyMoERec: Hybrid Mixture-of-Experts for Sequential Recommendation | Kunrong Li et.al. | 2511.06388 | null |
| 2025-11-09 | DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation | Speed Zhu et.al. | 2511.06307 | null |
| 2025-11-09 | A Mixture-of-Experts Framework with Log-Logistic Components for Survival Analysis on Histopathology Images | Ardhendu Sekhar et.al. | 2511.06266 | null |
| 2025-11-08 | MoSKA: Mixture of Shared KV Attention for Efficient Long-Sequence LLM Inference | Myunghyun Rhee et.al. | 2511.06010 | null |
| 2025-11-08 | DiA-gnostic VLVAE: Disentangled Alignment-Constrained Vision Language Variational AutoEncoder for Robust Radiology Reporting with Missing Modalities | Nagur Shareef Shaik et.al. | 2511.05968 | null |
| 2025-11-08 | MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering | Jian Zhu et.al. | 2511.05876 | null |
| 2025-11-08 | In-depth Analysis on Caching and Pre-fetching in Mixture of Experts Offloading | Shuning Lin et.al. | 2511.05814 | null |
| 2025-11-07 | Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder | Zhen Xu et.al. | 2511.05745 | null |
| 2025-11-07 | BrainCSD: A Hierarchical Consistency-Driven MoE Foundation Model for Unified Connectome Synthesis and Multitask Brain Trait Prediction | Xiongri Shen et.al. | 2511.05630 | null |
| 2025-11-07 | Quantum-Uncertainty-Governed Spin Dynamics in s-d Coupled Systems | Jie Zheng et.al. | 2511.05388 | null |
| 2025-11-07 | OvA-LP: A Simple and Efficient Framework for Federated Learning on Non-IID Data | Dongjin Park et.al. | 2511.05028 | null |
| 2025-11-07 | MoE-DP: An MoE-Enhanced Diffusion Policy for Robust Long-Horizon Robotic Manipulation with Skill Decomposition and Failure Recovery | Baiye Cheng et.al. | 2511.05007 | null |
| 2025-11-06 | PuzzleMoE: Efficient Compression of Large Mixture-of-Experts Models via Sparse Expert Merging and Bit-packed inference | Yushu Zhao et.al. | 2511.04805 | null |
| 2025-11-06 | GNN-MoE: Context-Aware Patch Routing using GNNs for Parameter-Efficient Domain Generalization | Mahmoud Soliman et.al. | 2511.04008 | null |
| 2025-11-05 | GMoPE:A Prompt-Expert Mixture Framework for Graph Foundation Models | Zhibin Wang et.al. | 2511.03251 | null |
| 2025-11-04 | From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos | Xun Wang et.al. | 2511.02762 | null |
| 2025-11-04 | Verifying LLM Inference to Prevent Model Weight Exfiltration | Roy Rinberg et.al. | 2511.02620 | null |
| 2025-11-04 | RoME: Domain-Robust Mixture-of-Experts for MILP Solution Prediction across Domains | Tianle Pu et.al. | 2511.02331 | null |
| 2025-11-04 | FP8-Flow-MoE: A Casting-Free FP8 Recipe without Double Quantization Error | Fengjuan Wang et.al. | 2511.02302 | null |
| 2025-11-04 | Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining | Costin-Andrei Oncescu et.al. | 2511.02237 | null |
| 2025-11-03 | Towards Efficient Federated Learning of Networked Mixture-of-Experts for Mobile Edge Computing | Song Gao et.al. | 2511.01743 | null |
| 2025-11-03 | HMVLM: Human Motion-Vision-Lanuage Model via MoE LoRA | Lei Hu et.al. | 2511.01463 | null |
| 2025-11-04 | CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing | Yifan Zhou et.al. | 2511.01197 | null |
| 2025-11-03 | DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection | Guoxin Ma et.al. | 2511.01192 | null |
| 2025-11-01 | OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback | Kai Luo et.al. | 2511.00510 | null |
| 2025-10-31 | LongCat-Flash-Omni Technical Report | Meituan LongCat Team et.al. | 2511.00279 | link |
| 2025-10-31 | Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals | Xiangyu Fan et.al. | 2510.27684 | null |
| 2025-10-31 | RDMA Point-to-Point Communication for LLM Systems | Nandor Licker et.al. | 2510.27656 | null |
| 2025-10-31 | MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts | Jingnan Gao et.al. | 2510.27234 | null |
| 2025-10-31 | AFM-Net: Advanced Fusing Hierarchical CNN Visual Priors with Global Sequence Modeling for Remote Sensing Image Scene Classification | Yuanhao Tang et.al. | 2510.27155 | null |
| 2025-10-30 | Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement | Aaditya Shukla et.al. | 2510.27051 | null |
| 2025-10-30 | Mixture-of-Transformers Learn Faster: A Theoretical Study on Classification Problems | Hongbo Li et.al. | 2510.27004 | null |
| 2025-10-30 | MoME: Mixture of Visual Language Medical Experts for Medical Imaging Segmentation | Arghavan Rezvani et.al. | 2510.26996 | null |
| 2025-10-30 | ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference | Zixu Shen et.al. | 2510.26730 | null |
| 2025-10-30 | Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications | Chuang Zhang et.al. | 2510.26628 | null |
| 2025-10-30 | Asymptotic meshes from $r$ -variational adaptation methods for static problems in one dimension | Darith Hun et.al. | 2510.26375 | null |
| 2025-10-30 | MossNet: Mixture of State-Space Experts is a Multi-Head Attention | Shikhar Tuli et.al. | 2510.26182 | null |
| 2025-10-29 | Dual Mixture-of-Experts Framework for Discrete-Time Survival Analysis | Hyeonjun Lee et.al. | 2510.26014 | null |
| 2025-10-31 | Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training | Hong Wang et.al. | 2510.25803 | null |
| 2025-10-29 | Revisiting scalable sequential recommendation with Multi-Embedding Approach and Mixture-of-Experts | Qiushi Pan et.al. | 2510.25285 | null |
| 2025-10-29 | MoEntwine: Unleashing the Potential of Wafer-scale Chips for Large-scale Expert Parallel Inference | Xinru Tang et.al. | 2510.25258 | null |
| 2025-10-29 | H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts | Peilin Tan et.al. | 2510.25091 | null |
| 2025-10-28 | Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation | Inclusion AI et.al. | 2510.24821 | null |
| 2025-10-28 | Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance | Yujie Wei et.al. | 2510.24711 | null |
| 2025-10-28 | Language-Conditioned Representations and Mixture-of-Experts Policy for Robust Multi-Task Robotic Manipulation | Xiucheng Zhang et.al. | 2510.24055 | null |
| 2025-10-26 | Sparsity and Superposition in Mixture of Experts | Marmik Chaudhari et.al. | 2510.23671 | null |
| 2025-10-27 | EMTSF:Extraordinary Mixture of SOTA Models for Time Series Forecasting | Musleh Alharthi et.al. | 2510.23396 | null |
| 2025-10-27 | Rethinking GSPO: The Perplexity-Entropy Equivalence | Chi Liu et.al. | 2510.23142 | null |
| 2025-10-27 | Knocking-Heads Attention | Zhanchao Zhou et.al. | 2510.23052 | null |
| 2025-10-27 | Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts | Di Zhang et.al. | 2510.23027 | null |
| 2025-10-27 | MoEMeta: Mixture-of-Experts Meta Learning for Few-Shot Relational Learning | Han Wu et.al. | 2510.23013 | null |
| 2025-10-25 | Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation | Ling-Team et.al. | 2510.22115 | null |
| 2025-10-23 | Addressing Corner Cases in Autonomous Driving: A World Model-based Approach with Mixture of Experts and LLMs | Haicheng Liao et.al. | 2510.21867 | null |
| 2025-10-24 | PINN Balls: Scaling Second-Order Methods for PINNs with Domain Decomposition and Adaptive Sampling | Andrea Bonfanti et.al. | 2510.21262 | null |
| 2025-10-24 | Adaptive Graph Mixture of Residual Experts: Unsupervised Learning on Diverse Graphs with Heterogeneous Specialization | Yunlong Chu et.al. | 2510.21207 | null |
| 2025-10-24 | Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts | Yanguang Sun et.al. | 2510.21114 | null |
| 2025-10-24 | MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning | Siyong Chen et.al. | 2510.21093 | null |
| 2025-10-23 | Bayesian Jammer Localization with a Hybrid CNN and Path-Loss Mixture of Experts | Mariona Jaramillo-Civill et.al. | 2510.20666 | null |
| 2025-10-23 | xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion | Quan Li et.al. | 2510.20651 | null |
| 2025-10-23 | Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning | Xiaohan Lan et.al. | 2510.20519 | null |
| 2025-10-23 | A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization | LinFeng Li et.al. | 2510.20291 | null |
| 2025-10-23 | AsyncHZP: Hierarchical ZeRO Parallelism with Asynchronous Scheduling for Scalable LLM Training | Huawei Bai et.al. | 2510.20111 | null |
| 2025-10-22 | HybridEP: Scaling Expert Parallelism to Cross-Datacenter Scenario via Hybrid Expert/Data Transmission | Weihao Yang et.al. | 2510.19470 | null |
| 2025-10-22 | MoE-Prism: Disentangling Monolithic Experts for Elastic MoE Services via Model-System Co-Designs | Xinfeng Xia et.al. | 2510.19366 | null |
| 2025-10-22 | Modeling Turn-Taking with Semantically Informed Gestures | Varsha Suresh et.al. | 2510.19350 | null |
| 2025-10-23 | RailS: Load Balancing for All-to-All Communication in Distributed Mixture-of-Experts Training | Heng Xu et.al. | 2510.19262 | null |
| 2025-10-22 | A Design Science Blueprint for an Orchestrated AI Assistant in Doctoral Supervision | Teo Susnjak et.al. | 2510.19227 | null |
| 2025-10-23 | MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting | In-Hwan Jin et.al. | 2510.19210 | null |
| 2025-10-25 | Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model | Ling Team et.al. | 2510.18855 | null |
| 2025-10-21 | Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework | Yujie Xing et.al. | 2510.18825 | null |
| 2025-10-21 | Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification | Bin Gu et.al. | 2510.18533 | null |
| 2025-10-21 | Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study | Gangda Deng et.al. | 2510.18370 | null |
| 2025-10-21 | DeepSeek-OCR: Contexts Optical Compression | Haoran Wei et.al. | 2510.18234 | link |
| 2025-10-22 | L-MoE: End-to-End Training of a Lightweight Mixture of Low-Rank Adaptation Experts | Shihao Ji et.al. | 2510.17898 | null |
| 2025-10-20 | Towards 3D Objectness Learning in an Open World | Taichi Liu et.al. | 2510.17686 | null |
| 2025-10-20 | Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model | Xinwei Zhang et.al. | 2510.17684 | null |
| 2025-10-20 | Learned Inertial Odometry for Cycling Based on Mixture of Experts Algorithm | Hao Qiao et.al. | 2510.17604 | null |
| 2025-10-23 | Photon radiation induced by rescattering in strong-interacting medium with a magnetic field | Yue Zhang et.al. | 2510.17597 | null |
| 2025-10-20 | ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts | Zheyue Tan et.al. | 2510.17483 | null |
| 2025-10-19 | Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures | Pingzhi Li et.al. | 2510.16968 | null |
| 2025-10-19 | End-to-end Listen, Look, Speak and Act | Siyin Wang et.al. | 2510.16756 | null |
| 2025-10-18 | NeurIPT: Foundation Model for Neural Interfaces | Zitao Fang et.al. | 2510.16548 | link |
| 2025-10-18 | Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts | Yongxiang Hua et.al. | 2510.16448 | null |
| 2025-10-18 | Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures | Minh-Khoi Nguyen-Nhat et.al. | 2510.16411 | null |
| 2025-10-17 | Expert Merging in Sparse Mixture of Experts with Nash Bargaining | Dung V. Nguyen et.al. | 2510.16138 | null |
| 2025-10-17 | Human or AI? Comparing Design Thinking Assessments by Teaching Assistants and Bots | Sumbul Khan et.al. | 2510.16069 | null |
| 2025-10-17 | Mixture of Experts Approaches in Dense Retrieval Tasks | Effrosyni Sokli et.al. | 2510.15683 | null |
| 2025-10-17 | FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification | Zhen Sun et.al. | 2510.15595 | null |
| 2025-10-17 | Backdoor or Manipulation? Graph Mixture of Experts Can Defend Against Various Graph Adversarial Attacks | Yuyuan Feng et.al. | 2510.15333 | null |
| 2025-10-17 | MTmixAtt: Integrating Mixture-of-Experts with Multi-Mix Attention for Large-Scale Recommendation | Xianyang Qi et.al. | 2510.15286 | null |
| 2025-10-17 | Adaptive Individual Uncertainty under Out-Of-Distribution Shift with Expert-Routed Conformal Prediction | Amitesh Badkul et.al. | 2510.15233 | null |
| 2025-10-16 | Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models | Guinan Su et.al. | 2510.14853 | null |
| 2025-10-16 | MergeMoE: Efficient Compression of MoE Models via Expert Output Merging | Ruijie Miao et.al. | 2510.14436 | null |
| 2025-10-16 | Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning | Weijie Shen et.al. | 2510.14300 | null |
| 2025-10-16 | MACE: Mixture-of-Experts Accelerated Coordinate Encoding for Large-Scale Scene Localization and Rendering | Mingkai Liu et.al. | 2510.14251 | null |
| 2025-10-16 | Demonstrating Exoplanet Transit Photometry from Space with a 15-mm Aperture Optical Navigation Camera on Hayabusa2 | Koki Yumoto et.al. | 2510.14229 | null |
| 2025-10-15 | REAP the Experts: Why Pruning Prevails for One-Shot MoE compression | Mike Lasby et.al. | 2510.13999 | null |
| 2025-10-15 | Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module | Ruitao Feng et.al. | 2510.13558 | null |
| 2025-10-15 | ExpressNet-MoE: A Hybrid Deep Neural Network for Emotion Recognition | Deeptimaan Banerjee et.al. | 2510.13493 | null |
| 2025-10-15 | Who Speaks for the Trigger? Dynamic Expert Routing in Backdoored Mixture-of-Experts Transformers | Xin Zhao et.al. | 2510.13462 | null |
| 2025-10-15 | Toward Efficient Inference Attacks: Shadow Model Sharing via Mixture-of-Experts | Li Bai et.al. | 2510.13451 | null |
| 2025-10-15 | UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE | Zhenyu Liu et.al. | 2510.13344 | null |
| 2025-10-15 | GatePro: Parameter-Free Expert Selection Optimization for Mixture-of-Experts Models | Chen Zheng et.al. | 2510.13079 | null |
| 2025-10-17 | Scope: Selective Cross-modal Orchestration of Visual Perception Experts | Tianyu Zhang et.al. | 2510.12974 | null |
| 2025-10-14 | Dendrograms of Mixing Measures for Softmax-Gated Gaussian Mixture of Experts: Consistency without Model Sweeps | Do Tien Hai et.al. | 2510.12744 | null |
| 2025-10-14 | Proof of Cloud: Data Center Execution Assurance for Confidential VMs | Filip Rezabek et.al. | 2510.12469 | null |
| 2025-10-14 | MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts | Yushu Zhao et.al. | 2510.12357 | null |
| 2025-10-14 | DE3S: Dual-Enhanced Soft-Sparse-Shape Learning for Medical Early Time-Series Classification | Tao Xie et.al. | 2510.12214 | null |
| 2025-10-13 | Enhancing the Quality of 3D Lunar Maps Using JAXA’s Kaguya Imagery | Yumi Iwashita et.al. | 2510.11817 | null |
| 2025-10-13 | Beyond ‘Templates’: Category-Agnostic Object Pose, Size, and Shape Estimation from a Single View | Jinyu Zhang et.al. | 2510.11687 | null |
| 2025-10-13 | Robust Ego-Exo Correspondence with Long-Term Memory | Yijun Hu et.al. | 2510.11417 | null |
| 2025-10-13 | Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers | Wenhan Ma et.al. | 2510.11370 | null |
| 2025-10-13 | What to expect from microscopic nuclear modelling for k $_{\rm eff}$ calculations ? | D. Rochman et.al. | 2510.11256 | null |
| 2025-10-13 | DND: Boosting Large Language Models with Dynamic Nested Depth | Tieyuan Chen et.al. | 2510.11001 | null |
| 2025-10-13 | MC#: Mixture Compressor for Mixture-of-Experts Large Models | Wei Huang et.al. | 2510.10962 | null |
| 2025-10-12 | Crisis-Aware Regime-Conditioned Diffusion with CVaR Allocation | Ali Atiah Alzahrani et.al. | 2510.10807 | null |
| 2025-10-12 | Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection | Shizhen Zhao et.al. | 2510.10584 | null |
| 2025-10-12 | Hierarchical LoRA MoE for Efficient CTR Model Scaling | Zhichen Zeng et.al. | 2510.10432 | null |
| 2025-10-11 | SP-MoE: Speculative Decoding and Prefetching for Accelerating MoE-based Model Inference | Liangkun Chen et.al. | 2510.10302 | null |
| 2025-10-10 | MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest | Xiao Yang et.al. | 2510.09857 | null |
| 2025-10-10 | ARROW: An Adaptive Rollout and Routing Method for Global Weather Forecasting | Jindong Tian et.al. | 2510.09734 | null |
| 2025-10-10 | Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation | Youwei Zheng et.al. | 2510.09094 | null |
| 2025-10-09 | LinearSR: Unlocking Linear Attention for Stable and Efficient Image Super-Resolution | Xiaohui Li et.al. | 2510.08771 | null |
| 2025-10-13 | dInfer: An Efficient Inference Framework for Diffusion Language Models | Yuxin Ma et.al. | 2510.08666 | null |
| 2025-10-08 | Dynamic Mixture-of-Experts for Visual Autoregressive Model | Jort Vincenti et.al. | 2510.08629 | null |
| 2025-10-09 | FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts | Heming Zou et.al. | 2510.08396 | link |
| 2025-10-09 | Mix- and MoE-DPO: A Variational Inference Approach to Direct Preference Optimization | Jason Bohne et.al. | 2510.08256 | null |
| 2025-10-09 | From Tokens to Layers: Redefining Stall-Free Scheduling for LLM Serving with Layered Prefill | Gunjun Lee et.al. | 2510.08055 | null |
| 2025-10-09 | Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training | Ruizhe Wang et.al. | 2510.08008 | null |
| 2025-10-09 | Multilingual Knowledge Graph Completion via Efficient Multilingual Knowledge Sharing | Cunli Mao et.al. | 2510.07736 | null |
| 2025-10-09 | Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision | Xiaoxu Ma et.al. | 2510.07703 | null |
| 2025-10-09 | LiveThinking: Enabling Real-Time Efficient Reasoning for AI-Powered Livestreaming via Reinforcement Learning | Yuhan Sun et.al. | 2510.07685 | null |
| 2025-10-08 | MoGU: Mixture-of-Gaussians with Uncertainty-based Gating for Time Series Forecasting | Yoli Shavit et.al. | 2510.07459 | null |
| 2025-10-08 | Less is More: Strategic Expert Selection Outperforms Ensemble Complexity in Traffic Forecasting | Walid Guettala et.al. | 2510.07426 | null |
| 2025-10-08 | Guided by the Experts: Provable Feature Learning Dynamic of Soft-Routed Mixture-of-Experts | Fangshuo Liao et.al. | 2510.07205 | null |
| 2025-10-08 | A Bridge from Audio to Video: Phoneme-Viseme Alignment Allows Every Face to Speak Multiple Languages | Zibo Su et.al. | 2510.06612 | null |
| 2025-10-09 | SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation | Shuang Cheng et.al. | 2510.06303 | null |
| 2025-10-06 | Reproducibility Study of “XRec: Large Language Models for Explainable Recommendation” | Ranjan Mishra et.al. | 2510.06275 | null |
| 2025-10-10 | Barbarians at the Gate: How AI is Upending Systems Research | Audrey Cheng et.al. | 2510.06189 | null |
| 2025-10-07 | CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits | Kangyu Wang et.al. | 2510.06133 | null |
| 2025-10-07 | Rasterized Steered Mixture of Experts for Efficient 2D Image Regression | Yi-Hsin Li et.al. | 2510.05814 | null |
| 2025-10-07 | Mixture of Neuron Experts | Runxi Cheng et.al. | 2510.05781 | link |
| 2025-10-07 | MSF-SER: Enriching Acoustic Modeling with Multi-Granularity Semantics for Speech Emotion Recognition | Haoxun Li et.al. | 2510.05749 | null |
| 2025-10-07 | Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting | Zhongkai Yu et.al. | 2510.05497 | null |
| 2025-10-06 | Stratum: System-Hardware Co-Design with Tiered Monolithic 3D-Stackable DRAM for Efficient MoE Serving | Yue Pan et.al. | 2510.05245 | null |
| 2025-10-06 | REN: Anatomically-Informed Mixture-of-Experts for Interstitial Lung Disease Diagnosis | Alec K. Peltekian et.al. | 2510.04923 | null |
| 2025-10-06 | LMM-Incentive: Large Multimodal Model-based Incentive Design for User-Generated Content in Web 3.0 | Jinbo Wen et.al. | 2510.04765 | null |
| 2025-10-06 | Multilingual Routing in Mixture-of-Experts | Lucas Bandarkar et.al. | 2510.04694 | null |
| 2025-10-06 | Improving Multimodal Brain Encoding Model with Dynamic Subject-awareness Routing | Xuanhua Yin et.al. | 2510.04670 | null |
| 2025-10-06 | Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space | Tomas Figliolia et.al. | 2510.04476 | null |
| 2025-10-05 | HoRA: Cross-Head Low-Rank Adaptation with Joint Hypernetworks | Nghiem T. Diep et.al. | 2510.04295 | null |
| 2025-10-05 | SliceMoE: Routing Embedding Slices Instead of Tokens for Fine-Grained and Balanced Transformer Scaling | Harshil Vejendla et.al. | 2510.04286 | null |
| 2025-10-05 | MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition | Umberto Cappellazzo et.al. | 2510.04136 | null |
| 2025-10-03 | Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective | Yehuda Dar et.al. | 2510.03151 | null |
| 2025-10-02 | ElasticMoE: An Efficient Auto Scaling Method for Mixture-of-Experts Models | Gursimran Singh et.al. | 2510.02613 | null |
| 2025-10-02 | UpSafe $^\circ$ C: Upcycling for Controllable Safety in Large Language Models | Yuhao Sun et.al. | 2510.02194 | null |
| 2025-10-02 | LadderMoE: Ladder-Side Mixture of Experts Adapters for Bronze Inscription Recognition | Rixin Zhou et.al. | 2510.01651 | null |
| 2025-10-01 | Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs | Leyla Mirvakhabova et.al. | 2510.01185 | null |
| 2025-10-01 | Learning Compact Representations of LLM Abilities via Item Response Theory | Jianhao Chen et.al. | 2510.00844 | null |
| 2025-10-01 | Graph Integrated Multimodal Concept Bottleneck Model | Jiakai Lin et.al. | 2510.00701 | null |
| 2025-10-01 | FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression | Yifei Gao et.al. | 2510.00621 | null |
| 2025-10-01 | Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning | Minghao Yang et.al. | 2510.00570 | null |
| 2025-10-07 | FlowMoE: A Scalable Pipeline Scheduling Framework for Distributed Mixture-of-Experts Training | Yunqi Gao et.al. | 2510.00207 | null |
| 2025-09-30 | Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization | Yaoxiang Wang et.al. | 2509.26520 | null |
| 2025-09-30 | Nephrobase Cell+: Multimodal Single-Cell Foundation Model for Decoding Kidney Biology | Chenyu Li et.al. | 2509.26223 | null |
| 2025-09-30 | Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline | Haiyang Li et.al. | 2509.25991 | null |
| 2025-09-30 | UniMMAD: Unified Multi-Modal and Multi-Class Anomaly Detection via MoE-Driven Feature Decompression | Yuan Zhao et.al. | 2509.25934 | null |
| 2025-09-30 | Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel | Chuanyang Zheng et.al. | 2509.25913 | null |
| 2025-10-01 | A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI | Arvind Murari Vepa et.al. | 2509.25889 | null |
| 2025-09-30 | Collaborative Compression for Large-Scale MoE Deployment on Edge | Yixiao Chen et.al. | 2509.25689 | link |
| 2025-09-30 | LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts | Yuan Zhuang et.al. | 2509.25684 | null |
| 2025-09-30 | Guiding Mixture-of-Experts with Temporal Multimodal Interactions | Xing Han et.al. | 2509.25678 | null |
| 2025-09-29 | K-Prism: A Knowledge-Guided and Prompt Integrated Universal Medical Image Segmentation Model | Bangwei Guo et.al. | 2509.25594 | null |
| 2025-09-29 | MAESTRO : Adaptive Sparse Attention and Robust Learning for Multimodal Dynamic Time Series | Payal Mohapatra et.al. | 2509.25278 | null |
| 2025-09-29 | GRACE-MoE: Grouping and Replication with Locality-Aware Routing for Efficient Distributed MoE Inference | Yu Han et.al. | 2509.25041 | null |
| 2025-09-29 | LEAF: A Robust Expert-Based Framework for Few-Shot Continual Event Detection | Bao-Ngoc Dao et.al. | 2509.24547 | null |
| 2025-09-29 | One-Prompt Strikes Back: Sparse Mixture of Experts for Prompt-based Continual Learning | Minh Le et.al. | 2509.24483 | null |
| 2025-09-29 | Muon: Training and Trade-offs with Latent Attention and MoE | Sushant Mehta et.al. | 2509.24406 | null |
| 2025-09-29 | LLaDA-MoE: A Sparse MoE Diffusion Language Model | Fengqi Zhu et.al. | 2509.24389 | null |
| 2025-09-29 | Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning | Zhisheng Chen et.al. | 2509.24222 | null |
| 2025-09-28 | HunyuanImage 3.0 Technical Report | Siyu Cao et.al. | 2509.23951 | null |
| 2025-09-28 | Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms | Jiahao Ying et.al. | 2509.23933 | link |
| 2025-09-28 | Bayesian Mixture-of-Experts: Towards Making LLMs Know What They Don’t Know | Albus Yizhuo Li et.al. | 2509.23830 | link |
| 2025-09-28 | A Modality-Tailored Graph Modeling Framework for Urban Region Representation via Contrastive Learning | Yaya Zhao et.al. | 2509.23772 | null |
| 2025-09-28 | Towards a Comprehensive Scaling Law of Mixture-of-Experts | Guoliang Zhao et.al. | 2509.23678 | null |
| 2025-09-28 | PreScope: Unleashing the Power of Prefetching for Resource-Constrained MoE Inference | Enda Yu et.al. | 2509.23638 | null |
| 2025-09-27 | Agentic AI Reasoning for Mobile Edge General Intelligence: Fundamentals, Approaches, and Directions | Mingyi Luo et.al. | 2509.23248 | null |
| 2025-09-27 | MoE-PHDS: One MoE checkpoint for flexible runtime sparsity | Lauren. A Hannah et.al. | 2509.23012 | null |
| 2025-09-26 | Tiny-QMoE | Jack Cashman et.al. | 2509.22951 | null |
| 2025-09-26 | Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time | Yixuan Han et.al. | 2509.22572 | null |
| 2025-09-26 | Learning to Ball: Composing Policies for Long-Horizon Basketball Moves | Pei Xu et.al. | 2509.22442 | link |
| 2025-09-26 | Role-Aware Multi-modal federated learning system for detecting phishing webpages | Bo Wang et.al. | 2509.22369 | null |
| 2025-09-26 | HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space | Ke Li et.al. | 2509.22299 | link |
| 2025-09-26 | Unlocking the Power of Mixture-of-Experts for Task-Aware Time Series Analytics | Xingjian Wu et.al. | 2509.22279 | null |
| 2025-09-26 | MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning | Tao Wu et.al. | 2509.21953 | link |
| 2025-09-26 | Elastic MoE: Unlocking the Inference-Time Scalability of Mixture-of-Experts | Naibin Gu et.al. | 2509.21892 | null |
| 2025-09-26 | ChaosNexus: A Foundation Model for Universal Chaotic System Forecasting with Multi-scale Representations | Chang Liu et.al. | 2509.21802 | null |
| 2025-09-26 | LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE | Yu Shang et.al. | 2509.21790 | null |
| 2025-09-24 | MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering | Lihui Liu et.al. | 2509.21391 | null |
| 2025-09-25 | Distributed Specialization: Rare-Token Neurons in Large Language Models | Jing Liu et.al. | 2509.21163 | null |
| 2025-09-26 | Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns | Xuemiao Zhang et.al. | 2509.21124 | null |
| 2025-09-25 | Physics Informed Neural Networks for design optimisation of diamond particle detectors for charged particle fast-tracking at high luminosity hadron colliders | Alessandro Bombini et.al. | 2509.21123 | null |
| 2025-09-24 | Dynamic Reasoning Chains through Depth-Specialized Mixture-of-Experts in Transformer Architectures | Sampurna Roy et.al. | 2509.20577 | null |
| 2025-09-24 | Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study | Viktoria Stray et.al. | 2509.20353 | null |
| 2025-09-24 | SHMoAReg: Spark Deformable Image Registration via Spatial Heterogeneous Mixture of Experts and Attention Heads | Yuxi Zheng et.al. | 2509.20073 | null |
| 2025-09-24 | Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference | Ziyi Han et.al. | 2509.19781 | null |
| 2025-09-23 | Human-AI Narrative Synthesis to Foster Shared Understanding in Civic Decision-Making | Cassandra Overney et.al. | 2509.19643 | null |
| 2025-09-21 | A Statistical Mixture-of-Experts Framework for EMG Artifact Removal in EEG: Empirical Insights and a Proof-of-Concept Application | Benjamin J. Choi et.al. | 2509.19385 | null |
| 2025-09-23 | DevFD: Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces | Tianshuo Zhang et.al. | 2509.19230 | link |
| 2025-09-23 | Frequency-Domain Decomposition and Recomposition for Robust Audio-Visual Segmentation | Yunzhe Shen et.al. | 2509.18912 | null |
| 2025-09-23 | LongCat-Flash-Thinking Technical Report | Meituan LongCat Team et.al. | 2509.18883 | null |
| 2025-09-23 | PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving | Chengran Yuan et.al. | 2509.18609 | null |
| 2025-09-23 | Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts | Qi Wang et.al. | 2509.18542 | null |
| 2025-09-23 | StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models | Haoxin Yang et.al. | 2509.17993 | null |
| 2025-09-23 | Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark | Siu Hang Ho et.al. | 2509.17894 | link |
| 2025-09-22 | Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving | Ziming Liu et.al. | 2509.17863 | null |
| 2025-09-22 | SSNet: Flexible and robust channel extrapolation for fluid antenna systems enabled by an self-supervised learning framework | Yuan Gao et.al. | 2509.17797 | null |
| 2025-09-22 | Qwen3-Omni Technical Report | Jin Xu et.al. | 2509.17765 | null |
| 2025-09-22 | Attention-based Mixture of Experts for Robust Speech Deepfake Detection | Viola Negroni et.al. | 2509.17585 | null |
| 2025-09-22 | Robust Mixture Models for Algorithmic Fairness Under Latent Heterogeneity | Siqi Li et.al. | 2509.17411 | link |
| 2025-09-21 | MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE | Soheil Zibakhsh et.al. | 2509.17238 | null |
| 2025-09-21 | A community-driven optimization framework for redrawing school attendance boundaries | Hongzhao Guan et.al. | 2509.17130 | link |
| 2025-09-21 | CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception | Lingzhao Kong et.al. | 2509.17107 | null |
| 2025-09-21 | Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation | Junzhuo Li et.al. | 2509.16882 | null |
| 2025-09-20 | KungfuBot2: Learning Versatile Motion Skills for Humanoid Whole-Body Control | Jinrui Han et.al. | 2509.16638 | null |
| 2025-09-19 | DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning | Sikai Bai et.al. | 2509.16105 | null |
| 2025-09-19 | MoE-CE: Enhancing Generalization for Deep Learning based Channel Estimation via a Mixture-of-Experts Framework | Tianyu Li et.al. | 2509.15964 | null |
| 2025-09-19 | pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation | Tong Wang et.al. | 2509.15638 | null |
| 2025-09-19 | MEC-Quant: Maximum Entropy Coding for Extremely Low Bit Quantization-Aware Training | Junbiao Pang et.al. | 2509.15514 | null |
| 2025-09-18 | SPH-Net: A Co-Attention Hybrid Model for Accurate Stock Price Prediction | Yiyang Wu et.al. | 2509.15414 | null |
| 2025-09-18 | Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing | Zichen Wu et.al. | 2509.15361 | null |
| 2025-09-18 | Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting | Liran Nochumsohn et.al. | 2509.15105 | null |
| 2025-09-18 | Adaptive LoRA Experts Allocation and Selection for Federated Fine-Tuning | Lei Wang et.al. | 2509.15087 | null |
| 2025-09-18 | EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence | Chaoyin She et.al. | 2509.14977 | null |
| 2025-09-18 | FURINA: Free from Unmergeable Router via LINear Aggregation of mixed experts | Jiayi Han et.al. | 2509.14900 | null |
| 2025-09-18 | CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human | Nan Sun et.al. | 2509.14889 | null |
| 2025-09-15 | SparseDoctor: Towards Efficient Chat Doctor with Mixture of Experts Enhanced Large Language Models | Zhang Jianbin et.al. | 2509.14269 | null |
| 2025-09-17 | CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts | Leonard Hackel et.al. | 2509.14104 | null |
| 2025-09-18 | SAIL-VL2 Technical Report | Weijie Yin et.al. | 2509.14033 | null |
| 2025-09-17 | Mixture of Low-Rank Adapter Experts in Generalizable Audio Deepfake Detection | Janne Laakkonen et.al. | 2509.13878 | null |
| 2025-09-17 | Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation | Nguyen Lan Vi Vu et.al. | 2509.13834 | null |
| 2025-09-18 | Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers | Manan Mittal et.al. | 2509.13548 | null |
| 2025-09-18 | GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR | Yujie Guo et.al. | 2509.13093 | null |
| 2025-09-16 | Dual-Stage Reweighted MoE for Long-Tailed Egocentric Mistake Detection | Boyu Han et.al. | 2509.12990 | null |
| 2025-09-16 | Bridging Perception and Planning: Towards End-to-End Planning for Signal Temporal Logic Tasks | Bowen Ye et.al. | 2509.12813 | null |
| 2025-09-16 | MEGAN: Mixture of Experts for Robust Uncertainty Estimation in Endoscopy Videos | Damola Agbelese et.al. | 2509.12772 | null |
| 2025-09-17 | NavMoE: Hybrid Model- and Learning-based Traversability Estimation for Local Navigation via Mixture of Experts | Botao He et.al. | 2509.12747 | null |
| 2025-09-16 | AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models | Heng Zhang et.al. | 2509.12715 | null |
| 2025-09-18 | Ensembling Large Language Models for Code Vulnerability Detection: An Empirical Evaluation | Zhihong Sun et.al. | 2509.12629 | null |
| 2025-09-15 | A high fraction of close massive binary stars at low metallicity | H. Sana et.al. | 2509.12488 | null |
| 2025-09-16 | When MoE Meets Blockchain: A Trustworthy Distributed Framework of Large Models | Weihao Zhu et.al. | 2509.12141 | null |
| 2025-09-15 | Dynamic Adaptive Parsing of Temporal and Cross-Variable Patterns for Network State Classification | Yuan Gao et.al. | 2509.11601 | null |
| 2025-09-15 | RadioLAM: A Large AI Model for Fine-Grained 3D Radio Map Estimation | Zhiyuan Liu et.al. | 2509.11571 | null |
| 2025-09-14 | Knowledge-Guided Adaptive Mixture of Experts for Precipitation Prediction | Chen Jiang et.al. | 2509.11459 | null |
| 2025-09-14 | MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation | Syed Talal Wasim et.al. | 2509.11394 | null |
| 2025-09-14 | On Linear Mode Connectivity of Mixture-of-Experts Architectures | Viet-Hoang Tran et.al. | 2509.11348 | null |
| 2025-09-13 | Lightweight Metadata-Aware Mixture-of-Experts Masked Autoencoder for Earth Observation | Mohanad Albughdadi et.al. | 2509.10919 | null |
| 2025-09-12 | RefactorCoderQA: Benchmarking LLMs for Multi-Domain Coding Question Solutions in Cloud and Edge Deployment | Shadikur Rahman et.al. | 2509.10436 | null |
| 2025-09-12 | Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs | Yixiao Zhou et.al. | 2509.10377 | null |
| 2025-09-12 | Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts | Strahinja Nikolic et.al. | 2509.10025 | null |
| 2025-09-11 | Combining Textual and Spectral Features for Robust Classification of Pilot Communications | Abdullah All Tanvir et.al. | 2509.09752 | null |
| 2025-09-11 | Steering MoE LLMs via Expert (De)Activation | Mohsen Fayyaz et.al. | 2509.09660 | null |
| 2025-09-11 | HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing | Haochen Huang et.al. | 2509.09420 | null |
| 2025-09-11 | MoLEx: Mixture of LoRA Experts in Speech Self-Supervised Models for Audio Deepfake Detection | Zihan Pan et.al. | 2509.09175 | null |
| 2025-09-11 | Compass-v3: Scaling Domain-Specific LLMs for Multilingual E-Commerce in Southeast Asia | Sophia Maria et.al. | 2509.09121 | null |
| 2025-09-10 | MoWE : A Mixture of Weather Experts | Dibyajyoti Chakraborty et.al. | 2509.09052 | null |
| 2025-09-15 | Too Helpful, Too Harmless, Too Honest or Just Right? | Gautam Siddharth Kashyap et.al. | 2509.08486 | null |
| 2025-09-10 | Joint Learning using Mixture-of-Expert-Based Representation for Enhanced Speech Generation and Robust Emotion Recognition | Jing-Tong Tzeng et.al. | 2509.08470 | null |
| 2025-09-10 | Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism | Jiaming Yan et.al. | 2509.08342 | null |
| 2025-09-09 | SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery | Fengyu She et.al. | 2509.08032 | null |
| 2025-09-09 | One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning | Yuan Pu et.al. | 2509.07945 | null |
| 2025-09-09 | MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model? | Songkai Ma et.al. | 2509.07727 | null |
| 2025-09-09 | DuoServe-MoE: Dual-Phase Expert Prefetch and Cache Scheduling for Efficient MoE LLM Inference | Yuning Zhang et.al. | 2509.07379 | null |
| 2025-09-11 | PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions | Yixuan Tang et.al. | 2509.07370 | null |
| 2025-09-11 | CAME-AB: Cross-Modality Attention with Mixture-of-Experts for Antibody Binding Site Prediction | Hongzong Li et.al. | 2509.06465 | null |
| 2025-09-08 | Ban&Pick: Achieving Free Performance Gains and Inference Speedup via Smarter Routing in MoE-LLMs | Yuanteng Chen et.al. | 2509.06346 | null |
| 2025-09-08 | MCTuner: Spatial Decomposition-Enhanced Database Tuning via LLM-Guided Exploration | Zihan Yan et.al. | 2509.06298 | null |
| 2025-09-05 | SpikingBrain Technical Report: Spiking Brain-inspired Large Models | Yuqi Pan et.al. | 2509.05276 | null |
| 2025-09-05 | Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers | Svetlana Pavlitska et.al. | 2509.05086 | null |
| 2025-09-05 | Phase-field and lip-field approaches for fracture with extreme mesh deformation (X-Mesh): a one-dimensional study | Nicolas Moës et.al. | 2509.04971 | null |
| 2025-09-05 | A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing | Chengkai Xu et.al. | 2509.04853 | null |
| 2025-09-05 | REMOTE: A Unified Multimodal Relation Extraction Framework with Multilevel Optimal Transport and Mixture-of-Experts | Xinkui Lin et.al. | 2509.04844 | null |
| 2025-09-05 | Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation | Svetlana Pavlitska et.al. | 2509.04816 | null |
| 2025-09-04 | Wav2DF-TSL: Two-stage Learning with Efficient Pre-training and Hierarchical Experts Fusion for Robust Audio Deepfake Detection | Yunqi Hao et.al. | 2509.04161 | null |
| 2025-09-03 | Hierarchical Federated Foundation Models over Wireless Networks for Multi-Modal Multi-Task Intelligence: Integration of Edge Learning with D2D/P2P-Enabled Fog Learning Architectures | Payam Abdisarabshali et.al. | 2509.03695 | null |
| 2025-09-03 | OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation | Han Li et.al. | 2509.03498 | null |
| 2025-09-02 | LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference | Krishna Teja Chitty-Venkata et.al. | 2509.02753 | null |
| 2025-09-02 | Acrobotics: A Generalist Approahc To Quadrupedal Robots’ Parkour | Guillaume Gagné-Labelle et.al. | 2509.02727 | null |
| 2025-09-02 | MoPEQ: Mixture of Mixed Precision Quantized Experts | Krishna Teja Chitty-Venkata et.al. | 2509.02512 | null |
| 2025-09-02 | Cache Management for Mixture-of-Experts LLMs – extended version | Spyros Angelopoulos et.al. | 2509.02408 | null |
| 2025-09-02 | OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds | Longrong Yang et.al. | 2509.02322 | null |
| 2025-09-01 | Automatic Screening of Parkinson’s Disease from Visual Explorations | Maria F. Alcala-Durand et.al. | 2509.01326 | null |
| 2025-09-01 | LongCat-Flash Technical Report | Meituan LongCat Team et.al. | 2509.01322 | link |
| 2025-09-01 | SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation | Chenyang Le et.al. | 2509.01200 | null |
| 2025-09-06 | Joint Information Extraction Across Classical and Modern Chinese with Tea-MOELoRA | Xuemei Tang et.al. | 2509.01158 | null |
| 2025-08-31 | MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper | Runjia Zeng et.al. | 2509.00996 | link |
| 2025-08-31 | Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling | Junfeng Ran et.al. | 2509.00679 | null |
| 2025-11-03 | Accelerating Mixture-of-Experts Inference by Hiding Offloading Latency with Speculative Decoding | Zhibin Wang et.al. | 2508.21706 | null |
| 2025-07-01 | Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging | Lujun Li et.al. | 2506.23266 | null |
| 2025-09-23 | GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors | Hengyuan Zhang et.al. | 2506.14646 | null |
| 2025-06-02 | Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts | Xuweiyi Chen et.al. | 2505.23926 | null |
| 2025-05-29 | Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity | Yehui Tang et.al. | 2505.21411 | null |
| 2025-05-27 | FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models | Hao Kang et.al. | 2505.20225 | link |
| 2025-05-22 | MoE-Loco: Mixture of Experts for Multitask Locomotion | Runhan Huang et.al. | 2503.08564 | null |
| 2025-03-06 | Convergence Rates for Softmax Gating Mixture of Experts | Huy Nguyen et.al. | 2503.03213 | null |
| 2025-01-29 | Mixture of Experts (MoE): A Big Data Perspective | Wensheng Gan et.al. | 2501.16352 | null |
| 2024-12-02 | MH-MoE: Multi-Head Mixture-of-Experts | Shaohan Huang et.al. | 2411.16205 | null |
| 2024-10-24 | ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Xin He et.al. | 2410.17954 | null |
| 2024-10-11 | MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts | Peng Jin et.al. | 2410.07348 | link |
| 2024-05-21 | Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts | Yunxin Li et.al. | 2405.11273 | null |
| 2024-05-31 | Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Xudong Lu et.al. | 2402.14800 | null |
| 2024-10-29 | GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts | Shirley Wu et.al. | 2312.04693 | null |
| 2023-09-12 | Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning | Ted Zadouri et.al. | 2309.05444 | null |
| 2023-04-25 | Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism | Xin Chen et.al. | 2304.11414 | null |
| 2018-06-22 | Mixtures of Experts Models | Isobel Claire Gormley et.al. | 1806.08200 | link |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-04-02 | PARD-SSM: Probabilistic Cyber-Attack Regime Detection via Variational Switching State-Space Models | Prakul Sunil Hiremath et.al. | 2604.02299 | null |
| 2026-04-02 | Selective State-Space Models for Koopman-based Data-driven Distribution System State Estimation | Bader Alabdulrazzaq et.al. | 2604.02273 | null |
| 2026-04-02 | AEGIS: Adversarial Entropy-Guided Immune System – Thermodynamic State Space Models for Zero-Day Network Evasion Detection | Vickson Ferrel et.al. | 2604.02149 | null |
| 2026-04-02 | Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling | Shota Takashiro et.al. | 2604.01577 | null |
| 2026-04-01 | Parallelized Hierarchical Connectome: A Spatiotemporal Recurrent Framework for Spiking State-Space Models | Po-Han Chiang et.al. | 2604.01295 | null |
| 2026-04-01 | A Benchmark of State-Space Models vs. Transformers and BiLSTM-based Models for Historical Newspaper OCR | Merveilles Agbeti-messan et.al. | 2604.00725 | null |
| 2026-04-01 | MATHENA: Mamba-based Architectural Tooth Hierarchical Estimator and Holistic Evaluation Network for Anatomy | Kyeonghun Kim et.al. | 2604.00537 | null |
| 2026-03-31 | MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control | Sahil Kumar et.al. | 2604.00292 | null |
| 2026-03-31 | Compressive sensing inspired self-supervised single-pixel imaging | Jijun Lu et.al. | 2603.29732 | null |
| 2026-03-31 | Learning Surrogate LPV State-Space Models with Uncertainty Quantification | E. Javier Olucha et.al. | 2603.29532 | null |
| 2026-03-31 | HCLSM: Hierarchical Causal Latent State Machines for Object-Centric World Modeling | Jaber Jaber et.al. | 2603.29090 | null |
| 2026-03-30 | Bridging the Geometry Mismatch: Frequency-Aware Anisotropic Serialization for Thin-Structure SSMs | Jin Bai et.al. | 2603.28503 | null |
| 2026-03-30 | A Probabilistic Generative Model for Spectral Speech Enhancement | Marco Hidalgo-Araya et.al. | 2603.28436 | null |
| 2026-04-01 | Self-Organizing Score-based Data Assimilation | Yuma Yamaoka et.al. | 2603.28048 | null |
| 2026-03-27 | WiMamba: Linear-Scale Wireless Foundation Model | Tomer Raviv et.al. | 2603.26367 | null |
| 2026-03-26 | Accelerating Bayesian Optimization for Nonlinear State-Space System Identification with Application to Lithium-Ion Batteries | Hao Tu et.al. | 2603.25840 | null |
| 2026-03-26 | A Mamba-based Perceptual Loss Function for Learning-based UGC Transcoding | Zihao Qi et.al. | 2603.25566 | null |
| 2026-03-26 | Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification | Giampaolo Bovenzi et.al. | 2603.25507 | null |
| 2026-03-26 | Towards Controllable Low-Light Image Enhancement: A Continuous Multi-illumination Dataset and Efficient State Space Framework | Hongru Han et.al. | 2603.25296 | null |
| 2026-03-26 | Vision Hopfield Memory Networks | Jianfeng Wang et.al. | 2603.25157 | null |
| 2026-03-26 | RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation | Kai Zhu et.al. | 2603.24295 | null |
| 2026-03-25 | S $^{3}$ G: Stock State Space Graph for Enhanced Stock Trend Prediction | Yao Lu et.al. | 2603.24236 | null |
| 2026-03-25 | State-space fading memory | Gustave Bainier et.al. | 2603.23814 | null |
| 2026-03-24 | The Diminishing Returns of Early-Exit Decoding in Modern LLMs | Rui Wei et.al. | 2603.23701 | null |
| 2026-03-24 | Markov State–Space Modeling and Channel Characterization for DNA-Based Molecular Communication | Ruifeng Zheng et.al. | 2603.23394 | null |
| 2026-03-24 | Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning | Konstantinos Barmpounakis et.al. | 2603.23295 | null |
| 2026-03-23 | Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures | Hector Borobia et.al. | 2603.22473 | null |
| 2026-03-20 | Graph Signal Processing Meets Mamba2: Adaptive Filter Bank via Delta Modulation | Yehjin Shin et.al. | 2603.22333 | null |
| 2026-03-23 | Multi-View Deformable Convolution Meets Visual Mamba for Coronary Artery Segmentation | Xiaochan Yuan et.al. | 2603.21829 | null |
| 2026-03-20 | MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space Models | Puskal Khadka et.al. | 2603.20074 | null |
| 2026-03-20 | Grid-following and Grid-forming Switching Control for Grid-connected Inverters Considering Small-signal Security Region | Qiping Lai et.al. | 2603.19618 | null |
| 2026-03-20 | ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization | Danish Gufran et.al. | 2603.19594 | null |
| 2026-03-19 | Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders | Shang-Jui Ray Kuo et.al. | 2603.19209 | null |
| 2026-03-19 | The Exponentially Weighted Signature | Alexandre Bloch et.al. | 2603.19198 | null |
| 2026-03-19 | LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling | Danaé Broustail et.al. | 2603.19100 | null |
| 2026-03-19 | DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection | Haochen Li et.al. | 2603.18757 | null |
| 2026-03-19 | Deceiving Flexibility: A Stealthy False Data Injection Model in Vehicle-to-Grid Coordination | Kaan T. Gun et.al. | 2603.18424 | null |
| 2026-03-18 | Atomic Trajectory Modeling with State Space Models for Biomolecular Dynamics | Liang Shi et.al. | 2603.17633 | null |
| 2026-03-17 | Koopman Lifted Finite Memory Identification via Truncated Grunwald Letnikov Kernels | Navid Mojahed et.al. | 2603.16851 | null |
| 2026-03-17 | SF-Mamba: Rethinking State Space Model for Vision | Masakazu Yoshimura et.al. | 2603.16423 | null |
| 2026-03-17 | RASLF: Representation-Aware State Space Model for Light Field Super-Resolution | Zeqiang Wei et.al. | 2603.16243 | null |
| 2026-03-16 | Mamba-3: Improved Sequence Modeling using State Space Principles | Aakash Lahoti et.al. | 2603.15569 | null |
| 2026-03-16 | DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages | Alish Kanani et.al. | 2603.15530 | null |
| 2026-03-16 | AnoleVLA: Lightweight Vision-Language-Action Model with Deep State Space Models for Mobile Manipulation | Yusuke Takagi et.al. | 2603.15046 | null |
| 2026-03-14 | Enhancing Eye Feature Estimation from Event Data Streams through Adaptive Inference State Space Modeling | Viet Dung Nguyen et.al. | 2603.14077 | null |
| 2026-03-13 | State-space models through the lens of ensemble control | Ye Feng et.al. | 2603.13587 | null |
| 2026-03-13 | Robust Automatic Differentiation of Square-Root Kalman Filters via Gramian Differentials | Adrien Corenflos et.al. | 2603.13559 | null |
| 2026-03-13 | From Gradients to Riccati Geometry: Kalman World Models for Single-Pass Learning | Andrew Kiruluta et.al. | 2603.13423 | null |
| 2026-03-12 | SpectralGuard: Detecting Memory Collapse Attacks in State Space Models | Davi Bonetto et.al. | 2603.12414 | null |
| 2026-03-12 | Spatial PDE-aware Selective State-space with Nested Memory for Mobile Traffic Grid Forecasting | Zineddine Bettouche et.al. | 2603.12353 | null |
| 2026-03-12 | CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks | Alexandre Le Mercier et.al. | 2603.12206 | null |
| 2026-03-12 | SEMamba++: A General Speech Restoration Framework Leveraging Global, Local, and Periodic Spectral Patterns | Yongjoon Lee et.al. | 2603.11669 | null |
| 2026-03-11 | Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild | Jun Yu et.al. | 2603.11306 | null |
| 2026-03-11 | Single molecule localization microscopy challenge: a biologically inspired benchmark for long-sequence modeling | Fatemeh Valeh et.al. | 2603.11296 | null |
| 2026-03-11 | DysonNet: Constant-Time Local Updates for Neural Quantum States | Lucas Winter et.al. | 2603.11189 | null |
| 2026-03-10 | Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference | Cosmo Santoni et.al. | 2603.09555 | null |
| 2026-03-10 | Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking | Shilei Wang et.al. | 2603.09287 | null |
| 2026-03-10 | Progressive Split Mamba: Effective State Space Modelling for Image Restoration | Mohammed Hassanin et.al. | 2603.09171 | null |
| 2026-03-10 | Rotation Equivariant Mamba for Vision Tasks | Zhongchen Zhao et.al. | 2603.09138 | null |
| 2026-03-10 | WS-Net: Weak-Signal Representation Learning and Gated Abundance Reconstruction for Hyperspectral Unmixing via State-Space and Weak Signal Attention Fusion | Zekun Long et.al. | 2603.09037 | null |
| 2026-03-09 | Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models | John Cooper et.al. | 2603.08859 | null |
| 2026-03-07 | Permutation-Equivariant 2D State Space Models: Theory and Canonical Architecture for Multivariate Time Series | Seungwoo Jeong et.al. | 2603.08753 | null |
| 2026-03-09 | BuildMamba: A Visual State-Space Based Model for Multi-Task Building Segmentation and Height Estimation from Satellite Images | Sinan U. Ulu et.al. | 2603.08523 | null |
| 2026-03-08 | Dissecting Spectral Granger Causality through Partial Information Decomposition | Luca Faes et.al. | 2603.07634 | null |
| 2026-03-07 | Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving | Jiazhuo Li et.al. | 2603.07264 | null |
| 2026-03-07 | Inter-Image Pixel Shuffling for Multi-focus Image Fusion | Huangxing Lin et.al. | 2603.07120 | null |
| 2026-03-06 | Swimba: Switch Mamba Model Scales State Space Models | Zhixu Du et.al. | 2603.06938 | null |
| 2026-03-06 | DLRMamba: Distilling Low-Rank Mamba for Edge Multispectral Fusion Object Detection | Qianqian Zhang et.al. | 2603.06920 | null |
| 2026-03-06 | Latent Autoencoder Ensemble Kalman Filter for Data assimilation | Xin T. Tong et.al. | 2603.06752 | null |
| 2026-03-06 | MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis | Dongqing Xie et.al. | 2603.06378 | null |
| 2026-03-06 | Two Localization Strategies for Sequential MCMC Data Assimilation with Applications to Nonlinear Non-Gaussian Geophysical Models | Hamza Ruzayqat et.al. | 2603.05817 | null |
| 2026-03-05 | Warm Starting State-Space Models with Automata Learning | William Fishell et.al. | 2603.05694 | null |
| 2026-03-05 | Why Depth Matters in Parallelizable Sequence Models: A Lie Algebraic View | Gyuryang Heo et.al. | 2603.05573 | null |
| 2026-03-05 | BLINK: Behavioral Latent Modeling of NK Cell Cytotoxicity | Iman Nematollahi et.al. | 2603.05110 | null |
| 2026-03-05 | DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization | Xiaodong Zhu et.al. | 2603.04882 | null |
| 2026-03-04 | When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift | Kevin Vogt-Lowell et.al. | 2603.04648 | null |
| 2026-03-04 | Mask-aware inference with State-Space Models | Ignasi Mas et.al. | 2603.04568 | null |
| 2026-03-04 | Architectural Proprioception in State Space Models: Thermodynamic Training Induces Anticipatory Halt Detection | Jay Noon et.al. | 2603.04180 | null |
| 2026-03-04 | Efficient Bayesian Estimation of Dynamic Structural Equation Models via State Space Marginalization | Øystein Sørensen et.al. | 2603.04003 | null |
| 2026-03-04 | Harnessing Selective State Space Models to Enhance Semianalytical Design of Fabrication-Ready Multilayered Huygens’ Metasurfaces: Part II - Generative Inverse Design (MetaMamba) | Natanel Nissan et.al. | 2603.03877 | null |
| 2026-03-04 | Harnessing Selective State Space Models to Enhance Semianalytical Design of Fabrication-Ready Multilayered Huygens’ Metasurfaces: Part I - Field-based Semianalytical Synthesis | Sherman W. Marcus et.al. | 2603.03837 | null |
| 2026-03-04 | Separators in Enhancing Autoregressive Pretraining for Vision Mamba | Hanpeng Liu et.al. | 2603.03806 | null |
| 2026-03-03 | MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling | Jinwoong Kim et.al. | 2603.03001 | null |
| 2026-03-03 | Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures | Georgios Pantazopoulos et.al. | 2603.02874 | null |
| 2026-03-02 | The Expressive Limits of Diagonal SSMs for State-Tracking | Mehran Shakerinava et.al. | 2603.01959 | null |
| 2026-03-02 | Deep Learning for Financial Time Series: A Large-Scale Benchmark of Risk-Adjusted Performance | Adir Saly-Kaufmann et.al. | 2603.01820 | null |
| 2026-03-01 | Efficient Extractive Summarization with MAMBA-Transformer Hybrids for Low-Resource Scenarios | Nisrine Ait Khayi et.al. | 2603.01288 | null |
| 2026-03-01 | VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification | Abdellah Zakaria Sellam et.al. | 2603.01174 | null |
| 2026-03-01 | GRAD-Former: Gated Robust Attention-based Differential Transformer for Change Detection | Durgesh Ameta et.al. | 2603.01161 | null |
| 2026-02-28 | Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation | Jinhan Xu et.al. | 2603.00576 | null |
| 2026-02-28 | Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling | Xueyang Li et.al. | 2603.00439 | null |
| 2026-02-27 | BiM-GeoAttn-Net: Linear-Time Depth Modeling with Geometry-Aware Attention for 3D Aortic Dissection CTA Segmentation | Yuan Zhang et.al. | 2602.23803 | null |
| 2026-02-26 | SpectralMamba-UNet: Frequency-Disentangled State Space Modeling for Texture-Structure Consistent Medical Image Segmentation | Fuhao Zhang et.al. | 2602.23103 | null |
| 2026-02-26 | Latent Matters: Learning Deep State-Space Models | Alexej Klushyn et.al. | 2602.23050 | null |
| 2026-02-26 | A guided residual search for nonlinear state-space identification | Merijn Floren et.al. | 2602.22964 | null |
| 2026-02-26 | Interpreting and Steering State-Space Models via Activation Subspace Bottlenecks | Vamshi Sunku Mohan et.al. | 2602.22719 | null |
| 2026-02-26 | SPMamba-YOLO: An Underwater Object Detection Network Based on Multi-Scale Feature Enhancement and Global Context Modeling | Guanghao Liao et.al. | 2602.22674 | null |
| 2026-02-25 | WaveSSM: Multiscale State-Space Models for Non-stationary Signal Attention | Ruben Solozabal et.al. | 2602.22266 | null |
| 2026-02-23 | CrossLLM-Mamba: Multimodal State Space Fusion of LLMs for RNA Interaction Prediction | Rabeya Tus Sadia et.al. | 2602.22236 | null |
| 2026-02-25 | Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration | Chen Wu et.al. | 2602.21917 | null |
| 2026-02-25 | Mamba Meets Scheduling: Learning to Solve Flexible Job Shop Scheduling with Efficient Sequence Modeling | Zhi Cao et.al. | 2602.21546 | null |
| 2026-02-25 | When Learning Hurts: Fixed-Pole RNN for Real-Time Online Training | Alexander Morgan et.al. | 2602.21454 | null |
| 2026-02-24 | Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting | Sunki Hong et.al. | 2602.21415 | null |
| 2026-02-24 | HiPPO Zoo: Explicit Memory Mechanisms for Interpretable State Space Models | Jack Goffinet et.al. | 2602.21340 | null |
| 2026-02-24 | Scaling State-Space Models on Multiple GPUs with Tensor Parallelism | Anurag Dutt et.al. | 2602.21144 | null |
| 2026-02-21 | NeXt2Former-CD: Efficient Remote Sensing Change Detection with Modern Vision Architectures | Yufan Wang et.al. | 2602.18717 | null |
| 2026-02-19 | COMBA: Cross Batch Aggregation for Learning Large Graphs with Context Gating State Space Models | Jiajun Shen et.al. | 2602.17893 | null |
| 2026-02-19 | Bayesian Optimality of In-Context Learning with Selective State Spaces | Di Zhang et.al. | 2602.17744 | null |
| 2026-02-18 | StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation | Zeyu Ren et.al. | 2602.16915 | null |
| 2026-02-16 | Is Mamba Reliable for Medical Imaging? | Banafsheh Saber Latibari et.al. | 2602.16723 | null |
| 2026-02-17 | Tracking Time-Varying Multipath Channels forActive Sonar Applications | Ashwani Koul et.al. | 2602.15555 | null |
| 2026-02-15 | Chemical Language Models for Natural Products: A State-Space Model Approach | Ho-Hsuan Wang et.al. | 2602.13958 | null |
| 2026-02-14 | Backward Smoothing versus Fixed-Lag Smoothing in Particle Filters | Genshiro Kitagawa et.al. | 2602.13635 | null |
| 2026-02-13 | Federated Learning of Nonlinear Temporal Dynamics with Graph Attention-based Cross-Client Interpretability | Ayse Tursucular et.al. | 2602.13485 | null |
| 2026-02-09 | DriveMamba: Task-Centric Scalable State Space Model for Efficient End-to-End Autonomous Driving | Haisheng Su et.al. | 2602.13301 | null |
| 2026-02-13 | Efficient Plug-and-Play method for Dynamic Imaging Via Kalman Smoothing | Benjamin Hawkes et.al. | 2602.13043 | null |
| 2026-02-13 | A Theoretical Analysis of Mamba’s Training Dynamics: Filtering Relevant Features for Generalization in State Space Models | Mugunthan Shandirasegaran et.al. | 2602.12499 | null |
| 2026-02-12 | Learning to Forget Attention: Memory Consolidation for Adaptive Compute Reduction | Ibne Farabi Shihab et.al. | 2602.12204 | null |
| 2026-02-12 | Improved state mixing in higher-order and block diagonal linear recurrent networks | Igor Dubinin et.al. | 2602.12021 | null |
| 2026-02-12 | RI-Mamba: Rotation-Invariant Mamba for Robust Text-to-Shape Retrieval | Khanh Nguyen et.al. | 2602.11673 | null |
| 2026-02-20 | Jailbreaking Leaves a Trace: Understanding and Detecting Jailbreak Attacks from Internal Representations of Large Language Models | Sri Durga Sai Sowmya Kadali et.al. | 2602.11495 | null |
| 2026-02-11 | Retrieval-Aware Distillation for Transformer-SSM Hybrids | Aviv Bick et.al. | 2602.11374 | null |
| 2026-02-11 | LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation | Lei Yao et.al. | 2602.11007 | null |
| 2026-02-11 | VFGS-Net: Frequency-Guided State-Space Learning for Topology-Preserving Retinal Vessel Segmentation | Ruiqi Song et.al. | 2602.10978 | null |
| 2026-02-11 | Trajectory-based data-driven predictive control and the state-space predictor | Levi D. Reyes Premer et.al. | 2602.10936 | null |
| 2026-02-10 | Can Image Splicing and Copy-Move Forgery Be Detected by the Same Model? Forensim: An Attention-Based State-Space Approach | Soumyaroop Nandi et.al. | 2602.10079 | null |
| 2026-02-10 | BabyMamba-HAR: Lightweight Selective State Space Models for Efficient Human Activity Recognition on Resource Constrained Devices | Mridankan Mandal et.al. | 2602.09872 | null |
| 2026-02-09 | DMamba: Decomposition-enhanced Mamba for Time Series Forecasting | Ruxuan Chen et.al. | 2602.09081 | null |
| 2026-02-12 | MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection | Venkatraman Narayanan et.al. | 2602.08126 | null |
| 2026-02-06 | Behavior Score Prediction in Resting-State Functional MRI by Deep State Space Modeling | Javier Salazar Cavazos et.al. | 2602.07131 | null |
| 2026-02-06 | Towards Understanding What State Space Models Learn About Code | Jiali Wu et.al. | 2602.06774 | link |
| 2026-02-06 | Efficient Online Variational Estimation via Monte Carlo Sampling | Mathis Chagneux et.al. | 2602.06579 | null |
| 2026-02-06 | AS-Mamba: Asymmetric Self-Guided Mamba Decoupled Iterative Network for Metal Artifact Reduction | Bowen Ning et.al. | 2602.06350 | null |
| 2026-02-05 | MambaVF: State Space Model for Efficient Video Fusion | Zixiang Zhao et.al. | 2602.06017 | null |
| 2026-02-05 | A Decomposition-based State Space Model for Multivariate Time-Series Forecasting | Shunya Nagashima et.al. | 2602.05389 | null |
| 2026-02-05 | HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reliable Healthcare Facility Visit Prediction | Dahai Yu et.al. | 2602.05286 | null |
| 2026-02-04 | Partial Ring Scan: Revisiting Scan Order in Vision State Space Models | Yi-Kuan Hsieh et.al. | 2602.04170 | null |
| 2026-02-03 | Systematic review of self-supervised foundation models for brain network representation using electroencephalography | Hannah Portmann et.al. | 2602.03269 | null |
| 2026-02-03 | Bayesian Methods for the Navier-Stokes Equations | Nicholas Polson et.al. | 2602.02945 | null |
| 2026-02-02 | A Multi-scale Linear-time Encoder for Whole-Slide Image Analysis | Jagan Mohan Reddy Dwarampudi et.al. | 2602.02918 | null |
| 2026-02-01 | Learnable Koopman-Enhanced Transformer-Based Time Series Forecasting with Spectral Control | Ali Forootani et.al. | 2602.02592 | null |
| 2026-02-02 | SMTrack: State-Aware Mamba for Efficient Temporal Modeling in Visual Tracking | Yinchao Ma et.al. | 2602.01677 | null |
| 2026-02-02 | ASGMamba: Adaptive Spectral Gating Mamba for Multivariate Time Series Forecasting | Qianyang Li et.al. | 2602.01668 | null |
| 2026-02-02 | Samba+: General and Accurate Salient Object Detection via A More Unified Mamba-based Framework | Wenzhuo Zhao et.al. | 2602.01593 | null |
| 2026-02-02 | HandMCM: Multi-modal Point Cloud-based Correspondence State Space Model for 3D Hand Pose Estimation | Wencan Cheng et.al. | 2602.01586 | null |
| 2026-02-02 | Rotation-free Online Handwritten Character Recognition Using Linear Recurrent Units | Zhe Ling et.al. | 2602.01533 | null |
| 2026-02-04 | BioTamperNet: Affinity-Guided State-Space Model Detecting Tampered Biomedical Images | Soumyaroop Nandi et.al. | 2602.01435 | null |
| 2026-01-31 | OCTOPUS: Enhancing the Spatial-Awareness of Vision SSMs with Multi-Dimensional Scans and Traversal Selection | Kunal Mahatha et.al. | 2602.00904 | null |
| 2026-01-31 | Cognitive-Flexible Control via Latent Model Reorganization with Predictive Safety Guarantees | Thanana Nuchkrua et.al. | 2602.00812 | null |
| 2026-01-31 | A Hybrid Mamba-SAM Architecture for Efficient 3D Medical Image Segmentation | Mohammadreza Gholipour Shahraki et.al. | 2602.00650 | null |
| 2026-01-31 | AIRE-Prune: Asymptotic Impulse-Response Energy for State Pruning in State Space Models | Apurba Prasad Padhy et.al. | 2602.00534 | null |
| 2026-01-30 | GaussianOcc3D: A Gaussian-Based Adaptive Multi-modal 3D Occupancy Prediction | A. Enes Doruk et.al. | 2601.22729 | null |
| 2026-01-30 | Learning to Defer in Non-Stationary Time Series via Switching State-Space Models | Yannis Montreuil et.al. | 2601.22538 | null |
| 2026-01-30 | Elastic Spectral State Space Models for Budgeted Inference | Dachuan Song et.al. | 2601.22488 | null |
| 2026-01-29 | Spectral Filtering for Learning Quantum Dynamics | Elad Hazan et.al. | 2601.22400 | null |
| 2026-01-29 | ParalESN: Enabling parallel information processing in Reservoir Computing | Matteo Pinna et.al. | 2601.22296 | null |
| 2026-01-29 | MAR: Efficient Large Language Models via Module-aware Architecture Refinement | Junhong Cai et.al. | 2601.21503 | null |
| 2026-01-29 | Towards Geometry-Aware and Motion-Guided Video Human Mesh Recovery | Hongjun Chen et.al. | 2601.21376 | null |
| 2026-01-29 | Model-Free Neural State Estimation in Nonlinear Dynamical Systems: A Comparative Study of Neural Architectures and Classical Filters | Zhuochen Liu et.al. | 2601.21266 | null |
| 2026-01-28 | CCMamba: Selective State-Space Models for Higher-Order Graph Learning on Combinatorial Complexes | Jiawen Chen et.al. | 2601.20518 | null |
| 2026-01-27 | QuaMo: Quaternion Motions for Vision-based 3D Human Kinematics Capture | Cuong Le et.al. | 2601.19580 | null |
| 2026-01-27 | Scale-Consistent State-Space Dynamics via Fractal of Stationary Transformations | Geunhyeok Yu et.al. | 2601.19551 | null |
| 2026-01-27 | On the Expressiveness of State Space Models via Temporal Logics | Eric Alsmann et.al. | 2601.19467 | null |
| 2026-01-24 | Fluxamba: Topology-Aware Anisotropic State Space Models for Geological Lineament Segmentation in Multi-Source Remote Sensing | Jin Bai et.al. | 2601.17288 | null |
| 2026-01-23 | From Noisy News Sentiment Scores to Interpretable Temporal Dynamics: A Bayesian State-Space Model | Ian Carbó Casals et.al. | 2601.16769 | null |
| 2026-01-23 | PanopMamba: Vision State Space Modeling for Nuclei Panoptic Segmentation | Ming Kang et.al. | 2601.16631 | null |
| 2026-01-23 | Omni-directional attention mechanism based on Mamba for speech separation | Ke Xue et.al. | 2601.16603 | null |
| 2026-01-23 | Variational Dimension Lifting for Robust Tracking of Nonlinear Stochastic Dynamics | Yonatan L. Ashenafi et.al. | 2601.16470 | null |
| 2026-01-22 | NeuroMamba: Multi-Perspective Feature Interaction with Visual Mamba for Neuron Segmentation | Liuyun Jiang et.al. | 2601.15929 | null |
| 2026-01-22 | Design, Modelling, and Control of Magnetic Ball Suspension System | Sampson E. Nwachukwu et.al. | 2601.15622 | null |
| 2026-01-20 | A Dual-Head Transformer-State-Space Architecture for Neurocircuit Mechanism Decomposition from fMRI | Cole Korponay et.al. | 2601.15344 | null |
| 2026-01-21 | UBATrack: Spatio-Temporal State Space Model for General Multi-Modal Tracking | Qihua Liang et.al. | 2601.14799 | null |
| 2026-01-21 | Training-Efficient Text-to-Music Generation with State-Space Modeling | Wei-Jaw Lee et.al. | 2601.14786 | null |
| 2026-01-24 | M2I2HA: Multi-modal Object Detection Based on Intra- and Inter-Modal Hypergraph Attention | Xiaofan Yang et.al. | 2601.14776 | null |
| 2026-01-21 | Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination | Ping Zhong et.al. | 2601.14649 | link |
| 2026-01-20 | PAS-Mamba: Phase-Amplitude-Spatial State Space Model for MRI Reconstruction | Xiaoyan Kui et.al. | 2601.14530 | null |
| 2026-01-20 | Gaussian Based Adaptive Multi-Modal 3D Semantic Occupancy Prediction | A. Enes Doruk et.al. | 2601.14448 | null |
| 2026-01-20 | ASBA: A-line State Space Model and B-line Attention for Sparse Optical Doppler Tomography Reconstruction | Zhenghong Li et.al. | 2601.14165 | null |
| 2026-01-20 | GeoDynamics: A Geometric State-Space Neural Network for Understanding Brain Dynamics on Riemannian Manifolds | Tingting Dan et.al. | 2601.13570 | null |
| 2026-01-19 | On the Relation of State Space Models and Hidden Markov Models | Aydin Ghojogh et.al. | 2601.13357 | null |
| 2026-01-19 | ConvMambaNet: A Hybrid CNN-Mamba State Space Architecture for Accurate and Real-Time EEG Seizure Detection | Md. Nishan Khan et.al. | 2601.13234 | null |
| 2026-01-19 | Analysis of Long Range Dependency Understanding in State Space Models | Srividya Ravikumar et.al. | 2601.13048 | null |
| 2026-01-15 | Online identification of nonlinear time-varying systems with uncertain information | He Ren et.al. | 2601.10379 | null |
| 2026-01-14 | Parallelizable memory recurrent units | Florent De Geeter et.al. | 2601.09495 | null |
| 2026-01-14 | Late Breaking Results: Quamba-SE: Soft-edge Quantizer for Activations in State Space Models | Yizhi Chen et.al. | 2601.09451 | null |
| 2026-01-13 | SfMamba: Efficient Source-Free Domain Adaptation via Selective Scan Modeling | Xi Chen et.al. | 2601.08608 | null |
| 2026-01-13 | Particle Filtering for a Class of State-Space Models with Low and Degenerate Observational Noise | Abylay Zhumekenov et.al. | 2601.08411 | null |
| 2026-01-12 | Rescind: Countering Image Misconduct in Biomedical Publications with Vision-Language and State-Space Modeling | Soumyaroop Nandi et.al. | 2601.08040 | null |
| 2026-01-12 | Language markers of emotion flexibility predict depression and anxiety treatment outcomes | Benjamin Brindle et.al. | 2601.07961 | null |
| 2026-01-11 | Conditional Normalizing Flows for Forward and Backward Joint State and Parameter Estimation | Luke S. Lagunowich et.al. | 2601.07013 | null |
| 2026-01-11 | Deep Recurrent Hidden Markov Learning Framework for Multi-Stage Advanced Persistent Threat Prediction | Saleem Ishaq Tijjani et.al. | 2601.06734 | null |
| 2026-01-08 | Akasha 2: Hamiltonian State Space Duality and Visual-Language Joint Embedding Predictive Architectur | Yani Meziani et.al. | 2601.06212 | null |
| 2026-01-02 | Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs | Andrew Kiruluta et.al. | 2601.06100 | null |
| 2026-01-09 | Dynamic Mortality Forecasting via Mixed-Frequency State-Space Models | Runze Li et.al. | 2601.05702 | null |
| 2026-01-09 | DIFF-MF: A Difference-Driven Channel-Spatial State Space Model for Multi-Modal Image Fusion | Yiming Sun et.al. | 2601.05538 | null |
| 2026-01-08 | DB-MSMUNet:Dual Branch Multi-scale Mamba UNet for Pancreatic CT Scans Segmentation | Qiu Guan et.al. | 2601.04676 | null |
| 2026-01-07 | Unified and Efficient Analysis of Machining Chatter and Surface Location Error | Woraphrut Kornmaneesang et.al. | 2601.03819 | null |
| 2026-01-06 | Time-Aware Synthetic Control | Saeyoung Rho et.al. | 2601.03099 | null |
| 2026-01-06 | Fast Surrogate Models for Adaptive Aircraft Trajectory Prediction in En route Airspace | Nick Pepper et.al. | 2601.03075 | null |
| 2026-01-06 | XLSR-MamBo: Scaling the Hybrid Mamba-Attention Backbone for Audio Deepfake Detection | Kwok-Ho Ng et.al. | 2601.02944 | null |
| 2026-01-05 | AMC26: VSSEA robust position control | Emre Sariyildiz et.al. | 2601.02557 | null |
| 2026-01-05 | Scalable Gaussian Processes for Integrated and Overlapping Measurements Via Augmented State Space Models | Ryan A. Rubenzahl et.al. | 2601.02527 | null |
| 2026-01-02 | SpikySpace: A Spiking State Space Model for Energy-Efficient Time Series Forecasting | Kaiwen Tang et.al. | 2601.02411 | null |
| 2026-01-05 | A Mamba-Based Model for Automatic Chord Recognition | Chunyu Yuan et.al. | 2601.02101 | null |
| 2026-01-06 | Hidden State Poisoning Attacks against Mamba-based Language Models | Alexandre Le Mercier et.al. | 2601.01972 | null |
| 2026-01-08 | Reliable Grid Forecasting: State Space Models for Safety-Critical Energy Systems | Jisoo Lee et.al. | 2601.01410 | null |
| 2026-01-04 | LinMU: Multimodal Understanding Made Linear | Hongjie Wang et.al. | 2601.01322 | null |
| 2026-01-03 | MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Efficient Clinical Assistance | Hamad Khan et.al. | 2601.01260 | null |
| 2026-01-03 | Benchmarking the Computational and Representational Efficiency of State Space Models against Transformers on Long-Context Dyadic Sessions | Abidemi Koledoye et.al. | 2601.01237 | null |
| 2026-01-03 | NeuroSSM: Multiscale Differential State-Space Modeling for Context-Aware fMRI Analysis | Furkan Genç et.al. | 2601.01229 | null |
| 2026-01-01 | Depth-Synergized Mamba Meets Memory Experts for All-Day Image Reflection Separation | Siyan Fang et.al. | 2601.00322 | null |
| 2026-01-08 | Modern Neuromorphic AI: From Intra-Token to Inter-Token Processing | Osvaldo Simeone et.al. | 2601.00245 | null |
| 2025-12-30 | Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis | Hao Wu et.al. | 2512.24013 | null |
| 2025-12-29 | MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling | Mahdi Karami et.al. | 2512.23824 | null |
| 2025-12-28 | Breaking the Memory Wall: Exact Analytical Differentiation via Tiled Operator-Space Evolution | Shuhuan Wang et.al. | 2512.23068 | null |
| 2025-12-28 | Nonlinear Dynamical Modeling of Human Intracranial Brain Activity with Flexible Inference | Kiarash Vaziri et.al. | 2512.22785 | null |
| 2025-12-25 | UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation | Linxuan Fan et.al. | 2512.21584 | null |
| 2025-12-24 | A Mechanistic Analysis of Transformers for Dynamical Systems | Gregory Duthé et.al. | 2512.21113 | null |
| 2025-12-25 | Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning | Mojtaba Safari et.al. | 2512.19676 | null |
| 2025-12-22 | Generative Krylov Subspace Representations for Scalable Quantum Eigensolvers | Changwon Lee et.al. | 2512.19420 | null |
| 2025-12-22 | Lag Operator SSMs: A Geometric Framework for Structured State Space Modeling | Sutashu Tomonaga et.al. | 2512.18965 | null |
| 2025-12-21 | State-Space Modeling of Time-Varying Spillovers on Networks | Marios Papamichalis et.al. | 2512.18584 | link |
| 2025-12-19 | Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers | Zeyuan Allen-Zhu et.al. | 2512.17351 | null |
| 2025-12-18 | KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse Signals | Shuting Zhao et.al. | 2512.16791 | null |
| 2025-12-18 | KOSS: Kalman-Optimal Selective State Spaces for Long-Term Sequence Modeling | Lei Wang et.al. | 2512.16723 | null |
| 2025-12-18 | CPMamba: Selective State Space Models for MIMO Channel Prediction in High-Mobility Environments | Sheng Luo et.al. | 2512.16315 | null |
| 2025-12-17 | BarcodeMamba+: Advancing State-Space Models for Fungal Biodiversity Research | Tiancheng Gao et.al. | 2512.15931 | null |
| 2025-12-22 | COBRA: Catastrophic Bit-flip Reliability Analysis of State-Space Models | Sanjay Das et.al. | 2512.15778 | null |
| 2025-12-17 | Characterizing Mamba’s Selective Memory using Auto-Encoders | Tamanna Hossain et.al. | 2512.15653 | null |
| 2025-12-17 | On non-stationarity of the Poisson gamma state space models | Kaoru Irie et.al. | 2512.15128 | null |
| 2025-12-17 | How Many Heads Make an SSM? A Unified Framework for Attention and State Space Models | Ali Ghodsi et.al. | 2512.15115 | null |
| 2025-12-16 | XAI-Driven Diagnosis of Generalization Failure in State-Space Cerebrovascular Segmentation Models: A Case Study on Domain Shift Between RSNA and TopCoW Datasets | Youssef Abuzeid et.al. | 2512.13977 | null |
| 2025-12-15 | Temporal parallelisation of continuous-time maximum-a-posteriori trajectory estimation | Hassan Razavi et.al. | 2512.13319 | null |
| 2025-12-14 | Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics | Jingdi Lei et.al. | 2512.12602 | link |
| 2025-12-13 | HydroDiffusion: Diffusion-Based Probabilistic Streamflow Forecasting with a State Space Backbone | Yihan Wang et.al. | 2512.12183 | null |
| 2025-12-12 | TSkel-Mamba: Temporal Dynamic Modeling via State Space Model for Human Skeleton-based Action Recognition | Yanan Liu et.al. | 2512.11503 | link |
| 2025-12-11 | On a class of constrained Bayesian filters and their numerical implementation in high-dimensional state-space Markov models | Utku Erdogan et.al. | 2512.11012 | null |
| 2025-12-11 | Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation | Yiheng Lyu et.al. | 2512.10353 | null |
| 2025-12-10 | Inertial Magnetic SLAM Systems Using Low-Cost Sensors | Chuan Huang et.al. | 2512.10128 | null |
| 2025-12-10 | Neural posterior inference with state-space models for calibrating ice sheet simulators | Bao Anh Vu et.al. | 2512.09561 | null |
| 2025-12-11 | StateSpace-SSL: Linear-Time Self-supervised Learning for Plant Disease Detection | Abdullah Al Mamun et.al. | 2512.09492 | null |
| 2025-12-08 | How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline | Chunhui Zhang et.al. | 2512.07385 | null |
| 2025-12-07 | Always Keep Your Promises: DynamicLRP, A Model-Agnostic Solution To Layer-Wise Relevance Propagation | Kevin Lee et.al. | 2512.07010 | null |
| 2025-12-07 | FGE: A Fast Free-Boundary Grad-Shafranov Evolutive Solver | Cosmas Heiß et.al. | 2512.06847 | null |
| 2025-12-07 | TextMamba: Scene Text Detector with Mamba | Qiyan Zhao et.al. | 2512.06657 | null |
| 2025-12-06 | Assessing the Information Content of Individual Spikes in Population-Level Models of Neural Spiking Activity | Azar Ghahari et.al. | 2512.06280 | null |
| 2025-12-05 | Speech World Model: Causal State-Action Planning with Explicit Reasoning for Speech | Xuanru Zhou et.al. | 2512.05933 | null |
| 2025-12-05 | World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty | Zhiting Mei et.al. | 2512.05927 | null |
| 2025-12-05 | Measurements of Light Nuclei (d, t, $^3$He)-$Λ$ Correlations in Au+Au Collisions at $\sqrt{s_{NN}}=3$ GeV from STAR | Xialei Jiang et.al. | 2512.05885 | null |
| 2025-12-05 | Vague Knowledge: Information without Transitivity and Partitions | Kerry Xiao et.al. | 2512.05833 | null |
| 2025-12-05 | Ferroelectricity in dipolar liquids: from an exactly solvable model in the large-dimensional limit to finite dimensions | M. G. Izzo et.al. | 2512.05758 | null |
| 2025-12-05 | Comparing the latent features of universal machine-learning interatomic potentials | Sofiia Chorna et.al. | 2512.05717 | null |
| 2025-12-05 | LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving | Yiming Shu et.al. | 2512.05686 | null |
| 2025-12-05 | Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation | Dhorasso Temfack et.al. | 2512.05650 | null |
| 2025-12-05 | DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model | Pasquale De Marinis et.al. | 2512.05613 | null |
| 2025-12-05 | Supervisory Measurement-Guided Noise Covariance Estimation: Discussing Forward and Reverse Differentiation | Haoying Li et.al. | 2512.05604 | null |
| 2025-12-05 | CureAgent: A Training-Free Executor-Analyst Framework for Clinical Reasoning | Ting-Ting Xie et.al. | 2512.05576 | null |
| 2025-12-05 | MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models | Chuang Yu et.al. | 2512.05530 | null |
| 2025-12-05 | UniFS: Unified Multi-Contrast MRI Reconstruction via Frequency-Spatial Fusion | Jialin Li et.al. | 2512.05481 | null |
| 2025-12-05 | TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression | Cheng-Yuan Ho et.al. | 2512.05446 | null |
| 2025-12-05 | BEAVER: An Efficient Deterministic LLM Verifier | Tarun Suresh et.al. | 2512.05439 | null |
| 2025-12-05 | Computing Supported Models via Transformation to Stable Models | Fang Li et.al. | 2512.05437 | null |
| 2025-12-05 | RevoNAD: Reflective Evolutionary Exploration for Neural Architecture Design | Gyusam Chang et.al. | 2512.05403 | null |
| 2025-12-05 | Group Orthogonal Low-Rank Adaptation for RGB-T Tracking | Zekai Shao et.al. | 2512.05359 | null |
| 2025-12-04 | Nested State and Degradation Estimation of a Satellite Battery with In-flight Data | Linda Bolay et.al. | 2512.05255 | null |
| 2025-12-04 | The deep Hilbert space of all-to-all interacting SU(3) atoms: from quantum to classical | Federico Balducci et.al. | 2512.05184 | null |
| 2025-12-04 | Global phase diagram of two-dimensional dirty hyperbolic Dirac liquids | Christopher A. Leong et.al. | 2512.05109 | null |
| 2025-12-04 | Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction | Vincent Pauline et.al. | 2512.05092 | null |
| 2025-12-04 | RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation | Nicolas Houdré et.al. | 2512.05025 | null |
| 2025-12-04 | Reflection Removal through Efficient Adaptation of Diffusion Transformers | Daniyar Zakarin et.al. | 2512.05000 | link |
| 2025-12-04 | PENCO: A Physics-Energy-Numerical-Consistent Operator for 3D Phase Field Modeling | Mostafa Bamdad et.al. | 2512.04863 | null |
| 2025-12-04 | Model-Based and Sample-Efficient AI-Assisted Math Discovery in Sphere Packing | Rasul Tutunov et.al. | 2512.04829 | null |
| 2025-12-04 | LaFiTe: A Generative Latent Field for 3D Native Texturing | Chia-Hao Chen et.al. | 2512.04786 | null |
| 2025-12-04 | Probing false vacuum decay and bubble nucleation in a Rydberg atom array | Yu-Xin Chao et.al. | 2512.04637 | null |
| 2025-12-04 | Temporal and Spatial Decomposition for Prospective Studies in Energy Systems under Uncertainty | Camila Martinez Parra et.al. | 2512.04622 | null |
| 2025-12-04 | TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification | Zishuo Wan et.al. | 2512.04576 | null |
| 2025-12-04 | VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management | Hongbo Jin et.al. | 2512.04540 | null |
| 2025-12-04 | PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement | Yu-Wei Zhan et.al. | 2512.04532 | null |
| 2025-12-04 | VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory | Yifei Yu et.al. | 2512.04519 | null |
| 2025-12-04 | BiTAgent: A Task-Aware Modular Framework for Bidirectional Coupling between Multimodal Large Language Models and World Models | Yu-Wei Zhan et.al. | 2512.04513 | null |
| 2025-12-04 | DeRA: Decoupled Representation Alignment for Video Tokenization | Pengbo Guo et.al. | 2512.04483 | null |
| 2025-12-04 | ELG $\times$ LRG distribution through dark matter halo dynamics | Ginevra Favole et.al. | 2512.04362 | null |
| 2025-12-04 | Distance Is All You Need: Radial Dispersion for Uncertainty Estimation in Large Language Models | Manh Nguyen et.al. | 2512.04351 | null |
| 2025-12-04 | Cosmological implications of Bumblebee theory on an FLRW background | Manuel Gonzalez-Espinoza et.al. | 2512.04349 | null |
| 2025-12-03 | Driving Beyond Privilege: Distilling Dense-Reward Knowledge into Sparse-Reward Policies | Feeza Khan Khanzada et.al. | 2512.04279 | null |
| 2025-12-03 | Inflation with a Growing Fifth Dimension | Rashmish K. Mishra et.al. | 2512.04177 | null |
| 2025-12-03 | SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL | Siyi Chen et.al. | 2512.04069 | null |
| 2025-12-03 | Training-Free Policy Violation Detection via Activation-Space Whitening in LLMs | Oren Rachmil et.al. | 2512.03994 | null |
| 2025-12-03 | Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization | Lianyu Pang et.al. | 2512.03964 | null |
| 2025-12-03 | Collective dynamics of trail-interacting particles | Paul Pineau et.al. | 2512.03950 | null |
| 2025-12-03 | Rethinking Collapse: Coupling Quantum States to Classical Bits with quasi-probabilities | Dagomir Kaszlikowski et.al. | 2512.03929 | null |
| 2025-12-03 | Acceleration of Parallel Tempering for Markov Chain Monte Carlo methods | Aingeru Ramos et.al. | 2512.03825 | null |
| 2025-12-03 | MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving | Jia Hu et.al. | 2512.03795 | null |
| 2025-12-03 | A comparison between initialization strategies for the infinite hidden Markov model | Federico P. Cortese et.al. | 2512.03777 | null |
| 2025-12-03 | Tutorial on Large Language Model-Enhanced Reinforcement Learning for Wireless Networks | Lingyi Cai et.al. | 2512.03722 | link |
| 2025-12-03 | Consistent Projection of Langevin Dynamics: Preserving Thermodynamics and Kinetics in Coarse-Grained Models | Vahid Nateghi et.al. | 2512.03706 | null |
| 2025-12-03 | State Space Models for Bioacoustics: A comparative Evaluation with Transformers | Chengyu Tang et.al. | 2512.03563 | null |
| 2025-12-03 | Edge bits in average symmetry protected topological mixed state | Yoshihito Kuno et.al. | 2512.03530 | null |
| 2025-12-03 | Seasonal trend assessment of US extreme precipitation via changepoint segmentation | Jaechoul Lee et.al. | 2512.03513 | null |
| 2025-12-03 | CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving | Zhijian Qiao et.al. | 2512.03510 | null |
| 2025-12-03 | Procedural Mistake Detection via Action Effect Modeling | Wenliang Guo et.al. | 2512.03474 | null |
| 2025-12-03 | DM3D: Deformable Mamba via Offset-Guided Gaussian Sequencing for Point Cloud Understanding | Bin Liu et.al. | 2512.03424 | null |
| 2025-12-03 | Comparative algorithm performance evaluation and prediction for the maximum clique problem using instance space analysis | Bharat Sharman et.al. | 2512.03419 | null |
| 2025-12-06 | UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs | Hung-Yueh Chiang et.al. | 2512.03383 | null |
| 2025-12-03 | Generative Refinement:A New Paradigm for Determining Single Crystal Structures Directly from HKL Data | Wen-Lin Luo et.al. | 2512.03365 | null |
| 2025-12-02 | Adaptive Regime-Switching Forecasts with Distribution-Free Uncertainty: Deep Switching State-Space Models Meet Conformal Prediction | Echo Diyun LU et.al. | 2512.03298 | null |
| 2025-12-02 | PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer | Xiaoshui Huang et.al. | 2512.03111 | null |
| 2025-12-02 | The Hilbert space of gauge theories: group averaging and the quantization of Jackiw-Teitelboim gravity | Elba Alonso-Monsalve et.al. | 2512.03030 | null |
| 2025-12-02 | Unrolled Networks are Conditional Probability Flows in MRI Reconstruction | Kehan Qi et.al. | 2512.03020 | null |
| 2025-12-02 | TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond | Yifei Zeng et.al. | 2512.02993 | link |
| 2025-12-08 | AutoNeural: Co-Designing Vision-Language Models for NPU Inference | Wei Chen et.al. | 2512.02924 | null |
| 2025-12-02 | Statistical-Symbolic Verification of Perception-Based Autonomous Systems using State-Dependent Conformal Prediction | Yuang Geng et.al. | 2512.02893 | null |
| 2025-12-02 | MICCAI STSR 2025 Challenge: Semi-Supervised Teeth and Pulp Segmentation and CBCT-IOS Registration | Yaqi Wang et.al. | 2512.02867 | null |
| 2025-12-02 | Tempering the Bayes Filter towards Improved Model-Based Estimation | Menno van Zutphen et.al. | 2512.02823 | null |
| 2025-12-02 | Invariance under Structure Translation as the Origin of Host Immune Capacity Conservation from Noether’s Theorem | Yexing Chen et.al. | 2512.02730 | null |
| 2025-12-02 | DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions | Yifan Zhou et.al. | 2512.02727 | null |
| 2025-12-02 | Graph VQ-Transformer (GVT): Fast and Accurate Molecular Generation via High-Fidelity Discrete Latents | Haozhuo Zheng et.al. | 2512.02667 | null |
| 2025-12-02 | Efficient Simulation of the 2D Hubbard Model via Hilbert Space-Filling Curve Mapping | Ashkan Abedi et.al. | 2512.02666 | null |
| 2025-12-02 | SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization | Zhengcheng Wang et.al. | 2512.02631 | null |
| 2025-12-02 | Excitation function of femtoscopic Lévy source parameters of pion pairs in EPOS4 | Yan Huang et.al. | 2512.02560 | null |
| 2025-12-02 | Deep Learning-Based Joint Uplink-Downlink CSI Acquisition for Next-Generation Upper Mid-Band Systems | Xuan He et.al. | 2512.02557 | null |
| 2025-12-02 | CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning | Songqiao Su et.al. | 2512.02551 | null |
| 2025-12-02 | Detection of photon-level signals embedded in sunlight with an atomic photodetector | Laura Zarraoa et.al. | 2512.02521 | null |
| 2025-12-02 | ClusterStyle: Modeling Intra-Style Diversity with Prototypical Clustering for Stylized Motion Generation | Kerui Chen et.al. | 2512.02453 | null |
| 2025-12-02 | WSCF-MVCC: Weakly-supervised Calibration-free Multi-view Crowd Counting | Bin Li et.al. | 2512.02359 | null |
| 2025-12-02 | Enhancing Cross Domain SAR Oil Spill Segmentation via Morphological Region Perturbation and Synthetic Label-to-SAR Generation | Andre Juarez et.al. | 2512.02290 | null |
| 2025-12-01 | High-Precision Simulations of the Parity Conserving Directed Percolation Universality Class in 1+1 Dimensions | Peter Grassberger et.al. | 2512.02241 | null |
| 2025-12-01 | TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models | Zhiheng Liu et.al. | 2512.02014 | null |
| 2025-12-01 | Low-Rank Prehab: Preparing Neural Networks for SVD Compression | Haoran Qin et.al. | 2512.01980 | link |
| 2025-12-01 | Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design | Danny Reidenbach et.al. | 2512.01976 | null |
| 2025-12-01 | Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies | Bailiang Jian et.al. | 2512.01913 | null |
| 2025-12-01 | Delays in Spiking Neural Networks: A State Space Model Approach | Sanja Karilanova et.al. | 2512.01906 | null |
| 2025-12-01 | Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos | Xavier Thomas et.al. | 2512.01803 | null |
| 2025-12-01 | Quantum dynamics of monitored free fermions | Igor Poboiko et.al. | 2512.01772 | null |
| 2025-12-01 | Mofasa: A Step Change in Metal-Organic Framework Generation | Vaidotas Simkus et.al. | 2512.01756 | null |
| 2025-12-01 | ViT $^3$ : Unlocking Test-Time Training in Vision | Dongchen Han et.al. | 2512.01643 | null |
| 2025-12-01 | Improved Disease Outbreak Detection from Out-of-sequence measurements Using Markov-switching Fixed-lag Particle Filters | Conor Rosato et.al. | 2512.01639 | null |
| 2025-12-01 | Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval | Xin Wang et.al. | 2512.01636 | null |
| 2025-12-01 | Parallel Delayed Memory Units for Enhanced Temporal Modeling in Biomedical and Bioacoustic Signal Analysis | Pengfei Sun et.al. | 2512.01626 | null |
| 2025-12-01 | Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation | Thao Thi Phuong Dao et.al. | 2512.01589 | null |
| 2025-12-01 | Real-Space Spectral Approach to Orbital Magnetization | Kevin J. U. Vidarte et.al. | 2512.01575 | null |
| 2025-12-01 | Q2D2: A Geometry-Aware Audio Codec Leveraging Two-Dimensional Quantization | Tal Shuster et.al. | 2512.01537 | null |
| 2025-12-01 | Multi-Path Collaborative Reasoning via Reinforcement Learning | Jindi Lv et.al. | 2512.01485 | link |
| 2025-12-01 | Language-Guided Open-World Anomaly Segmentation | Klara Reichard et.al. | 2512.01427 | null |
| 2025-12-01 | Fourier Neural Operators Explained: A Practical Perspective | Valentin Duruisseaux et.al. | 2512.01421 | null |
| 2025-12-01 | PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications | Yunze Liu et.al. | 2512.01383 | null |
| 2025-12-01 | InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision | Chenting Wang et.al. | 2512.01342 | null |
| 2025-12-01 | Gaussian Process State-Space Modeling and Particle Filtering for Time Series Decomposition and Nonlinear Signal Extraction | Genshiro Kitagawa et.al. | 2512.01162 | null |
| 2025-11-30 | Upper Approximation Bounds for Neural Oscillators | Zifeng Huang et.al. | 2512.01015 | null |
| 2025-11-29 | A State-Space Approach to Modeling Tire Degradation in Formula 1 Racing | Cole Cappello et.al. | 2512.00640 | null |
| 2025-11-29 | DPNet: Doppler LiDAR Motion Planning for Highly-Dynamic Environments | Wei Zuo et.al. | 2512.00375 | null |
| 2025-11-28 | ReactionMamba: Generating Short &Long Human Reaction Sequences | Hajra Anwar Beg et.al. | 2512.00208 | null |
| 2025-11-28 | Wilson loops, symmetries, and selective bulk-boundary correspondence in higher-order topological insulators | Suman Aich et.al. | 2511.23471 | null |
| 2025-11-28 | Visual Generation Tuning | Jiahao Guo et.al. | 2511.23469 | null |
| 2025-11-28 | SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments | Xinyi Li et.al. | 2511.23465 | null |
| 2025-11-28 | Kinetic Mixing and the Phantom Illusion: Axion-Dilaton Quintessence in Light of DESI DR2 | Michael W. Toomey et.al. | 2511.23463 | null |
| 2025-11-28 | DisMo: Disentangled Motion Representations for Open-World Motion Transfer | Thomas Ressler-Antal et.al. | 2511.23428 | null |
| 2025-11-28 | Hilbert space fragmentation in driven-dephasing Rydberg atom array | Tianyi Yan et.al. | 2511.23395 | null |
| 2025-11-28 | Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon | Isabel Whiteley Tscherniak et.al. | 2511.23384 | null |
| 2025-11-28 | Functional Program Synthesis with Higher-Order Functions and Recursion Schemes | Matheus Campos Fernandes et.al. | 2511.23354 | null |
| 2025-11-28 | Data-driven Reachability Verification with Probabilistic Guarantees under Koopman Spectral Uncertainty | Jianqiang Ding et.al. | 2511.23322 | null |
| 2025-11-28 | Magnetic Dipole Portal Vector Dark Matter at Fixed-Targets | Avik Banerjee et.al. | 2511.23259 | null |
| 2025-11-28 | SDE-Attention: Latent Attention in SDE-RNNs for Irregularly Sampled Time Series with Missing Data | Yuting Fang et.al. | 2511.23238 | null |
| 2025-11-28 | Incorporating Ephemeral Traffic Waves in A Data-Driven Framework for Microsimulation in CARLA | Alex Richardson et.al. | 2511.23236 | null |
| 2025-11-28 | Constraining the Inert Doublet Model at the LHC | Jayita Lahiri et.al. | 2511.23133 | null |
| 2025-11-28 | Einstein’s 1935 Letters to Schrödinger and Popper and the Boundaries of the PBR $ψ$ -Epistemic Framework | Galina Weinstein et.al. | 2511.23125 | null |
| 2025-11-28 | Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM | Mengjie Liu et.al. | 2511.23119 | null |
| 2025-11-28 | Time Extrapolation with Graph Convolutional Autoencoder and Tensor Train Decomposition | Yuanhong Chen et.al. | 2511.23037 | null |
| 2025-11-28 | Joint Bayesian Inference of Parameter and Discretization Error Uncertainties in ODE Models | Shoji Toyota et.al. | 2511.23010 | null |
| 2025-11-28 | SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving | Wonjeong Ryu et.al. | 2511.22865 | null |
| 2025-11-28 | TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE | Jiawen Wei et.al. | 2511.22853 | null |
| 2025-11-28 | PerfMamba: Performance Analysis and Pruning of Selective State Space Models | Abdullah Al Asif et.al. | 2511.22849 | null |
| 2025-11-26 | TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos | Seungjae Lee et.al. | 2511.21690 | null |
| 2025-11-26 | G $^2$ VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning | Wenbo Hu et.al. | 2511.21688 | null |
| 2025-11-26 | Visualizing LLM Latent Space Geometry Through Dimensionality Reduction | Alex Ning et.al. | 2511.21594 | null |
| 2025-11-26 | Machine Learning Approaches to Clinical Risk Prediction: Multi-Scale Temporal Alignment in Electronic Health Records | Wei-Chen Chang et.al. | 2511.21561 | null |
| 2025-11-26 | MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors | Thai-Khanh Nguyen et.al. | 2511.21550 | null |
| 2025-11-26 | Simulations of high-energy neutrino emissions from blazars with the LeHa-Paris code | Francesco Carenini et.al. | 2511.21532 | null |
| 2025-11-26 | Sector theory of Levin-Wen models I : Classification of Anyon Sectors | Alex Bols et.al. | 2511.21521 | null |
| 2025-11-26 | Nested ensemble Kalman filter for static parameter inference in nonlinear state-space models | Andrew Golightly et.al. | 2511.21497 | null |
| 2025-11-26 | Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning | Taehoon Kim et.al. | 2511.21490 | null |
| 2025-11-26 | SONAR: Spectral-Contrastive Audio Residuals for Generalizable Deepfake Detection | Ido Nitzan HIdekel et.al. | 2511.21325 | null |
| 2025-11-26 | PathMamba: A Hybrid Mamba-Transformer for Topologically Coherent Road Segmentation in Satellite Imagery | Jules Decaestecker et.al. | 2511.21298 | null |
| 2025-11-26 | Exploring muonphilic dark matter with the $Z_2$ -even mediator at muon colliders | Wanyun Chen et.al. | 2511.21290 | null |
| 2025-11-26 | Floquet thermalization by power-law induced permutation symmetry breaking | Manju C et.al. | 2511.21284 | null |
| 2025-11-26 | I-GLIDE: Input Groups for Latent Health Indicators in Degradation Estimation | Lucas Thil et.al. | 2511.21208 | null |
| 2025-11-26 | Vortex-Enhanced Zitterbewegung in Relativistic Electron Wave Packets | Zhongze Guo et.al. | 2511.21142 | null |
| 2025-11-26 | Referring Video Object Segmentation with Cross-Modality Proxy Queries | Baoli Sun et.al. | 2511.21139 | null |
| 2025-11-26 | DeepRFTv2: Kernel-level Learning for Image Deblurring | Xintian Mao et.al. | 2511.21132 | null |
| 2025-11-26 | OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection | Chujie Wang et.al. | 2511.21064 | null |
| 2025-11-26 | Gated KalmaNet: A Fading Memory Layer Through Test-Time Ridge Regression | Liangzu Peng et.al. | 2511.21016 | null |
| 2025-11-26 | SpaceX: Exploring metrics with the SPACE model for developer productivity | Sanchit Kaul et.al. | 2511.20955 | null |
| 2025-11-25 | DINO-Tok: Adapting DINO for Visual Tokenizers | Mingkai Jia et.al. | 2511.20565 | link |
| 2025-11-25 | From Features to States: Data-Driven Selection of Measured State Variables via RFE-DMDc | Haoyu Wang et.al. | 2511.20552 | null |
| 2025-11-25 | Physically Interpretable Interatomic Potentials via Symbolic Regression and Reinforcement Learning | Bilvin Varughese et.al. | 2511.20506 | null |
| 2025-11-25 | Generative Modeling with Manifold Percolation | Rui Tong et.al. | 2511.20503 | null |
| 2025-11-25 | Universe of Thoughts: Enabling Creative Reasoning with Large Language Models | Yuto Suzuki et.al. | 2511.20471 | null |
| 2025-11-25 | Advances and Challenges in Solar Flare Prediction: A Review | Mingfu Shao et.al. | 2511.20465 | null |
| 2025-11-25 | STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow | Jiatao Gu et.al. | 2511.20462 | null |
| 2025-11-25 | Towards Trustworthy Wi-Fi Sensing: Systematic Evaluation of Deep Learning Model Robustness to Adversarial Attacks | Shreevanth Krishnaa Gopalakrishnan et.al. | 2511.20456 | null |
| 2025-11-25 | Adaptive Meshing for CPA Lyapunov Function Synthesis | Amy K. Strong et.al. | 2511.20443 | null |
| 2025-11-25 | The effect of sound speed on the gravitational wave spectrum of first order phase transitions in the early universe | Mika Mäki et.al. | 2511.20436 | null |
| 2025-11-25 | BRIC: Bridging Kinematic Plans and Physical Control at Test Time | Dohun Lim et.al. | 2511.20431 | null |
| 2025-11-25 | Proximity driven photon-tunneling in chiral quantum hybrid systems | Aryan Pratap Srivastava et.al. | 2511.20357 | null |
| 2025-11-25 | Active Inference in Discrete State Spaces from First Principles | Patrick Kenny et.al. | 2511.20321 | null |
| 2025-11-25 | Improving Language Agents through BREW | Shashank Kirtania et.al. | 2511.20297 | null |
| 2025-11-25 | DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion | Yinghui Li et.al. | 2511.20278 | link |
| 2025-11-25 | PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling | Bo-Kai Ruan et.al. | 2511.20251 | link |
| 2025-11-25 | POMDP-Based Routing for DTNs with Partial Knowledge and Dependent Failures | Gregory F. Stock et.al. | 2511.20241 | null |
| 2025-11-25 | Communication-Efficient Learning for Satellite Constellations | Ruxandra-Stefania Tudose et.al. | 2511.20220 | null |
| 2025-11-25 | Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis | Mohammad Mahdi et.al. | 2511.20186 | null |
| 2025-11-25 | Alzheimers Disease Progression Prediction Based on Manifold Mapping of Irregularly Sampled Longitudinal Data | Xin Hong et.al. | 2511.20154 | null |
| 2025-11-24 | Cloud4D | Jacob Lin et.al. | 2511.19431 | null |
| 2025-11-24 | Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection | Zixuan Wang et.al. | 2511.19306 | null |
| 2025-11-24 | Innovative Modular Design and Kinematic Approach based on Screw Theory for Triple Scissors Links Deployable Space Antenna Mechanism | Mamoon Aamir et.al. | 2511.19287 | null |
| 2025-11-24 | What is the signature of a trion in photoemission? | Jinyuan Wu et.al. | 2511.19280 | null |
| 2025-11-24 | Solar-GECO: Perovskite Solar Cell Property Prediction with Geometric-Aware Co-Attention | Lucas Li et.al. | 2511.19263 | null |
| 2025-11-24 | LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models | Shuai Wang et.al. | 2511.19261 | null |
| 2025-11-24 | Learning Plug-and-play Memory for Guiding Video Diffusion Models | Selena Song et.al. | 2511.19229 | link |
| 2025-11-24 | Reference-Free Sampling-Based Model Predictive Control | Fabian Schramm et.al. | 2511.19204 | null |
| 2025-11-24 | Information Physics of Intelligence: Unifying Logical Depth and Entropy under Thermodynamic Constraints | Jianfeng Xu et.al. | 2511.19156 | null |
| 2025-11-24 | Fast-Converging and Asymptotic-Preserving DSMC | Bin Hu et.al. | 2511.19061 | null |
| 2025-11-24 | Latent-Space Non-Linear Model Predictive Control for Partially-Observable Systems | Luigi Marra et.al. | 2511.19056 | null |
| 2025-11-24 | Multigrid with Linear Storage Complexity | Daniel Bauer et.al. | 2511.19036 | null |
| 2025-11-24 | Web of Non-invertible Dualities for (2+1) Dimensional Models with Subsystem Symmetries | Avijit Maity et.al. | 2511.18969 | null |
| 2025-11-24 | BSN-V: The First Detailed Light Curve Modeling of Eight Totally Eclipsing Contact Binary Stars Using Ground-Based and TESS Observations | Atila Poro et.al. | 2511.18909 | null |
| 2025-11-24 | MFmamba: A Multi-function Network for Panchromatic Image Resolution Restoration Based on State-Space Model | Qian Jiang et.al. | 2511.18888 | null |
| 2025-11-24 | KernelBand: Boosting LLM-based Kernel Optimization with a Hierarchical and Hardware-aware Multi-armed Bandit | Dezhi Ran et.al. | 2511.18868 | null |
| 2025-11-24 | SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation | Nimeshika Udayangani et.al. | 2511.18816 | null |
| 2025-11-24 | ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection | Ruize Ma et.al. | 2511.18780 | null |
| 2025-11-24 | SAOT: An Enhanced Locality-Aware Spectral Transformer for Solving PDEs | Chenhong Zhou et.al. | 2511.18777 | link |
| 2025-11-24 | Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers | Yiqing Shi et.al. | 2511.18673 | null |
| 2025-11-21 | Counterfactual World Models via Digital Twin-conditioned Video Diffusion | Yiqing Shen et.al. | 2511.17481 | null |
| 2025-11-21 | Moving superfluids in the rotating universe | Jose Beltrán Jiménez et.al. | 2511.17472 | null |
| 2025-11-21 | SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding | Nikolay Nikolov et.al. | 2511.17411 | null |
| 2025-11-21 | Selective Rotary Position Embedding | Sajad Movahedi et.al. | 2511.17388 | null |
| 2025-11-21 | ReBaPL: Repulsive Bayesian Prompt Learning | Yassir Bendou et.al. | 2511.17339 | null |
| 2025-11-21 | Parameter Inference from Final-State Entanglement in Higgs Decays | Jia Liu et.al. | 2511.17321 | null |
| 2025-11-21 | SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion | Jiajie Guo et.al. | 2511.17308 | null |
| 2025-11-21 | SAVeD: Semantic Aware Version Discovery | Artem Frenk et.al. | 2511.17298 | null |
| 2025-11-21 | PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention | Yipeng Chen et.al. | 2511.17185 | null |
| 2025-11-21 | On the Predictive Skill of Artificial Intelligence-based Weather Models for Extreme Events using Uncertainty Quantification | Rodrigo Almeida et.al. | 2511.17176 | null |
| 2025-11-21 | Dark Matter Admixed White Dwarfs: A Single-Fluid Approach | Rajasmita Sahoo et.al. | 2511.17120 | null |
| 2025-11-21 | RL-AD-Net: Reinforcement Learning Guided Adaptive Displacement in Latent Space for Refined Point Cloud Completion | Bhanu Pratap Paregi et.al. | 2511.17054 | null |
| 2025-11-21 | Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters | Zhan Su et.al. | 2511.17044 | null |
| 2025-11-21 | CLLMRec: LLM-powered Cognitive-Aware Concept Recommendation via Semantic Alignment and Prerequisite Knowledge Distillation | Xiangrui Xiong et.al. | 2511.17041 | null |
| 2025-11-21 | Generative MIMO Beam Map Construction for Location Recovery and Beam Tracking | Wangqian Chen et.al. | 2511.17007 | null |
| 2025-11-21 | FLUID: Training-Free Face De-identification via Latent Identity Substitution | Jinhyeong Park et.al. | 2511.17005 | null |
| 2025-11-21 | Stable Offline Hand-Eye Calibration for any Robot with Just One Mark | Sicheng Xie et.al. | 2511.17001 | null |
| 2025-11-21 | The Finer the Better: Towards Granular-aware Open-set Domain Generalization | Yunyun Wang et.al. | 2511.16979 | null |
| 2025-11-21 | Flow-Guided Implicit Neural Representation for Motion-Aware Dynamic MRI Reconstruction | Baoqing Li et.al. | 2511.16948 | null |
| 2025-11-21 | Improving Latent Reasoning in LLMs via Soft Concept Mixing | Kang Wang et.al. | 2511.16885 | null |
| 2025-11-20 | Dataset Distillation for Pre-Trained Self-Supervised Vision Models | George Cazenavette et.al. | 2511.16674 | null |
| 2025-11-20 | Strained hyperbolic Dirac fermions: Zero modes, flat bands, and competing orders | Christopher A. Leong et.al. | 2511.16667 | null |
| 2025-11-20 | Time dependent loss reweighting for flow matching and diffusion models is theoretically justified | Lukas Billera et.al. | 2511.16599 | null |
| 2025-11-20 | TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding | Boshen Xu et.al. | 2511.16595 | null |
| 2025-11-20 | Comment on: “Scaling and Universality at Noisy Quench Dynamical Quantum Phase Transitions” | J. Sirker et.al. | 2511.16509 | null |
| 2025-11-20 | Order-by-disorder from Schwinger bosons in a frustrated honeycomb ferromagnet | Arnaud Ralko et.al. | 2511.16429 | null |
| 2025-11-20 | Search for Higgsinos in final states with low-momentum lepton-track pairs at 13 TeV | CMS Collaboration et.al. | 2511.16394 | null |
| 2025-11-20 | Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning | Mohammad Areeb Qazi et.al. | 2511.16333 | null |
| 2025-11-20 | SeSE: A Structural Information-Guided Uncertainty Quantification Framework for Hallucination Detection in LLMs | Xingtao Zhao et.al. | 2511.16275 | null |
| 2025-11-20 | SwiTrack: Tri-State Switch for Cross-Modal Object Tracking | Boyue Xu et.al. | 2511.16227 | null |
| 2025-11-20 | CausalMamba: Interpretable State Space Modeling for Temporal Rumor Causality | Xiaotong Zhan et.al. | 2511.16191 | null |
| 2025-11-20 | Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion | Lirui Zhang et.al. | 2511.16161 | null |
| 2025-11-20 | How Noise Benefits AI-generated Image Detection | Jiazhen Yan et.al. | 2511.16136 | null |
| 2025-11-20 | Decoupling Complexity from Scale in Latent Diffusion Model | Tianxiong Zhong et.al. | 2511.16117 | null |
| 2025-11-20 | Parallelizable Complex Neural Dynamics Models for PMSM Temperature Estimation with Hardware Acceleration | Xinyuan Liao et.al. | 2511.16093 | null |
| 2025-11-20 | A Hybrid Proactive And Predictive Framework For Edge Cloud Resource Management | Hrikshesh Kumar et.al. | 2511.16075 | null |
| 2025-11-20 | High-Throughput Exploration of Refractory High-Entropy Alloys for Strength and Plasticity | Stephen A. Giles et.al. | 2511.16057 | null |
| 2025-11-20 | Exploiting Inter-Sample Information for Long-tailed Out-of-Distribution Detection | Nimeshika Udayangani et.al. | 2511.16015 | null |
| 2025-11-20 | Synergizing Deconfounding and Temporal Generalization For Time-series Counterfactual Outcome Estimation | Yiling Liu et.al. | 2511.16006 | null |
| 2025-11-19 | Breaking the Bottleneck with DiffuApriel: High-Throughput Diffusion LMs with Mamba Backbone | Vaibhav Singh et.al. | 2511.15927 | null |
| 2025-11-19 | From Qubits to Couplings: A Hybrid Quantum Machine Learning Framework for LHC Physics | Marwan Ait Haddou et.al. | 2511.15672 | null |
| 2025-11-19 | SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models | Senyu Fei et.al. | 2511.15605 | null |
| 2025-11-19 | Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi | Kayo Tei et.al. | 2511.15581 | null |
| 2025-11-19 | Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA | Yukun Du et.al. | 2511.15551 | null |
| 2025-11-19 | Partial-Wave Unitarity Bounds on Higher-Dimensional Operators from 2-to- $N$ Scattering | Céline Degrande et.al. | 2511.15524 | null |
| 2025-11-19 | Probing the disk-jet coupling in M87 | Ainara Saiz-Pérez et.al. | 2511.15482 | null |
| 2025-11-19 | Robust H-infinity control and worst-case search in constrained parametric space | Ervan Kassarian et.al. | 2511.15480 | null |
| 2025-11-19 | Proximal Approximate Inference in State-Space Models | Hany Abdulsamad et.al. | 2511.15409 | null |
| 2025-11-19 | C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models | Nayoung Oh et.al. | 2511.15333 | null |
| 2025-11-19 | SkyEgg: Joint Implementation Selection and Scheduling for Hardware Synthesis using E-graphs | Youwei Xiao et.al. | 2511.15323 | null |
| 2025-11-19 | Tensor-network approach to quantum optical state evolution beyond the Fock basis | Nikolay Kapridov et.al. | 2511.15295 | null |
| 2025-11-19 | Reinforcement Learning in Queue-Reactive Models: Application to Optimal Execution | Tomas Espana et.al. | 2511.15262 | null |
| 2025-11-19 | PLATONT: Learning a Platonic Representation for Unified Network Tomography | Chengze Du et.al. | 2511.15251 | null |
| 2025-11-19 | Modelling and Model-Checking a ROS2 Multi-Robot System using Timed Rebeca | Hiep Hong Trinh et.al. | 2511.15227 | null |
| 2025-11-19 | Well-posedness and time-asymptotic of Boltzmann equations for monatomic and polyatomic mixtures | Ricardo Alonso et.al. | 2511.15185 | null |
| 2025-11-19 | Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance | Songze Li et.al. | 2511.15164 | null |
| 2025-11-19 | Robust outlier-adjusted mean-shift estimation of state-space models | Rajan Shankar et.al. | 2511.15155 | null |
| 2025-11-19 | TiCAL:Typicality-Based Consistency-Aware Learning for Multimodal Emotion Recognition | Wen Yin et.al. | 2511.15085 | null |
| 2025-11-19 | Fourier-KAN-Mamba: A Novel State-Space Equation Approach for Time-Series Anomaly Detection | Xiancheng Wang et.al. | 2511.15083 | null |
| 2025-11-19 | MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation | Shengjing Tian et.al. | 2511.15077 | null |
| 2025-11-18 | From Random Determinants to the Ground State | Hao Zhang et.al. | 2511.14734 | null |
| 2025-11-18 | Charged Higgs bosons associated with neutral gauge bosons at future multi–TeV muon colliders | Khiem Hong Phan et.al. | 2511.14525 | null |
| 2025-11-18 | Neural Networks-Enabled Channel Reconstruction for Fluid Antenna Systems: A Data-Driven Approach | Haoyu Liang et.al. | 2511.14520 | null |
| 2025-11-18 | D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images | Taifour Yousra Nabila et.al. | 2511.14518 | null |
| 2025-11-18 | Full Atom Peptide Design via Riemannian Euclidean Bayesian Flow Networks | Hao Qian et.al. | 2511.14516 | null |
| 2025-11-18 | Parameter Aware Mamba Model for Multi-task Dense Prediction | Xinzhuo Yu et.al. | 2511.14503 | null |
| 2025-11-18 | An introduction to Coupling | Artur O. Lopes et.al. | 2511.14489 | null |
| 2025-11-18 | Towards a Comprehensive Theory of Reservoir Computing | Denis Kleyko et.al. | 2511.14484 | null |
| 2025-11-18 | Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation | Aditi Agarwal et.al. | 2511.14481 | null |
| 2025-11-18 | Hölder regularity in bang-bang type affine optimal control problems | Alberto Domínguez Corella et.al. | 2511.14459 | null |
| 2025-11-18 | H-LDM: Hierarchical Latent Diffusion Models for Controllable and Interpretable PCG Synthesis from Clinical Metadata | Chenyang Xu et.al. | 2511.14312 | null |
| 2025-11-18 | Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation | Weimin Bai et.al. | 2511.14271 | null |
| 2025-11-18 | Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction | Juncheng Hu et.al. | 2511.14237 | null |
| 2025-11-18 | EBind: a practical approach to space binding | Jim Broadbent et.al. | 2511.14229 | null |
| 2025-11-18 | InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior | Weimin Bai et.al. | 2511.14208 | null |
| 2025-11-18 | FreeMusco: Motion-Free Learning of Latent Control for Morphology-Adaptive Locomotion in Musculoskeletal Characters | Minkwan Kim et.al. | 2511.14205 | null |
| 2025-11-18 | Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing | Xun Lin et.al. | 2511.14157 | null |
| 2025-11-18 | State-Space Representation of INGARCH Models and Their Application in Insurance | Jae Youn Ahn et.al. | 2511.14091 | null |
| 2025-11-18 | Cosmological dynamics of interacting dark matter-dark energy in generalized Rastall gravity | Manuel Gonzalez-Espinoza et.al. | 2511.14089 | null |
| 2025-11-18 | Enhancing Non-classical Properties of Entangled Coherent States via Post-Selected von Neumann Measurements | Janarbek Yuanbek et.al. | 2511.14079 | null |
| 2025-11-17 | Open-shell frozen natural orbital approach for quantum eigensolvers | Angela F. Harper et.al. | 2511.13677 | null |
| 2025-11-17 | Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly? | Chunqiu Steven Xia et.al. | 2511.13646 | null |
| 2025-11-17 | Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification | Linhan Zhou et.al. | 2511.13575 | null |
| 2025-11-17 | Coclique level structure for stochastic chemical reaction networks | Simone Bruno et.al. | 2511.13569 | null |
| 2025-11-17 | A Quantum Tensor Network-Based Viewpoint for Modeling and Analysis of Time Series Data | Pragatheeswaran Vipulananthan et.al. | 2511.13514 | null |
| 2025-11-17 | Naga: Vedic Encoding for Deep State Space Models | Melanie Schaller et.al. | 2511.13510 | null |
| 2025-11-17 | Explainable RL Policies by Distilling to Locally-Specialized Linear Policies with Voronoi State Partitioning | Senne Deproost et.al. | 2511.13322 | null |
| 2025-11-17 | Voltage-Based Unsupervised Learning Framework for Bridge Damage Detection in Simultaneous Energy Harvesting and Sensing Systems | S. Yao et.al. | 2511.13291 | null |
| 2025-11-17 | Spectroscopic signatures of emergent elementary excitations in a kinetically constrained long-range interacting two-dimensional spin system | Tobias Kaltenmark et.al. | 2511.13279 | null |
| 2025-11-17 | MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI | Malek Al Abed et.al. | 2511.13232 | null |
| 2025-11-17 | 3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale | Yijia Fan et.al. | 2511.13211 | null |
| 2025-11-17 | Modeling group heterogeneity in spatio-temporal data via physics-informed semiparametric regression | Marco F. De Sanctis et.al. | 2511.13203 | null |
| 2025-11-17 | Video Spatial Reasoning with Object-Centric 3D Rollout | Haoran Tang et.al. | 2511.13190 | null |
| 2025-11-17 | Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework | Diego Ortego et.al. | 2511.13189 | null |
| 2025-11-17 | WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection | Longhui Zheng et.al. | 2511.13138 | null |
| 2025-11-17 | Departures: Distributional Transport for Single-Cell Perturbation Prediction with Neural Schrödinger Bridges | Changxi Chi et.al. | 2511.13124 | null |
| 2025-11-17 | Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining | Zhaocheng Yu et.al. | 2511.13113 | null |
| 2025-11-17 | DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection | Jiazhen Yan et.al. | 2511.13108 | null |
| 2025-11-17 | Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact | Satyanarayan Pati et.al. | 2511.13057 | null |
| 2025-11-17 | Monocular 3D Lane Detection via Structure Uncertainty-Aware Network with Curve-Point Queries | Ruixin Liu et.al. | 2511.13055 | null |
| 2025-11-14 | Coherent-state path integrals in quantum thermodynamics | Luca Salasnich et.al. | 2511.11547 | null |
| 2025-11-14 | Bridging Hidden States in Vision-Language Models | Benjamin Fein-Ashley et.al. | 2511.11526 | null |
| 2025-11-14 | Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective | Nhat Chung et.al. | 2511.11478 | null |
| 2025-11-14 | Unsupervised Motion-Compensated Decomposition for Cardiac MRI Reconstruction via Neural Representation | Xuanyu Tian et.al. | 2511.11436 | null |
| 2025-11-14 | Lorentz Transformation in Quantum Mechanics | Marcello Baldo et.al. | 2511.11342 | null |
| 2025-11-14 | BOA Constrictor: A Mamba-based lossless compressor for High Energy Physics data | Akshat Gupta et.al. | 2511.11337 | null |
| 2025-11-14 | RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms | Yitian Kou et.al. | 2511.11323 | null |
| 2025-11-14 | Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs | Jitesh Chavan et.al. | 2511.11243 | null |
| 2025-11-14 | Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation | Quoc-Huy Trinh et.al. | 2511.11177 | null |
| 2025-11-14 | Non-Gaussianity-induced enhanced target-finding dynamics of confined colloids | Guirec de Tournemire et.al. | 2511.11117 | null |
| 2025-11-14 | CPT symmetry in the mirror universe | Natalia Gorobey et.al. | 2511.11109 | null |
| 2025-11-14 | On the accuracy of the model predictive control method | Georgi Angelov et.al. | 2511.11098 | null |
| 2025-11-14 | A Space-Time Transformer for Precipitation Forecasting | Levi Harris et.al. | 2511.11090 | null |
| 2025-11-14 | Evaluating Latent Generative Paradigms for High-Fidelity 3D Shape Completion from a Single Depth Image | Matthias Humt et.al. | 2511.11074 | null |
| 2025-11-14 | Autonomous motion in changing environment, fibrations and reaction mechanisms | Michael Farber et.al. | 2511.11042 | null |
| 2025-11-14 | Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation | Zhiwei Zhang et.al. | 2511.11011 | null |
| 2025-11-14 | ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization | Anzhe Cheng et.al. | 2511.10971 | null |
| 2025-11-14 | Subgrid Stress Modelling with Multi-dimensional State Space Sequence Models | Andy Wu et.al. | 2511.10910 | null |
| 2025-11-14 | Tracking EEG Thalamic and Cortical Focal Brain Activity using Standardized Kalman Filtering with Kinematics Modeling | Veikka Piispa et.al. | 2511.10877 | null |
| 2025-11-13 | Adaptive Digital Twin of Sheet Metal Forming via Proper Orthogonal Decomposition-Based Koopman Operator with Model Predictive Control | Yi-Ping Chen et.al. | 2511.10852 | null |
| 2025-11-13 | Impacts of Decoder Latency on Utility-Scale Quantum Computer Architectures | Abdullah Khalid et.al. | 2511.10633 | null |
| 2025-11-13 | OmniVGGT: Omni-Modality Driven Visual Geometry Grounded | Haosong Peng et.al. | 2511.10560 | link |
| 2025-11-13 | Friction terms in multi-fluid description of heavy-ion collisions | Clemens Werthmann et.al. | 2511.10487 | null |
| 2025-11-13 | From Local Nonclassicality to Entanglement: A Convexity Law for Single-Excitation Dynamics | Atta ur Rahman et.al. | 2511.10470 | null |
| 2025-11-13 | Continuous Branching Processes with Settlement in Cancer Metastasis: Stochastic Modelling and the Feller Property | Ivan Biočić et.al. | 2511.10456 | null |
| 2025-11-13 | Chromatic Zeros on the Limit $G^{(p,\ell)}_\infty$ of the Family $G^{(p,\ell)}_m$ of Hierarchical Graphs | Shu-Chiuan Chang et.al. | 2511.10405 | null |
| 2025-11-13 | FOUND: Fourier-based von Mises Distribution for Robust Single Domain Generalization in Object Detection | Mengzhu Wang et.al. | 2511.10352 | null |
| 2025-11-13 | Out-of-Context Misinformation Detection via Variational Domain-Invariant Learning with Test-Time Training | Xi Yang et.al. | 2511.10213 | null |
| 2025-11-13 | Scalable data-driven modeling of microstructure evolution by learning local dependency and spatiotemporal translation invariance rules in phase field simulation | Zishuo Lan et.al. | 2511.10171 | link |
| 2025-11-13 | RI-Loss: A Learnable Residual-Informed Loss for Time Series Forecasting | Jieting Wang et.al. | 2511.10130 | null |
| 2025-11-13 | Geometric foundations of thermodynamics in the quantum regime | Álvaro Tejero et.al. | 2511.10125 | null |
| 2025-11-13 | T2IBias: Uncovering Societal Bias Encoded in the Latent Space of Text-to-Image Generative Models | Abu Sufian et.al. | 2511.10089 | null |
| 2025-11-13 | Efficient Thought Space Exploration through Strategic Intervention | Ziheng Li et.al. | 2511.10038 | null |
| 2025-11-13 | The Age-Structured Chemostat with Substrate Dynamics as a Control System | Iasson Karafyllis et.al. | 2511.09963 | null |
| 2025-11-13 | A Universal Block Error Rate Bound for Fluid Antenna Systems | Zhentian Zhang et.al. | 2511.09929 | null |
| 2025-11-13 | Boosting In-Silicon Directed Evolution with Fine-Tuned Protein Language Model and Tree Search | Yaodong Yang et.al. | 2511.09900 | null |
| 2025-11-13 | Interaction-induced Dimension Reduction for Bound States in Microwave-Shielded Ultracold Molecules | Haitian Wang et.al. | 2511.09856 | null |
| 2025-11-12 | Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models | Konstantinos M. Dafnis et.al. | 2511.09809 | null |
| 2025-11-12 | A Robust Task-Level Control Architecture for Learned Dynamical Systems | Eshika Pathak et.al. | 2511.09790 | null |
| 2025-11-12 | Ksurf-Drone: Attention Kalman Filter for Contextual Bandit Optimization in Cloud Resource Allocation | Michael Dang’ana et.al. | 2511.09766 | null |
| 2025-11-12 | CloudMamba: Grouped Selective State Spaces for Point Cloud Analysis | Kanglin Qu et.al. | 2511.07823 | link |
| 2025-11-10 | On the Redundant Distributed Observability of Mixed Traffic Transportation Systems | M. Doostmohammadian et.al. | 2511.06950 | null |
| 2025-11-10 | Dual Mamba for Node-Specific Representation Learning: Tackling Over-Smoothing with Selective State Space Modeling | Xin He et.al. | 2511.06756 | null |
| 2025-11-08 | L2T-Hyena: Enhancing State-Space Models with an Adaptive Learn-to-Teach Framework | Fatemeh Sobati et.al. | 2511.05926 | null |
| 2025-11-07 | Sequential Markov chain Monte Carlo for Filtering of State-Space Models with Low or Degenerate Observation Noise | Abylay Zhumekenov et.al. | 2511.04975 | null |
| 2025-11-06 | Generative Bayesian Filtering and Parameter Learning | Edoardo Marcelli et.al. | 2511.04552 | null |
| 2025-11-06 | Online Bayesian Experimental Design for Partially Observed Dynamical Systems | Sara Pérez-Vieites et.al. | 2511.04403 | null |
| 2025-11-05 | FAPEX: Fractional Amplitude-Phase Expressor for Robust Cross-Subject Seizure Prediction | Ruizhe Zheng et.al. | 2511.03263 | null |
| 2025-11-04 | Apriel-H1: Towards Efficient Enterprise Reasoning Models | Oleksiy Ostapenko et.al. | 2511.02651 | null |
| 2025-11-10 | MM-UNet: Morph Mamba U-shaped Convolutional Networks for Retinal Vessel Segmentation | Jiawen Liu et.al. | 2511.02193 | null |
| 2025-11-03 | MVSMamba: Multi-View Stereo with State Space Model | Jianfei Jiang et.al. | 2511.01315 | null |
| 2025-10-31 | MambaNetLK: Enhancing Colonoscopy Point Cloud Registration with Mamba | Linzhe Jiang et.al. | 2511.00260 | null |
| 2025-10-31 | Context-Gated Cross-Modal Perception with Visual Mamba for PET-CT Lung Tumor Segmentation | Elena Mulero Ayllón et.al. | 2510.27508 | null |
| 2025-10-31 | Versatile and Efficient Medical Image Super-Resolution Via Frequency-Gated Mamba | Wenfeng Huang et.al. | 2510.27296 | null |
| 2025-10-31 | Higher-order Linear Attention | Yifan Zhang et.al. | 2510.27258 | null |
| 2025-10-30 | Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling | Hyunji Lee et.al. | 2510.26912 | null |
| 2025-11-04 | PyDPF: A Python Package for Differentiable Particle Filtering | John-Joseph Brady et.al. | 2510.25693 | null |
| 2025-10-21 | Stable-by-Design Neural Network-Based LPV State-Space Models for System Identification | Ahmet Eren Sertbaş et.al. | 2510.24757 | null |
| 2025-10-28 | DeshadowMamba: Deshadowing as 1D Sequential Similarity | Zhaotong Yang et.al. | 2510.24260 | null |
| 2025-10-27 | Deep Active Inference with Diffusion Policy and Multiple Timescale World Model for Real-World Exploration and Navigation | Riko Yokozawa et.al. | 2510.23258 | null |
| 2025-10-30 | Hankel Singular Value Regularization for Highly Compressible State Space Models | Paul Schwerdtner et.al. | 2510.22951 | null |
| 2025-10-27 | GTR-Mamba: Geometry-to-Tangent Routing for Hyperbolic POI Recommendation | Zhuoxuan Li et.al. | 2510.22942 | null |
| 2025-10-26 | Beyond Semantics: How Temporal Biases Shape Retrieval in Transformer and State-Space Models | Anooshka Bajaj et.al. | 2510.22752 | null |
| 2025-10-26 | Scalable Neural Decoders for Practical Real-Time Quantum Error Correction | Changwon Lee et.al. | 2510.22724 | null |
| 2025-10-24 | Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging | Ying Xue et.al. | 2510.21654 | null |
| 2025-11-03 | ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models | Federico Danieli et.al. | 2510.21450 | null |
| 2025-10-23 | LLM-Integrated Bayesian State Space Models for Multimodal Time-Series Forecasting | Sungjun Cho et.al. | 2510.20952 | null |
| 2025-10-22 | PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation | Zhuoyang Xie et.al. | 2510.19475 | null |
| 2025-10-23 | Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge | Penghao Wang et.al. | 2510.19266 | null |
| 2025-10-21 | $Δ$ t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction | Zhengbo Zhou et.al. | 2510.19003 | null |
| 2025-10-23 | MLMA: Towards Multilingual ASR With Mamba-based Architectures | Mohamed Nabih Ali et.al. | 2510.18684 | null |
| 2025-10-15 | DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association | Zenghuang Fu et.al. | 2510.17860 | null |
| 2025-10-20 | S4ECG: Exploring the impact of long-range interactions for arrhythmia prediction | Tiezhi Wang et.al. | 2510.17406 | null |
| 2025-10-20 | CausalMamba: Scalable Conditional State Space Models for Neural Causal Inference | Sangyoon Bae et.al. | 2510.17318 | null |
| 2025-10-20 | Understanding and Improving Length Generalization in Hierarchical Sparse Attention Models | Jiaqi Leng et.al. | 2510.17196 | null |
| 2025-10-19 | Schrödinger Bridge Mamba for One-Step Speech Enhancement | Jing Yang et.al. | 2510.16834 | null |
| 2025-10-17 | VM-BeautyNet: A Synergistic Ensemble of Vision Transformer and Mamba for Facial Beauty Prediction | Djamel Eddine Boukhari et.al. | 2510.16220 | null |
| 2025-10-17 | StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales | Nyle Siddiqui et.al. | 2510.16209 | null |
| 2025-10-17 | Recursive Inference for Heterogeneous Multi-Output GP State-Space Models with Arbitrary Moment Matching | Tengjie Zheng et.al. | 2510.15390 | null |
| 2025-10-17 | Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding | Shuntaro Suzuki et.al. | 2510.15371 | null |
| 2025-10-16 | To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models | Eran Malach et.al. | 2510.14826 | null |
| 2025-10-16 | State-Space Models for Tabular Prior-Data Fitted Networks | Felix Koch et.al. | 2510.14573 | null |
| 2025-10-16 | A Deep State-Space Model Compression Method using Upper Bound on Output Error | Hiroki Sakamoto et.al. | 2510.14542 | null |
| 2025-10-16 | SHaRe-SSM: An Oscillatory Spiking Neural Network for Target Variable Modeling in Long Sequences | Kartikay Agrawal et.al. | 2510.14386 | null |
| 2025-10-16 | DRBD-Mamba for Robust and Efficient Brain Tumor Segmentation with Analytical Insights | Danish Ali et.al. | 2510.14383 | null |
| 2025-10-15 | Context-Selective State Space Models: Feedback is All You Need | Riccardo Zattra et.al. | 2510.14027 | null |
| 2025-10-16 | The Mechanistic Emergence of Symbol Grounding in Language Models | Shuyu Wu et.al. | 2510.13796 | null |
| 2025-10-14 | One Dimensional CNN ECG Mamba for Multilabel Abnormality Classification in 12 Lead ECG | Huawei Jiang et.al. | 2510.13046 | null |
| 2025-10-14 | State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding | Jiahuan Zhou et.al. | 2510.12160 | link |
| 2025-10-14 | Chimera: State Space Models Beyond Sequences | Aakash Lahoti et.al. | 2510.12111 | link |
| 2025-10-13 | Argus: JAX state-space filtering for gravitational wave detection with a pulsar timing array | Tom Kimpson et.al. | 2510.11077 | null |
| 2025-10-13 | High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation | Runyang Feng et.al. | 2510.11017 | null |
| 2025-10-16 | MSF-Mamba: Motion-aware State Fusion Mamba for Efficient Micro-Gesture Recognition | Deng Li et.al. | 2510.10478 | null |
| 2025-10-10 | Design Principles for Sequence Models via Coefficient Dynamics | Jerome Sieber et.al. | 2510.09389 | null |
| 2025-10-10 | Task-Level Insights from Eigenvalues across Sequence Models | Rahel Rickenbach et.al. | 2510.09379 | null |
| 2025-10-10 | Minkowski-MambaNet: A Point Cloud Framework with Selective State Space Models for Forest Biomass Quantification | Jinxiang Tu et.al. | 2510.09367 | null |
| 2025-10-10 | MambaH-Fit: Rethinking Hyper-surface Fitting-based Point Cloud Normal Estimation via State Space Modelling | Weijia Wang et.al. | 2510.09088 | null |
| 2025-10-13 | Revisiting Node Affinity Prediction in Temporal Graphs | Krishna Sri Ipsit Mantri et.al. | 2510.06940 | null |
| 2025-10-08 | DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining | Zhiliang Zhu et.al. | 2510.06746 | null |
| 2025-10-08 | A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures | Nhat M. Hoang et.al. | 2510.06640 | null |
| 2025-10-09 | Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection? | Sri Durga Sai Sowmya Kadali et.al. | 2510.06594 | null |
| 2025-10-09 | High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training | Zhuoyi Huang et.al. | 2510.05492 | null |
| 2025-10-06 | The End of Transformers? On Challenging Attention and the Rise of Sub-Quadratic Architectures | Alexander M. Fichtl et.al. | 2510.05364 | null |
| 2025-10-06 | Rivaling Transformers: Multi-Scale Structured State-Space Mixtures for Agentic 6G O-RAN | Farhad Rezazadeh et.al. | 2510.05255 | null |
| 2025-10-06 | On Structured State-Space Duality | Jerry Yao-Chieh Hu et.al. | 2510.04944 | null |
| 2025-10-06 | MCMC for State Space models | Paul Fearnhead et.al. | 2510.04932 | null |
| 2025-10-06 | Hybrid Architectures for Language Models: Systematic Analysis and Design Insights | Sangmin Bae et.al. | 2510.04800 | null |
| 2025-10-06 | Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba | Baher Mohammad et.al. | 2510.04738 | null |
| 2025-10-05 | Wave-PDE Nets: Trainable Wave-Equation Layers as an Alternative to Attention | Harshil Vejendla et.al. | 2510.04304 | null |
| 2025-10-09 | The Curious Case of In-Training Compression of State Space Models | Makram Chahine et.al. | 2510.02823 | null |
| 2025-10-02 | Accurate linear modeling of EEG-based cortical activity during a passive motor task with input: a sub-space identification approach | Sanna Bakels et.al. | 2510.02596 | null |
| 2025-10-02 | Bridging the Prediction Error Method and Subspace Identification: A Weighted Null Space Fitting Method | Jiabao He et.al. | 2510.02529 | null |
| 2025-10-01 | Linear RNNs for autoregressive generation of long music samples | Konrad Szewczyk et.al. | 2510.02401 | null |
| 2025-09-30 | Dynamic Modeling and Control System Analysis for Continuous-Disc Filters in Pulp Mill Operations | Jose M. Campos-Salazar et.al. | 2510.02385 | null |
| 2025-10-02 | Knots and variance ordering of sequential Monte Carlo algorithms | Joshua J Bon et.al. | 2510.01901 | null |
| 2025-10-01 | Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model | Hyun-kyu Ko et.al. | 2510.00862 | null |
| 2025-10-01 | Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models | JingChuan Guan et.al. | 2510.00563 | null |
| 2025-09-30 | PRISM: Progressive Rain removal with Integrated State-space Modeling | Pengze Xue et.al. | 2509.26413 | null |
| 2025-09-30 | Neural Network State-Space Estimators | Minxing Sun et.al. | 2509.25959 | null |
| 2025-09-30 | Bringing Emerging Architectures to Sequence Labeling in NLP | Ana Ezquerro et.al. | 2509.25918 | null |
| 2025-09-29 | Benchmarking ECG Foundational Models: A Reality Check Across Clinical Tasks | M A Al-Masud et.al. | 2509.25095 | link |
| 2025-09-29 | DyMoDreamer: World Modeling with Dynamic Modulation | Boxuan Zhang et.al. | 2509.24804 | link |
| 2025-09-29 | Q-Net: Transferable Queue Length Estimation via Kalman-based Neural Networks | Ting Gao et.al. | 2509.24725 | null |
| 2025-09-29 | Wavelet-Assisted Mamba for Satellite-Derived Sea Surface Temperature Super-Resolution | Wankun Chen et.al. | 2509.24334 | null |
| 2025-09-29 | Similarity-Aware Selective State-Space Modeling for Semantic Correspondence | Seungwook Kim et.al. | 2509.24318 | link |
| 2025-09-28 | HyMaTE: A Hybrid Mamba and Transformer Model for EHR Representation Learning | Md Mozaharul Mottalib et.al. | 2509.24118 | link |
| 2025-09-28 | Hazy Pedestrian Trajectory Prediction via Physical Priors and Graph-Mamba | Jian Chen et.al. | 2509.24020 | null |
| 2025-09-28 | Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression | Jiarui Jiang et.al. | 2509.23779 | null |
| 2025-10-04 | EfficientMIL: Efficient Linear-Complexity MIL Method for WSI Classification | Chengying She et.al. | 2509.23640 | link |
| 2025-09-26 | TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses | Sahar Dastani et.al. | 2509.22813 | link |
| 2025-09-26 | StateX: Enhancing RNN Recall via Post-training State Expansion | Xingyu Shen et.al. | 2509.22630 | null |
| 2025-09-26 | Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models | Aleksandar Terzić et.al. | 2509.22284 | null |
| 2025-09-25 | MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation | Xinyu Liu et.al. | 2509.21265 | null |
| 2025-09-26 | Aligning Inductive Bias for Data-Efficient Generalization in State Space Models | Qiyu Chen et.al. | 2509.20789 | null |
| 2025-09-24 | SpecMamba: Accelerating Mamba Inference on FPGA with Speculative Decoding | Linfeng Zhong et.al. | 2509.19873 | null |
| 2025-09-24 | RoboSSM: Scalable In-context Imitation Learning via State-Space Models | Youngju Yoo et.al. | 2509.19658 | null |
| 2025-09-23 | Mamba Modulation: On the Length Generalization of Mamba | Peng Lu et.al. | 2509.19633 | null |
| 2025-09-23 | Tractable Approximation of Labeled Multi-Object Posterior Densities | Thi Hong Thai Nguyen et.al. | 2509.18780 | null |
| 2025-09-23 | An overview of neural architectures for self-supervised audio representation learning from masked spectrograms | Sarthak Yadav et.al. | 2509.18691 | null |
| 2025-09-23 | LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection | Lanhu Wu et.al. | 2509.18683 | null |
| 2025-09-23 | LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA | Zeyi Kang et.al. | 2509.18576 | null |
| 2025-09-22 | Bayesian Nonhomogeneous hidden Markov models to leverage routine in physical activity monitoring with informative wear time | Beatrice Cantoni et.al. | 2509.17806 | null |
| 2025-09-22 | DA-Mamba: Dialogue-aware selective state-space model for multimodal engagement estimation | Shenwei Kang et.al. | 2509.17711 | null |
| 2025-09-22 | Achilles’ Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data | Tianyi Chen et.al. | 2509.17514 | null |
| 2025-09-21 | SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction | Djamel Eddine Boukhari et.al. | 2509.17172 | null |
| 2025-09-21 | Communication over LQG Control Systems: A Convex Optimization Approach to Capacity | Aharon Rips et.al. | 2509.17002 | null |
| 2025-09-19 | Estimating Clinical Lab Test Result Trajectories from PPG using Physiological Foundation Model and Patient-Aware State Space Model – a UNIPHY+ Approach | Minxiao Wang et.al. | 2509.16345 | null |
| 2025-09-19 | Mamba-2 audio captioning: design space exploration and analysis | Taehan Lee et.al. | 2509.15680 | null |
| 2025-09-19 | De-crackling Virtual Analog Controls with Asymptotically Stable Recurrent Neural Networks | Valtteri Kallinen et.al. | 2509.15622 | null |
| 2025-09-19 | DC-Mamba: Bi-temporal deformable alignment and scale-sparse enhancement for remote sensing change detection | Min Sun et.al. | 2509.15563 | null |
| 2025-09-17 | Classification Filtering | Ilker Bayram et.al. | 2509.13975 | null |
| 2025-09-17 | Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models | Motonari Kambara et.al. | 2509.13839 | null |
| 2025-09-17 | CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling | Hanfang Liang et.al. | 2509.13784 | null |
| 2025-09-17 | State Space Models over Directed Graphs | Junzhi She et.al. | 2509.13735 | null |
| 2025-09-16 | Multivariate Low-Rank State-Space Model with SPDE Approach for High-Dimensional Data | Jacopo Rodeschini et.al. | 2509.12825 | null |
| 2025-09-15 | U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT | Zhi Qin Tan et.al. | 2509.12069 | null |
| 2025-09-15 | AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective | Yuchen Deng et.al. | 2509.12052 | null |
| 2025-09-15 | Joint-octamamba:an octa joint segmentation network based on feature enhanced mamba | Chuang Liu et.al. | 2509.11649 | null |
| 2025-09-14 | MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation | Syed Talal Wasim et.al. | 2509.11394 | null |
| 2025-09-14 | MEMBOT: Memory-Based Robot in Intermittent POMDP | Youzhi Liang et.al. | 2509.11225 | null |
| 2025-09-12 | FLARE-SSM: Deep State Space Models with Influence-Balanced Loss for 72-Hour Solar Flare Prediction | Yusuke Takagi et.al. | 2509.09988 | null |
| 2025-09-12 | MAESTRO: Multi-modal Adaptive Estimation for Temporal Respiratory Disease Outbreak | Hong Liu et.al. | 2509.08578 | null |
| 2025-09-10 | First-order State Space Model for Lightweight Image Super-resolution | Yujie Zhu et.al. | 2509.08458 | null |
| 2025-09-09 | A kernel-based approach to physics-informed nonlinear system identification | Cesare Donati et.al. | 2509.07634 | null |
| 2025-09-07 | Recursive State Inference for Linear PASFA | Vishal Rishi et.al. | 2509.07028 | null |
| 2025-09-06 | Hyperbolic Large Language Models | Sarang Patil et.al. | 2509.05757 | null |
| 2025-09-05 | A Bayesian Gaussian Process Dynamic Factor Model | Tony Chernis et.al. | 2509.04928 | null |
| 2025-09-05 | CD-Mamba: Cloud detection with long-range spatial dependency modeling | Tianxiang Xue et.al. | 2509.04729 | link |
| 2025-09-04 | VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation | Mustafa Munir et.al. | 2509.04669 | null |
| 2025-09-04 | Echo State Networks as State-Space Models: A Systems Perspective | Pradeep Singh et.al. | 2509.04422 | null |
| 2025-09-04 | Rethinking the long-range dependency in Mamba/SSM and transformer models | Cong Ma et.al. | 2509.04226 | null |
| 2025-09-03 | Time-Scaling State-Space Models for Dense Video Captioning | AJ Piergiovanni et.al. | 2509.03426 | null |
| 2025-09-03 | S2M2ECG: Spatio-temporal bi-directional State Space Model Enabled Multi-branch Mamba for ECG | Huaicheng Zhang et.al. | 2509.03066 | null |
| 2025-09-02 | Mentality: A Mamba-based Approach towards Foundation Models for EEG | Saarang Panchavati et.al. | 2509.02746 | null |
| 2025-09-02 | ESTM: An Enhanced Dual-Branch Spectral-Temporal Mamba for Anomalous Sound Detection | Chengyuan Ma et.al. | 2509.02471 | null |
| 2025-09-02 | AudioRWKV: Efficient and Stable Bidirectional RWKV for Audio Pattern Recognition | Jiayu Xiong et.al. | 2509.02167 | null |
| 2025-09-01 | A Mathematical Model of Hybrid Microgrid With Pole Placement Controller Using State Feedback For Stability Improvement | Yangyadatta Tripathy et.al. | 2509.01749 | null |
| 2025-09-01 | Mamba-CNN: A Hybrid Architecture for Efficient and Accurate Facial Beauty Prediction | Djamel Eddine Boukhari et.al. | 2509.01431 | null |
| 2025-09-01 | StoxLSTM: A Stochastic Extended Long Short-Term Memory Network for Time Series Forecasting | Zihao Wang et.al. | 2509.01187 | null |
| 2025-09-01 | SpectMamba: Integrating Frequency and State Space Models for Enhanced Medical Image Detection | Yao Wang et.al. | 2509.01080 | null |
| 2025-08-31 | Prospects of Imitating Trading Agents in the Stock Market | Mateusz Wilinski et.al. | 2509.00982 | null |
| 2025-08-31 | CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification | Qingyu Wang et.al. | 2509.00677 | null |
| 2025-08-31 | MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation | Aviral Chharia et.al. | 2509.00649 | link |
| 2025-08-30 | COMET: A Framework for Modeling Compound Operation Dataflows with Explicit Collectives | Shubham Negi et.al. | 2509.00599 | null |
| 2025-08-30 | SemaMIL: Semantic Reordering with Retrieval-Guided State Space Modeling for Whole Slide Image Classification | Lubin Gan et.al. | 2509.00442 | null |
| 2025-08-29 | Quantum-Optimized Selective State Space Model for Efficient Time Series Prediction | Stefan-Alexandru Jura et.al. | 2509.00259 | null |
| 2025-07-24 | PointLAMA: Latent Attention meets Mamba for Efficient Point Cloud Pretraining | Xuanyu Lin et.al. | 2507.17296 | null |
| 2025-06-17 | MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration | Bingxi Liu et.al. | 2506.13183 | null |
| 2025-05-20 | Mamba-Adaptor: State Space Model Adaptor for Visual Recognition | Fei Xie et.al. | 2505.12685 | null |
| 2025-03-18 | TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba | Jiaxu Liu et.al. | 2503.13004 | null |
| 2025-03-21 | MambaTron: Efficient Cross-Modal Point Cloud Enhancement using Aggregate Selective State Space Modeling | Sai Tarun Inaganti et.al. | 2501.16384 | null |
| 2025-02-27 | Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion | Chaodong Xiao et.al. | 2410.15091 | link |
| 2024-07-18 | Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model | Tao Wang et.al. | 2407.12319 | null |
| 2025-01-14 | Pamba: Enhancing Global Interaction in Point Clouds via State Space Model | Zhuoyuan Li et.al. | 2406.17442 | null |
| 2024-06-11 | PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis | Jia-wei Chen et.al. | 2406.06069 | null |
| 2024-06-18 | PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis | Zicheng Wang et.al. | 2405.15463 | link |
| 2024-05-08 | Vision Mamba: A Comprehensive Survey and Taxonomy | Xiao Liu et.al. | 2405.04404 | link |
| 2024-11-12 | Visual Mamba: A Survey and New Outlooks | Rui Xu et.al. | 2404.18861 | link |
| 2024-04-29 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
| 2025-01-10 | 3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering | Qingyuan Zhou et.al. | 2404.05522 | link |
| 2024-03-19 | Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy | Jiuming Liu et.al. | 2403.06467 | null |
| 2024-06-25 | MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection | Tianxiang Chen et.al. | 2403.02148 | link |
| 2024-10-14 | Point Cloud Mamba: Point Cloud Learning via State Space Model | Tao Zhang et.al. | 2403.00762 | link |
| 2024-11-26 | PointMamba: A Simple State Space Model for Point Cloud Analysis | Dingkang Liang et.al. | 2402.10739 | link |
| 2024-11-15 | Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Lianghui Zhu et.al. | 2401.09417 | link |
(<a href=#updated-on-20260404>back to top</a>)
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2026-04-02 | Cosine-Normalized Attention for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2604.01763 | null |
| 2026-03-27 | Learnable Quantum Efficiency Filters for Urban Hyperspectral Segmentation | Imad Ali Shah et.al. | 2603.26528 | null |
| 2026-03-27 | HyVIC: A Metric-Driven Spatio-Spectral Hyperspectral Image Compression Architecture Based on Variational Autoencoders | Martin Hermann Paul Fuchs et.al. | 2603.26468 | null |
| 2026-03-26 | Narrowband searches for continuous gravitational waves from known pulsars in the first two parts of the fourth LIGO–Virgo–KAGRA observing run | The LIGO Scientific Collaboration et.al. | 2603.25938 | null |
| 2026-04-02 | Searches for Continuous Gravitational Waves from Supernova Remnants in the first part of the LIGO-Virgo-KAGRA Fourth Observing run | The LIGO Scientific Collaboration et.al. | 2603.25808 | null |
| 2026-03-26 | Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case | Koldo Basterretxea et.al. | 2603.25510 | null |
| 2026-03-26 | Underdetermined Blind Source Separation via Weighted Simplex Shrinkage Regularization and Quantum Deep Image Prior | Chia-Hsiang Lin et.al. | 2603.25384 | null |
| 2026-03-25 | Connecting Meteorite Spectra to Lunar Surface Composition Using Hyperspectral Imaging and Machine Learning | Fatemeh Fazel Hesar et.al. | 2603.24323 | null |
| 2026-03-25 | LGEST: Dynamic Spatial-Spectral Expert Routing for Hyperspectral Image Classification | Jiawen Wen et.al. | 2603.24045 | null |
| 2026-03-23 | A Latent Representation Learning Framework for Hyperspectral Image Emulation in Remote Sensing | Chedly Ben Azizi et.al. | 2603.21911 | null |
| 2026-03-23 | Hyperspectral imaging solutions for brain tissue metabolic and haemodynamic monitoring: an updated perspective | Luca Giannoni et.al. | 2603.21732 | null |
| 2026-04-01 | Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability | Jiahui Song et.al. | 2603.21510 | null |
| 2026-03-19 | HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting | Songfeng Zhu et.al. | 2603.20292 | null |
| 2026-03-19 | GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants | The LIGO Scientific Collaboration et.al. | 2603.19021 | null |
| 2026-03-19 | GWTC-4.0: Tests of General Relativity. II. Parameterized Tests | The LIGO Scientific Collaboration et.al. | 2603.19020 | null |
| 2026-03-19 | GWTC-4.0: Tests of General Relativity. I. Overview and General Tests | The LIGO Scientific Collaboration et.al. | 2603.19019 | null |
| 2026-03-17 | Spectral Property-Driven Data Augmentation for Hyperspectral Single-Source Domain Generalization | Taiqin Chen et.al. | 2603.16662 | null |
| 2026-03-17 | 3D Fourier-based Global Feature Extraction for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2603.16426 | null |
| 2026-03-16 | HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions | Yukang Cao et.al. | 2603.15612 | null |
| 2026-03-15 | All-sky Searches for Continuous Gravitational Waves from Isolated Neutron Stars in the Data from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run | The LIGO Scientific Collaboration et.al. | 2603.14168 | null |
| 2026-03-14 | Bidirectional Cross-Attention Fusion of High-Res RGB and Low-Res HSI for Multimodal Automated Waste Sorting | Jonas V. Funk et.al. | 2603.13941 | null |
| 2026-03-12 | Blind Hyperspectral and Multispectral Images Fusion: A Unified Tensor Fusion Framework from Coupled Inverse Problem Perspective | Ying Gao et.al. | 2603.11530 | null |
| 2026-03-09 | Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning | Yingkai Zhang et.al. | 2603.07918 | null |
| 2026-03-24 | Spectral Gaps and Spatial Priors: Studying Hyperspectral Downstream Adaptation Using TerraMind | Julia Anna Leonardi et.al. | 2603.06690 | null |
| 2026-03-03 | Unmixing microinfrared spectroscopic images of cross-sections of historical oil paintings | Shivam Pande et.al. | 2603.06673 | null |
| 2026-03-05 | Towards 3D Scene Understanding of Gas Plumes in LWIR Hyperspectral Images Using Neural Radiance Fields | Scout Jarman et.al. | 2603.05473 | null |
| 2026-03-05 | A Benchmark Study of Neural Network Compression Methods for Hyperspectral Image Classification | Sai Shi et.al. | 2603.04720 | null |
| 2026-03-03 | mHC-HSI: Clustering-Guided Hyper-Connection Mamba for Hyperspectral Image Classification | Yimin Zhu et.al. | 2603.03418 | null |
| 2026-03-02 | RoboGPU: Accelerating GPU Collision Detection for Robotics | Lufei Liu et.al. | 2603.01517 | null |
| 2026-03-01 | VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification | Abdellah Zakaria Sellam et.al. | 2603.01174 | null |
| 2026-02-18 | HS-3D-NeRF: 3D Surface and Hyperspectral Reconstruction From Stationary Hyperspectral Images Using Multi-Channel NeRFs | Kibon Ku et.al. | 2602.16950 | null |
| 2026-02-11 | Benchmarking Deep Learning and Statistical Target Detection Methods for PFM-1 Landmine Detection in UAV Hyperspectral Imagery | Sagar Lekhak et.al. | 2602.10434 | null |
| 2026-02-04 | DMS2F-HAD: A Dual-branch Mamba-based Spatial-Spectral Fusion Network for Hyperspectral Anomaly Detection | Aayushma Pant et.al. | 2602.04102 | null |
| 2026-02-02 | DSXFormer: Dual-Pooling Spectral Squeeze-Expansion and Dynamic Context Attention Transformer for Hyperspectral Image Classification | Farhan Ullah et.al. | 2602.01906 | null |
| 2026-01-31 | HSI-VAR: Rethinking Hyperspectral Restoration through Spatial-Spectral Visual Autoregression | Xiangming Wang et.al. | 2602.00749 | null |
| 2026-01-31 | From Prompt to Graph: Comparing LLM-Based Information Extraction Strategies in Domain-Specific Ontology Development | Xuan Liu et.al. | 2602.00699 | null |
| 2026-01-31 | HSSDCT: Factorized Spatial-Spectral Correlation for Hyperspectral Image Fusion | Chia-Ming Lee et.al. | 2602.00490 | null |
| 2026-01-30 | Cross-Domain Few-Shot Learning for Hyperspectral Image Classification Based on Mixup Foundation Model | Naeem Paeedeh et.al. | 2601.22581 | null |
| 2026-01-29 | SR $^{2}$ -Net: A General Plug-and-Play Model for Spectral Refinement in Hyperspectral Image Super-Resolution | Ji-Xuan He et.al. | 2601.21338 | null |
| 2026-01-27 | Dynamic Worlds, Dynamic Humans: Generating Virtual Human-Scene Interaction Motion in Dynamic Scenes | Yin Wang et.al. | 2601.19484 | null |
| 2026-01-26 | AI-enabled Satellite Edge Computing: A Single-Pixel Feature based Shallow Classification Model for Hyperspectral Imaging | Li Fang et.al. | 2601.18560 | null |
| 2026-01-26 | Cross-Domain Transfer with Self-Supervised Spectral-Spatial Modeling for Hyperspectral Image Classification | Jianshu Chao et.al. | 2601.18088 | null |
| 2026-01-26 | Semi-Supervised Hyperspectral Image Classification with Edge-Aware Superpixel Label Propagation and Adaptive Pseudo-Labeling | Yunfei Qiu et.al. | 2601.18049 | null |
| 2026-01-24 | HyDeMiC: A Deep Learning-based Mineral Classifier using Hyperspectral Data | M. L. Mamud et.al. | 2601.17352 | null |
| 2026-01-22 | Clustering-Guided Spatial-Spectral Mamba for Hyperspectral Image Classification | Zack Dewis et.al. | 2601.16098 | null |
| 2026-01-22 | Multimodal Imaging System Combining Hyperspectral and Laser Speckle Imaging for In Vivo Hemodynamic and Metabolic Monitoring | Junda Wang et.al. | 2601.15947 | null |
| 2026-01-22 | White-Box mHC: Electromagnetic Spectrum-Aware and Interpretable Stream Interactions for Hyperspectral Image Classification | Yimin Zhu et.al. | 2601.15757 | null |
| 2026-01-20 | SHARE: A Fully Unsupervised Framework for Single Hyperspectral Image Restoration | Jiangwei Xie et.al. | 2601.13987 | null |
| 2026-01-18 | Utilizing the Score of Data Distribution for Hyperspectral Anomaly Detection | Jiahui Sheng et.al. | 2601.12379 | null |
| 2026-01-18 | Turbo-GoDec: Exploiting the Cluster Sparsity Prior for Hyperspectral Anomaly Detection | Jiahui Sheng et.al. | 2601.12337 | null |
| 2026-01-16 | Anisotropic Tensor Deconvolution of Hyperspectral Images | Xinjue Wang et.al. | 2601.11694 | null |
| 2026-01-13 | MMLGNet: Cross-Modal Alignment of Remote Sensing Data using CLIP | Aditya Chaudhary et.al. | 2601.08420 | null |
| 2026-01-12 | SDHSI-Net: Learning Better Representations for Hyperspectral Images via Self-Distillation | Prachet Dev Singh et.al. | 2601.07416 | null |
| 2026-01-11 | Adversarial Attacks on Medical Hyperspectral Imaging Exploiting Spectral-Spatial Dependencies and Multiscale Features | Yunrui Gu et.al. | 2601.07056 | null |
| 2026-01-08 | EdgeLDR: Quaternion Low-Displacement Rank Neural Networks for Edge-Efficient Deep Learning | Vladimir Frants et.al. | 2601.05379 | null |
| 2026-01-07 | HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object Detection | Shuyan Bai et.al. | 2601.03736 | null |
| 2026-01-03 | Efficient Hyperspectral Image Reconstruction Using Lightweight Separate Spectral Transformers | Jianan Li et.al. | 2601.01064 | null |
| 2025-12-30 | Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges | Yu-Tang Chang et.al. | 2512.24172 | null |
| 2025-12-25 | Degradation-Aware Metric Prompting for Hyperspectral Image Restoration | Binfeng Wang et.al. | 2512.20251 | null |
| 2025-12-22 | Rethinking Coupled Tensor Analysis for Hyperspectral Super-Resolution: Recoverable Modeling Under Endmember Variability | Meng Ding et.al. | 2512.19489 | null |
| 2026-01-21 | Constraints on gravitational waves from the 2024 Vela pulsar glitch | The LIGO Scientific Collaboration et.al. | 2512.17990 | null |
| 2025-12-22 | A Parametric Framework for Anticipatory Flashflood Warning: Integrating Landscape Vulnerability with Precipitation Forecasts | Xiangpeng Li et.al. | 2512.17785 | null |
| 2025-12-16 | Bridging the Gap Between Modern UX Design and Particle Accelerator Control Room Interfaces | Rachael Hill et.al. | 2512.14872 | null |
| 2025-12-11 | Perception-Inspired Color Space Design for Photo White Balance Editing | Yang Cheng et.al. | 2512.09383 | null |
| 2025-12-08 | Agreement Disagreement Guided Knowledge Transfer for Cross-Scene Hyperspectral Imaging | Lu Huo et.al. | 2512.08990 | null |
| 2025-12-08 | Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration | Lu Huo et.al. | 2512.08989 | null |
| 2025-12-05 | Hyperspectral Unmixing with 3D Convolutional Sparse Coding and Projected Simplex Volume Maximization | Gargi Panda et.al. | 2512.05674 | null |
| 2025-12-03 | Label-Efficient Hyperspectral Image Classification via Spectral FiLM Modulation of Low-Level Pretrained Diffusion Features | Yuzhen Hu et.al. | 2512.03430 | null |
| 2025-12-02 | PyroFocus: A Deep Learning Approach to Real-Time Wildfire Detection in Multispectral Remote Sensing Imagery | Mark Moussa et.al. | 2512.03257 | null |
| 2025-11-29 | UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations | Yuzhen Hu et.al. | 2512.00261 | null |
| 2025-12-05 | Search for planetary-mass ultra-compact binaries using data from the first part of the LIGO–Virgo–KAGRA fourth observing run | The LIGO Scientific Collaboration et.al. | 2511.19911 | null |
| 2025-11-23 | LRDUN: A Low-Rank Deep Unfolding Network for Efficient Spectral Compressive Imaging | He Huang et.al. | 2511.18513 | null |
| 2025-11-23 | Uncertainty Quantification in HSI Reconstruction using Physics-Aware Diffusion Priors and Optics-Encoded Measurements | Juan Romero et.al. | 2511.18473 | null |
| 2025-11-22 | Spectral Super-Resolution Neural Operator with Atmospheric Radiative Transfer Prior | Ziye Zhang et.al. | 2511.17895 | null |
| 2025-11-21 | REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing | Binger Chen et.al. | 2511.17442 | null |
| 2025-11-21 | Real Noise Decoupling for Hyperspectral Image Denoising | Yingkai Zhang et.al. | 2511.17196 | null |
| 2025-12-04 | All-sky search for continuous gravitational-wave signals from unknown neutron stars in binary systems in the first part of the fourth LIGO-Virgo-KAGRA observing run | The LIGO Scientific Collaboration et.al. | 2511.16863 | null |
| 2025-11-20 | SpectralTrain: A Universal Framework for Hyperspectral Image Classification | Meihua Zhou et.al. | 2511.16084 | null |
| 2025-11-19 | Hyperspectral Image Classification using Spectral-Spatial Mixer Network | Mohammed Q. Alkhatib et.al. | 2511.15692 | null |
| 2025-11-24 | Multimodal Optical Imaging Platform for Quantitative Burn Assessment | Nathaniel Hanson et.al. | 2511.15509 | null |
| 2025-11-19 | Hyperspectral Super-Resolution with Inter-Image Variability via Degradation-based Low-Rank and Residual Fusion Method | Yue Wen et.al. | 2511.15052 | null |
| 2025-11-17 | Human-centric Maintenance Process Through Integration of AI, Speech, and AR | Parul Khanna et.al. | 2511.13918 | null |
| 2025-11-17 | SpectralAdapt: Semi-Supervised Domain Adaptation with Spectral Priors for Human-Centered Hyperspectral Image Reconstruction | Yufei Wen et.al. | 2511.13020 | null |
| 2025-12-19 | CLAReSNet: When Convolution Meets Latent Attention for Hyperspectral Image Classification | Asmit Bandyopadhyay et.al. | 2511.12346 | null |
| 2025-11-15 | Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification | Rupam Mukherjee et.al. | 2511.12268 | null |
| 2025-11-13 | Exposing DeepFakes via Hyperspectral Domain Mapping | Aditya Mehta et.al. | 2511.11732 | null |
| 2025-11-13 | Perceive, Act and Correct: Confidence Is Not Enough for Hyperspectral Classification | Muzhou Yang et.al. | 2511.10068 | null |
| 2025-11-11 | HyperScout-H: the hyperspectral imager for the ESA Hera mission | Marcel M. Popescu et.al. | 2511.08047 | null |
| 2025-11-10 | GEWDiff: Geometric Enhanced Wavelet-based Diffusion Model for Hyperspectral Image Super-resolution | Sirui Wang et.al. | 2511.07103 | null |
| 2025-10-31 | SpecAware: A Spectral-Content Aware Foundation Model for Unifying Multi-Sensor Learning in Hyperspectral Remote Sensing Mapping | Renjie Ji et.al. | 2510.27219 | null |
| 2025-11-13 | Direct multi-model dark-matter search with gravitational-wave interferometers using data from the first part of the fourth LIGO-Virgo-KAGRA observing run | The LIGO Scientific Collaboration et.al. | 2510.27022 | null |
| 2025-10-30 | GW241011 and GW241110: Exploring Binary Formation and Fundamental Physics with Asymmetric, High-Spin Black Hole Coalescence | The LIGO Scientific Collaboration et.al. | 2510.26931 | null |
| 2025-11-07 | Cosmological and High Energy Physics implications from gravitational-wave background searches in LIGO-Virgo-KAGRA’s O1-O4a runs | The LIGO Scientific Collaboration et.al. | 2510.26848 | null |
| 2025-10-23 | SpectraMorph: Structured Latent Learning for Self-Supervised Hyperspectral Super-Resolution | Ritik Shah et.al. | 2510.20814 | null |
| 2025-10-20 | Directional Search for Persistent Gravitational Waves: Results from the First Part of LIGO-Virgo-KAGRA’s Fourth Observing Run | The LIGO Scientific Collaboration et.al. | 2510.17487 | null |
| 2025-10-18 | HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications | Christopher Thirgood et.al. | 2510.16664 | null |
| 2025-10-15 | Near-Infrared Hyperspectral Imaging Applications in Food Analysis – Improving Algorithms and Methodologies | Ole-Christian Galbo Engstrøm et.al. | 2510.13452 | null |
| 2025-10-14 | Benchmarking foundation models for hyperspectral image classification: Application to cereal crop type mapping | Walid Elbarz et.al. | 2510.11576 | null |
| 2025-10-13 | Directly Mapping Interacting Components to Complex Systems’ Emergent Properties | Lina Yan et.al. | 2510.10881 | null |
| 2025-10-10 | SpectralCA: Bi-Directional Cross-Attention for Next-Generation UAV Hyperspectral Vision | D. V. Brovko et.al. | 2510.09912 | null |
| 2025-10-09 | Hyperspectral data augmentation with transformer-based diffusion models | Mattia Ferrari et.al. | 2510.08363 | null |
| 2025-10-08 | Label Semantics for Robust Hyperspectral Image Classification | Rafin Hassan et.al. | 2510.07556 | null |
| 2025-10-06 | In-Field Mapping of Grape Yield and Quality with Illumination-Invariant Deep Learning | Ciem Cornelissen et.al. | 2510.04864 | null |
| 2025-10-02 | Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction | Yi Ai et.al. | 2510.01912 | null |
| 2025-10-01 | Towards Adversarial Training under Hyperspectral Images | Weihua Zhang et.al. | 2510.01014 | null |
| 2025-09-28 | Joint Superpixel and Self-Representation Learning for Scalable Hyperspectral Image Clustering | Xianlu Li et.al. | 2509.24027 | null |
| 2025-09-28 | Generalized Category Discovery in Hyperspectral Images via Prototype Subspace Modeling | Xianlu Li et.al. | 2509.24017 | null |
| 2025-09-20 | Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment | Abhiroop Chatterjee et.al. | 2509.22697 | null |
| 2025-09-25 | Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models | Juana Valeria Hurtado et.al. | 2509.20107 | null |
| 2025-09-21 | SwarmChat: An LLM-Based, Context-Aware Multimodal Interaction System for Robotic Swarms | Ettilla Mohiuddin Eumi et.al. | 2509.16920 | null |
| 2025-09-20 | Spectral Compressive Imaging via Chromaticity-Intensity Decomposition | Xiaodong Wang et.al. | 2509.16690 | null |
| 2025-09-16 | Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation | Hugo Carlesso et.al. | 2509.13229 | null |
| 2025-09-15 | Progressive Flow-inspired Unfolding for Spectral Compressive Imaging | Xiaodong Wang et.al. | 2509.12079 | null |
| 2025-09-19 | USCTNet: A deep unfolding nuclear-norm optimization solver for physically consistent HSI reconstruction | Xiaoyang Ma et.al. | 2509.10651 | null |
| 2025-09-12 | Nanosculpting lateral weak link junctions in superconducting Fe(Te,Se)/Bi2Te3 with focused Si++ ions and implications on vortex pinning | Debarghya Mallick et.al. | 2509.10606 | null |
| 2025-09-11 | CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution | Yulin Tong et.al. | 2509.09163 | null |
| 2025-09-22 | HyperTTA: Test-Time Adaptation for Hyperspectral Image Classification under Distribution Shifts | Xia Yue et.al. | 2509.08436 | null |
| 2025-09-09 | GW250114: testing Hawking’s area law and the Kerr nature of black holes | The LIGO Scientific Collaboration et.al. | 2509.08054 | null |
| 2025-09-15 | Directed searches for gravitational waves from ultralight vector boson clouds around merger remnant and galactic black holes during the first part of the fourth LIGO-Virgo-KAGRA observing run | The LIGO Scientific Collaboration et.al. | 2509.07352 | null |
| 2025-10-07 | GWTC-4.0: Constraints on the Cosmic Expansion Rate and Modified Gravitational-wave Propagation | The LIGO Scientific Collaboration et.al. | 2509.04348 | null |
| 2025-09-02 | Explainability-Driven Dimensionality Reduction for Hyperspectral Imaging | Salma Haidar et.al. | 2509.02340 | null |
| 2025-09-01 | FantasyHSI: Video-Generation-Centric 4D Human Synthesis In Any Scene through A Graph-based Multi-Agent Framework | Lingzhou Mu et.al. | 2509.01232 | null |
| 2025-08-31 | CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification | Qingyu Wang et.al. | 2509.00677 | null |
| 2025-08-30 | Iterative Low-rank Network for Hyperspectral Image Denoising | Jin Ye et.al. | 2509.00356 | null |
| 2025-08-28 | Upper Limits on the Isotropic Gravitational-Wave Background from the first part of LIGO, Virgo, and KAGRA’s fourth Observing Run | The LIGO Scientific Collaboration et.al. | 2508.20721 | null |
| 2025-08-27 | Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities | Imad Ali Shah et.al. | 2508.19905 | null |
| 2025-09-08 | GWTC-4.0: Updating the Gravitational-Wave Transient Catalog with Observations from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run | The LIGO Scientific Collaboration et.al. | 2508.18082 | null |
| 2025-09-03 | Open Data from LIGO, Virgo, and KAGRA through the First Part of the Fourth Observing Run | The LIGO Scientific Collaboration et.al. | 2508.18079 | null |
| 2025-08-25 | Few-shot Unknown Class Discovery of Hyperspectral Images with Prototype Learning and Clustering | Chun Liu et.al. | 2508.18075 | null |
| 2025-08-21 | Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising | Jin Ye et.al. | 2508.15553 | link |
| 2025-08-15 | Hyperspectral vs. RGB for Pedestrian Segmentation in Urban Driving Scenes: A Comparative Study | Jiarong Li et.al. | 2508.11301 | null |
| 2025-08-14 | CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving | Jiarong Li et.al. | 2508.10962 | null |
| 2025-08-13 | Probabilistic Emissivity Retrieval from Hyperspectral Data via Physics-Guided Variational Inference | Joshua R. Tempelman et.al. | 2508.08291 | null |
| 2025-08-11 | Hyperspectral Imaging | Danfeng Hong et.al. | 2508.08107 | link |
| 2025-08-11 | DETACH: Cross-domain Learning for Long-Horizon Tasks via Mixture of Disentangled Experts | Yutong Shen et.al. | 2508.07842 | null |
| 2025-08-09 | TerraMAE: Learning Spatial-Spectral Representations from Hyperspectral Earth Observation Data via Adaptive Masked Autoencoders | Tanjim Bin Faruk et.al. | 2508.07020 | null |
| 2025-08-05 | Low-rankness and Smoothness Meet Subspace: A Unified Tensor Regularization for Hyperspectral Image Super-resolution | Jun Zhang et.al. | 2508.03049 | null |
| 2025-08-02 | Hyperspectral Image Recovery Constrained by Multi-Granularity Non-Local Self-Similarity Priors | Zhuoran Peng et.al. | 2508.01435 | null |
| 2025-08-05 | Phase-Locked SNR Band Selection for Weak Mineral Signal Detection in Hyperspectral Imagery | Judy X Yang et.al. | 2508.00539 | null |
| 2025-08-01 | Honey Classification using Hyperspectral Imaging and Machine Learning | Mokhtar A. Al-Awadhi et.al. | 2508.00361 | null |
| 2025-07-31 | SAMSA: Segment Anything Model Enhanced with Spectral Angles for Hyperspectral Interactive Medical Image Segmentation | Alfie Roddan et.al. | 2507.23673 | link |
| 2025-03-28 | HSLiNets: Evaluating Band Ordering Strategies in Hyperspectral and LiDAR Fusion | Judy X Yang et.al. | 2503.21072 | null |
| 2025-03-11 | Dynamic Cross-Modal Feature Interaction Network for Hyperspectral and LiDAR Data Classification | Junyan Lin et.al. | 2503.06945 | link |
| 2024-12-04 | HSLiNets: Hyperspectral Image and LiDAR Data Fusion Using Efficient Dual Non-Linear Feature Learning Networks | Judy X Yang et.al. | 2412.00302 | null |
| 2024-04-09 | Unsupervised Band Selection Using Fused HSI and LiDAR Attention Integrating With Autoencoder | Judy X Yang et.al. | 2404.05258 | null |
| 2024-04-16 | LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification | Judy X Yang et.al. | 2404.03883 | null |
| 2023-04-04 | Multimodal Hyperspectral Image Classification via Interconnected Fusion | Lu Huo et.al. | 2304.00495 | null |
| 2023-03-24 | MMFormer: Multimodal Transformer Using Multiscale Self-Attention for Remote Sensing Image Classification | Bo Zhang et.al. | 2303.13101 | null |
| 2023-02-08 | Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification | Meng Wang et.al. | 2301.03335 | null |
| 2022-11-01 | Hybridization of filter and wrapper approaches for the dimensionality reduction and classification of hyperspectral images | Asma Elmaizi et.al. | 2210.16496 | null |
| 2023-01-04 | A CNN with Noise Inclined Module and Denoise Framework for Hyperspectral Image Classification | Zhiqiang Gong et.al. | 2205.12459 | null |
| 2021-04-07 | Disentangled Non-Local Network for Hyperspectral and LiDAR Data Classification | Wenxia Liu et.al. | 2104.02302 | null |
| 2021-04-07 | Hyperspectral and LiDAR data classification based on linear self-attention | Min Feng et.al. | 2104.02301 | null |
| 2020-07-20 | Advances in Deep Learning for Hyperspectral Image Analysis–Addressing Challenges Arising in Practical Imaging Scenarios | Xiong Zhou et.al. | 2007.08592 | null |
| 2020-02-05 | Classification of Hyperspectral and LiDAR Data Using Coupled CNNs | Renlong Hang et.al. | 2002.01144 | null |
| 2019-12-09 | 3D CNN with Localized Residual Connections for Hyperspectral Image Classification | Shivangi Dwivedi et.al. | 1912.03000 | null |
| 2019-10-30 | Deep Learning for Hyperspectral Image Classification: An Overview | Shutao Li et.al. | 1910.12861 | null |
| 2021-06-08 | Multiscale Principle of Relevant Information for Hyperspectral Image Classification | Yantao Wei et.al. | 1907.06022 | null |
| 2018-03-01 | HSI-CNN: A Novel Convolution Neural Network for Hyperspectral Image | Yanan Luo et.al. | 1802.10478 | null |
| 2016-06-17 | Combining multiscale features for classification of hyperspectral images: a sequence based kernel approach | Yanwei Cui et.al. | 1606.04985 | null |
| 2015-04-30 | Robust hyperspectral image classification with rejection fields | Filipe Condessa et.al. | 1504.07918 | null |
(<a href=#updated-on-20260404>back to top</a>)