Usage instructions: here
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-30 | Video Object Segmentation-Aware Audio Generation | Ilpo Viertola et.al. | 2509.26604 | null |
2025-09-30 | SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval | Ren-Di Wu et.al. | 2509.26330 | null |
2025-09-30 | SETR: A Two-Stage Semantic-Enhanced Framework for Zero-Shot Composed Image Retrieval | Yuqi Xiao et.al. | 2509.26012 | null |
2025-09-30 | SAGE: Spatial-visual Adaptive Graph Exploration for Visual Place Recognition | Shunpeng Chen et.al. | 2509.25723 | null |
2025-09-29 | Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity | Tu-Hoa Pham et.al. | 2509.25520 | null |
2025-09-29 | Performance-Efficiency Trade-off for Fashion Image Retrieval | Julio Hurtado et.al. | 2509.24477 | null |
2025-09-28 | Prepare for Warp Speed: Sub-millisecond Visual Place Recognition Using Event Cameras | Vignesh Ramanathan et.al. | 2509.24094 | null |
2025-09-27 | Terrorism & Democracy in Burkina-Faso | P Carmel Marie Zagre et.al. | 2509.23046 | null |
2025-09-26 | Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation | Jinpeng Lu et.al. | 2509.22307 | null |
2025-09-25 | Enhancing Contrastive Learning for Geolocalization by Discovering Hard Negatives on Semivariograms | Boyi Chen et.al. | 2509.21573 | null |
2025-09-23 | SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment | Binod Singh et.al. | 2509.20401 | null |
2025-09-24 | A Versatile Foundation Model for AI-enabled Mammogram Interpretation | Fuxiang Huang et.al. | 2509.20271 | null |
2025-09-23 | Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions | Ioanna Ntinou et.al. | 2509.19203 | null |
2025-09-30 | OrthoLoC: UAV 6-DoF Localization and Calibration Using Orthographic Geodata | Oussema Dhaouadi et.al. | 2509.18350 | null |
2025-09-21 | Learning Attribute-Aware Hash Codes for Fine-Grained Image Retrieval via Query Optimization | Peng Wang et.al. | 2509.17049 | null |
2025-09-20 | PM25Vision: A Large-Scale Benchmark Dataset for Visual Estimation of Air Quality | Yang Han et.al. | 2509.16519 | null |
2025-09-25 | Efficient Multimodal Dataset Distillation via Generative Models | Zhenghao Zhao et.al. | 2509.15472 | null |
2025-09-18 | SERVAL: Surprisingly Effective Zero-Shot Visual Document Retrieval Powered by Large Vision and Language Models | Thong Nguyen et.al. | 2509.15432 | null |
2025-09-18 | Assessing metadata privacy in neuroimaging | Emilie Kibsgaard et.al. | 2509.15278 | null |
2025-09-18 | PRISM: Product Retrieval In Shopping Carts using Hybrid Matching | Arda Kabadayi et.al. | 2509.14985 | null |
2025-09-18 | Chain-of-Thought Re-ranking for Image Retrieval Tasks | Shangrong Wu et.al. | 2509.14746 | null |
2025-09-18 | DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising | Li Gao et.al. | 2509.14565 | null |
2025-09-18 | Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods | Adam D. Hines et.al. | 2509.14516 | null |
2025-09-17 | Hashing-Baseline: Rethinking Hashing in the Age of Pretrained Models | Ilyass Moummad et.al. | 2509.14427 | null |
2025-09-17 | CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts | Leonard Hackel et.al. | 2509.14104 | null |
2025-09-16 | Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization | Yujia Lin et.al. | 2509.13474 | null |
2025-09-18 | MapAnything: Universal Feed-Forward Metric 3D Reconstruction | Nikhil Keetha et.al. | 2509.13414 | null |
2025-09-17 | DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval | Zechao Liu et.al. | 2509.12824 | null |
2025-09-16 | Ketto and the Science of Giving: A Data-Driven Investigation of Crowdfunding for India | Karuna Chandra et.al. | 2509.12616 | null |
2025-09-15 | Bridging Vision Language Models and Symbolic Grounding for Video Question Answering | Haodi Ma et.al. | 2509.11862 | null |
2025-09-14 | UnLoc: Leveraging Depth Uncertainties for Floorplan Localization | Matthias WĂĽest et.al. | 2509.11301 | null |
2025-09-12 | A Stochastic Birth-and-Death Approach for Street Furniture Geolocation in Urban Environments | Evan Murphy et.al. | 2509.10310 | null |
2025-09-11 | Listening for “You”: Enhancing Speech Image Retrieval via Target Speaker Extraction | Wenhao Yang et.al. | 2509.09306 | null |
2025-09-09 | Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark | Yandi Yang et.al. | 2509.07362 | null |
2025-09-08 | Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval | Emil Demić et.al. | 2509.06566 | null |
2025-09-06 | Augmenting Human-Centered Racial Covenant Detection and Georeferencing with Plug-and-Play NLP Pipelines | Jiyoon Pyo et.al. | 2509.05829 | null |
2025-09-05 | Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots) | Emanuela Boros et.al. | 2509.04948 | null |
2025-09-05 | FloodVision: Urban Flood Depth Estimation Using Foundation Vision-Language Models and Domain Knowledge Graph | Zhangding Liu et.al. | 2509.04772 | null |
2025-09-05 | Global-to-Local or Local-to-Global? Enhancing Image Retrieval with Efficient Local Search and Effective Global Re-ranking | Dror Aiger et.al. | 2509.04351 | null |
2025-09-05 | GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization | Pengyue Jia et.al. | 2509.04334 | null |
2025-09-04 | DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval | Ruohong Yang et.al. | 2509.04193 | null |
2025-09-04 | A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning | Qika Lin et.al. | 2509.03906 | null |
2025-09-02 | Scale, Don’t Fine-tune: Guiding Multimodal LLMs for Efficient Visual Place Recognition at Test-Time | Jintao Cheng et.al. | 2509.02129 | null |
2025-09-02 | Ensemble-Based Event Camera Place Recognition Under Varying Illumination | Therese Joseph et.al. | 2509.01968 | null |
2025-09-01 | ConamArray: A 32-Element Broadband MEMS Ultrasound Transducer Array | Dennis Laurijssen et.al. | 2509.01372 | null |
2025-09-01 | M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision | Che Liu et.al. | 2509.01360 | null |
2025-09-01 | Street-Level Geolocalization Using Multimodal Large Language Models and Retrieval-Augmented Generation | Yunus Serhat Bicakci et.al. | 2509.01341 | null |
2025-09-01 | ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization | Thinh-Phuc Nguyen et.al. | 2509.01259 | null |
2025-09-03 | Multimodal Iterative RAG for Knowledge Visual Question Answering | Changin Choi et.al. | 2509.00798 | null |
2025-08-31 | Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification | Y Hop Nguyen et.al. | 2509.00752 | null |
2025-08-31 | EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions | Dinh-Khoi Vo et.al. | 2509.00751 | null |
2025-08-29 | Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders | Faizan Farooq Khan et.al. | 2509.00177 | null |
2025-08-29 | HCCM: Hierarchical Cross-Granularity Contrastive and Matching Learning for Natural Language-Guided Drones | Hao Ruan et.al. | 2508.21539 | null |
2025-08-27 | Disentangling Latent Embeddings with Sparse Linear Concept Subspaces (SLiCS) | Zhi Li et.al. | 2508.20322 | null |
2025-08-27 | Low-exposure, high-quality multimodal speckle X-ray imaging via an intrinsic gradient-flow approach | Jayvan Liu et.al. | 2508.20209 | null |
2025-08-27 | Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study | Max Torop et.al. | 2508.20188 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-09 | LifelongPR: Lifelong point cloud place recognition based on sample replay and prompt learning | Xianghong Zou et.al. | 2507.10034 | null |
2025-08-08 | ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models | Minwoo Jung et.al. | 2505.18364 | null |
2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
2025-08-27 | OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion | Shuhao Kang et.al. | 2504.19258 | null |
2025-06-19 | An Iterative Task-Driven Framework for Resilient LiDAR Place Recognition in Adverse Weather | Xiongwei Zhao et.al. | 2504.14806 | null |
2025-04-16 | Diffusion Based Robust LiDAR Place Recognition | Benjamin Krummenacher et.al. | 2504.12412 | null |
2025-04-15 | Text-Driven 3D Lidar Place Recognition for Autonomous Driving | Tianyi Shang et.al. | 2503.18035 | null |
2025-05-24 | L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery | Ziwei Shi et.al. | 2503.11245 | null |
2025-03-21 | HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views | Ethan Griffiths et.al. | 2503.08140 | null |
2025-03-06 | ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images | Yanqing Shen et.al. | 2503.04475 | null |
2025-03-20 | CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework | Yanlong Xu et.al. | 2503.02593 | null |
2025-02-07 | HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer | Minwoo Jung et.al. | 2501.18943 | null |
2024-12-20 | SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning | Yuhao Li et.al. | 2412.15577 | null |
2025-04-04 | PerLA: Perceptive 3D Language Assistant | Guofeng Mei et.al. | 2411.19774 | null |
2025-05-19 | A Deeper Look into Second-Order Feature Aggregation for LiDAR Place Recognition | Saimunur Rahman et.al. | 2409.15919 | null |
2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
2024-10-02 | Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition | Hogyun Kim et.al. | 2408.07330 | null |
2024-07-30 | SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition | Raktim Gautam Goswami et.al. | 2407.08260 | null |
2024-06-21 | Voxel-Based Point Cloud Localization for Smart Spaces Management | F. S. Mortazavi et.al. | 2406.15110 | null |
2024-10-09 | PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture | T. Barros et.al. | 2405.19038 | null |
2025-03-14 | VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition | Yun-Jin Li et.al. | 2403.14594 | null |
2024-08-30 | Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests | Haedam Oh et.al. | 2403.14326 | null |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | null |
2024-03-19 | HeLiPR: Heterogeneous LiDAR Dataset for inter-LiDAR Place Recognition under Spatiotemporal Variations | Minwoo Jung et.al. | 2309.14590 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-30 | HART: Human Aligned Reconstruction Transformer | Xiyi Chen et.al. | 2509.26621 | null |
2025-09-30 | Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting | Hanzhou Liu et.al. | 2509.26455 | null |
2025-09-30 | GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts | Zhenyu Shu et.al. | 2509.26055 | null |
2025-09-30 | PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion | Zhiwei Zhang et.al. | 2509.26008 | null |
2025-09-30 | LLM-Powered Code Analysis and Optimization for Gaussian Splatting Kernels | Yi Hu et.al. | 2509.25626 | null |
2025-09-29 | GaussianLens: Localized High-Resolution Reconstruction via On-Demand Gaussian Densification | Yijia Weng et.al. | 2509.25603 | null |
2025-09-29 | Triangle Splatting+: Differentiable Rendering with Opaque Triangles | Jan Held et.al. | 2509.25122 | null |
2025-09-29 | GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction | Huaizhi Qu et.al. | 2509.25075 | null |
2025-09-29 | LVT: Large-Scale Scene Reconstruction via Local View Transformers | Tooba Imtiaz et.al. | 2509.25001 | null |
2025-09-29 | DWGS: Enhancing Sparse-View Gaussian Splatting with Hybrid-Loss Depth Estimation and Bidirectional Warping | Yu Ma et.al. | 2509.24893 | null |
2025-09-29 | ExGS: Extreme 3D Gaussian Compression with Diffusion Priors | Jiaqi Chen et.al. | 2509.24758 | null |
2025-09-29 | Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh | Yuanyuan Gao et.al. | 2509.24421 | null |
2025-09-29 | OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction | Yuhang Cao et.al. | 2509.24308 | null |
2025-09-28 | CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting | DragoĹź-Andrei Chileban et.al. | 2509.23947 | null |
2025-09-28 | From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations | Javed Ahmad et.al. | 2509.23555 | null |
2025-09-27 | Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos | Junyi Wu et.al. | 2509.23492 | null |
2025-09-27 | OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting | Atakan Topaloglu et.al. | 2509.23258 | null |
2025-09-26 | Learning Unified Representation of 3D Gaussian Splatting | Yuelin Xin et.al. | 2509.22917 | null |
2025-09-26 | Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting | Yasmine Omri et.al. | 2509.22615 | null |
2025-09-26 | GS-2M: Gaussian Splatting for Joint Mesh Reconstruction and Material Decomposition | Dinh Minh Nguyen et.al. | 2509.22276 | null |
2025-09-26 | Polysemous Language Gaussian Splatting via Matching-based Mask Lifting | Jiayu Ding et.al. | 2509.22225 | null |
2025-09-26 | Large Material Gaussian Model for Relightable 3D Generation | Jingrui Ye et.al. | 2509.22112 | null |
2025-09-26 | Drag4D: Align Your Motion with Text-Driven 3D Scene Generation | Minjun Kang et.al. | 2509.21888 | null |
2025-09-30 | Dynamic Novel View Synthesis in High Dynamic Range | Kaixuan Zhang et.al. | 2509.21853 | null |
2025-09-25 | PowerGS: Display-Rendering Power Co-Optimization for Neural Rendering in Power-Constrained XR Systems | Weikai Lin et.al. | 2509.21702 | null |
2025-09-25 | Gaussian splatting holography | Shuhe Zhang et.al. | 2509.20774 | null |
2025-09-25 | FreeInsert: Personalized Object Insertion with Geometric and Style Control | Yuhong Zhang et.al. | 2509.20756 | null |
2025-09-23 | SeHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing | Yiyu Li et.al. | 2509.20400 | null |
2025-09-24 | 4D Driving Scene Generation With Stereo Forcing | Hao Lu et.al. | 2509.20251 | null |
2025-09-24 | GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes | Guo Chen et.al. | 2509.19937 | null |
2025-09-24 | Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering | Jiangxue Yu et.al. | 2509.19898 | null |
2025-09-24 | BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting | Yixun Zhang et.al. | 2509.19793 | null |
2025-09-24 | PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction | Yufei Han et.al. | 2509.19726 | null |
2025-09-23 | VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction | Weijie Wang et.al. | 2509.19297 | null |
2025-09-23 | Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation | Sherwin Bahmani et.al. | 2509.19296 | null |
2025-09-23 | WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction | Hung Nguyen et.al. | 2509.19073 | null |
2025-09-23 | Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting | Zijing Guo et.al. | 2509.18956 | null |
2025-09-23 | DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring | Pengteng Li et.al. | 2509.18898 | null |
2025-09-23 | FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation | Zhaorui Wang et.al. | 2509.18759 | null |
2025-09-23 | SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones | Maximilian Adang et.al. | 2509.18610 | null |
2025-09-23 | Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction | Xiaoting Yin et.al. | 2509.18566 | null |
2025-09-23 | BridgeSplat: Bidirectionally Coupled CT and Non-Rigid Gaussian Splatting for Deformable Intraoperative Surgical Navigation | Maximilian Fehrentz et.al. | 2509.18501 | null |
2025-09-23 | Differentiable Light Transport with Gaussian Surfels via Adapted Radiosity for Efficient Relighting and Geometry Reconstruction | Kaiwen Jiang et.al. | 2509.18497 | null |
2025-09-22 | GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction | Jiahe Li et.al. | 2509.18090 | null |
2025-09-22 | GaussianPSL: A novel framework based on Gaussian Splatting for exploring the Pareto frontier in multi-criteria optimization | Phuong Mai Dinh et.al. | 2509.17889 | null |
2025-09-22 | ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos | Shi Chen et.al. | 2509.17864 | null |
2025-09-22 | From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes | Guoxi Huang et.al. | 2509.17789 | null |
2025-09-22 | Neural-MMGS: Multi-modal Neural Gaussian Splats for Large-Scale Scene Reconstruction | Sitian Shen et.al. | 2509.17762 | null |
2025-09-23 | EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device | Gunjan Chhablani et.al. | 2509.17430 | null |
2025-09-22 | FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR | Junzhe Wu et.al. | 2509.17390 | null |
2025-09-22 | SmokeSeer: 3D Gaussian Splatting for Smoke Removal and Scene Reconstruction | Neham Jain et.al. | 2509.17329 | null |
2025-09-21 | SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2509.17246 | null |
2025-09-23 | HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis | Zipeng Wang et.al. | 2509.17083 | null |
2025-09-21 | Efficient 3D Scene Reconstruction and Simulation from Sparse Endoscopic Views | Zhenya Yang et.al. | 2509.17027 | null |
2025-09-21 | PGSTalker: Real-Time Audio-Driven Talking Head Generation via 3D Gaussian Splatting with Pixel-Aware Density Control | Tianheng Zhu et.al. | 2509.16922 | null |
2025-09-21 | ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM | Amanuel T. Dufera et.al. | 2509.16863 | null |
2025-09-20 | MedGS: Gaussian Splatting for Multi-Modal 3D Medical Imaging | Kacper Marzol et.al. | 2509.16806 | null |
2025-09-20 | ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting | Xiaoyang Yan et.al. | 2509.16552 | null |
2025-09-19 | RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars | Weiyi Xiong et.al. | 2509.16119 | null |
2025-09-19 | Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval | Liwei Liao et.al. | 2509.15871 | null |
2025-09-19 | Camera Splatting for Continuous View Optimization | Gahye Lee et.al. | 2509.15677 | null |
2025-09-19 | FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting | Yuwei Jia et.al. | 2509.15648 | null |
2025-09-19 | GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading | Donghyun Lee et.al. | 2509.15645 | null |
2025-09-19 | MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild | Deming Li et.al. | 2509.15548 | null |
2025-09-18 | Causal Reasoning Elicits Controllable 3D Scene Generation | Shen Chen et.al. | 2509.15249 | null |
2025-09-18 | FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction | Jinlong Fan et.al. | 2509.14739 | null |
2025-09-18 | RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI | Cong Tai et.al. | 2509.14687 | null |
2025-09-17 | Perception-Integrated Safety Critical Control via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting | Dario Tscholl et.al. | 2509.14421 | null |
2025-09-17 | MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping | Zhihao Cao et.al. | 2509.14191 | null |
2025-09-17 | Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction | Yifan Mo et.al. | 2509.13938 | null |
2025-09-17 | LamiGauss: Pitching Radiative Gaussian for Sparse-View X-ray Laminography Reconstruction | Chu Chen et.al. | 2509.13863 | null |
2025-09-16 | MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM | Yinlong Bai et.al. | 2509.13536 | null |
2025-09-16 | Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization | Hao Xu et.al. | 2509.13482 | null |
2025-09-16 | Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image | Gaofeng Liu et.al. | 2509.13013 | null |
2025-09-16 | Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings | Abdalla Arafa et.al. | 2509.12938 | null |
2025-09-16 | Effective Gaussian Management for High-fidelity Object Reconstruction | Jiateng Liu et.al. | 2509.12742 | null |
2025-09-15 | Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization | Mengjiao Han et.al. | 2509.12138 | null |
2025-09-15 | Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting | Yi-Hsin Li et.al. | 2509.11853 | null |
2025-09-15 | A Controllable 3D Deepfake Generation Framework with Gaussian Splatting | Wending Liu et.al. | 2509.11624 | null |
2025-09-14 | On the Skinning of Gaussian Avatars | Nikolaos Zioulis et.al. | 2509.11411 | null |
2025-09-14 | ROSGS: Relightable Outdoor Scenes With Gaussian Splatting | Lianjun Liao et.al. | 2509.11275 | null |
2025-09-14 | SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting | Ashkan Taghipour et.al. | 2509.11116 | null |
2025-09-13 | AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting | Gurutva Patle et.al. | 2509.11003 | null |
2025-09-13 | Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation | Yi-Ruei Liu et.al. | 2509.10759 | null |
2025-09-12 | T2Bs: Text-to-Character Blendshapes via Video Generation | Jiahao Luo et.al. | 2509.10678 | null |
2025-09-15 | On the Geometric Accuracy of Implicit and Primitive-based Representations Derived from View Rendering Constraints | Elias De Smijter et.al. | 2509.10241 | null |
2025-09-09 | SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting | Mahtab Dahaghin et.al. | 2509.07809 | null |
2025-09-09 | HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting | Yimin Pan et.al. | 2509.07774 | null |
2025-09-09 | DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning | Wenzhi Guo et.al. | 2509.07493 | null |
2025-09-09 | DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation | Ze-Xin Yin et.al. | 2509.07435 | null |
2025-09-07 | MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning | Jiarui Chen et.al. | 2509.07021 | null |
2025-09-10 | VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes | Shengkai Zhang et.al. | 2509.06685 | null |
2025-09-15 | Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation | Ian Page et.al. | 2509.06433 | null |
2025-09-08 | 3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom | Matthieu Gendrin et.al. | 2509.06400 | null |
2025-09-05 | Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting | Sen Wang et.al. | 2509.05515 | null |
2025-09-05 | Toward Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization | Mengjiao Han et.al. | 2509.05216 | null |
2025-09-05 | Symbolic Graphics Programming with Large Language Models | Yamei Chen et.al. | 2509.05208 | null |
2025-09-05 | GeoSplat: A Deep Dive into Geometry-Constrained Gaussian Splatting | Yangming Li et.al. | 2509.05075 | null |
2025-09-05 | CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus | Hannah Schieber et.al. | 2509.04859 | null |
2025-09-04 | SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer | Jimin Xu et.al. | 2509.04379 | null |
2025-09-03 | ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction | Sankeerth Durvasula et.al. | 2509.03775 | null |
2025-09-02 | Efficient Geometry Compression and Communication for 3D Gaussian Splatting Point Clouds | Liang Xie et.al. | 2509.02232 | null |
2025-09-02 | GRMM: Real-Time High-Fidelity Gaussian Morphable Head Model with Learned Residuals | Mohit Mendiratta et.al. | 2509.02141 | null |
2025-09-02 | 2D Gaussian Splatting with Semantic Alignment for Image Inpainting | Hongyu Li et.al. | 2509.01964 | null |
2025-09-01 | GaussianGAN: Real-Time Photorealistic controllable Human Avatars | Mohamed Ilyes Lakhal et.al. | 2509.01681 | null |
2025-09-01 | FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field | Fan Zhu et.al. | 2509.01547 | null |
2025-09-01 | Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars | Vanessa Sklyarova et.al. | 2509.01469 | null |
2025-08-31 | Towards Integrating Multi-Spectral Imaging with Gaussian Splatting | Josef GrĂĽn et.al. | 2509.00989 | null |
2025-09-03 | GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency | Joongho Jo et.al. | 2509.00911 | null |
2025-09-03 | UPGS: Unified Pose-aware Gaussian Splatting for Dynamic Scene Deblurring | Zhijing Wu et.al. | 2509.00831 | null |
2025-08-31 | SWAGSplatting: Semantic-guided Water-scene Augmented Gaussian Splatting | Zhuodong Jiang et.al. | 2509.00800 | null |
2025-08-31 | MarkSplatter: Generalizable Watermarking for 3D Gaussian Splatting Model via Splatter Image Structure | Xiufeng Huang et.al. | 2509.00757 | null |
2025-08-31 | DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments | Yi Liu et.al. | 2509.00741 | null |
2025-08-30 | AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection | Houshu He et.al. | 2509.00433 | null |
2025-08-29 | Complete Gaussian Splats from a Single Image with Denoising Diffusion Models | Ziwei Liao et.al. | 2508.21542 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-30 | PRISM: Progressive Rain removal with Integrated State-space Modeling | Pengze Xue et.al. | 2509.26413 | null |
2025-09-30 | Beyond Overall Accuracy: Pose- and Occlusion-driven Fairness Analysis in Pedestrian Detection for Autonomous Driving | Mohammad Khoshkdahan et.al. | 2509.26166 | null |
2025-09-30 | NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving | Yuan Gao et.al. | 2509.25944 | null |
2025-09-30 | Preemptive Spatiotemporal Trajectory Adjustment for Heterogeneous Vehicles in Highway Merging Zones | Yuan Li et.al. | 2509.25929 | null |
2025-09-30 | MuSLR: Multimodal Symbolic Logical Reasoning | Jundong Xu et.al. | 2509.25851 | null |
2025-09-30 | Cooperative Autonomous Driving in Diverse Behavioral Traffic: A Heterogeneous Graph Reinforcement Learning Approach | Qi Liu et.al. | 2509.25751 | null |
2025-09-29 | Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments | Zihan Zhang et.al. | 2509.25542 | null |
2025-09-29 | StreamForest: Efficient Online Video Understanding with Persistent Event Memory | Xiangyu Zeng et.al. | 2509.24871 | null |
2025-09-29 | TACO-Net: Topological Signatures Triumph in 3D Object Classification | Anirban Ghosh et.al. | 2509.24802 | null |
2025-09-29 | FuncPoison: Poisoning Function Library to Hijack Multi-agent Autonomous Driving Systems | Yuzhen Long et.al. | 2509.24408 | null |
2025-09-29 | Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning | Korbinian Moller et.al. | 2509.24313 | null |
2025-09-29 | Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds | Yongqiang Wang et.al. | 2509.24273 | null |
2025-09-28 | Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning | Muleilan Pei et.al. | 2509.23993 | null |
2025-09-28 | AutoPrune: Each Complexity Deserves a Pruning Policy | Hanshi Wang et.al. | 2509.23931 | null |
2025-09-28 | DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation | Haibao Yu et.al. | 2509.23922 | null |
2025-09-28 | Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios | Jinghan Xu Yuyang Zhang Qixuan Cai Jiancheng Chen Keqiu Li et.al. | 2509.23895 | null |
2025-09-28 | From Static to Dynamic: a Survey of Topology-Aware Perception in Autonomous Driving | Yixiao Chen et.al. | 2509.23641 | null |
2025-09-28 | Foundation Model-Based Adaptive Semantic Image Transmission for Dynamic Wireless Environments | Fangyu Liu et.al. | 2509.23590 | null |
2025-09-28 | BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving | Shu Liu et.al. | 2509.23589 | null |
2025-09-27 | WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving | Ziyue Zhu et.al. | 2509.23402 | null |
2025-09-27 | Preventing Robotic Jailbreaking via Multimodal Domain Adaptation | Francesco Marchiori et.al. | 2509.23281 | null |
2025-09-26 | Persistent Autoregressive Mapping with Traffic Rules for Autonomous Driving | Shiyi Liang et.al. | 2509.22756 | null |
2025-09-26 | Self-driving cars: Are we there yet? | Merve Atasever et.al. | 2509.22754 | null |
2025-09-26 | An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment | Xiaoyun Qiu et.al. | 2509.22550 | null |
2025-09-26 | EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model | Andrii Litvynchuk et.al. | 2509.22527 | null |
2025-09-29 | A Multi-Modality Evaluation of the Reality Gap in Autonomous Driving Systems | Stefano Carlo Lambertenghi et.al. | 2509.22379 | null |
2025-09-26 | UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data | Yujian Yuan et.al. | 2509.22262 | null |
2025-09-26 | An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose | Qifeng Wang et.al. | 2509.22058 | null |
2025-09-25 | PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines | Zhixin Zhang et.al. | 2509.21563 | null |
2025-09-25 | Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement | Jianbo Zhao et.al. | 2509.20938 | null |
2025-09-25 | MTRDrive: Memory-Tool Synergistic Reasoning for Robust Autonomous Driving in Corner Cases | Ziang Luo et.al. | 2509.20843 | null |
2025-09-25 | DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation | Ved Umrajkar et.al. | 2509.20792 | null |
2025-09-25 | MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM | Yuxuan Zhou et.al. | 2509.20757 | null |
2025-09-25 | Cyber Racing Coach: A Haptic Shared Control Framework for Teaching Advanced Driving Skills | Congkai Shen et.al. | 2509.20653 | null |
2025-09-26 | AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving | Jinhao Chai et.al. | 2509.20253 | null |
2025-09-24 | Universal Camouflage Attack on Vision-Language Models for Autonomous Driving | Dehong Kong et.al. | 2509.20196 | null |
2025-09-24 | Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving | Pengxiang Li et.al. | 2509.20109 | null |
2025-09-25 | Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models | Juana Valeria Hurtado et.al. | 2509.20107 | null |
2025-09-24 | Steerable Adversarial Scenario Generation through Test-Time Preference Alignment | Tong Nie et.al. | 2509.20102 | null |
2025-09-25 | OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving | Pei Liu et.al. | 2509.19973 | null |
2025-09-24 | BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting | Yixun Zhang et.al. | 2509.19793 | null |
2025-09-24 | RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving | Carlo Bosio et.al. | 2509.19789 | null |
2025-09-24 | EfficienT-HDR: An Efficient Transformer-Based Framework via Multi-Exposure Fusion for HDR Reconstruction | Yu-Shen Huang et.al. | 2509.19779 | null |
2025-09-23 | The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar | William L. Muckelroy III et.al. | 2509.19644 | null |
2025-09-23 | Coordinated PSO-PID based longitudinal control with LPV-MPC based lateral control for autonomous vehicles | Yassine Kebbati et.al. | 2509.19529 | null |
2025-09-23 | Autonomous driving using an optimized neural network based adaptive LPV-MPC controller | Yassine Kebbati et.al. | 2509.19523 | null |
2025-09-23 | Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation | Sherwin Bahmani et.al. | 2509.19296 | null |
2025-09-24 | An on-chip Pixel Processing Approach with 2.4ÎĽs latency for Asynchronous Read-out of SPAD-based dToF Flash LiDARs | Yiyang Liu et.al. | 2509.19192 | null |
2025-09-23 | TriFusion-AE: Language-Guided Depth and LiDAR Fusion for Robust Point Cloud Processing | Susmit Neogi et.al. | 2509.18743 | null |
2025-09-23 | The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving | Jay Patrikar et.al. | 2509.18626 | null |
2025-09-23 | MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving | Yuzhi Wu et.al. | 2509.18613 | null |
2025-09-23 | PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving | Chengran Yuan et.al. | 2509.18609 | null |
2025-09-23 | Spatial Envelope MPC: High Performance Driving without a Reference | Siyuan Yu et.al. | 2509.18506 | null |
2025-09-22 | AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback | Yunhao Yang et.al. | 2509.18384 | null |
2025-09-23 | V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts | Hsu-kuang Chiu et.al. | 2509.18053 | null |
2025-09-22 | DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving | Shuyao Shang et.al. | 2509.17940 | null |
2025-09-22 | SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model | Xiao Zhou et.al. | 2509.17850 | null |
2025-09-22 | RSU-Assisted Resource Allocation for Collaborative Perception | Guowei Liu et.al. | 2509.17691 | null |
2025-09-22 | Predicting Depth Maps from Single RGB Images and Addressing Missing Information in Depth Estimation | Mohamad Mofeed Chaar et.al. | 2509.17686 | null |
2025-09-22 | Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method | Gregory Schroeder et.al. | 2509.17620 | null |
2025-09-22 | Vision-Based Driver Drowsiness Monitoring: Comparative Analysis of YOLOv5-v11 Models | Dilshara Herath et.al. | 2509.17498 | null |
2025-09-22 | FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR | Junzhe Wu et.al. | 2509.17390 | null |
2025-09-22 | Multi-Scenario Highway Lane-Change Intention Prediction: A Physics-Informed AI Framework for Three-Class Classification | Jiazhao Shi et.al. | 2509.17354 | null |
2025-09-21 | Optimized adaptive MPC for lateral control of autonomous vehicles | Yassine Kebbati et.al. | 2509.17215 | null |
2025-09-21 | CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving | Ruiguo Zhong et.al. | 2509.17080 | null |
2025-09-21 | Orchestrate, Generate, Reflect: A VLM-Based Multi-Agent Collaboration Framework for Automated Driving Policy Learning | Zengqi Peng et.al. | 2509.17042 | null |
2025-09-21 | Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving | Xuan Chen et.al. | 2509.16950 | null |
2025-09-21 | End2Race: Efficient End-to-End Imitation Learning for Real-Time F1Tenth Racing | Zhijie Qiao et.al. | 2509.16894 | null |
2025-09-20 | Improve bounding box in Carla Simulator | Mohamad Mofeed Chaar et.al. | 2509.16773 | null |
2025-09-20 | Are VLMs Ready for Lane Topology Awareness in Autonomous Driving? | Xin Chen et.al. | 2509.16654 | null |
2025-09-20 | ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents | Yichen Wang et.al. | 2509.16645 | null |
2025-09-20 | SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving | Haiming Zhang et.al. | 2509.16588 | null |
2025-09-20 | ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting | Xiaoyang Yan et.al. | 2509.16552 | null |
2025-09-20 | RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation | Tianyi Yan et.al. | 2509.16500 | null |
2025-09-19 | RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars | Weiyi Xiong et.al. | 2509.16119 | null |
2025-09-19 | CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios | Kangyu Wu et.al. | 2509.15984 | null |
2025-09-19 | CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine | Shiyu Fang et.al. | 2509.15968 | null |
2025-09-19 | RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation | Paul Julius KĂĽhn et.al. | 2509.15886 | null |
2025-09-19 | ThermalGuardian: Temperature-Aware Testing of Automotive Deep Learning Frameworks | Yinglong Zou et.al. | 2509.15815 | null |
2025-09-19 | CBPNet: A Continual Backpropagation Prompt Network for Alleviating Plasticity Loss on Edge Devices | Runjie Shao et.al. | 2509.15785 | null |
2025-09-19 | Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution | Chang Soo Lim et.al. | 2509.15781 | null |
2025-09-18 | Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing | Christopher Oeltjen et.al. | 2509.15423 | null |
2025-09-18 | Out-of-Sight Trajectories: Tracking, Fusion, and Prediction | Haichao Zhang et.al. | 2509.15219 | null |
2025-09-18 | Digital Twin-based Cooperative Autonomous Driving in Smart Intersections: A Multi-Agent Reinforcement Learning Approach | Taoyuan Yu et.al. | 2509.15099 | null |
2025-09-18 | Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression | Xuan Deng et.al. | 2509.14591 | null |
2025-09-18 | DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising | Li Gao et.al. | 2509.14565 | null |
2025-09-17 | FlowDrive: Energy Flow Field for End-to-End Autonomous Driving | Hao Jiang et.al. | 2509.14303 | null |
2025-09-17 | MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping | Zhihao Cao et.al. | 2509.14191 | null |
2025-09-17 | BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection | Rongyu Zhang et.al. | 2509.14151 | null |
2025-09-17 | SEG-Parking: Towards Safe, Efficient, and Generalizable Autonomous Parking via End-to-End Offline Reinforcement Learning | Zewei Yang et.al. | 2509.13956 | null |
2025-09-17 | MAP: End-to-End Autonomous Driving with Map-Assisted Planning | Huilin Yin et.al. | 2509.13926 | null |
2025-09-17 | Ensemble of Pre-Trained Models for Long-Tailed Trajectory Prediction | Divya Thuremella et.al. | 2509.13914 | null |
2025-09-17 | Data-Efficient Spectral Classification of Hyperspectral Data Using MiniROCKET and HDC-MiniROCKET | Nick Theisen et.al. | 2509.13809 | null |
2025-09-17 | AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving | Yuechen Luo et.al. | 2509.13769 | null |
2025-09-17 | UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry | Tae-Wook Um et.al. | 2509.13713 | null |
2025-09-17 | FishBEV: Distortion-Resilient Bird’s Eye View Segmentation with Surround-View Fisheye Cameras | Hang Li et.al. | 2509.13681 | null |
2025-09-16 | TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning | Momchil S. Tomov et.al. | 2509.13579 | null |
2025-09-16 | Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving | Artem Savkin et.al. | 2509.13507 | null |
2025-09-16 | Road Obstacle Video Segmentation | Shyam Nandan Rai et.al. | 2509.13181 | null |
2025-09-17 | TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving | Jiawei Wang et.al. | 2509.13164 | null |
2025-09-16 | An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios | Zhihao Zhang et.al. | 2509.13132 | null |
2025-09-16 | Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving | Ruibo Li et.al. | 2509.13116 | null |
2025-09-16 | 4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar | Xiao Tang et.al. | 2509.12931 | null |
2025-09-16 | StereoCarla: A High-Fidelity Driving Dataset for Generalizable Stereo | Xianda Guo et.al. | 2509.12683 | null |
2025-09-16 | Maps for Autonomous Driving: Full-process Survey and Frontiers | Pengxin Chen et.al. | 2509.12632 | null |
2025-09-16 | DisorientLiDAR: Physical Attacks on LiDAR-based Localization | Yizhen Lao et.al. | 2509.12595 | null |
2025-09-15 | Approaches to Analysis and Design of AI-Based Autonomous Vehicles | Tao Yan et.al. | 2509.12169 | null |
2025-09-16 | Embodied Navigation Foundation Model | Jiazhao Zhang et.al. | 2509.12129 | null |
2025-09-15 | Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network | Navid Hashemi et.al. | 2509.11838 | null |
2025-09-15 | HeLoFusion: An Efficient and Scalable Encoder for Modeling Heterogeneous and Multi-Scale Interactions in Trajectory Prediction | Bingqing Wei et.al. | 2509.11719 | null |
2025-09-14 | SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion | Zhiwen Yang et.al. | 2509.11171 | null |
2025-09-13 | Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios | Simone Mosco et.al. | 2509.10841 | null |
2025-09-11 | Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey | Wei Dai et.al. | 2509.10570 | null |
2025-09-17 | DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training | Jianxin Shi et.al. | 2509.10426 | null |
2025-09-12 | Multimodal SAM-adapter for Semantic Segmentation | Iacopo Curti et.al. | 2509.10408 | null |
2025-09-12 | CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion | Santiago Montiel-MarĂn et.al. | 2509.10139 | null |
2025-09-12 | BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals | Minsang Kong et.al. | 2509.10080 | null |
2025-09-11 | MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network | Ge Sun et.al. | 2509.09200 | null |
2025-09-10 | LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations | Payal Varshney et.al. | 2509.08422 | null |
2025-09-10 | Sparse BEV Fusion with Self-View Consistency for Multi-View Detection and Tracking | Keisuke Toida et.al. | 2509.08421 | null |
2025-09-10 | InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection | Zhongyu Xia et.al. | 2509.08374 | null |
2025-09-10 | Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities | Rajendramayavan Sathyam et.al. | 2509.08302 | null |
2025-09-10 | A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator | Elahe Delavari et.al. | 2509.08221 | null |
2025-09-09 | Mean Field Game-Based Interactive Trajectory Planning Using Physics-Inspired Unified Potential Fields | Zhen Tian et.al. | 2509.08147 | null |
2025-09-09 | TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models | Zongzheng Zhang et.al. | 2509.07962 | null |
2025-09-09 | Bias-Aware Machine Unlearning: Towards Fairer Vision Models via Controllable Forgetting | Sai Siddhartha Chary Aylapuram et.al. | 2509.07456 | null |
2025-09-09 | Attention and Risk-Aware Decision Framework for Safe Autonomous Driving | Zhen Tian et.al. | 2509.07412 | null |
2025-09-09 | TEGRA: A Flexible & Scalable NextGen Mobile Core | Bilal Saleem et.al. | 2509.07410 | null |
2025-09-08 | SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis | Zhengqing Chen et.al. | 2509.06798 | null |
2025-09-08 | Adaptive Evolution Factor Risk Ellipse Framework for Reliable and Safe Autonomous Driving | Fujiang Yuan et.al. | 2509.06375 | null |
2025-09-07 | Asymmetry Vulnerability and Physical Attacks on Online Map Construction for Autonomous Driving | Yang Lou et.al. | 2509.06071 | null |
2025-09-06 | Scenario-based Decision-making Using Game Theory for Interactive Autonomous Driving: A Survey | Zhihao Lin et.al. | 2509.05777 | null |
2025-09-06 | Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh | Ha Meem Hossain et.al. | 2509.05652 | null |
2025-09-06 | OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision | Ruixun Liu et.al. | 2509.05578 | null |
2025-09-08 | LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation | Yinglin Duan et.al. | 2509.05263 | null |
2025-09-05 | Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet | Mohammad Saeid et.al. | 2509.05198 | null |
2025-09-05 | A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing | Chengkai Xu et.al. | 2509.04853 | null |
2025-09-05 | Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization | Dharsan Ravindran et.al. | 2509.04735 | null |
2025-09-04 | Bootstrapping Reinforcement Learning with Sub-optimal Policies for Autonomous Driving | Zhihao Zhang et.al. | 2509.04712 | null |
2025-09-04 | Domain Adaptation for Different Sensor Configurations in 3D Object Detection | Satoshi Tanaka et.al. | 2509.04711 | null |
2025-09-04 | In-Context Policy Adaptation via Cross-Domain Skill Diffusion | Minjong Yoo et.al. | 2509.04535 | null |
2025-09-09 | One Flight Over the Gap: A Survey from Perspective to Panoramic Vision | Xin Lin et.al. | 2509.04444 | null |
2025-09-04 | TriLiteNet: Lightweight Model for Multi-Task Visual Perception | Quang-Huy Che et.al. | 2509.04092 | null |
2025-09-04 | SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation | Han Huang et.al. | 2509.03999 | null |
2025-09-03 | sam-llm: interpretable lane change trajectoryprediction via parametric finetuning | Zhuo Cao et.al. | 2509.03462 | null |
2025-09-03 | Rashomon in the Streets: Explanation Ambiguity in Scene Understanding | Helge Spieker et.al. | 2509.03169 | null |
2025-09-03 | Automatically Generating High-Precision Simulated Road Networking in Traffic Scenario | Liang Xie et.al. | 2509.02990 | null |
2025-09-03 | KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models | Yujin Wang et.al. | 2509.02966 | null |
2025-09-02 | Do LLM Modules Generalize? A Study on Motion Generation for Autonomous Driving | Mingyi Wang et.al. | 2509.02754 | null |
2025-09-02 | 2nd Place Solution for CVPR2024 E2E Challenge: End-to-End Autonomous Driving Using Vision Language Model | Zilong Guo et.al. | 2509.02659 | null |
2025-09-02 | Omnidirectional Spatial Modeling from Correlated Panoramas | Xinshen Zhang et.al. | 2509.02164 | null |
2025-09-02 | Txt2Sce: Scenario Generation for Autonomous Driving System Testing Based on Textual Reports | Pin Ji et.al. | 2509.02150 | null |
2025-09-02 | Curiosity-Driven Testing for Sequential Decision-Making Process | Junda He et.al. | 2509.02025 | null |
2025-09-02 | Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions | Beibei Zhou et.al. | 2509.02011 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-30 | Memory-Efficient 2D/3D Shape Assembly of Robot Swarms | Shuoyu Yue et.al. | 2509.26518 | null |
2025-09-30 | Classical feature map surrogates and metrics for quantum control landscapes | Martino Calzavara et.al. | 2509.25930 | null |
2025-09-25 | Neural Integrated Sensing and Communication for the MIMO-OFDM Downlink | Ziyi Wang et.al. | 2509.21118 | null |
2025-09-18 | Semantic-LiDAR-Inertial-Wheel Odometry Fusion for Robust Localization in Large-Scale Dynamic Environments | Haoxuan Jiang et.al. | 2509.14999 | null |
2025-09-17 | Charting trajectories of human thought using large language models | Matthew M Nour et.al. | 2509.14455 | null |
2025-09-17 | FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph | Xiaolin Zhou et.al. | 2509.13733 | null |
2025-09-16 | Maps for Autonomous Driving: Full-process Survey and Frontiers | Pengxin Chen et.al. | 2509.12632 | null |
2025-09-15 | Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing | Bingyu Li et.al. | 2509.12040 | null |
2025-09-11 | ObjectReact: Learning Object-Relative Control for Visual Navigation | Sourav Garg et.al. | 2509.09594 | null |
2025-09-01 | Hierarchical Motion Captioning Utilizing External Text Data Source | Clayton Leite et.al. | 2509.01471 | null |
2025-08-19 | MMIS-Net for Retinal Fluid Segmentation and Detection | Nchongmaje Ndipenocha et.al. | 2508.13936 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-07-29 | MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving | Thomas Monninger et.al. | 2507.21423 | null |
2025-09-15 | RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction | Yuqing Lan et.al. | 2507.17594 | null |
2025-07-15 | Mapping Fusion: Improving FPGA Technology Mapping with ASIC Mapper | Cunxi Yu et.al. | 2507.10912 | null |
2025-07-07 | Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR | Tao Du et.al. | 2507.04662 | null |
2025-07-11 | Learning to Generate Vectorized Maps at Intersections with Multiple Roadside Cameras | Quanxin Zheng et.al. | 2507.02899 | null |
2025-06-27 | Norm-dependent Lamperti-type MAP representations of stable processes and Brownian motions in the orthant | Andreas E. Kyprianou et.al. | 2506.22020 | null |
2025-06-26 | CURL-SLAM: Continuous and Compact LiDAR Mapping | Kaicheng Zhang et.al. | 2506.21077 | null |
2025-06-25 | Communication-Aware Map Compression for Online Path-Planning: A Rate-Distortion Approach | Ali Reza Pedram et.al. | 2506.20579 | null |
2025-07-16 | Cross-Layer Discrete Concept Discovery for Interpreting Language Models | Ankur Garg et.al. | 2506.20040 | null |
2025-06-17 | TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping | Jeewon Kim et.al. | 2506.14178 | null |
2025-06-16 | Complexity of Coexistence Regions in the GRHT Map | Sishu Shankar Muni et.al. | 2506.13515 | null |
2025-06-09 | ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving | Yongkang Li et.al. | 2506.08052 | null |
2025-06-07 | Multimodal Spatial Language Maps for Robot Navigation and Manipulation | Chenguang Huang et.al. | 2506.06862 | null |
2025-08-19 | Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU | Wenhao Dai et.al. | 2506.06095 | null |
2025-07-31 | X-ray Polarization Detection of the Pulsar Wind Nebula in G21.5-0.9 with IXPE | Niccolò Di Lalla et.al. | 2506.05630 | null |
2025-08-13 | DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes | Jiajun Jiang et.al. | 2506.01950 | null |
2025-06-05 | ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real | Youwei Yu et.al. | 2506.01759 | null |
2025-06-01 | Globally Consistent RGB-D SLAM with 2D Gaussian Splatting | Xingguang Zhong et.al. | 2506.00970 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-10 | A Steerable Deep Network for Model-Free Diffusion MRI Registration | Gianfranco Cortes et.al. | 2501.04794 | null |
2024-04-19 | DeeperHistReg: Robust Whole Slide Images Registration Framework | Marek Wodzinski et.al. | 2404.14434 | null |
2024-04-26 | RegWSI: Whole Slide Image Registration using Combined Deep Feature- and Intensity-Based Methods: Winner of the ACROBAT 2023 Challenge | Marek Wodzinski et.al. | 2404.13108 | null |
2024-01-05 | Partition-based Nonrigid Registration for 3D Face Model | Yuping Ye et.al. | 2401.02607 | null |
2022-05-21 | Myocardial Segmentation of Late Gadolinium Enhanced MR Images by Propagation of Contours from Cine MR Images | Dong Wei et.al. | 2205.10595 | null |
2020-06-11 | Nonrigid registration using Gaussian processes and local likelihood estimation | Ashton Wiens et.al. | 2006.06864 | null |
2019-04-01 | Automatic Nonrigid Histological Image Registration with Adaptive Multistep Algorithm | Marek Wodzinski et.al. | 1904.00982 | null |
2019-04-07 | Symmetry-guided nonrigid registration: the case for distortion correction in multidimensional photoemission spectroscopy | Rui Patrick Xian et.al. | 1901.00312 | null |
2018-12-25 | A Survey on Non-rigid 3D Shape Analysis | Hamid Laga et.al. | 1812.10111 | null |
2015-04-14 | A Multicomponent Approach to Nonrigid Registration of Diffusion Tensor Images | Mohammed Khader et.al. | 1504.01800 | null |
2013-04-03 | Scale Selection of Adaptive Kernel Regression by Joint Saliency Map for Nonrigid Image Registration | Zhuangming Shen et.al. | 1303.0479 | null |
2013-04-15 | Local Structure Matching Driven by Joint-Saliency-Structure Adaptive Kernel Regression | Binjie Qin et.al. | 1302.0494 | null |
2011-04-21 | A Meshless Method for Variational Nonrigid 2-D Shape Registration | Wei Liu et.al. | 1104.4168 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-30 | Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization | Yaoxiang Wang et.al. | 2509.26520 | null |
2025-09-30 | Nephrobase Cell+: Multimodal Single-Cell Foundation Model for Decoding Kidney Biology | Chenyu Li et.al. | 2509.26223 | null |
2025-09-30 | Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline | Haiyang Li et.al. | 2509.25991 | null |
2025-09-30 | UniMMAD: Unified Multi-Modal and Multi-Class Anomaly Detection via MoE-Driven Feature Decompression | Yuan Zhao et.al. | 2509.25934 | null |
2025-09-30 | Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel | Chuanyang Zheng et.al. | 2509.25913 | null |
2025-09-30 | A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI | Arvind Murari Vepa et.al. | 2509.25889 | null |
2025-09-30 | Collaborative Compression for Large-Scale MoE Deployment on Edge | Yixiao Chen et.al. | 2509.25689 | null |
2025-09-30 | LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts | Yuan Zhuang et.al. | 2509.25684 | null |
2025-09-30 | Guiding Mixture-of-Experts with Temporal Multimodal Interactions | Xing Han et.al. | 2509.25678 | null |
2025-09-29 | K-Prism: A Knowledge-Guided and Prompt Integrated Universal Medical Image Segmentation Model | Bangwei Guo et.al. | 2509.25594 | null |
2025-09-29 | MAESTRO : Adaptive Sparse Attention and Robust Learning for Multimodal Dynamic Time Series | Payal Mohapatra et.al. | 2509.25278 | null |
2025-09-29 | GRACE-MoE: Grouping and Replication with Locality-Aware Routing for Efficient Distributed MoE Inference | Yu Han et.al. | 2509.25041 | null |
2025-09-29 | LEAF: A Robust Expert-Based Framework for Few-Shot Continual Event Detection | Bao-Ngoc Dao et.al. | 2509.24547 | null |
2025-09-29 | One-Prompt Strikes Back: Sparse Mixture of Experts for Prompt-based Continual Learning | Minh Le et.al. | 2509.24483 | null |
2025-09-29 | Muon: Training and Trade-offs with Latent Attention and MoE | Sushant Mehta et.al. | 2509.24406 | null |
2025-09-29 | LLaDA-MoE: A Sparse MoE Diffusion Language Model | Fengqi Zhu et.al. | 2509.24389 | null |
2025-09-29 | Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning | Zhisheng Chen et.al. | 2509.24222 | null |
2025-09-28 | HunyuanImage 3.0 Technical Report | Siyu Cao et.al. | 2509.23951 | null |
2025-09-28 | Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms | Jiahao Ying et.al. | 2509.23933 | null |
2025-09-28 | Bayesian Mixture-of-Experts: Towards Making LLMs Know What They Don’t Know | Albus Yizhuo Li et.al. | 2509.23830 | null |
2025-09-28 | A Modality-Tailored Graph Modeling Framework for Urban Region Representation via Contrastive Learning | Yaya Zhao et.al. | 2509.23772 | null |
2025-09-28 | Towards a Comprehensive Scaling Law of Mixture-of-Experts | Guoliang Zhao et.al. | 2509.23678 | null |
2025-09-28 | PreScope: Unleashing the Power of Prefetching for Resource-Constrained MoE Inference | Enda Yu et.al. | 2509.23638 | null |
2025-09-27 | Agentic AI Reasoning for Mobile Edge General Intelligence: Fundamentals, Approaches, and Directions | Mingyi Luo et.al. | 2509.23248 | null |
2025-09-27 | MoE-PHDS: One MoE checkpoint for flexible runtime sparsity | Lauren. A Hannah et.al. | 2509.23012 | null |
2025-09-26 | Tiny-QMoE | Jack Cashman et.al. | 2509.22951 | null |
2025-09-26 | Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time | Yixuan Han et.al. | 2509.22572 | null |
2025-09-26 | Learning to Ball: Composing Policies for Long-Horizon Basketball Moves | Pei Xu et.al. | 2509.22442 | null |
2025-09-26 | Role-Aware Multi-modal federated learning system for detecting phishing webpages | Bo Wang et.al. | 2509.22369 | null |
2025-09-26 | HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space | Ke Li et.al. | 2509.22299 | null |
2025-09-26 | Unlocking the Power of Mixture-of-Experts for Task-Aware Time Series Analytics | Xingjian Wu et.al. | 2509.22279 | null |
2025-09-26 | MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning | Tao Wu et.al. | 2509.21953 | null |
2025-09-26 | Elastic MoE: Unlocking the Inference-Time Scalability of Mixture-of-Experts | Naibin Gu et.al. | 2509.21892 | null |
2025-09-26 | ChaosNexus: A Foundation Model for Universal Chaotic System Forecasting with Multi-scale Representations | Chang Liu et.al. | 2509.21802 | null |
2025-09-26 | LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE | Yu Shang et.al. | 2509.21790 | null |
2025-09-24 | MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering | Lihui Liu et.al. | 2509.21391 | null |
2025-09-25 | Distributed Specialization: Rare-Token Neurons in Large Language Models | Jing Liu et.al. | 2509.21163 | null |
2025-09-26 | Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns | Xuemiao Zhang et.al. | 2509.21124 | null |
2025-09-25 | Physics Informed Neural Networks for design optimisation of diamond particle detectors for charged particle fast-tracking at high luminosity hadron colliders | Alessandro Bombini et.al. | 2509.21123 | null |
2025-09-24 | Dynamic Reasoning Chains through Depth-Specialized Mixture-of-Experts in Transformer Architectures | Sampurna Roy et.al. | 2509.20577 | null |
2025-09-24 | Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study | Viktoria Stray et.al. | 2509.20353 | null |
2025-09-24 | SHMoAReg: Spark Deformable Image Registration via Spatial Heterogeneous Mixture of Experts and Attention Heads | Yuxi Zheng et.al. | 2509.20073 | null |
2025-09-24 | Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference | Ziyi Han et.al. | 2509.19781 | null |
2025-09-23 | Human-AI Narrative Synthesis to Foster Shared Understanding in Civic Decision-Making | Cassandra Overney et.al. | 2509.19643 | null |
2025-09-21 | A Statistical Mixture-of-Experts Framework for EMG Artifact Removal in EEG: Empirical Insights and a Proof-of-Concept Application | Benjamin J. Choi et.al. | 2509.19385 | null |
2025-09-23 | DevFD: Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces | Tianshuo Zhang et.al. | 2509.19230 | null |
2025-09-23 | Frequency-Domain Decomposition and Recomposition for Robust Audio-Visual Segmentation | Yunzhe Shen et.al. | 2509.18912 | null |
2025-09-23 | LongCat-Flash-Thinking Technical Report | Meituan LongCat Team et.al. | 2509.18883 | null |
2025-09-23 | PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving | Chengran Yuan et.al. | 2509.18609 | null |
2025-09-23 | Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts | Qi Wang et.al. | 2509.18542 | null |
2025-09-23 | StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models | Haoxin Yang et.al. | 2509.17993 | null |
2025-09-23 | Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark | Siu Hang Ho et.al. | 2509.17894 | null |
2025-09-22 | Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving | Ziming Liu et.al. | 2509.17863 | null |
2025-09-22 | SSNet: Flexible and robust channel extrapolation for fluid antenna systems enabled by an self-supervised learning framework | Yuan Gao et.al. | 2509.17797 | null |
2025-09-22 | Qwen3-Omni Technical Report | Jin Xu et.al. | 2509.17765 | null |
2025-09-22 | Attention-based Mixture of Experts for Robust Speech Deepfake Detection | Viola Negroni et.al. | 2509.17585 | null |
2025-09-22 | Robust Mixture Models for Algorithmic Fairness Under Latent Heterogeneity | Siqi Li et.al. | 2509.17411 | null |
2025-09-21 | MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE | Soheil Zibakhsh et.al. | 2509.17238 | null |
2025-09-21 | A community-driven optimization framework for redrawing school attendance boundaries | Hongzhao Guan et.al. | 2509.17130 | null |
2025-09-21 | CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception | Lingzhao Kong et.al. | 2509.17107 | null |
2025-09-21 | Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation | Junzhuo Li et.al. | 2509.16882 | null |
2025-09-20 | KungfuBot2: Learning Versatile Motion Skills for Humanoid Whole-Body Control | Jinrui Han et.al. | 2509.16638 | null |
2025-09-19 | DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning | Sikai Bai et.al. | 2509.16105 | null |
2025-09-19 | MoE-CE: Enhancing Generalization for Deep Learning based Channel Estimation via a Mixture-of-Experts Framework | Tianyu Li et.al. | 2509.15964 | null |
2025-09-19 | pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation | Tong Wang et.al. | 2509.15638 | null |
2025-09-19 | MEC-Quant: Maximum Entropy Coding for Extremely Low Bit Quantization-Aware Training | Junbiao Pang et.al. | 2509.15514 | null |
2025-09-18 | SPH-Net: A Co-Attention Hybrid Model for Accurate Stock Price Prediction | Yiyang Wu et.al. | 2509.15414 | null |
2025-09-18 | Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing | Zichen Wu et.al. | 2509.15361 | null |
2025-09-18 | Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting | Liran Nochumsohn et.al. | 2509.15105 | null |
2025-09-18 | Adaptive LoRA Experts Allocation and Selection for Federated Fine-Tuning | Lei Wang et.al. | 2509.15087 | null |
2025-09-18 | EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence | Chaoyin She et.al. | 2509.14977 | null |
2025-09-18 | FURINA: Free from Unmergeable Router via LINear Aggregation of mixed experts | Jiayi Han et.al. | 2509.14900 | null |
2025-09-18 | CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human | Nan Sun et.al. | 2509.14889 | null |
2025-09-15 | SparseDoctor: Towards Efficient Chat Doctor with Mixture of Experts Enhanced Large Language Models | Zhang Jianbin et.al. | 2509.14269 | null |
2025-09-17 | CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts | Leonard Hackel et.al. | 2509.14104 | null |
2025-09-18 | SAIL-VL2 Technical Report | Weijie Yin et.al. | 2509.14033 | null |
2025-09-17 | Mixture of Low-Rank Adapter Experts in Generalizable Audio Deepfake Detection | Janne Laakkonen et.al. | 2509.13878 | null |
2025-09-17 | Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation | Nguyen Lan Vi Vu et.al. | 2509.13834 | null |
2025-09-18 | Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers | Manan Mittal et.al. | 2509.13548 | null |
2025-09-18 | GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR | Yujie Guo et.al. | 2509.13093 | null |
2025-09-16 | Dual-Stage Reweighted MoE for Long-Tailed Egocentric Mistake Detection | Boyu Han et.al. | 2509.12990 | null |
2025-09-16 | Bridging Perception and Planning: Towards End-to-End Planning for Signal Temporal Logic Tasks | Bowen Ye et.al. | 2509.12813 | null |
2025-09-16 | MEGAN: Mixture of Experts for Robust Uncertainty Estimation in Endoscopy Videos | Damola Agbelese et.al. | 2509.12772 | null |
2025-09-17 | NavMoE: Hybrid Model- and Learning-based Traversability Estimation for Local Navigation via Mixture of Experts | Botao He et.al. | 2509.12747 | null |
2025-09-16 | AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models | Heng Zhang et.al. | 2509.12715 | null |
2025-09-18 | Ensembling Large Language Models for Code Vulnerability Detection: An Empirical Evaluation | Zhihong Sun et.al. | 2509.12629 | null |
2025-09-15 | A high fraction of close massive binary stars at low metallicity | H. Sana et.al. | 2509.12488 | null |
2025-09-16 | When MoE Meets Blockchain: A Trustworthy Distributed Framework of Large Models | Weihao Zhu et.al. | 2509.12141 | null |
2025-09-15 | Dynamic Adaptive Parsing of Temporal and Cross-Variable Patterns for Network State Classification | Yuan Gao et.al. | 2509.11601 | null |
2025-09-15 | RadioLAM: A Large AI Model for Fine-Grained 3D Radio Map Estimation | Zhiyuan Liu et.al. | 2509.11571 | null |
2025-09-14 | Knowledge-Guided Adaptive Mixture of Experts for Precipitation Prediction | Chen Jiang et.al. | 2509.11459 | null |
2025-09-14 | MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation | Syed Talal Wasim et.al. | 2509.11394 | null |
2025-09-14 | On Linear Mode Connectivity of Mixture-of-Experts Architectures | Viet-Hoang Tran et.al. | 2509.11348 | null |
2025-09-13 | Lightweight Metadata-Aware Mixture-of-Experts Masked Autoencoder for Earth Observation | Mohanad Albughdadi et.al. | 2509.10919 | null |
2025-09-12 | RefactorCoderQA: Benchmarking LLMs for Multi-Domain Coding Question Solutions in Cloud and Edge Deployment | Shadikur Rahman et.al. | 2509.10436 | null |
2025-09-12 | Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs | Yixiao Zhou et.al. | 2509.10377 | null |
2025-09-12 | Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts | Strahinja Nikolic et.al. | 2509.10025 | null |
2025-09-11 | Combining Textual and Spectral Features for Robust Classification of Pilot Communications | Abdullah All Tanvir et.al. | 2509.09752 | null |
2025-09-11 | Steering MoE LLMs via Expert (De)Activation | Mohsen Fayyaz et.al. | 2509.09660 | null |
2025-09-11 | HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing | Haochen Huang et.al. | 2509.09420 | null |
2025-09-11 | MoLEx: Mixture of LoRA Experts in Speech Self-Supervised Models for Audio Deepfake Detection | Zihan Pan et.al. | 2509.09175 | null |
2025-09-11 | Compass-v3: Scaling Domain-Specific LLMs for Multilingual E-Commerce in Southeast Asia | Sophia Maria et.al. | 2509.09121 | null |
2025-09-10 | MoWE : A Mixture of Weather Experts | Dibyajyoti Chakraborty et.al. | 2509.09052 | null |
2025-09-15 | Too Helpful, Too Harmless, Too Honest or Just Right? | Gautam Siddharth Kashyap et.al. | 2509.08486 | null |
2025-09-10 | Joint Learning using Mixture-of-Expert-Based Representation for Enhanced Speech Generation and Robust Emotion Recognition | Jing-Tong Tzeng et.al. | 2509.08470 | null |
2025-09-10 | Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism | Jiaming Yan et.al. | 2509.08342 | null |
2025-09-09 | SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery | Fengyu She et.al. | 2509.08032 | null |
2025-09-09 | One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning | Yuan Pu et.al. | 2509.07945 | null |
2025-09-09 | MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model? | Songkai Ma et.al. | 2509.07727 | null |
2025-09-09 | DuoServe-MoE: Dual-Phase Expert Prefetch and Cache Scheduling for Efficient MoE LLM Inference | Yuning Zhang et.al. | 2509.07379 | null |
2025-09-11 | PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions | Yixuan Tang et.al. | 2509.07370 | null |
2025-09-11 | CAME-AB: Cross-Modality Attention with Mixture-of-Experts for Antibody Binding Site Prediction | Hongzong Li et.al. | 2509.06465 | null |
2025-09-08 | Ban&Pick: Achieving Free Performance Gains and Inference Speedup via Smarter Routing in MoE-LLMs | Yuanteng Chen et.al. | 2509.06346 | null |
2025-09-08 | MCTuner: Spatial Decomposition-Enhanced Database Tuning via LLM-Guided Exploration | Zihan Yan et.al. | 2509.06298 | null |
2025-09-05 | SpikingBrain Technical Report: Spiking Brain-inspired Large Models | Yuqi Pan et.al. | 2509.05276 | null |
2025-09-05 | Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers | Svetlana Pavlitska et.al. | 2509.05086 | null |
2025-09-05 | Phase-field and lip-field approaches for fracture with extreme mesh deformation (X-Mesh): a one-dimensional study | Nicolas Moës et.al. | 2509.04971 | null |
2025-09-05 | A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing | Chengkai Xu et.al. | 2509.04853 | null |
2025-09-05 | REMOTE: A Unified Multimodal Relation Extraction Framework with Multilevel Optimal Transport and Mixture-of-Experts | Xinkui Lin et.al. | 2509.04844 | null |
2025-09-05 | Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation | Svetlana Pavlitska et.al. | 2509.04816 | null |
2025-09-04 | Wav2DF-TSL: Two-stage Learning with Efficient Pre-training and Hierarchical Experts Fusion for Robust Audio Deepfake Detection | Yunqi Hao et.al. | 2509.04161 | null |
2025-09-03 | Hierarchical Federated Foundation Models over Wireless Networks for Multi-Modal Multi-Task Intelligence: Integration of Edge Learning with D2D/P2P-Enabled Fog Learning Architectures | Payam Abdisarabshali et.al. | 2509.03695 | null |
2025-09-03 | OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation | Han Li et.al. | 2509.03498 | null |
2025-09-02 | LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference | Krishna Teja Chitty-Venkata et.al. | 2509.02753 | null |
2025-09-02 | Acrobotics: A Generalist Approahc To Quadrupedal Robots’ Parkour | Guillaume Gagné-Labelle et.al. | 2509.02727 | null |
2025-09-02 | MoPEQ: Mixture of Mixed Precision Quantized Experts | Krishna Teja Chitty-Venkata et.al. | 2509.02512 | null |
2025-09-02 | Cache Management for Mixture-of-Experts LLMs – extended version | Spyros Angelopoulos et.al. | 2509.02408 | null |
2025-09-02 | OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds | Longrong Yang et.al. | 2509.02322 | null |
2025-09-01 | Automatic Screening of Parkinson’s Disease from Visual Explorations | Maria F. Alcala-Durand et.al. | 2509.01326 | null |
2025-09-01 | LongCat-Flash Technical Report | Meituan LongCat Team et.al. | 2509.01322 | null |
2025-09-01 | SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation | Chenyang Le et.al. | 2509.01200 | null |
2025-09-06 | Joint Information Extraction Across Classical and Modern Chinese with Tea-MOELoRA | Xuemei Tang et.al. | 2509.01158 | null |
2025-08-31 | MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper | Runjia Zeng et.al. | 2509.00996 | null |
2025-08-31 | Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling | Junfeng Ran et.al. | 2509.00679 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-30 | PRISM: Progressive Rain removal with Integrated State-space Modeling | Pengze Xue et.al. | 2509.26413 | null |
2025-09-30 | Neural Network State-Space Estimators | Minxing Sun et.al. | 2509.25959 | null |
2025-09-30 | Bringing Emerging Architectures to Sequence Labeling in NLP | Ana Ezquerro et.al. | 2509.25918 | null |
2025-09-29 | Benchmarking ECG Foundational Models: A Reality Check Across Clinical Tasks | M A Al-Masud et.al. | 2509.25095 | null |
2025-09-29 | DyMoDreamer: World Modeling with Dynamic Modulation | Boxuan Zhang et.al. | 2509.24804 | null |
2025-09-29 | Q-Net: Transferable Queue Length Estimation via Kalman-based Neural Networks | Ting Gao et.al. | 2509.24725 | null |
2025-09-29 | Wavelet-Assisted Mamba for Satellite-Derived Sea Surface Temperature Super-Resolution | Wankun Chen et.al. | 2509.24334 | null |
2025-09-29 | Similarity-Aware Selective State-Space Modeling for Semantic Correspondence | Seungwook Kim et.al. | 2509.24318 | null |
2025-09-28 | HyMaTE: A Hybrid Mamba and Transformer Model for EHR Representation Learning | Md Mozaharul Mottalib et.al. | 2509.24118 | null |
2025-09-28 | Hazy Pedestrian Trajectory Prediction via Physical Priors and Graph-Mamba | Jian Chen et.al. | 2509.24020 | null |
2025-09-28 | Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression | Jiarui Jiang et.al. | 2509.23779 | null |
2025-09-28 | EfficientMIL: Efficient Linear-Complexity MIL Method for WSI Classification | Chengying She et.al. | 2509.23640 | null |
2025-09-26 | TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses | Sahar Dastani et.al. | 2509.22813 | null |
2025-09-26 | StateX: Enhancing RNN Recall via Post-training State Expansion | Xingyu Shen et.al. | 2509.22630 | null |
2025-09-26 | Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models | Aleksandar Terzić et.al. | 2509.22284 | null |
2025-09-25 | MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation | Xinyu Liu et.al. | 2509.21265 | null |
2025-09-26 | Aligning Inductive Bias for Data-Efficient Generalization in State Space Models | Qiyu Chen et.al. | 2509.20789 | null |
2025-09-24 | SpecMamba: Accelerating Mamba Inference on FPGA with Speculative Decoding | Linfeng Zhong et.al. | 2509.19873 | null |
2025-09-24 | RoboSSM: Scalable In-context Imitation Learning via State-Space Models | Youngju Yoo et.al. | 2509.19658 | null |
2025-09-23 | Mamba Modulation: On the Length Generalization of Mamba | Peng Lu et.al. | 2509.19633 | null |
2025-09-23 | Tractable Approximation of Labeled Multi-Object Posterior Densities | Thi Hong Thai Nguyen et.al. | 2509.18780 | null |
2025-09-23 | An overview of neural architectures for self-supervised audio representation learning from masked spectrograms | Sarthak Yadav et.al. | 2509.18691 | null |
2025-09-23 | LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection | Lanhu Wu et.al. | 2509.18683 | null |
2025-09-23 | LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA | Zeyi Kang et.al. | 2509.18576 | null |
2025-09-22 | Bayesian Nonhomogeneous hidden Markov models to leverage routine in physical activity monitoring with informative wear time | Beatrice Cantoni et.al. | 2509.17806 | null |
2025-09-22 | DA-Mamba: Dialogue-aware selective state-space model for multimodal engagement estimation | Shenwei Kang et.al. | 2509.17711 | null |
2025-09-22 | Achilles’ Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data | Tianyi Chen et.al. | 2509.17514 | null |
2025-09-21 | SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction | Djamel Eddine Boukhari et.al. | 2509.17172 | null |
2025-09-21 | Communication over LQG Control Systems: A Convex Optimization Approach to Capacity | Aharon Rips et.al. | 2509.17002 | null |
2025-09-19 | Estimating Clinical Lab Test Result Trajectories from PPG using Physiological Foundation Model and Patient-Aware State Space Model – a UNIPHY+ Approach | Minxiao Wang et.al. | 2509.16345 | null |
2025-09-19 | Mamba-2 audio captioning: design space exploration and analysis | Taehan Lee et.al. | 2509.15680 | null |
2025-09-19 | De-crackling Virtual Analog Controls with Asymptotically Stable Recurrent Neural Networks | Valtteri Kallinen et.al. | 2509.15622 | null |
2025-09-19 | DC-Mamba: Bi-temporal deformable alignment and scale-sparse enhancement for remote sensing change detection | Min Sun et.al. | 2509.15563 | null |
2025-09-17 | Classification Filtering | Ilker Bayram et.al. | 2509.13975 | null |
2025-09-17 | Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models | Motonari Kambara et.al. | 2509.13839 | null |
2025-09-17 | CETUS: Causal Event-Driven Temporal Modeling With Unified Variable-Rate Scheduling | Hanfang Liang et.al. | 2509.13784 | null |
2025-09-17 | State Space Models over Directed Graphs | Junzhi She et.al. | 2509.13735 | null |
2025-09-16 | Multivariate Low-Rank State-Space Model with SPDE Approach for High-Dimensional Data | Jacopo Rodeschini et.al. | 2509.12825 | null |
2025-09-15 | U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT | Zhi Qin Tan et.al. | 2509.12069 | null |
2025-09-15 | AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective | Yuchen Deng et.al. | 2509.12052 | null |
2025-09-15 | Joint-octamamba:an octa joint segmentation network based on feature enhanced mamba | Chuang Liu et.al. | 2509.11649 | null |
2025-09-14 | MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation | Syed Talal Wasim et.al. | 2509.11394 | null |
2025-09-14 | MEMBOT: Memory-Based Robot in Intermittent POMDP | Youzhi Liang et.al. | 2509.11225 | null |
2025-09-12 | FLARE-SSM: Deep State Space Models with Influence-Balanced Loss for 72-Hour Solar Flare Prediction | Yusuke Takagi et.al. | 2509.09988 | null |
2025-09-12 | MAESTRO: Multi-modal Adaptive Estimation for Temporal Respiratory Disease Outbreak | Hong Liu et.al. | 2509.08578 | null |
2025-09-10 | First-order State Space Model for Lightweight Image Super-resolution | Yujie Zhu et.al. | 2509.08458 | null |
2025-09-09 | A kernel-based approach to physics-informed nonlinear system identification | Cesare Donati et.al. | 2509.07634 | null |
2025-09-07 | Recursive State Inference for Linear PASFA | Vishal Rishi et.al. | 2509.07028 | null |
2025-09-06 | Hyperbolic Large Language Models | Sarang Patil et.al. | 2509.05757 | null |
2025-09-05 | A Bayesian Gaussian Process Dynamic Factor Model | Tony Chernis et.al. | 2509.04928 | null |
2025-09-05 | CD-Mamba: Cloud detection with long-range spatial dependency modeling | Tianxiang Xue et.al. | 2509.04729 | null |
2025-09-04 | VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation | Mustafa Munir et.al. | 2509.04669 | null |
2025-09-04 | Echo State Networks as State-Space Models: A Systems Perspective | Pradeep Singh et.al. | 2509.04422 | null |
2025-09-04 | Rethinking the long-range dependency in Mamba/SSM and transformer models | Cong Ma et.al. | 2509.04226 | null |
2025-09-03 | Time-Scaling State-Space Models for Dense Video Captioning | AJ Piergiovanni et.al. | 2509.03426 | null |
2025-09-03 | S2M2ECG: Spatio-temporal bi-directional State Space Model Enabled Multi-branch Mamba for ECG | Huaicheng Zhang et.al. | 2509.03066 | null |
2025-09-02 | Mentality: A Mamba-based Approach towards Foundation Models for EEG | Saarang Panchavati et.al. | 2509.02746 | null |
2025-09-02 | ESTM: An Enhanced Dual-Branch Spectral-Temporal Mamba for Anomalous Sound Detection | Chengyuan Ma et.al. | 2509.02471 | null |
2025-09-02 | AudioRWKV: Efficient and Stable Bidirectional RWKV for Audio Pattern Recognition | Jiayu Xiong et.al. | 2509.02167 | null |
2025-09-01 | A Mathematical Model of Hybrid Microgrid With Pole Placement Controller Using State Feedback For Stability Improvement | Yangyadatta Tripathy et.al. | 2509.01749 | null |
2025-09-01 | Mamba-CNN: A Hybrid Architecture for Efficient and Accurate Facial Beauty Prediction | Djamel Eddine Boukhari et.al. | 2509.01431 | null |
2025-09-01 | StoxLSTM: A Stochastic Extended Long Short-Term Memory Network for Time Series Forecasting | Zihao Wang et.al. | 2509.01187 | null |
2025-09-01 | SpectMamba: Integrating Frequency and State Space Models for Enhanced Medical Image Detection | Yao Wang et.al. | 2509.01080 | null |
2025-08-31 | Prospects of Imitating Trading Agents in the Stock Market | Mateusz Wilinski et.al. | 2509.00982 | null |
2025-08-31 | CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification | Qingyu Wang et.al. | 2509.00677 | null |
2025-08-31 | MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation | Aviral Chharia et.al. | 2509.00649 | null |
2025-08-30 | COMET: A Framework for Modeling Compound Operation Dataflows with Explicit Collectives | Shubham Negi et.al. | 2509.00599 | null |
2025-08-30 | SemaMIL: Semantic Reordering with Retrieval-Guided State Space Modeling for Whole Slide Image Classification | Lubin Gan et.al. | 2509.00442 | null |
2025-08-29 | Quantum-Optimized Selective State Space Model for Efficient Time Series Prediction | Stefan-Alexandru Jura et.al. | 2509.00259 | null |
(<a href=#updated-on-20251001>back to top</a>)
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-09-28 | Joint Superpixel and Self-Representation Learning for Scalable Hyperspectral Image Clustering | Xianlu Li et.al. | 2509.24027 | null |
2025-09-28 | Generalized Category Discovery in Hyperspectral Images via Prototype Subspace Modeling | Xianlu Li et.al. | 2509.24017 | null |
2025-09-20 | Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment | Abhiroop Chatterjee et.al. | 2509.22697 | null |
2025-09-25 | Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models | Juana Valeria Hurtado et.al. | 2509.20107 | null |
2025-09-21 | SwarmChat: An LLM-Based, Context-Aware Multimodal Interaction System for Robotic Swarms | Ettilla Mohiuddin Eumi et.al. | 2509.16920 | null |
2025-09-20 | Spectral Compressive Imaging via Chromaticity-Intensity Decomposition | Xiaodong Wang et.al. | 2509.16690 | null |
2025-09-16 | Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation | Hugo Carlesso et.al. | 2509.13229 | null |
2025-09-15 | Progressive Flow-inspired Unfolding for Spectral Compressive Imaging | Xiaodong Wang et.al. | 2509.12079 | null |
2025-09-19 | USCTNet: A deep unfolding nuclear-norm optimization solver for physically consistent HSI reconstruction | Xiaoyang Ma et.al. | 2509.10651 | null |
2025-09-12 | Nanosculpting lateral weak link junctions in superconducting Fe(Te,Se)/Bi2Te3 with focused Si++ ions and implications on vortex pinning | Debarghya Mallick et.al. | 2509.10606 | null |
2025-09-11 | CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution | Yulin Tong et.al. | 2509.09163 | null |
2025-09-22 | HyperTTA: Test-Time Adaptation for Hyperspectral Image Classification under Distribution Shifts | Xia Yue et.al. | 2509.08436 | null |
2025-09-09 | GW250114: testing Hawking’s area law and the Kerr nature of black holes | The LIGO Scientific Collaboration et.al. | 2509.08054 | null |
2025-09-15 | Directed searches for gravitational waves from ultralight vector boson clouds around merger remnant and galactic black holes during the first part of the fourth LIGO-Virgo-KAGRA observing run | The LIGO Scientific Collaboration et.al. | 2509.07352 | null |
2025-09-02 | Explainability-Driven Dimensionality Reduction for Hyperspectral Imaging | Salma Haidar et.al. | 2509.02340 | null |
2025-09-01 | FantasyHSI: Video-Generation-Centric 4D Human Synthesis In Any Scene through A Graph-based Multi-Agent Framework | Lingzhou Mu et.al. | 2509.01232 | null |
2025-08-31 | CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification | Qingyu Wang et.al. | 2509.00677 | null |
2025-08-30 | Iterative Low-rank Network for Hyperspectral Image Denoising | Jin Ye et.al. | 2509.00356 | null |
2025-08-28 | Upper Limits on the Isotropic Gravitational-Wave Background from the first part of LIGO, Virgo, and KAGRA’s fourth Observing Run | The LIGO Scientific Collaboration et.al. | 2508.20721 | null |
2025-08-27 | Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities | Imad Ali Shah et.al. | 2508.19905 | null |
2025-09-08 | GWTC-4.0: Updating the Gravitational-Wave Transient Catalog with Observations from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run | The LIGO Scientific Collaboration et.al. | 2508.18082 | null |
2025-09-03 | Open Data from LIGO, Virgo, and KAGRA through the First Part of the Fourth Observing Run | The LIGO Scientific Collaboration et.al. | 2508.18079 | null |
2025-08-25 | Few-shot Unknown Class Discovery of Hyperspectral Images with Prototype Learning and Clustering | Chun Liu et.al. | 2508.18075 | null |
2025-08-21 | Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising | Jin Ye et.al. | 2508.15553 | null |
2025-08-15 | Hyperspectral vs. RGB for Pedestrian Segmentation in Urban Driving Scenes: A Comparative Study | Jiarong Li et.al. | 2508.11301 | null |
2025-08-14 | CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving | Jiarong Li et.al. | 2508.10962 | null |
2025-08-13 | Probabilistic Emissivity Retrieval from Hyperspectral Data via Physics-Guided Variational Inference | Joshua R. Tempelman et.al. | 2508.08291 | null |
2025-08-11 | Hyperspectral Imaging | Danfeng Hong et.al. | 2508.08107 | null |
2025-08-11 | DETACH: Cross-domain Learning for Long-Horizon Tasks via Mixture of Disentangled Experts | Yutong Shen et.al. | 2508.07842 | null |
2025-08-09 | TerraMAE: Learning Spatial-Spectral Representations from Hyperspectral Earth Observation Data via Adaptive Masked Autoencoders | Tanjim Bin Faruk et.al. | 2508.07020 | null |
2025-08-05 | Low-rankness and Smoothness Meet Subspace: A Unified Tensor Regularization for Hyperspectral Image Super-resolution | Jun Zhang et.al. | 2508.03049 | null |
2025-08-02 | Hyperspectral Image Recovery Constrained by Multi-Granularity Non-Local Self-Similarity Priors | Zhuoran Peng et.al. | 2508.01435 | null |
2025-08-05 | Phase-Locked SNR Band Selection for Weak Mineral Signal Detection in Hyperspectral Imagery | Judy X Yang et.al. | 2508.00539 | null |
2025-08-01 | Honey Classification using Hyperspectral Imaging and Machine Learning | Mokhtar A. Al-Awadhi et.al. | 2508.00361 | null |
2025-07-31 | SAMSA: Segment Anything Model Enhanced with Spectral Angles for Hyperspectral Interactive Medical Image Segmentation | Alfie Roddan et.al. | 2507.23673 | null |
(<a href=#updated-on-20251001>back to top</a>)