Skip to content

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Notifications You must be signed in to change notification settings

agipro/cv-arxiv-daily

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2025.03.09

Table of Contents
  1. SLAM
  2. SFM
  3. Visual Localization
  4. Keypoint Detection
  5. Image Matching
  6. NeRF

SLAM

Publish Date Title Authors PDF Code
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235v1 null
2025-03-05 Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments Jie Deng et.al. 2503.03373v1 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661v1 null
2025-03-04 DnD Filter: Differentiable State Estimation for Dynamic Systems using Diffusion Models Ziyu Wan et.al. 2503.01274v2 null
2025-02-27 BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground Yufei Wei et.al. 2502.20078v1 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932v1 null
2025-02-26 Efficient and Distributed Large-Scale Point Cloud Bundle Adjustment via Majorization-Minimization Rundong Li et.al. 2502.18801v1 null
2025-02-23 Improving Monocular Visual-Inertial Initialization with Structureless Visual-Inertial Bundle Adjustment Junlin Song et.al. 2502.16598v1 null
2025-02-19 Active Illumination for Visual Ego-Motion Estimation in the Dark Francesco Crocetti et.al. 2502.13708v1 null
2025-02-19 pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM Luigi Freda et.al. 2502.11955v2 link
2025-03-05 Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions Dario Pisanti et.al. 2502.09795v2 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111v1 null
2025-02-13 PTZ-Calib: Robust Pan-Tilt-Zoom Camera Calibration Jinhui Guo et.al. 2502.09075v1 link
2025-02-12 LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features Shujie Zhou et.al. 2502.08676v1 link
2025-02-10 Building Rome with Convex Optimization Haoyu Han et.al. 2502.04640v2 null
2025-01-31 Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping Yiming Huang et.al. 2501.19319v1 link
2025-01-23 FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation Bingyang Zhou et.al. 2501.13876v1 null
2025-02-14 DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM Jesse Morris et.al. 2501.11893v2 link
2025-01-19 Tracking Mouse from Incomplete Body-Part Observations and Deep-Learned Deformable-Mouse Model Motion-Track Constraint for Behavior Analysis Olaf Hellwich et.al. 2501.11030v1 null
2025-01-15 SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM Yuhang Ming et.al. 2501.08880v1 null
2025-01-16 BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module Dongzhihan Wang et.al. 2501.08659v2 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286v1 null
2025-01-07 MAD-BA: 3D LiDAR Bundle Adjustment -- from Uncertainty Modelling to Structure Optimization Krzysztof Ćwian et.al. 2501.03972v1 null
2025-01-06 Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation Yuezhang Lv et.al. 2501.02821v1 null
2024-12-28 MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing Shuo Wang et.al. 2412.20082v1 null
2025-01-18 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry Zhaoxing Zhang et.al. 2412.16923v3 null
2024-12-18 Event-based Photometric Bundle Adjustment Shuang Guo et.al. 2412.14111v1 link
2024-12-18 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching Fernando Amodeo et.al. 2412.13639v1 link
2024-12-17 NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment Andrea Dunn Beltran et.al. 2412.13176v1 null
2024-12-16 Efficient LiDAR Bundle Adjustment for Multi-Scan Alignment Utilizing Continuous-Time Trajectories Louis Wiesmann et.al. 2412.11760v1 null
2024-12-19 RoMeO: Robust Metric Visual Odometry Junda Cheng et.al. 2412.11530v2 null
2024-12-12 eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction Jad Mansour et.al. 2412.09209v1 link
2024-12-08 GBR: Generative Bundle Refinement for High-fidelity Gaussian Splatting and Meshing Jianing Zhang et.al. 2412.05908v1 null
2024-12-04 BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement Miguel Arturo Vega Torres et.al. 2412.03434v1 link
2024-12-04 MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras Huai Yu et.al. 2412.03146v1 link
2024-12-04 An indoor DSO-based ceiling-vision odometry system for indoor industrial environments Abdelhak Bougouffa et.al. 2412.02950v1 null
2024-12-13 SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure Frames Yuxuan Zhou et.al. 2412.01500v2 link
2024-12-01 DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair Weihang Li et.al. 2412.00851v1 null
2024-11-29 Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction Shaoxiang Wang et.al. 2412.00242v1 null
2024-11-27 SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images Yanyan Li et.al. 2411.18072v1 null
2024-11-27 HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction Wei Zhang et.al. 2411.17982v1 null
2024-11-24 Bundle Adjusted Gaussian Avatars Deblurring Muyao Niu et.al. 2411.16758v1 null
2024-11-21 InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation Marziyeh Bamdad et.al. 2411.14358v1 link
2024-11-20 Robust Monocular Visual Odometry using Curriculum Learning Assaf Lahiany et.al. 2411.13438v1 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291v1 null
2024-11-15 BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation Yufei Wei et.al. 2411.10195v1 null
2024-11-24 Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments Ankit Shaw et.al. 2411.08231v2 null
2024-11-10 A novel algorithm for optimizing bundle adjustment in image sequence alignment Hailin Xu et.al. 2411.06343v1 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796v1 null
2024-11-13 DEIO: Deep Event Inertial Odometry Weipeng Guan et.al. 2411.03928v3 link
2024-11-08 GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting Jilan Mei et.al. 2411.03807v3 null
2024-10-30 LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM Yucheng Huang et.al. 2410.23231v1 link
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213v1 null
2024-10-09 Very High-Resolution Bridge Deformation Monitoring Using UAV-based Photogrammetry Mehdi Maboudi et.al. 2410.18984v1 null
2024-10-22 EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting Bohao Liao et.al. 2410.15392v2 null
2024-10-18 Graph Optimality-Aware Stochastic LiDAR Bundle Adjustment with Progressive Spatial Smoothing Jianping Li et.al. 2410.14565v1 null
2024-10-17 Hybrid bundle-adjusting 3D Gaussians for view consistent rendering with pose optimization Yanan Guo et.al. 2410.13280v1 link
2024-10-12 ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras Junkai Niu et.al. 2410.09374v1 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935v1 link
2024-10-18 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107v2 link
2024-10-02 SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment Xingyu Ji et.al. 2410.01618v1 null
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111v1 null
2024-09-26 Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot Justin Yu et.al. 2409.18108v1 null
2024-09-20 Learning Visual Information Utility with PIXER Yash Turkar et.al. 2409.13151v1 null
2024-09-18 Bundle Adjustment in the Eager Mode Zitong Zhan et.al. 2409.12190v1 null
2024-09-18 Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments Lei Cheng et.al. 2409.11854v1 null
2024-09-18 ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation Yanlin Jin et.al. 2409.11692v1 null
2024-09-17 LVBA: LiDAR-Visual Bundle Adjustment for RGB Point Cloud Mapping Rundong Li et.al. 2409.10868v1 null
2024-09-14 MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry Yuheng Qiu et.al. 2409.09479v1 null
2024-09-14 GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians Dasong Gao et.al. 2409.09295v1 link
2024-09-14 Panoramic Direct LiDAR-assisted Visual Odometry Zikang Yuan et.al. 2409.09287v1 link
2024-09-13 SLIM: Scalable and Lightweight LiDAR Mapping in Urban Environments Zehuan Yu et.al. 2409.08681v1 link
2024-09-11 Event-based Mosaicing Bundle Adjustment Shuang Guo et.al. 2409.07365v1 link
2024-09-23 Robust Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric Tingchen Ma et.al. 2409.01856v2 null
2024-09-02 Robust Vehicle Localization and Tracking in Rain using Street Maps Yu Xiang Tan et.al. 2409.01038v1 link
2024-09-05 EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System Bonan Liu et.al. 2409.00343v2 null
2024-08-30 Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning Shuyang Zhang et.al. 2408.17005v1 link
2024-08-29 Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry Michael Adlerstein et.al. 2408.16472v1 null
2024-08-28 Single-Photon 3D Imaging with Equi-Depth Photon Histograms Kaustubh Sadekar et.al. 2408.16150v1 null
2024-08-28 ES-PTAM: Event-based Stereo Parallel Tracking and Mapping Suman Ghosh et.al. 2408.15605v1 link
2024-08-28 FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2408.14035v2 link
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682v1 null
2024-08-20 TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks Jinjie Mai et.al. 2408.10739v1 null
2024-08-20 LoopSplat: Loop Closure by Registering 3D Gaussian Splats Liyuan Zhu et.al. 2408.10154v2 link
2024-08-10 RSL-BA: Rolling Shutter Line Bundle Adjustment Yongcong Zhang et.al. 2408.05409v1 null
2024-08-07 Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR Racquel Fygenson et.al. 2408.03503v1 link
2024-08-03 FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields Yifan Wu et.al. 2408.01878v1 null
2024-08-03 Deep Patch Visual SLAM Lahav Lipson et.al. 2408.01654v1 link
2024-07-25 CodedVO: Coded Visual Odometry Sachin Shah et.al. 2407.18240v1 null
2024-07-25 PGD-VIO: An Accurate Plane-Aided Visual-Inertial Odometry with Graph-Based Drift Suppression Yidi Zhang et.al. 2407.17709v1 null
2024-07-22 Reinforcement Learning Meets Visual Odometry Nico Messikommer et.al. 2407.15626v1 link
2024-07-21 Semi-Supervised Pipe Video Temporal Defect Interval Localization Zhu Huang et.al. 2407.15170v1 null
2024-07-18 Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain Bach Nguyen Gia et.al. 2407.13159v1 link
2024-07-17 Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge Andrea Albanese et.al. 2407.12663v1 null
2024-07-15 LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning Zhuozhu Jian et.al. 2407.10782v1 null
2024-07-06 Incremental Multiview Point Cloud Registration Xiaoya Cheng et.al. 2407.05021v1 link
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939v3 null
2024-07-01 Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation Lianjie Guo et.al. 2407.01292v1 link
2024-05-29 Rotation Averaging: A Primal-Dual Method and Closed-Forms in Cycle Graphs Gabriel Moreira et.al. 2406.18564v1 null
2024-07-25 Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy Chen Wang et.al. 2406.16087v3 null
2024-06-20 Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment Yunshan Qi et.al. 2406.14360v1 null
2024-06-16 Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry Boris Chidlovskii et.al. 2406.11019v1 null
2024-06-12 From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers Swaminathan Gurumurthy et.al. 2406.07785v1 link
2024-06-03 The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry Paolo Cudrano et.al. 2406.01797v1 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929v1 null
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614v1 null
2024-05-27 Adaptive VIO: Deep Visual-Inertial Odometry with Online Continual Learning Youqi Pan et.al. 2405.16754v1 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599v1 null
2024-06-20 Advancements in Translation Accuracy for Stereo Visual-Inertial Initialization Han Song et.al. 2405.15082v3 null
2024-06-08 EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving Boyi Liu et.al. 2405.12120v2 null
2024-05-13 SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling Yijun Yuan et.al. 2405.07847v1 null
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241v1 null
2024-05-09 Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment Simon Weber et.al. 2405.05079v2 link
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290v1 null
2024-05-07 IMU-Aided Event-based Stereo Visual Odometry Junkai Niu et.al. 2405.04071v1 link
2024-05-05 Blending Distributed NeRFs with Tri-stage Robust Pose Optimization Baijun Ye et.al. 2405.02880v1 null
2024-04-29 $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction Yunxuan Mao et.al. 2404.18439v1 null
2024-04-28 S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM Zhiyao Zhang et.al. 2404.18284v1 null
2024-04-27 An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation Olivier Brochu Dufour et.al. 2404.17745v1 null
2024-04-26 Camera Motion Estimation from RGB-D-Inertial Scene Flow Samuel Cerezo et.al. 2404.17251v1 null
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263v1 link
2024-04-23 FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent Cameron Smith et.al. 2404.15259v1 link
2024-04-22 RESFM: Robust Equivariant Multiview Structure from Motion Fadi Khatib et.al. 2404.14280v1 null
2024-04-23 CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory Yunlong Ran et.al. 2404.13896v2 null
2024-04-20 EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment Guanghao Li et.al. 2404.13346v1 link
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339v1 null
2024-04-17 SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping Vincent Cartillier et.al. 2404.11419v1 null
2024-04-17 VBR: A Vision Benchmark in Rome Leonardo Brizi et.al. 2404.11322v1 link
2024-04-14 Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration Yanhao Zhang et.al. 2404.09169v1 link
2024-04-09 Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes Tianchen Deng et.al. 2404.06050v1 null
2024-04-06 Salient Sparse Visual Odometry With Pose-Only Supervision Siyu Chen et.al. 2404.04677v1 null
2024-04-01 Visual-inertial state estimation based on Chebyshev polynomial optimization Hongyu Zhang et.al. 2404.01150v1 null
2024-04-01 BundledSLAM: An Accurate Visual SLAM System Using Multiple Cameras Han Song et.al. 2403.19886v2 null
2024-03-30 GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM Ganlin Zhang et.al. 2403.19549v2 link
2024-03-25 A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments Gianluca D'Amico et.al. 2403.17084v1 null
2024-03-20 DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping Yuxuan Zhou et.al. 2403.13714v1 link
2024-03-19 On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine Jagatpreet Singh Nir et.al. 2403.13170v1 null
2024-03-18 The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions Margaret Hansen et.al. 2403.12194v1 null
2024-03-19 BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting Lingzhe Zhao et.al. 2403.11831v2 link
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639v1 null
2024-03-17 Compact 3D Gaussian Splatting For Dense Visual SLAM Tianchen Deng et.al. 2403.11247v1 link
2024-03-16 Efficient Domain Adaptation for Endoscopic Visual Odometry Junyang Wu et.al. 2403.10860v1 null
2024-03-25 URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields Bo Xu et.al. 2403.10119v2 null
2024-03-14 Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) Matthew Lisondra et.al. 2403.09882v1 null
2024-03-12 CMax-SLAM: Event-based Rotational-Motion Bundle Adjustment and SLAM System using Contrast Maximization Shuang Guo et.al. 2403.08119v1 link
2024-03-12 SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM Siting Zhu et.al. 2403.07494v1 link
2024-03-12 Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints Weihan Wang et.al. 2403.07225v1 link
2024-03-10 PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial Smoothing Jianping Li et.al. 2403.06124v1 null
2024-03-02 RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking Ray Zhang et.al. 2403.01254v1 link
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110v1 null
2024-02-27 Differentiable Biomechanics Unlocks Opportunities for Markerless Motion Capture R. James Cotton et.al. 2402.17192v1 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961v1 link
2024-02-22 Secure Navigation using Landmark-based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2402.14280v1 null
2024-02-26 VOOM: Robust Visual Object Odometry and Mapping using Hierarchical Landmarks Yutong Wang et.al. 2402.13609v2 link
2024-02-19 Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment Ganesh Sapkota et.al. 2402.12551v1 null
2024-02-07 Online and Certifiably Correct Visual Odometry and Mapping Devansh R Agrawal et.al. 2402.05254v1 null
2024-02-06 YOLOPoint Joint Keypoint and Object Detection Anton Backhaus et.al. 2402.03989v1 link
2024-02-11 BA-LINS: A Frame-to-Frame Bundle Adjustment for LiDAR-Inertial Navigation Hailiang Tang et.al. 2401.11491v2 null
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857v1 null
2024-01-17 Event-Based Visual Odometry on Non-Holonomic Ground Vehicles Wanting Xu et.al. 2401.09331v1 link
2024-01-11 On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering Feng Zhu et.al. 2401.05836v1 null
2023-12-19 Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry Olaya Álvarez-Tuñón et.al. 2401.05396v1 link
2024-01-07 Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people Ali Samadzadeh et.al. 2401.03604v1 link
2024-01-03 LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry Weirong Chen et.al. 2401.01887v1 null
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800v1 link
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471v1 null
2023-12-22 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332v2 null
2023-12-20 Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach Habib Boloorchi Tabrizi et.al. 2312.13162v1 link
2023-12-20 Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera Abdulkadhem A. Abdulkadhem et.al. 2312.12680v1 null
2023-12-15 PLGSLAM: Progressive Neural Scene Represenation with Local to Global Bundle Adjustment Tianchen Deng et.al. 2312.09866v1 null
2023-12-15 Deep Event Visual Odometry Simon Klenk et.al. 2312.09800v1 link
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889v1 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563v1 null
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141v1 link
2023-12-04 Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks Yan Xu et.al. 2312.01561v1 null
2023-11-30 Event-based Visual Inertial Velometer Xiuyuan Lu et.al. 2311.18189v1 null
2023-11-21 CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems Young-Hee Lee et.al. 2311.12580v1 null
2023-11-21 Implicit Event-RGBD Neural SLAM Delin Qu et.al. 2311.11013v2 null
2023-11-14 CP-SLAM: Collaborative Neural Point-based SLAM System Jiarui Hu et.al. 2311.08013v1 null
2023-11-10 Dense Visual Odometry Using Genetic Algorithm Slimane Djema et.al. 2311.06149v1 null
2023-11-07 Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM Seongwook Yoon et.al. 2311.03722v1 null
2023-11-02 Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images Hermes McGriff et.al. 2311.01292v1 link
2023-10-29 3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets Ta-Ying Cheng et.al. 2310.19188v1 null
2023-10-23 RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments Jinyu Li et.al. 2310.15072v1 link
2023-10-23 Converting Depth Images and Point Clouds for Feature-based Pose Estimation Robert Lösch et.al. 2310.14924v1 link
2023-10-20 PACE: Human and Camera Motion Estimation from in-the-wild Videos Muhammed Kocabas et.al. 2310.13768v1 null
2023-10-17 Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms Yanyan Li et.al. 2310.10931v1 link
2023-10-15 CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses Hongyu Fu et.al. 2310.09776v1 null
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082v1 null
2023-10-10 l-dyno: framework to learn consistent visual features using robot's motion Kartikeya Singh et.al. 2310.06249v1 link
2023-10-07 HI-SLAM: Monocular Real-time Dense Mapping with Hybrid Implicit Fields Wei Zhang et.al. 2310.04787v1 null
2023-10-05 USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields Moyang Li et.al. 2310.02687v2 link
2023-10-08 XVO: Generalized Visual Odometry via Cross-Modal Self-Training Lei Lai et.al. 2309.16772v3 null
2023-09-27 Handbook on Leveraging Lines for Two-View Relative Pose Estimation Petr Hruby et.al. 2309.16040v1 null
2023-09-27 BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields Shreya Saha et.al. 2309.15329v1 null
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268v2 link
2023-09-23 Tag-based Visual Odometry Estimation for Indoor UAVs Localization Massimiliano Bertoni et.al. 2309.13311v1 null
2023-09-22 Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms Olivier Gamache et.al. 2309.13139v1 link
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883v1 link
2023-09-20 Conformalized Multimodal Uncertainty Regression and Reasoning Domenico Parente et.al. 2309.11018v1 null
2023-09-20 OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving Heng Li et.al. 2309.11011v1 link
2023-09-19 PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation Luigi Freda et.al. 2309.10896v1 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436v1 link
2023-09-21 Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration Hongbo Zhao et.al. 2309.10314v2 null
2023-09-18 End-to-End Learned Event- and Image-based Visual Odometry Roberto Pellerito et.al. 2309.09947v1 link
2023-09-18 DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach Chenghao Xu et.al. 2309.09879v1 null
2023-09-17 a critical analysis of internal reliability for uncertainty quantification of dense image matching in multi-view stereo Debao Huang et.al. 2309.09379v1 null
2023-09-14 MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems Yu Gao et.al. 2309.07846v1 null
2023-09-14 An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments Yehao Liu et.al. 2309.07408v1 null
2023-09-11 Evaluating Visual Odometry Methods for Autonomous Driving in Rain Yu Xiang Tan et.al. 2309.05249v1 null
2023-09-11 SIM-Sync: From Certifiably Optimal Synchronization over the 3D Similarity Group to Scene Reconstruction with Learned Depth Xihang Yu et.al. 2309.05184v1 link
2023-09-08 Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147v1 null
2023-09-08 Depth Completion with Multiple Balanced Bases and Confidence for Dense Monocular SLAM Weijian Xie et.al. 2309.04145v1 null
2023-09-05 GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction Youmin Zhang et.al. 2309.02436v1 link
2023-09-04 EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity Zijie Jiang et.al. 2309.01296v1 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984v1 link
2023-08-28 R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras Aron Schmied et.al. 2308.14713v1 null
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039v1 null
2023-08-25 A Game of Bundle Adjustment -- Learning Efficient Convergence Amir Belder et.al. 2308.13270v1 null
2023-08-24 Joint Intrinsic and Extrinsic LiDAR-Camera Calibration in Targetless Environments Using Plane-Constrained Bundle Adjustment Liang Li et.al. 2308.12629v1 link
2023-08-19 Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters Xiao Liu et.al. 2308.09870v1 link
2023-08-24 MIPS-Fusion: Multi-Implicit-Submaps for Scalable and Robust Online Neural RGB-D Reconstruction Yijie Tang et.al. 2308.08741v2 null
2023-08-12 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion Guirong Zhuo et.al. 2308.06573v1 null
2023-08-10 Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU U. V. B. L. Udugama et.al. 2308.05515v1 null
2023-08-01 NR-SLAM: Non-Rigid Monocular SLAM Juan J. Gomez Rodriguez et.al. 2308.04036v1 null
2023-08-02 A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry Cora A. Dimmig et.al. 2308.01398v1 null
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125v1 null
2023-08-02 Preliminary Design of the Dragonfly Navigation Filter Ben Schilling et.al. 2307.13513v2 null
2023-07-19 Optimizing the extended Fourier Mellin Transformation Algorithm Wenqing Jiang et.al. 2307.10015v1 link
2023-08-13 Distributed bundle adjustment with block-based sparse matrix compression for super large scale datasets Maoteng Zheng et.al. 2307.08383v2 link
2023-07-15 Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents Ke Cao et.al. 2307.07763v1 null
2023-07-14 Multi-Session, Localization-oriented and Lightweight LiDAR Mapping Using Semantic Lines and Planes Zehuan Yu et.al. 2307.07126v1 null
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770v2 link
2023-06-08 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction Jiawei He et.al. 2306.05418v1 null
2023-06-09 BAA-NGP: Bundle-Adjusting Accelerated Neural Graphics Primitives Sainan Liu et.al. 2306.04166v2 link
2023-07-26 Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression Jianeng Wang et.al. 2306.01188v2 null
2023-06-14 BAMF-SLAM: Bundle Adjusted Multi-Fisheye Visual-Inertial SLAM Using Recurrent Field Transforms Wei Zhang et.al. 2306.01173v2 null
2023-07-06 OSPC: Online Sequential Photometric Calibration Jawad Haidar et.al. 2305.17673v2 null
2023-05-20 DAC: Detector-Agnostic Spatial Covariances for Deep Local Features Javier Tirado-Garín et.al. 2305.12250v1 link
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036v1 link
2023-05-15 Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface Shifan Zhu et.al. 2305.08962v1 null
2023-05-15 Decentralization and Acceleration Enables Large-Scale Bundle Adjustment Taosha Fan et.al. 2305.07026v2 link
2023-05-10 Transformer-based model for monocular visual odometry: a video understanding approach André O. Françani et.al. 2305.06121v1 link
2023-04-29 Modality-invariant Visual Odometry for Embodied Vision Marius Memmel et.al. 2305.00348v1 link
2023-04-29 An Efficient Plane Extraction Approach for Bundle Adjustment on LiDAR Point clouds Zheng Liu et.al. 2305.00287v1 null
2023-04-27 Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM Hengyi Wang et.al. 2304.14377v1 link
2023-04-23 IDLL: Inverse Depth Line based Visual Localization in Challenging Environments Wanting Li et.al. 2304.11748v1 null
2023-04-21 FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving Yuxuan Liu et.al. 2304.10719v1 null
2023-04-18 Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Motion Compensation Hanyu Cai et.al. 2304.08978v1 null
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194v1 link
2023-04-12 SGL: Structure Guidance Learning for Camera Localization Xudong Zhang et.al. 2304.05571v1 null
2023-04-14 Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency Xingwu Ji et.al. 2304.05146v2 link
2023-04-11 Pointless Global Bundle Adjustment With Relative Motions Hessians Ewelina Rupnik et.al. 2304.05118v1 link
2023-04-11 ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster Yifei Dong et.al. 2304.04943v1 null
2023-04-04 Distributed Block Coordinate Moving Horizon Estimation for 2D Visual-Inertial-Odometry SLAM Emilien Flayac et.al. 2304.01613v1 null
2023-03-31 LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses Noah Stier et.al. 2304.00054v1 link
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504v1 link
2023-03-29 Photometric LiDAR and RGB-D Bundle Adjustment Luca Di Giammarino et.al. 2303.16878v1 link
2023-03-27 3D Video Object Detection with Learnable Object-Centric Global Optimization Jiawei He et.al. 2303.15416v1 link
2023-03-25 DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields Yu Chen et.al. 2303.14478v1 null
2023-03-23 RGB-D-Inertial SLAM in Indoor Dynamic Environments with Long-term Large Occlusion Ran Long et.al. 2303.13316v1 null
2023-03-21 Learning a Depth Covariance Function Eric Dexheimer et.al. 2303.12157v1 null
2023-03-21 Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network Alessandro Navone et.al. 2303.11725v1 null
2023-03-20 VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors Thien Hoang Nguyen et.al. 2303.10903v1 null
2023-03-17 CoVIO: Online Continual Learning for Visual-Inertial Odometry Niclas Vödisch et.al. 2303.10149v1 link
2023-03-15 UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry Chaoyang Jiang et.al. 2303.08550v1 null
2023-03-13 Discovering Multiple Algorithm Configurations Leonid Keselman et.al. 2303.07434v1 null
2023-03-09 Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation Masahiro Hirano et.al. 2303.05192v1 null
2023-03-16 Stereo Event-based Visual-Inertial Odometry Kunfeng Wang et.al. 2303.05086v2 link
2023-03-07 Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor Eduardo Gallo et.al. 2303.03804v1 null
2023-03-03 Lightweight, Uncertainty-Aware Conformalized Visual Odometry Alex C. Stutts et.al. 2303.02207v1 null
2023-02-28 LIW-OAM: Lidar-Inertial-Wheel Odometry and Mapping Zikang Yuan et.al. 2302.14298v1 link
2023-02-24 FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets Yelena Randall et.al. 2302.12772v1 null
2023-02-27 CP+: Camera Poses Augmentation with Large-scale LiDAR Maps Jiadi Cui et.al. 2302.12198v2 null
2023-02-19 EdgeVO: An Efficient and Accurate Edge-based Visual Odometry Hui Zhao et.al. 2302.09493v1 null
2023-02-12 Uncertainty-Driven Dense Two-View Structure from Motion Weirong Chen et.al. 2302.00523v2 null
2023-01-31 Design and Implementation of A Soccer Ball Detection System with Multiple Cameras Lei Li et.al. 2302.00123v1 null
2023-01-27 HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera Mostafa Ahmadi et.al. 2301.11823v1 null
2023-01-26 Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial Ola Shorinwa et.al. 2301.11313v1 null
2023-01-24 Generalized Object Search Kaiyu Zheng et.al. 2301.10121v1 null
2023-01-22 Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories Hanlin Chen et.al. 2301.09194v1 null
2023-01-21 Dense RGB SLAM with Neural Implicit Maps Heng Li et.al. 2301.08930v1 null
2023-01-18 Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information Junshi Chen et.al. 2301.07560v1 null
2023-01-17 COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM Manthan Patel et.al. 2301.07147v1 link
2023-01-31 Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems Pierre-Yves Lajoie et.al. 2301.06230v2 link
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604v1 null
2023-01-11 AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization Ying Chen et.al. 2301.04620v1 link
2023-01-12 TBV Radar SLAM -- trust but verify loop candidates Daniel Adolfsson et.al. 2301.04397v2 link
2022-12-31 Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges Maxwell McManus et.al. 2301.03359v1 null
2023-01-09 Motion Addition and Motion Optimization Liqun Qi et.al. 2301.03174v1 null
2023-01-08 Towards Open World NeRF-Based SLAM Daniil Lisus et.al. 2301.03102v1 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403v1 null
2023-01-03 LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation Shreyansh Daftry et.al. 2301.01350v1 null
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147v1 null
2023-01-03 BS3D: Building-scale 3D Reconstruction from RGB-D Images Janne Mustaniemi et.al. 2301.01057v1 null
2023-01-10 An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping Masoud Dayani Najafabadi et.al. 2301.00618v2 link
2022-12-25 A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion Nadia Figueroa et.al. 2212.14772v1 null
2022-12-29 An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping Kangcheng Liu et.al. 2212.14209v1 link
2022-12-27 Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands Felipe Gómez-Cuba et.al. 2212.13477v1 link
2022-12-26 ESVIO: Event-based Stereo Visual Inertial Odometry Peiyu Chen et.al. 2212.13184v1 link
2022-12-24 A Comprehensive Review on Autonomous Navigation Saeid Nahavandi et.al. 2212.12808v1 null
2022-12-23 Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation Marina Lotti et.al. 2212.12388v1 null
2022-12-23 Implementation of a Blind navigation method in outdoors/indoors areas Mohammad Javadian Farzaneh et.al. 2212.12185v1 null
2022-12-22 S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations Hriday Bavle et.al. 2212.11770v1 link
2022-12-22 Active SLAM: A Review On Last Decade Muhammad Farhan Ahmed et.al. 2212.11654v1 null
2022-12-27 Motion, Unit Dual Quaternion and Motion Optimization Liqun Qi et.al. 2212.11593v2 null
2022-12-22 Vision-Based Environmental Perception for Autonomous Driving Fei Liu et.al. 2212.11453v1 null
2022-12-19 Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models Yong Cheng et.al. 2212.09553v1 null
2022-12-16 Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments Lasitha Weerakoon et.al. 2212.08633v1 null
2022-12-16 rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments Bo Wei et.al. 2212.08418v1 null
2022-12-15 AirVO: An Illumination-Robust Point-Line Visual Odometry Kuan Xu et.al. 2212.07595v1 link
2022-12-14 Autonomous Vehicle Navigation with LIDAR using Path Planning Rahul M K et.al. 2212.07155v1 null
2022-12-14 RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping Hyowon Kim et.al. 2212.07141v1 null
2022-12-13 Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) Daniil Lisus et.al. 2212.06923v1 null
2022-12-13 SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance Chenyangguang Zhang et.al. 2212.06524v1 null
2022-12-13 Localization and Navigation System for Indoor Mobile Robot Yanbaihui Liu et.al. 2212.06391v1 null
2022-12-12 Evaluation of RGB-D SLAM in Large Indoor Environments Kirill Muravyev et.al. 2212.05980v1 null
2022-12-19 A Light-Weight LiDAR-Inertial SLAM System with Loop Closing Kangcheng Liu et.al. 2212.05743v2 link
2022-12-12 An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds Kangcheng Liu et.al. 2212.05705v1 link
2022-12-09 SLAM for Visually Impaired People: A Survey Marziyeh Bamdad et.al. 2212.04745v1 null
2022-12-09 Ego-Body Pose Estimation via Ego-Head Pose Estimation Jiaman Li et.al. 2212.04636v1 null
2022-12-06 Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles Sushant Veer et.al. 2212.03323v1 link
2022-12-06 PRISM: Probabilistic Real-Time Inference in Spatial World Models Atanas Mirchev et.al. 2212.02988v1 null
2022-12-06 RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps Florian Sauerbeck et.al. 2212.02085v2 link
2022-12-05 DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization Xuebo Tian et.al. 2212.02077v1 null
2022-12-05 ObjectMatch: Robust Registration using Canonical Object Correspondences Can Gümeli et.al. 2212.01985v1 null
2022-12-02 Sparse SPN: Depth Completion from Sparse Keypoints Yuqun Wu et.al. 2212.00987v1 null
2022-12-01 maplab 2.0 -- A Modular and Multi-Modal Mapping Framework Andrei Cramariuc et.al. 2212.00654v1 link
2022-12-01 AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments Mehregan Dor et.al. 2212.00350v1 null
2022-11-30 MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves Pranjali Pathre et.al. 2211.16882v1 null
2022-11-29 PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images Hartmut Surmann et.al. 2211.16266v1 link
2022-11-29 MmWave Mapping and SLAM for 5G and Beyond Yu Ge et.al. 2211.16024v1 null
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127v1 null
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731v2 null
2022-11-27 Development of a Modular Real-time Shared-control System for a Smart Wheelchair Vaishanth Ramaraj et.al. 2211.14711v1 null
2022-11-26 A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors Jerred Chen et.al. 2211.14432v1 link
2022-11-23 ActiveRMAP: Radiance Field for Active Mapping And Planning Huangying Zhan et.al. 2211.12656v1 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988v1 null
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836v1 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704v1 null
2022-11-24 Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths Erik Leitinger et.al. 2211.09241v2 null
2022-11-16 Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery Hao Qu et.al. 2211.08904v1 null
2022-11-20 Detecting Line Segments in Motion-blurred Images with Events Huai Yu et.al. 2211.07365v2 link
2022-11-13 Automatic Eye-in-Hand Calibration using EKF Aditya Ramakrishnan et.al. 2211.06881v1 null
2022-11-12 Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling Zhihao Wang et.al. 2211.06557v1 link
2022-11-11 Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications Jie Yang et.al. 2211.05982v1 null
2022-11-10 Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time Ignacio Torroba et.al. 2211.05601v1 link
2022-11-07 When Geometry is not Enough: Using Reflector Markers in Lidar SLAM Gerhard Kurz et.al. 2211.03484v1 null
2022-11-07 Detecting Invalid Map Merges in Lifelong SLAM Matthias Holoch et.al. 2211.03423v1 null
2022-11-06 Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU Yibin Wu et.al. 2211.03174v1 link
2022-11-07 Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments Daniel Adolfsson et.al. 2211.02445v2 link
2022-11-03 DyOb-SLAM : Dynamic Object Tracking SLAM System Rushmian Annoy Wadud et.al. 2211.01941v1 null
2022-11-03 Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM Yang Chen et.al. 2211.01749v1 null
2022-11-04 $D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm Hao Xu et.al. 2211.01538v2 link
2022-11-02 Semantic SuperPoint: A Deep Semantic Descriptor Gabriel S. Gama et.al. 2211.01098v1 link
2022-11-02 Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation Myung-Hwan Jeon et.al. 2211.00960v1 link
2022-10-31 Mapping Extended Landmarks for Radar SLAM Shuai Sun et.al. 2210.17207v1 null
2022-10-25 MAROAM: Map-based Radar SLAM through Two-step Feature Selection Dequan Wang et.al. 2210.13797v1 null
2022-10-25 S3E: A Large-scale Multimodal Dataset for Collaborative SLAM Dapeng Feng et.al. 2210.13723v1 link
2022-10-24 NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields Antoni Rosinol et.al. 2210.13641v1 link
2022-10-24 Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging Geng Wang et.al. 2210.13556v1 null
2022-10-28 VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points Andreas Georgis et.al. 2210.12756v2 null
2022-10-22 SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation Junliang Chen et.al. 2210.12417v1 null
2022-10-21 DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm Shipeng Zhong et.al. 2210.11978v1 link
2022-10-21 Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments Shubham Kedia et.al. 2210.11652v1 null
2022-10-22 Visual SLAM: What are the Current Trends and What to Expect? Ali Tourani et.al. 2210.10491v2 null
2022-10-18 Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM Geon Choi et.al. 2210.09636v1 null
2022-10-16 D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments Ayman Beghdadi et.al. 2210.08647v1 null
2022-10-16 Indoor Smartphone SLAM with Learned Echoic Location Features Wenjie Luo et.al. 2210.08493v1 null
2022-10-15 Self-Improving SLAM in Dynamic Environments: Learning When to Mask Adrian Bojko et.al. 2210.08350v1 link
2022-10-13 Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems Pushyami Kaveti et.al. 2210.07315v1 link
2022-10-12 RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map Xuecheng Xu et.al. 2210.05984v1 link
2022-10-11 Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization Yuanzheng He et.al. 2210.05600v1 null
2022-10-11 Autonomous Asteroid Characterization Through Nanosatellite Swarming Kaitlin Dennison et.al. 2210.05518v1 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517v1 null
2022-10-11 Multi-Object Navigation with dynamically learned neural implicit representations Pierre Marza et.al. 2210.05129v1 link
2022-10-12 Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation Yulun Tian et.al. 2210.05020v2 null
2022-10-10 Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios Xingyu Chen et.al. 2210.04562v1 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236v1 null
2022-10-06 SCORE: A Second-Order Conic Initialization for Range-Aided SLAM Alan Papalia et.al. 2210.03177v1 link
2022-10-06 Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding Kirill Mazur et.al. 2210.03043v1 null
2022-10-06 Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence Osian Morgan et.al. 2210.02642v1 null
2022-10-05 MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation Hanwei Zhang et.al. 2210.02038v1 null
2022-10-04 O2S: Open-source open shuttle Nwankwo Linus et.al. 2210.01627v1 null
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320v1 null
2022-10-03 Probabilistic Volumetric Fusion for Dense Monocular SLAM Antoni Rosinol et.al. 2210.01276v1 null
2022-10-03 DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams John McConnell et.al. 2210.00867v1 link
2022-10-03 A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments Ha Sier et.al. 2210.00812v1 link
2022-10-01 Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 Ali Eslamian et.al. 2210.00278v1 null
2022-09-30 PyPose: A Library for Robot Learning with Physics-based Optimization Chen Wang et.al. 2209.15428v1 link
2022-09-29 DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment Mariia Gladkova et.al. 2209.14965v1 null
2022-09-28 Robust Incremental Smoothing and Mapping (riSAM) Daniel McGann et.al. 2209.14359v1 null
2022-09-27 Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping Chi-Ming Chung et.al. 2209.13274v1 link
2022-09-24 Graph Neural Networks for Multi-Robot Active Information Acquisition Mariliza Tzes et.al. 2209.12091v1 null
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894v1 null
2022-09-23 involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs Gilad Rotman et.al. 2209.11591v1 null
2022-09-23 Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot David Balaban et.al. 2209.11432v1 null
2022-09-22 SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation Xiao Han et.al. 2209.10817v1 null
2022-09-22 Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio Wenhao Qiu et.al. 2209.10726v1 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710v1 null
2022-09-20 Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM Sabir Hossain et.al. 2209.10047v1 null
2022-09-20 WGICP: Differentiable Weighted GICP-Based Lidar Odometry Sanghyun Son et.al. 2209.09777v1 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699v1 link
2022-09-19 MeSLAM: Memory Efficient SLAM based on Neural Fields Evgenii Kruzhkov et.al. 2209.09357v1 null
2022-09-19 LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM Letian Zhang et.al. 2209.08810v1 null
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608v1 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578v1 link
2022-09-17 DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments Shihao Shen et.al. 2209.08430v1 link
2022-09-17 OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM Matthieu Zins et.al. 2209.08338v1 null
2022-09-17 PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments Adam Dai et.al. 2209.08248v1 link
2022-09-16 ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM Aditya Arun et.al. 2209.08091v1 null
2022-09-16 iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking Yuhang Ming et.al. 2209.07919v1 null
2022-09-16 TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM Mathieu Gonzalez et.al. 2209.07888v1 null
2022-09-15 Landmark Management in the Application of Radar SLAM Shuai Sun et.al. 2209.07199v1 link
2022-09-15 PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization Xianwei Meng et.al. 2209.07061v1 null
2022-09-14 Semantic Visual Simultaneous Localization and Mapping: A Survey Kaiqi Chen et.al. 2209.06428v1 null
2022-09-13 Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets Islam Ali et.al. 2209.06316v1 null
2022-09-12 A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding Tin Lai et.al. 2209.05222v1 null
2022-09-12 Attitude-Guided Loop Closure for Cameras with Negative Plane Ze Wang et.al. 2209.05167v1 link
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497v1 link
2022-09-08 ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology Julio A. Placed et.al. 2209.03693v1 link
2022-09-08 R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator Jiarong Lin et.al. 2209.03666v1 link
2022-09-06 Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection Brendon Forsgren et.al. 2209.02658v1 link
2022-09-05 Neuromorphic Visual Odometry with Resonator Networks Alpha Renner et.al. 2209.02000v1 null
2022-09-05 MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM Pavel Karpyshev et.al. 2209.01936v1 null
2022-09-05 ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics Boyi Liu et.al. 2209.01774v1 null
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605v1 null
2022-08-31 PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM Yifan Duan et.al. 2208.14848v1 null
2022-08-30 BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition Peng Yin et.al. 2208.14543v1 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997v1 null
2022-08-25 FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms Jianhao Jiao et.al. 2208.11865v1 null
2022-08-25 Lidar SLAM for Autonomous Driving Vehicles Farhad Aghili et.al. 2208.11855v1 null
2022-08-24 DynaVINS: A Visual-Inertial SLAM for Dynamic Environments Seungwon Song et.al. 2208.11500v1 link
2022-08-22 Doppler Exploitation in Bistatic mmWave Radio SLAM Yu Ge et.al. 2208.10204v1 null
2022-08-21 Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping Lintong Zhang et.al. 2208.09825v1 link
2022-08-26 JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario Longrui Dong et.al. 2208.09777v2 null
2022-08-15 BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM Yunge Cui et.al. 2208.07473v1 link
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325v1 null
2022-08-11 RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild Jason Y. Zhang et.al. 2208.05963v1 null
2022-08-08 Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation Yifei Ren et.al. 2208.04274v1 link
2022-08-08 SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty Shuai Zhang et.al. 2208.03945v1 link
2022-08-05 A Survey on Visual Map Localization Using LiDARs and Cameras Elhousni Mahdi et.al. 2208.03376v1 null
2022-08-04 SROS2: Usable Cyber Security Tools for ROS 2 Victor Mayoral Vilches et.al. 2208.02615v1 link
2022-08-03 Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms Bharath Garigipati et.al. 2208.02063v1 null
2022-08-02 Present and Future of SLAM in Extreme Underground Environments Kamak Ebadi et.al. 2208.01787v1 null
2022-08-01 Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion Simon Boche et.al. 2208.00709v1 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455v1 link
2022-07-25 DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions Tristan Laidlow et.al. 2207.12244v1 null
2022-07-25 Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration Kenji Koide et.al. 2207.11942v1 null
2022-07-22 NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction Yunlong Ran et.al. 2207.10985v1 null
2022-07-22 Dense RGB-D-Inertial SLAM with Map Deformations Tristan Laidlow et.al. 2207.10940v1 null
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916v1 null
2022-07-21 Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion Suman Ghosh et.al. 2207.10494v1 link
2022-07-21 Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions Quentin Serdel et.al. 2207.10489v1 link
2022-07-21 On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity Yujin Lu et.al. 2207.10413v1 null
2022-07-19 Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM Tuvy Lemberg et.al. 2207.09103v1 null
2022-07-18 DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM Weicai Ye et.al. 2207.08794v1 link
2022-07-18 Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction Marco Orsingher et.al. 2207.08439v1 null
2022-07-18 ORB-based SLAM accelerator on SoC FPGA Vibhakar Vemulapati et.al. 2207.08405v1 null
2022-07-14 Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset Riccardo Giubilato et.al. 2207.06815v1 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738v1 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732v1 null
2022-07-13 SLAM: SLO-Aware Memory Optimization for Serverless Applications Gor Safaryan et.al. 2207.06183v1 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058v2 link
2022-07-12 Accelerating Certifiable Estimation with Preconditioned Eigensolvers David M. Rosen et.al. 2207.05257v1 null
2022-07-12 Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features Meiyu Zhi et.al. 2207.05244v1 null
2022-07-14 SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial Chih-Yuan Chiu et.al. 2207.05043v2 null
2022-07-08 BlindSpotNet: Seeing Where We Cannot See Taichi Fukuda et.al. 2207.03870v1 null
2022-07-08 Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints Philipp Glira et.al. 2207.03785v1 null
2022-07-08 Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements Ran Liu et.al. 2207.03700v1 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539v1 null
2022-07-06 VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization Marius Laska et.al. 2207.02668v1 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396v1 null
2022-07-04 VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM Ling Gao et.al. 2207.01404v1 null
2022-07-04 VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM Danpeng Chen et.al. 2207.01158v1 null
2022-07-03 Wireless Channel Prediction in Partially Observed Environments Mingsheng Yin et.al. 2207.00934v1 null
2022-07-01 A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers Julio A. Placed et.al. 2207.00254v1 null
2022-07-01 Keeping Less is More: Point Sparsification for Visual SLAM Yeonsoo Park et.al. 2207.00225v1 null
2022-06-30 Controlled and impulsive compression of an entrapped air bubble during impact Utkarsh Jain et.al. 2206.15297v1 null
2022-06-30 Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery Yuehao Wang et.al. 2206.15255v1 link
2022-06-27 IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments Abanob Soliman et.al. 2206.13455v1 link
2022-06-26 An Efficient Global Optimality Certificate for Landmark-Based SLAM Connor Holmes et.al. 2206.12961v1 link
2022-06-21 Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping Davide Tateo et.al. 2206.10263v1 link
2022-06-20 Data Fusion for Radio Frequency SLAM with Robust Sampling Erik Leitinger et.al. 2206.09746v1 null
2022-06-19 RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments Chenglong Qian et.al. 2206.09463v1 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733v1 null
2022-06-17 An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions Yijun Yuan et.al. 2206.08712v1 link
2022-06-13 ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy Hao Bai et.al. 2206.06435v1 null
2022-06-10 Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming Javier Cremona et.al. 2206.05066v1 link
2022-06-09 SparseFormer: Attention-based Depth Completion Network Frederik Warburg et.al. 2206.04557v1 null
2022-06-07 Robot Self-Calibration Using Actuated 3D Sensors Arne Peters et.al. 2206.03430v1 null
2022-06-07 Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map Haodong Yuan et.al. 2206.03062v1 null
2022-06-05 DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions Alena Savinykh et.al. 2206.02199v1 null
2022-06-04 C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy Erez Posner et.al. 2206.01961v1 null
2022-06-01 PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry Dong-Uk Seo et.al. 2206.00266v1 link
2022-05-27 A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching Arno Solin et.al. 2205.13821v1 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135v2 link
2022-05-25 Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM Milad Ramezani et.al. 2205.12595v1 null
2022-05-24 Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM Christopher E. Denniston et.al. 2205.12402v1 link
2022-05-22 ALITA: A Large-scale Incremental Dataset for Long-term Autonomy Peng Yin et.al. 2205.10737v1 link
2022-05-19 FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 Jeffrey Ichnowski et.al. 2205.09778v1 link
2022-05-17 Global Data Association for SLAM with 3D Grassmannian Manifold Objects Parker C. Lusk et.al. 2205.08556v1 null
2022-05-19 Cluster on Wheels Yuanyuan Yang et.al. 2205.08151v2 null
2022-05-12 Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry Shihao Shen et.al. 2205.05916v1 link
2022-05-12 S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization Ran Cheng et.al. 2205.05861v1 null
2022-05-14 Multi-modal Semantic SLAM for Complex Dynamic Environments Han Wang et.al. 2205.04300v2 link
2022-05-06 OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations Carmen Delgado et.al. 2205.03256v1 null
2022-05-05 CNN-Augmented Visual-Inertial SLAM with Planar Constraints Pan Ji et.al. 2205.02940v1 null
2022-05-05 PMBM-based SLAM Filters in 5G mmWave Vehicular Networks Hyowon Kim et.al. 2205.02502v1 null
2022-05-04 BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking Dorian Henning et.al. 2205.02301v1 null
2022-05-04 A Global Asymptotic Convergent Observer for SLAM Seyed Hamed Hashemi et.al. 2205.01953v1 null
2022-05-04 Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation Nathaniel Merrill et.al. 2205.01823v1 link
2022-05-03 GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping Pan Ji et.al. 2205.01656v1 null
2022-04-29 Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM Jinwoo Jeon et.al. 2204.13877v1 link
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831v1 null
2022-04-27 Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment Wenyu Li et.al. 2204.12769v1 null
2022-04-29 MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment Tingchen Ma et.al. 2204.11621v2 null
2022-04-23 Indoor simultaneous localization and mapping based on fringe projection profilometry Yang Zhao et.al. 2204.11020v1 null
2022-04-22 Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria Julio A. Placed et.al. 2204.10631v1 null
2022-04-22 Fast Autonomous Robotic Exploration Using the Underlying Graph Structure Julio A. Placed et.al. 2204.10610v1 null
2022-04-22 Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions Yutong Hu et.al. 2204.10552v1 null
2022-04-22 Implicit Object Mapping With Noisy Data Jad Abou-Chakra et.al. 2204.10516v1 link
2022-04-19 Photometric single-view dense 3D reconstruction in endoscopy Victor M. Batlle et.al. 2204.09083v1 null
2022-04-18 Pulsar skips: Understanding variations in the regular periods of rotating neutron stars Clayton Miller et.al. 2204.08449v1 null
2022-04-18 Tracking monocular camera pose and deformation for SLAM inside the human body Juan J. Gomez Rodriguez et.al. 2204.08309v1 null
2022-04-18 Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker Hanjing Ye et.al. 2204.08163v1 null
2022-04-14 ViViD++: Vision for Visibility Dataset Alex Junho Lee et.al. 2204.06183v2 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481v1 null
2022-04-12 RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room Cong Gao et.al. 2204.05467v1 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932v1 link
2022-04-04 Monitoring social distancing with single image depth estimation Alessio Mingozzi et.al. 2204.01693v1 null
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524v1 null
2022-04-04 IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers Lei Sun et.al. 2204.01324v1 link
2022-04-03 Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor Wenyan Ou et.al. 2204.01154v1 null
2022-04-02 UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps Ayyappa Swamy Thatavarthy et.al. 2204.00865v1 link
2022-03-31 Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects Yujie Lu et.al. 2204.00035v1 null
2022-03-30 GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios Chih-Yuan Chiu et.al. 2203.16690v1 null
2022-03-29 Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field Mostafa Osman et.al. 2203.15866v1 null
2022-03-29 Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform Mingjun Li et.al. 2203.15439v1 null
2022-03-29 Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots Pranay Mathur et.al. 2203.15272v1 null
2022-03-28 Are High-Resolution Event Cameras Really Needed? Daniel Gehrig et.al. 2203.14672v1 null
2022-03-25 Spectral Measurement Sparsification for Pose-Graph SLAM Kevin J. Doherty et.al. 2203.13897v1 link
2022-03-25 FD-SLAM: 3-D Reconstruction Using Features and Dense Matching Xingrui Yang et.al. 2203.13861v1 null
2022-03-25 Gravity-constrained point cloud registration Vladimír Kubelka et.al. 2203.13799v1 null
2022-03-24 MD-SLAM: Multi-cue Direct SLAM Luca Di Giammarino et.al. 2203.13237v1 link
2022-03-24 Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video Shun Taguchi et.al. 2203.12804v1 null
2022-03-19 Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems Jie Yang et.al. 2203.10267v1 null
2022-03-16 Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR Ian D. Miller et.al. 2203.08925v1 link
2022-03-15 Neural RF SLAM for unsupervised positioning and mapping with channel state information Shreya Kadambi et.al. 2203.08264v1 null
2022-03-15 Simultaneous Localisation and Mapping with Quadric Surfaces Tristan Laidlow et.al. 2203.08040v1 null
2022-03-14 Drift Reduced Navigation with Deep Explainable Features Mohd Omama et.al. 2203.06897v1 link
2022-03-11 An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs Keisuke Sugiura et.al. 2203.05763v1 null
2022-03-10 High Definition, Inexpensive, Underwater Mapping Bharat Joshi et.al. 2203.05640v1 link
2022-03-10 SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning Jaehoon Choi et.al. 2203.05332v1 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446v1 link
2022-03-08 SLAM-Supported Self-Training for 6D Object Pose Estimation Ziqi Lu et.al. 2203.04424v1 link
2022-03-08 An Online Semantic Mapping System for Extending and Enhancing Visual SLAM Thorsten Hempel et.al. 2203.03944v1 null
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454v1 link
2022-03-07 OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition Junyi Ma et.al. 2203.03397v1 link
2022-03-06 Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM Kazushi Aiba et.al. 2203.02887v1 null
2022-03-06 RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects Ran Long et.al. 2203.02882v1 null
2022-03-03 STUN: Self-Teaching Uncertainty Estimation for Place Recognition Kaiwen Cai et.al. 2203.01851v1 link
2022-03-03 Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning Niclas Vödisch et.al. 2203.01578v1 link
2022-03-02 FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2203.00893v1 link
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851v1 null
2022-03-01 Descriptellation: Deep Learned Constellation Descriptors for SLAM Chunwei Xing et.al. 2203.00567v1 null
2022-03-01 Collaborative Robot Mapping using Spectral Graph Analysis Lukas Bernreiter et.al. 2203.00308v1 null
2022-02-26 RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization Nikolaos Kourtzanidis et.al. 2202.13221v1 link
2022-02-25 Probabilistic Data Association for Semantic SLAM at Scale Elad Michael et.al. 2202.12802v1 link
2022-02-24 TwistSLAM: Constrained SLAM in Dynamic Environment Mathieu Gonzalez et.al. 2202.12384v1 null
2022-02-24 Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion Hyeonsoo Jang et.al. 2202.12108v1 null
2022-02-23 MITI: SLAM Benchmark for Laparoscopic Surgery Regine Hartwig et.al. 2202.11496v1 null
2022-02-23 DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization Xuebo Tian et.al. 2202.11431v1 null
2022-02-23 Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets Islam Ali et.al. 2202.11312v1 null
2022-02-22 SAGE: SLAM with Appearance and Geometry Prior for Endoscopy Xingtong Liu et.al. 2202.09487v2 link
2022-02-18 OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure Stefan Leutenegger et.al. 2202.09199v1 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146v1 link
2022-02-18 An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems Qiang Liu et.al. 2202.08952v1 null
2022-02-17 Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study Giovanni Cioffi et.al. 2202.08894v1 link
2022-02-17 LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building Jiashi Zhang et.al. 2202.08487v1 null
2022-02-16 Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments Jinkun Wang et.al. 2202.08359v1 null
2022-02-11 Overhead Image Factors for Underwater Sonar-based SLAM John McConnell et.al. 2202.05811v1 null
2022-02-10 Scale Estimation with Dual Quadrics for Monocular Object SLAM Shuangfu Song et.al. 2202.04816v1 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677v1 null
2022-01-25 Autonomous Vehicles: Open-Source Technologies, Considerations, and Development Oussama Saoudi et.al. 2202.03148v1 null
2022-02-07 Temporal Point Cloud Completion with Pose Disturbance Jieqi Shi et.al. 2202.03084v1 null
2022-02-04 DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938v1 null
2022-02-01 A Model for Multi-View Residual Covariances based on Perspective Deformation Alejandro Fontan et.al. 2202.00765v1 null
2022-01-30 Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM Xinghe Chu et.al. 2201.12726v1 null
2022-01-28 RGB-D SLAM Using Attention Guided Frame Association Ali Caglayan et.al. 2201.12047v1 null
2022-02-04 Learning to Act with Affordance-Aware Multimodal Neural SLAM Zhiwei Jia et.al. 2201.09862v2 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048v1 link
2022-01-17 SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System Giseop Kim et.al. 2201.06423v1 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386v1 link
2022-01-19 Multi-Hypothesis Scan Matching through Clustering Giorgio Iavicoli et.al. 2201.03814v2 null
2022-01-11 Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM Kevin J. Doherty et.al. 2201.03773v1 null
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364v1 link
2022-01-10 Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition M. Usman Maqbool Bhutta et.al. 2201.03212v1 link
2022-01-04 Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds Xueliang Wen et.al. 2201.00959v1 null
2021-12-29 Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic Khen Elimelech et.al. 2112.14428v1 null
2021-12-19 M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots Jie Yin et.al. 2112.13659v1 link
2021-12-27 UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping Hyunjun Lim et.al. 2112.13515v1 link
2021-12-25 Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs Yusheng Wang et.al. 2112.13224v1 null
2021-12-25 Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping Peng Huang et.al. 2112.13222v1 null
2021-12-24 3D Point Cloud Reconstruction and SLAM as an Input Ziyu Li et.al. 2112.12907v1 null
2021-12-22 NICE-SLAM: Neural Implicit Scalable Encoding for SLAM Zihan Zhu et.al. 2112.12130v1 link
2021-12-18 Fast and Robust Registration of Partially Overlapping Point Clouds Eduardo Arnold et.al. 2112.09922v1 link
2021-12-17 Symmetry-aware Neural Architecture for Embodied Visual Navigation Shuang Liu et.al. 2112.09515v1 null
2021-12-27 Homography Decomposition Networks for Planar Object Tracking Xinrui Zhan et.al. 2112.07909v3 link
2021-12-14 Autonomous Navigation System from Simultaneous Localization and Mapping Micheal Caracciolo et.al. 2112.07723v1 link
2021-12-12 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation Bolivar Solarte et.al. 2112.06180v2 link
2021-12-11 Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization Amay Saxena et.al. 2112.05921v1 null
2021-12-07 Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems Gideon Billings et.al. 2112.03826v1 link
2021-12-05 Iterated Posterior Linearization PMB Filter for 5G SLAM Yu Ge et.al. 2112.02575v1 null
2021-12-03 Fast Direct Stereo Visual SLAM Jiawei Mo et.al. 2112.01890v1 link
2021-12-02 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349v2 link
2021-12-01 Research on Event Accumulator Settings for Event-Based SLAM Kun Xiao et.al. 2112.00427v1 link
2021-11-29 An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments Assem Sadek et.al. 2111.14666v1 null
2021-11-29 Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report Hartmut Surmann et.al. 2111.14542v1 null
2021-11-24 Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment V. Ayala-Alfaro et.al. 2111.12690v1 null
2021-11-24 Autonomous bot with ML-based reactive navigation for indoor environment Yash Srivastava et.al. 2111.12542v1 null
2021-11-22 A General Framework for Lifelong Localization and Mapping in Changing Environment Min Zhao et.al. 2111.10946v1 link
2021-11-17 Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network Xiaoming Zhao et.al. 2111.09006v2 null
2021-11-10 Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models Bruno Santos et.al. 2111.05631v1 null
2021-11-10 TomoSLAM: factor graph optimization for rotation angle refinement in microtomography Mark Griguletskii et.al. 2111.05562v1 null
2021-11-07 Hierarchical Segment-based Optimization for SLAM Yuxin Tian et.al. 2111.04101v1 null
2021-11-07 Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM Shing Yan Loo et.al. 2111.04096v2 null
2021-11-05 MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry Joan P. Company-Corcoles et.al. 2111.03408v1 null
2021-10-31 Loop closure detection using local 3D deep descriptors Youjie Zhou et.al. 2111.00440v1 link
2021-10-27 Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification Mingsheng Yin et.al. 2110.14789v2 link
2021-10-27 Efficient Placard Discovery for Semantic Mapping During Frontier Exploration David Balaban et.al. 2110.14742v1 null
2021-10-26 Robust Multi-view Registration of Point Sets with Laplacian Mixture Model Jin Zhang et.al. 2110.13744v1 null
2021-10-25 WOLF: A modular estimation framework for robotics based on factor graphs Joan Sola et.al. 2110.12919v1 null
2021-10-21 Real-Time Ground-Plane Refined LiDAR SLAM Fan Yang et.al. 2110.11517v1 null
2021-10-21 SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words Jonathan J. Y. Kim et.al. 2110.11491v1 null
2021-10-21 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion Zhenkun Zhu et.al. 2110.11040v2 null
2021-10-20 SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training Ankur Bapna et.al. 2110.10329v1 null
2021-10-18 Enhancing exploration algorithms for navigation with visual SLAM Kirill Muravyev et.al. 2110.09156v1 null
2021-10-18 Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment Rui Tian et.al. 2110.08977v1 null
2021-10-16 Partial Hierarchical Pose Graph Optimization for SLAM Alexander Korovko et.al. 2110.08639v1 null
2021-10-14 Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach Shumon Koga et.al. 2110.07546v1 null
2021-10-13 Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity Ran Liu et.al. 2110.06541v2 null
2021-10-12 Learning Efficient Multi-Agent Cooperative Visual Exploration Chao Yu et.al. 2110.05734v1 null
2021-10-07 Self-Supervised Depth Completion for Active Stereo Frederik Warburg et.al. 2110.03234v1 null
2021-10-06 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes Zhenkun Zhu et.al. 2110.02593v1 null
2021-10-03 AEROS: Adaptive RObust least-Squares for Graph-Based SLAM Milad Ramezani et.al. 2110.02018v1 null
2021-10-04 Fast Uncertainty Quantification for Active Graph SLAM Julio A. Placed et.al. 2110.01289v1 link
2021-10-04 Geometry-based Graph Pruning for Lifelong SLAM Gerhard Kurz et.al. 2110.01286v1 null
2021-10-03 Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration Marcus Greiff et.al. 2110.01099v1 null
2021-10-02 Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows Qiangqiang Huang et.al. 2110.00876v1 link

(back to top)

SFM

Publish Date Title Authors PDF Code
2025-03-06 PLMP -- Point-Line Minimal Problems for Projective SfM Kim Kiehn et.al. 2503.04351v1 null
2025-03-03 ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization Anas Abdelkarim et.al. 2503.01311v1 null
2025-03-05 A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping Jialei He et.al. 2503.01202v3 null
2025-03-02 MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain Rui Yi Yong et.al. 2503.00853v1 null
2025-03-02 PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery BoCheng Li et.al. 2503.00848v1 null
2025-03-02 Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration Jinjiang You et.al. 2503.00737v1 link
2025-02-27 Best Foot Forward: Robust Foot Reconstruction in-the-wild Kyle Fogarty et.al. 2502.20511v1 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779v3 null
2025-02-20 CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting Qilin Zhang et.al. 2502.14684v1 link
2025-02-19 Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections Seong Jong Yoo et.al. 2502.13986v1 null
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545v2 null
2025-02-10 FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences Oliver Boyne et.al. 2502.06367v1 link
2025-02-10 Building Rome with Convex Optimization Haoyu Han et.al. 2502.04640v2 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657v1 null
2025-03-02 GP-GS: Gaussian Processes for Enhanced Gaussian Splatting Zhihao Guo et.al. 2502.02283v3 link
2025-02-03 XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications Shangjin Zhai et.al. 2502.01297v1 null
2025-01-28 Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction Tim Flückiger et.al. 2501.16221v2 null
2025-01-25 Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos Zhen-Hui Dong et.al. 2501.15096v1 null
2025-01-24 MATCHA:Towards Matching Anything Fei Xue et.al. 2501.14945v1 null
2025-01-24 Light3R-SfM: Towards Feed-forward Structure-from-Motion Sven Elflein et.al. 2501.14914v1 null
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277v1 null
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015v2 null
2025-02-02 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927v2 link
2025-01-11 Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis Aditya Rauniyar et.al. 2501.06431v1 null
2025-01-06 Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation Yuezhang Lv et.al. 2501.02821v1 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409v1 null
2025-01-02 EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy Ao Gao et.al. 2501.01003v1 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767v1 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806v1 null
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103v1 null
2024-12-18 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Jongmin Park et.al. 2412.09982v2 null
2024-12-10 Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling Hui Deng et.al. 2412.07230v1 null
2024-12-08 Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features Yuanbo Xiangli et.al. 2412.05826v1 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463v2 null
2024-12-02 SfM-Free 3D Gaussian Splatting via Hierarchical Training Bo Ji et.al. 2412.01553v1 link
2024-12-02 MVImgNet2.0: A Larger-scale Dataset of Multi-view Images Xiaoguang Han et.al. 2412.01430v1 null
2024-12-02 Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM Alejandro Fontan et.al. 2412.01116v1 null
2024-11-27 RoMo: Robust Motion Segmentation Improves Structure from Motion Lily Goli et.al. 2411.18650v1 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779v1 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291v1 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592v1 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546v1 null
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879v1 null
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453v1 null
2024-11-08 From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS Haoran Zhang et.al. 2411.05362v1 link
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213v1 null
2024-10-25 A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint Changshi Mu et.al. 2410.19473v1 link
2024-10-30 Large Spatial Model: End-to-end Unposed Images to Semantic 3D Zhiwen Fan et.al. 2410.18956v2 link
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505v1 null
2024-10-20 Neural Active Structure-from-Motion in Dark and Textureless Environment Kazuto Ichimaru et.al. 2410.15378v1 null
2024-10-16 Gravity-aligned Rotation Averaging with Circular Regression Linfei Pan et.al. 2410.12763v1 link
2024-10-15 SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection Yizhe Liu et.al. 2410.12080v1 link
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505v1 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533v1 link
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434v1 null
2024-10-08 Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? Charalambos Tzamos et.al. 2410.05984v1 link
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861v1 null
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386v1 null
2024-09-29 Robust Incremental Structure-from-Motion with Hybrid Features Shaohui Liu et.al. 2409.19811v1 null
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152v1 null
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673v1 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981v1 null
2024-09-24 Frequency-based View Selection in Gaussian Splatting Reconstruction Monica M. Q. Li et.al. 2409.16470v1 null
2024-10-07 Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion Juan-Diego Florez et.al. 2409.16465v2 null
2024-09-24 Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research Vandita Shukla et.al. 2409.15914v1 null
2024-09-23 Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments Francisco Roza de Moraes et.al. 2409.15602v1 null
2024-09-17 GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module Yichen Zhang et.al. 2409.11307v1 null
2024-09-13 Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints Shan Chen et.al. 2409.08613v1 null
2024-09-09 KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction Davide Di Nucci et.al. 2409.05407v1 null
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581v1 null
2024-09-25 Geometry-aware Feature Matching for Large-Scale Structure from Motion Gonglin Chen et.al. 2409.02310v3 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373v3 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445v2 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966v1 null
2024-08-20 TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks Jinjie Mai et.al. 2408.10739v1 null
2024-08-16 Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS Wei Sun et.al. 2408.08723v1 null
2024-08-15 CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning Wei Zhu et.al. 2408.08134v1 link
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648v1 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825v1 null
2024-08-04 Birational geometry of critical loci in Algebraic Vision Marina Bertolini et.al. 2408.02067v1 null
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053v1 null
2024-08-02 Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris Kentaro Uno et.al. 2408.01035v1 null
2024-08-01 LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting Zhenyu Bao et.al. 2408.00254v1 null
2024-07-29 Global Structure-from-Motion Revisited Linfei Pan et.al. 2407.20219v1 link
2024-08-06 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166v2 null
2024-07-16 NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models Francesco Milano et.al. 2407.12207v1 link
2024-07-15 LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning Zhuozhu Jian et.al. 2407.10782v1 null
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406v1 null
2024-07-14 3DEgo: 3D Editing on the Go! Umar Khalid et.al. 2407.10102v1 null
2024-07-10 Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization Jinjie Mai et.al. 2407.08023v1 link
2024-07-09 Computer vision tasks for intelligent aerospace missions: An overview Huilin Chen et.al. 2407.06513v1 null
2024-07-08 Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views Jiawei Guo et.al. 2407.05666v1 null
2024-07-05 Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization Shaohan Li et.al. 2407.04260v1 null
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939v3 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918v1 link
2024-07-02 Indoor 3D Reconstruction with an Unknown Camera-Projector Pair Zhaoshuai Qi et.al. 2407.01945v1 null
2024-05-29 Rotation Averaging: A Primal-Dual Method and Closed-Forms in Cycle Graphs Gabriel Moreira et.al. 2406.18564v1 null
2024-06-26 VDG: Vision-Only Dynamic Gaussian for Driving Simulation Hao Li et.al. 2406.18198v1 null
2024-06-25 Consensus Learning with Deep Sets for Essential Matrix Estimation Dror Moran et.al. 2406.17414v1 link
2024-06-24 Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction Tong Qin et.al. 2406.16289v1 null
2024-06-19 MVSBoost: An Efficient Point Cloud-based 3D Reconstruction Umair Haroon et.al. 2406.13515v1 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819v1 link
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216v1 link
2024-06-13 Gaussian Splatting with Localized Points Management Haosen Yang et.al. 2406.04251v2 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509v1 null
2024-05-29 Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy Zijie Jiang et.al. 2405.18863v1 null
2024-05-29 3D Reconstruction with Fast Dipole Sums Hanyu Chen et.al. 2405.16788v3 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599v1 null
2024-05-09 Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment Simon Weber et.al. 2405.05079v2 link
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345v1 null
2024-05-07 Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling Jiawei Shi et.al. 2405.04309v1 null
2024-05-03 HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 Miriam Jäger et.al. 2405.02005v1 null
2024-04-22 Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer Eric Brachmann et.al. 2404.14351v1 null
2024-04-22 RESFM: Robust Equivariant Multiview Structure from Motion Fadi Khatib et.al. 2404.14280v1 null
2024-05-23 Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting Yalda Foroutan et.al. 2404.12547v3 null
2024-05-07 A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion Feng Yu et.al. 2404.11590v2 link
2024-04-18 DeblurGS: Gaussian Splatting for Camera Motion Blur Jeongtaek Oh et.al. 2404.11358v2 null
2024-05-21 LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives Jiadi Cui et.al. 2404.09748v2 null
2024-04-12 MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance Yuqun Wu et.al. 2404.08252v1 null
2024-04-11 Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation Keonhee Han et.al. 2404.07933v1 null
2024-04-07 NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization Peng Tu et.al. 2404.04875v1 null
2024-04-04 GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis Emmanouil Nikolakakis et.al. 2404.03126v1 null
2024-03-29 InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds Zhiwen Fan et.al. 2403.20309v1 link
2024-03-29 HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes Zhuopeng Li et.al. 2403.20032v1 null
2024-03-26 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation Jiahao Chen et.al. 2403.17537v1 null
2024-03-25 INPC: Implicit Neural Point Clouds for Radiance Field Rendering Florian Hahlbohm et.al. 2403.16862v1 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639v1 null
2024-03-14 Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting Jaewoo Jung et.al. 2403.09413v1 link
2024-03-13 Refractive COLMAP: Refractive Structure-from-Motion Revisited Mengkun She et.al. 2403.08640v1 null
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156v1 link
2024-03-11 SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection Yifu Tao et.al. 2403.06877v1 null
2024-03-24 BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling Cheng Peng et.al. 2403.04926v2 link
2024-02-22 GaussianPro: 3D Gaussian Splatting with Progressive Propagation Kai Cheng et.al. 2402.14650v1 null
2024-02-25 A Robust Error-Resistant View Selection Method for 3D Reconstruction Shaojie Zhang et.al. 2402.11431v2 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287v1 null
2024-03-11 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592v2 link
2024-01-22 HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs Zelin Gao et.al. 2401.11711v1 null
2024-01-19 SCENES: Subpixel Correspondence Estimation With Epipolar Supervision Dominik A. Kloepfer et.al. 2401.10886v1 null
2024-01-15 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data Mathilde Letard et.al. 2401.09481v1 link
2024-01-17 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey Thiago Lopes Trugillo da Silveira et.al. 2401.09252v1 null
2024-01-17 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization Weiyao Wang et.al. 2401.08937v1 null
2024-01-16 Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions Yi-Fan Zuo et.al. 2401.08043v1 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236v1 link
2024-01-07 A Classification of Critical Configurations for any Number of Projective Views Martin Bråtelund et.al. 2401.03450v1 link
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471v1 null
2023-12-16 Transformers in Unsupervised Structure-from-Motion Hemang Chawla et.al. 2312.10529v1 link
2023-12-14 HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video Xueying Wang et.al. 2312.08863v1 null
2023-12-14 CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning Qingsong Yan et.al. 2312.08760v1 null
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865v1 link
2023-12-11 Gaussian Splatting SLAM Hidenobu Matsuki et.al. 2312.06741v1 null
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889v1 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563v1 null
2023-11-30 Distributed Global Structure-from-Motion with a Deep Front-End Ayush Baid et.al. 2311.18801v1 link
2023-11-21 Robot Hand-Eye Calibration using Structure-from-Motion Nicolas Andreff et.al. 2311.11808v2 null
2023-11-18 LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation Sébastien Henry et.al. 2311.11171v1 null
2023-11-10 MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty Rémi Marsal et.al. 2311.06137v1 link
2023-11-08 VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering Linus Franke et.al. 2311.04634v1 link
2023-10-22 A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Jan Emily Mangulabnan et.al. 2310.14364v1 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605v1 null
2023-10-09 Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration Chunge Bai et.al. 2310.05504v1 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134v1 null
2023-11-29 Pose-Free Generalizable Rendering Transformer Zhiwen Fan et.al. 2310.03704v2 link
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092v1 null
2023-10-01 Propagating Semantic Labels in Video Data David Balaban et.al. 2310.00783v1 null
2023-09-22 Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning Jonathan Sauder et.al. 2309.12804v1 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883v1 link
2023-09-19 Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water Jayesh Tripathi et.al. 2309.10269v1 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927v1 link
2023-09-08 Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147v1 null
2023-09-01 SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation Youhong Wang et.al. 2309.00526v1 null
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385v1 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984v1 link
2023-08-26 Disjoint Pose and Shape for 3D Face Reconstruction Raja Kumar et.al. 2308.13903v1 null
2023-08-30 CamP: Camera Preconditioning for Neural Radiance Fields Keunhong Park et.al. 2308.10902v2 null
2023-08-18 Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling Haorui Ji et.al. 2308.10705v1 null
2023-08-14 Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation Tao Liu et.al. 2308.07231v1 link
2023-08-11 Efficient Large-scale AUV-based Visual Seafloor Mapping Mengkun She et.al. 2308.06147v1 null
2023-08-04 EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems Weihan Wang et.al. 2308.02670v1 null
2023-08-15 Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites Jyotirmaya Shivottam et.al. 2308.01246v2 link
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125v1 null
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055v1 link
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702v2 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981v1 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520v1 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404v1 link
2023-06-29 The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes David Recasens et.al. 2306.16917v1 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669v1 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770v2 link
2023-06-15 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Varun Jampani et.al. 2306.09109v1 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012v1 link
2023-06-10 3D reconstruction using Structure for Motion Kshitij Karnawat et.al. 2306.06360v1 link
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938v1 null
2023-05-31 FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow Cameron Smith et.al. 2306.00180v1 null
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036v1 link
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301v1 link
2023-05-09 Rotation Synchronization via Deep Matrix Factorization Gk Tejus et.al. 2305.05268v1 link
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664v1 null
2023-04-14 Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments Felix Ott et.al. 2304.07250v1 null
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947v1 link
2023-04-08 Photometric Correction for Infrared Sensors Jincheng Zhang et.al. 2304.03930v1 null
2023-04-07 DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium Antyanta Bangunharcana et.al. 2304.03560v1 link
2023-04-05 Semantic Validation in Structure from Motion Joseph Rowell et.al. 2304.02420v1 link
2023-03-31 Learning Internal Representations of 3D Transformations from 2D Projected Inputs Marissa Connor et.al. 2303.17776v1 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504v1 link
2023-03-27 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering Jaehoon Choi et.al. 2303.15060v1 null
2023-03-26 On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks HyunJun Jung et.al. 2303.14840v1 link
2023-03-24 Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong et.al. 2303.13805v1 link
2023-03-24 Progressively Optimized Local Radiance Fields for Robust View Synthesis Andreas Meuleman et.al. 2303.13791v1 null
2023-03-15 RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters Shuja Khalid et.al. 2303.08695v1 null
2023-03-09 Revisiting Rotation Averaging: Uncertainties and Robust Losses Ganlin Zhang et.al. 2303.05195v1 link
2023-02-28 Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images Zhongli Fan et.al. 2302.14239v1 link
2023-03-25 BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling Sameera Ramasinghe et.al. 2302.13543v3 null
2023-02-21 EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images Zhichao Ye et.al. 2302.10544v1 link
2023-02-18 Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering Tatsuro Yamane et.al. 2302.09208v1 null
2023-02-12 Uncertainty-Driven Dense Two-View Structure from Motion Weirong Chen et.al. 2302.00523v2 null
2023-01-28 AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion Yu Chen et.al. 2301.12135v1 null
2023-01-20 A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles Zhefan Xu et.al. 2301.08422v1 link
2023-03-21 Robust Dynamic Radiance Fields Yu-Lun Liu et.al. 2301.02239v2 link
2022-12-24 Polarimetric Multi-View Inverse Rendering Jinyu Zhao et.al. 2212.12721v1 null
2022-12-13 Accidental Turntables: Learning 3D Pose by Watching Objects Turn Zezhou Cheng et.al. 2212.06300v1 null
2022-12-04 3D Object Aided Self-Supervised Monocular Depth Estimation Songlin Wei et.al. 2212.01768v1 null
2022-12-02 High-Res Facial Appearance Capture from Polarized Smartphone Images Dejan Azinović et.al. 2212.01160v1 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069v1 link
2022-11-24 JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models Sepidehsadat Hosseini et.al. 2211.13785v1 null
2022-11-24 SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks Sergio Izquierdo et.al. 2211.13551v1 link
2022-11-22 Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces Yuxi Xiao et.al. 2211.12018v1 link
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836v1 null
2022-11-14 Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion René Haas et.al. 2211.07195v1 null
2022-10-13 Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach Zhiang Chen et.al. 2210.07349v1 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517v1 null
2022-10-07 Leveraging Structure from Motion to Localize Inaccessible Bus Stops Indu Panigrahi et.al. 2210.03646v1 link
2022-10-01 Structure-Aware NeRF without Posed Camera via Epipolar Constraint Shu Chen et.al. 2210.00183v1 link
2022-10-05 FAST-LIO, Then Bayesian ICP, Then GTSFM Jerred Chen et.al. 2210.00146v2 null
2022-09-20 BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction Ahalya Ravendran et.al. 2209.09470v1 null
2022-09-19 A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion Gerry Chen et.al. 2209.08690v1 null
2022-09-14 End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes Qiao Chen et.al. 2209.06926v1 null
2022-09-07 Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 Hartmut Surmann et.al. 2209.03084v1 null
2022-08-27 Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data Thomas A. Ciarfuglia et.al. 2208.13001v1 null
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325v1 null
2022-08-04 Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training Yao-Chih Lee et.al. 2208.02709v1 link
2022-07-31 One Object at a Time: Accurate and Robust Structure From Motion for Robots Aravind Battaje et.al. 2208.00487v1 null
2022-07-23 Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks Daniel Posada et.al. 2207.11413v1 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762v2 link
2022-07-19 ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild Wang Zhao et.al. 2207.09137v1 link
2022-07-16 Organic Priors in Non-Rigid Structure from Motion Suryansh Kumar et.al. 2207.06262v3 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396v1 null
2022-06-24 Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set San Jiang et.al. 2206.11499v2 null
2022-06-13 TC-SfM: Robust Track-Community-Based Structure-from-Motion Lei Wang et.al. 2206.05866v1 null
2022-06-10 EigenFairing: 3D Model Fairing using Image Coherence Pragyana Mishra et.al. 2206.05309v1 null
2022-06-01 Semantic Room Wireframe Detection from a Single View David Gillsjö et.al. 2206.00491v1 link
2022-05-31 Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction Qiancheng Fu et.al. 2205.15848v1 null
2022-05-09 Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression HyunJun Jung et.al. 2205.04565v1 null
2022-05-07 Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs Pedro F. Proença et.al. 2205.03522v1 null
2022-05-06 EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms Levi Burner et.al. 2205.03467v1 null
2022-04-20 Learned Monocular Depth Priors in Visual-Inertial Initialization Yunwen Zhou et.al. 2204.09171v1 null
2022-04-10 Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective Hui Deng et.al. 2204.04730v1 null
2022-04-08 Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems Debao Huang et.al. 2204.04145v1 null
2022-04-07 SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation Yi Wei et.al. 2204.03636v1 link
2022-04-06 Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion Lukas Bommes et.al. 2204.02733v1 link
2022-04-05 Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows Sheng Liu et.al. 2204.02509v1 link
2022-03-31 Fast, Accurate and Memory-Efficient Partial Permutation Synchronization Shaohan Li et.al. 2203.16505v2 null
2022-03-28 Visual Odometry for RGB-D Cameras Afonso Fontes et.al. 2203.15119v1 null
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901v1 link
2022-03-23 Event-Based Dense Reconstruction Pipeline Kun Xiao et.al. 2203.12270v1 null
2022-03-21 DiffPoseNet: Direct Differentiable Camera Pose Estimation Chethan M. Parameshwara et.al. 2203.11174v1 null
2022-03-02 Asynchronous Optimisation for Event-based Visual Odometry Daqi Liu et.al. 2203.01037v1 null
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851v1 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146v1 link
2022-01-20 GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry Yunhan Zhao et.al. 2201.08131v1 null
2022-01-13 Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching Yunpeng Shi et.al. 2201.04797v1 link
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364v1 link
2022-01-06 De-rendering 3D Objects in the Wild Felix Wimbauer et.al. 2201.02279v1 link
2021-12-29 On the Instability of Relative Pose Estimation and RANSAC's Role Hongyi Fan et.al. 2112.14651v1 null
2021-12-16 Road-aware Monocular Structure from Motion and Homography Estimation Wei Sui et.al. 2112.08635v1 null
2021-12-10 Critical configurations for three projective views Martin Bråtelund et.al. 2112.05478v1 null
2021-12-09 Critical configurations for two projective views, a new approach Martin Bråtelund et.al. 2112.05074v1 null
2021-12-06 Dense Depth Priors for Neural Radiance Fields from Sparse Input Views Barbara Roessle et.al. 2112.03288v1 link
2021-12-10 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349v2 link
2021-11-11 Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft Pascal Schoppmann et.al. 2111.06271v1 null
2021-11-10 Damage Estimation and Localization from Sparse Aerial Imagery Rene Garcia Franceschini et.al. 2111.03708v2 null
2021-11-03 Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems Swarnabja Bhaumik et.al. 2111.02064v1 null
2021-10-14 Modeling dynamic target deformation in camera calibration Annika Hagemann et.al. 2110.07322v1 null
2021-10-13 Hyperspectral 3D Mapping of Underwater Environments Maxime Ferrera et.al. 2110.06571v1 null
2021-09-24 Automatic Map Update Using Dashcam Videos Aziza Zhanabatyrova et.al. 2109.12131v1 null
2021-09-16 Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs Gabriel Moreira et.al. 2109.08046v1 link
2021-09-06 Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications Tejas Mane et.al. 2109.02740v1 null
2021-09-02 Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency Beatrix-Emőke Fülöp-Balogh et.al. 2109.01018v1 null
2021-09-01 On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation Eric Brachmann et.al. 2109.00524v1 link
2021-08-31 DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension Roman Shapovalov et.al. 2109.00033v1 null
2021-08-29 Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration Seyed-Mahdi Nasiri et.al. 2108.12876v1 null
2021-08-23 Burst Imaging for Light-Constrained Structure-From-Motion Ahalya Ravendran et.al. 2108.09895v1 null

(back to top)

Visual Localization

Publish Date Title Authors PDF Code
2025-03-06 RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining Tengfei Zhang et.al. 2503.04653v1 null
2025-03-06 ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images Yanqing Shen et.al. 2503.04475v1 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235v1 null
2025-03-06 Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior Haitao Wu et.al. 2503.04207v1 null
2025-03-06 Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments Beverley Gorry et.al. 2503.04096v1 null
2025-03-04 TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition Oliver Grainge et.al. 2503.02511v1 null
2025-03-04 Introspective Loop Closure for SLAM with 4D Imaging Radar Maximilian Hilger et.al. 2503.02383v1 null
2025-03-04 Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models Kenta Tsukahara et.al. 2503.02256v1 null
2025-03-03 Composed Multi-modal Retrieval: A Survey of Approaches and Applications Kun Zhang et.al. 2503.01334v1 link
2025-03-03 AirRoom: Objects Matter in Room Reidentification Runmao Yao et.al. 2503.01130v1 null
2025-03-02 Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching Jinyu Miao et.al. 2503.00862v1 null
2025-03-01 Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning Songlin Dong et.al. 2503.00515v1 null
2025-02-28 EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration Kuangyi Chen et.al. 2503.00167v1 null
2025-02-28 CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Zelong Sun et.al. 2502.20826v1 null
2025-02-28 SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition Shanshan Wan et.al. 2502.20676v1 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036v1 link
2025-02-27 On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation Ruben T. Lucassen et.al. 2502.19285v2 null
2025-02-26 BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure Haoxin Cai et.al. 2502.19242v1 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932v1 null
2025-02-19 A Comprehensive Survey on Composed Image Retrieval Xuemeng Song et.al. 2502.18495v1 null
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237v2 link
2025-02-23 Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries Yin Wu et.al. 2502.16636v1 link
2025-02-23 SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition Feng Lu et.al. 2502.16601v1 link
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682v1 null
2025-02-20 Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition Tianyi Shang et.al. 2502.14195v1 link
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803v1 null
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146v1 link
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545v2 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303v1 null
2025-02-17 Descriminative-Generative Custom Tokens for Vision-Language Models Pramuditha Perera et.al. 2502.12095v1 null
2025-02-17 ILIAS: Instance-Level Image retrieval At Scale Giorgos Kordopatis-Zilos et.al. 2502.11748v1 null
2025-02-17 Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition Jianyi Peng et.al. 2502.11742v1 null
2025-02-17 Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics Francesco Croce et.al. 2502.11725v1 link
2025-02-17 Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization Yuanze Xu et.al. 2502.11408v1 null
2025-02-12 E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection Junjie Wu et.al. 2502.10455v1 null
2025-02-11 Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning Yuhang Dong et.al. 2502.09649v1 null
2025-02-13 ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Rotem Shalev-Arkushin et.al. 2502.09411v1 null
2025-02-12 SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization Artem Dementyev et.al. 2502.08848v1 null
2025-02-12 Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions Prajwal Gatti et.al. 2502.08438v1 null
2025-02-11 Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang et.al. 2502.07830v1 null
2025-02-11 Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields Petr Koutenský et.al. 2502.07338v1 null
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327v1 null
2025-02-11 PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval Osman Tursun et.al. 2502.07215v1 null
2025-02-10 AstroLoc: Robust Space to Ground Image Localizer Gabriele Berton et.al. 2502.07003v1 null
2025-02-09 Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education Yanhao Jia et.al. 2502.05863v1 null
2025-02-07 Learning Street View Representations with Spatiotemporal Contrast Yong Li et.al. 2502.04638v1 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263v1 link
2025-02-05 Human-Aligned Image Models Improve Visual Decoding from the Brain Nona Rajabi et.al. 2502.03081v1 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335v1 null
2025-01-31 LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks Liudi Yang et.al. 2501.19382v1 link
2025-01-27 Freestyle Sketch-in-the-Loop Image Segmentation Subhadeep Koley et.al. 2501.16022v1 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379v1 null
2025-01-24 Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection Viktor Kozák et.al. 2501.14587v1 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051v1 link
2025-01-22 Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation Kenta Uesugi et.al. 2501.13968v1 null
2025-01-19 Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection Zhipeng Yu et.al. 2501.11063v1 link
2025-01-18 A Resource-Efficient Training Framework for Remote Sensing Text--Image Retrieval Weihang Zhang et.al. 2501.10638v1 null
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887v1 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001v1 link
2025-01-12 SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval Bhavin Jawade et.al. 2501.08347v1 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286v1 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399v1 null
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749v1 null
2025-01-06 Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI Xujin Li et.al. 2501.02841v1 null
2025-01-03 A Minimal Subset Approach for Efficient and Scalable Loop Closure Nikolaos Stathoulopoulos et.al. 2501.01791v1 link
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642v1 null
2025-01-02 R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization Xudong Jiang et.al. 2501.01421v1 null
2025-01-02 Training Medical Large Vision-Language Models with Abnormal-Aware Feedback Yucheng Zhou et.al. 2501.01377v1 null
2025-01-02 Domain-invariant feature learning in brain MR imaging for content-based image retrieval Shuya Tobari et.al. 2501.01326v1 null
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056v1 link
2024-12-25 FOR: Finetuning for Object Level Open Vocabulary Image Retrieval Hila Levi et.al. 2412.18806v1 null
2024-12-24 ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval Le Dong et.al. 2412.18136v1 link
2024-12-22 Where am I? Cross-View Geo-localization with Natural Language Descriptions Junyan Ye et.al. 2412.17007v1 null
2024-12-22 Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process Shenghai Yuan et.al. 2412.16880v1 null
2024-12-24 Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling Daichi Yashima et.al. 2412.16576v2 link
2024-12-20 A New Method to Capturing Compositional Knowledge in Linguistic Space Jiahe Wan et.al. 2412.15632v1 null
2024-12-20 Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation Samantha J Alloo et.al. 2412.15513v1 null
2024-12-19 Learning Visual Composition through Improved Semantic Guidance Austin Stone et.al. 2412.15396v1 null
2024-12-19 MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Junjie Zhou et.al. 2412.14475v1 null
2024-12-18 Adversarial Hubness in Multi-Modal Retrieval Tingwei Zhang et.al. 2412.14113v1 link
2024-12-18 Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval Giacomo Pacini et.al. 2412.13834v1 null
2024-12-18 ConDo: Continual Domain Expansion for Absolute Pose Regression Zijun Li et.al. 2412.13452v1 link
2024-12-17 Three Things to Know about Deep Metric Learning Yash Patel et.al. 2412.12432v1 null
2024-12-15 Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval Zelong Sun et.al. 2412.11087v1 null
2024-12-20 Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2412.11077v3 null
2024-12-13 MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition Qiwen Gu et.al. 2412.09199v2 null
2024-12-12 A Flexible Plug-and-Play Module for Generating Variable-Length Liyang He et.al. 2412.08922v1 link
2024-12-11 Image Retrieval Methods in the Dissimilarity Space Madhu Kiran et.al. 2412.08618v1 null
2024-12-11 Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Siyan Dong et.al. 2412.08376v1 link
2024-12-11 Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin Benjamin D. Killeen et.al. 2412.08020v1 null
2024-12-10 On Motion Blur and Deblurring in Visual Place Recognition Timur Ismagilov et.al. 2412.07751v1 null
2024-12-10 Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance Wanwen Chen et.al. 2412.07741v1 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488v1 link
2024-12-09 A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition Connor Malone et.al. 2412.06153v1 null
2024-12-07 Compositional Image Retrieval via Instruction-Aware Contrastive Learning Wenliang Zhong et.al. 2412.05756v1 link
2024-12-06 DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification Ying Jin et.al. 2412.04828v1 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512v1 null
2024-12-04 Composed Image Retrieval for Training-Free Domain Conversion Nikos Efthymiadis et.al. 2412.03297v1 link
2024-12-03 A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration Thulio Amorim et.al. 2412.02881v1 null
2024-12-03 Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar et.al. 2412.02310v1 link
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039v1 link
2024-12-02 Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features MD Shaikh Rahman et.al. 2412.01555v1 null
2024-12-02 Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models Yi Liao et.al. 2412.01202v1 null
2024-12-01 EDTformer: An Efficient Decoder Transformer for Visual Place Recognition Tong Jin et.al. 2412.00784v1 null
2024-11-28 EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval Muhammad Huzaifa et.al. 2412.00139v1 null
2024-11-28 Unleashing the Power of Data Synthesis in Visual Localization Sihang Li et.al. 2412.00138v1 null
2024-11-28 Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval Yang Liu et.al. 2412.00120v1 null
2024-11-29 A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications Liqiang Zhang Ye Tian Dongyan Wei et.al. 2411.19845v1 null
2024-11-27 Optimizing Image Retrieval with an Extended b-Metric Space Abdelkader Belhenniche et.al. 2411.18800v1 null
2024-11-26 Learning Visual Hierarchies with Hyperbolic Embeddings Ziwei Wang et.al. 2411.17490v1 null
2024-12-02 Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy You Li et.al. 2411.16752v2 null
2024-12-02 AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks You Li et.al. 2411.16749v2 null
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171v1 link
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800v1 null
2024-11-22 Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval Zengbao Sun et.al. 2411.14704v1 null
2024-11-20 Globally Correlation-Aware Hard Negative Generation Wenjie Peng et.al. 2411.13145v1 link
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481v1 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665v1 link
2024-11-13 Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval Saul Santos et.al. 2411.08590v1 link
2024-11-22 Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments Ashkan Nejad et.al. 2411.08567v2 link
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279v1 link
2024-11-05 From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Xintian Sun et.al. 2411.05826v1 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545v1 null
2024-11-11 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Edward Vendrow et.al. 2411.02537v3 link
2024-11-20 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925v2 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804v1 null
2024-11-03 Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification MD Shaikh Rahman et.al. 2411.01473v1 null
2024-11-01 Identifying Implicit Social Biases in Vision-Language Models Kimia Hamidieh et.al. 2411.00997v1 null
2024-10-31 Nearest Neighbor Normalization Improves Multimodal Retrieval Neil Chowdhury et.al. 2410.24114v1 link
2024-10-31 MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval Haiwen Li et.al. 2410.23736v1 null
2024-10-30 Decoupling Semantic Similarity from Spatial Alignment for Neural Networks Tassilo Wald et.al. 2410.23107v1 link
2024-10-29 Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications Monica Riedler et.al. 2410.21943v1 link
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615v1 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341v1 link
2024-10-24 ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Zijia Zhao et.al. 2410.18715v1 link
2024-10-25 On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features Tomáš Pivoňka et.al. 2410.18573v2 null
2024-10-22 Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2410.17393v1 null
2024-10-20 GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Haiwen Diao et.al. 2410.15266v1 link
2024-10-19 Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection Marie Roald et.al. 2410.14969v1 link
2024-10-16 Development of Image Collection Method Using YOLO and Siamese Network Chan Young Shin et.al. 2410.12561v1 null
2024-10-16 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment Juelin Zhu et.al. 2410.12269v1 link
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240v1 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505v1 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187v1 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533v1 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935v1 link
2024-10-16 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469v2 null
2024-10-11 A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification Eugene P. W. Ang et.al. 2410.08456v1 null
2024-10-10 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Hoin Jung et.al. 2410.07593v1 link
2024-10-09 Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval Mohammad Omama et.al. 2410.07022v1 null
2024-10-09 Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers Stephen Hausler et.al. 2410.06614v1 link
2024-10-09 MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel C. F. Codella et.al. 2410.06542v1 null
2024-10-08 Temporal Image Caption Retrieval Competition -- Description and Results Jakub Pokrywka et.al. 2410.06314v1 null
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285v1 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165v1 null
2024-10-08 Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Ayush Singh et.al. 2410.05928v1 null
2024-10-08 RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Minsoo Kim et.al. 2410.05621v1 null
2024-10-11 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249v3 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419v1 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544v1 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536v2 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411v2 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266v1 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597v1 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293v1 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152v1 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733v1 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049v1 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502v1 link
2024-09-23 CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis Xiang Zhang et.al. 2409.15169v1 null
2024-09-21 Combining Absolute and Semi-Generalized Relative Poses for Visual Localization Vojtech Panek et.al. 2409.14269v1 null
2024-09-21 SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality Hongjia Zhai et.al. 2409.14067v1 null
2024-09-20 Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval Morris Florek et.al. 2409.13513v1 link
2024-09-18 Towards Global Localization using Multi-Modal Object-Instance Re-Identification Aneesh Chavan et.al. 2409.12002v1 link
2024-09-17 Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching Kurran Singh et.al. 2409.11555v1 null
2024-09-17 Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information Kunal Chelani et.al. 2409.11536v1 null
2024-09-17 Improving the Efficiency of Visually Augmented Language Models Paula Ontalvilla et.al. 2409.11148v1 link
2024-09-21 HGSLoc: 3DGS-based Heuristic Camera Pose Refinement Zhongyan Niu et.al. 2409.10925v2 null
2024-09-16 SOLVR: Submap Oriented LiDAR-Visual Re-Localisation Joshua Knights et.al. 2409.10247v1 null
2024-09-16 Garment Attribute Manipulation with Multi-level Attention Vittorio Casula et.al. 2409.10206v1 null
2024-09-14 Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval Amirreza Mahbod et.al. 2409.09430v1 link
2024-09-12 Structured Pruning for Efficient Visual Place Recognition Oliver Grainge et.al. 2409.07834v1 null
2024-09-10 GeoCalib: Learning Single-image Calibration with Geometric Optimization Alexander Veicht et.al. 2409.06704v1 link
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471v1 link
2024-09-10 A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions Zhicong Wu et.al. 2409.06381v1 null
2024-09-09 Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding Bram Willemsen et.al. 2409.05721v1 link
2024-09-09 Open-World Dynamic Prompt and Continual Visual Representation Learning Youngeun Kim et.al. 2409.05312v1 null
2024-09-12 Training-free ZS-CIR via Weighted Modality Fusion and Similarity Ren-Di Wu et.al. 2409.04918v2 link
2024-09-12 Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models Saghir Alfasly et.al. 2409.04631v2 null
2024-09-06 Reprojection Errors as Prompts for Efficient Scene Coordinate Regression Ting-Ru Liu et.al. 2409.04178v1 null
2024-09-06 Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments Therese Joseph et.al. 2409.03998v1 null
2024-09-04 Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications Abby Stylianou et.al. 2409.03012v1 null
2024-09-04 NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval Sepanta Zeighami et.al. 2409.02343v1 link
2024-09-03 Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment Konstantin Schall et.al. 2409.01936v1 link
2024-09-02 A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches Kim Jinwoo et.al. 2409.01219v1 null
2024-09-02 Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection Manon Kok et.al. 2409.01091v1 null
2024-09-02 Evidential Transformers for Improved Image Retrieval Danilo Dordevic et.al. 2409.01082v1 null
2024-09-05 EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System Bonan Liu et.al. 2409.00343v2 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373v3 null
2024-09-02 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095v2 null
2024-08-29 A compact neuromorphic system for ultra energy-efficient, on-device robot localization Adam D. Hines et.al. 2408.16754v1 link
2024-08-29 Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models Kengo Nakata et.al. 2408.16296v1 null
2024-08-28 Temporal Attention for Cross-View Sequential Image Localization Dong Yuan et.al. 2408.15569v1 link
2024-08-27 Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Tianqi Wei et.al. 2408.14723v1 null
2024-08-25 LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task Ali Asgarov et.al. 2408.13909v1 link
2024-08-15 Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval Lifeng Zhou et.al. 2408.13705v1 null
2024-08-15 Coarse-to-fine Alignment Makes Better Speech-image Retrieval Lifeng Zhou et.al. 2408.13119v1 null
2024-08-21 FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization Son Tung Nguyen et.al. 2408.12037v1 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966v1 null
2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Xiangyu Zhao et.al. 2408.11305v1 link
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085v1 link
2024-08-19 BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval Zhenyu Lu et.al. 2408.10383v1 null
2024-08-23 Fashion Image-to-Image Translation for Complementary Item Retrieval Matteo Attimonelli et.al. 2408.09847v2 link
2024-08-20 MambaLoc: Efficient Camera Localisation via State Space Model Jialu Wang et.al. 2408.09680v2 null
2024-08-15 DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions Ryosuke Korekata et.al. 2408.07910v1 null
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648v1 null
2024-08-10 Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network Junyan Ye et.al. 2408.05475v1 link
2024-08-09 Spherical World-Locking for Audio-Visual Localization in Egocentric Videos Heeseung Yun et.al. 2408.05364v1 null
2024-08-06 AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval Pavel Suma et.al. 2408.03282v1 link
2024-08-05 CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration Gongxin Yao et.al. 2408.02394v1 null
2024-08-09 BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles Lun Luo et.al. 2408.01841v2 link
2024-08-02 On Validation of Search & Retrieval of Tissue Images in Digital Pathology H. R. Tizhoosh et.al. 2408.01570v1 null
2024-07-31 VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning Yuhang Ming et.al. 2407.21416v1 null
2024-07-31 SuperVINS: A visual-inertial SLAM framework integrated deep learning features Hongkun Luo et.al. 2407.21348v1 link
2024-07-30 Re-localization acceleration with Medoid Silhouette Clustering Hongyi Zhang et.al. 2407.20749v1 null
2024-07-29 A flexible framework for accurate LiDAR odometry, map manipulation, and localization José Luis Blanco-Claraco et.al. 2407.20465v1 link
2024-07-26 From 2D to 3D: AISG-SLA Visual Localization Challenge Jialin Gao et.al. 2407.18590v1 null
2024-07-24 Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Yongqi Li et.al. 2407.17274v1 null
2024-07-24 Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments Wei Gao et.al. 2407.17078v1 null
2024-07-24 Pose Estimation from Camera Images for Underwater Inspection Luyuan Peng et.al. 2407.16961v1 null
2024-07-22 Memory Management for Real-Time Appearance-Based Loop Closure Detection Mathieu Labbé et.al. 2407.15890v1 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791v1 null
2024-07-22 Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM Mathieu Labbe et.al. 2407.15305v1 null
2024-07-22 Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation Mathieu Labbé et.al. 2407.15304v1 null
2024-07-19 Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization Yuehua Ding et.al. 2407.14643v1 null
2024-07-18 Visual Haystacks: Answering Harder Questions About Sets of Images Tsung-Han Wu et.al. 2407.13766v1 link
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408v1 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736v2 link
2024-07-16 EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis Ruijie Yang et.al. 2407.11401v1 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964v1 link
2024-07-15 DINO Pre-training for Vision-based End-to-end Autonomous Driving Shubham Juneja et.al. 2407.10803v1 null
2024-07-15 Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval Youngsun Lim et.al. 2407.10683v1 null
2024-07-15 An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots J. J. Cabrera et.al. 2407.10596v1 link
2024-07-15 An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments J. J. Cabrera et.al. 2407.10536v1 null
2024-07-12 Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval Vaibhav Balloli et.al. 2407.08908v1 link
2024-07-11 Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates Owen Claxton et.al. 2407.08162v1 link
2024-07-12 Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal Xinyu Zhu et.al. 2407.08153v2 link
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106v1 link
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730v1 null
2024-07-09 CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding Wenhao Xu et.al. 2407.06611v1 null
2024-07-08 Pseudo-triplet Guided Few-shot Composed Image Retrieval Bohan Hou et.al. 2407.06001v1 null
2024-07-09 HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels Yingying Jiang et.al. 2407.05795v2 null
2024-07-05 Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning Mainak Singha et.al. 2407.04207v1 link
2024-07-04 Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models Chang-Sheng Kao et.al. 2407.03615v1 link
2024-07-03 Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach Pronay Debnath et.al. 2407.03486v1 null
2024-07-02 Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition Sergio Izquierdo et.al. 2407.02422v1 link
2024-07-01 Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval Aneeshan Sain et.al. 2407.01810v1 null
2024-07-01 Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval Hanwen Su et.al. 2407.00979v1 null
2024-07-01 Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios Connor Malone et.al. 2407.00863v1 null
2024-06-27 PathAlign: A vision-language model for whole slide images in histopathology Faruk Ahmed et.al. 2406.19578v1 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898v2 null
2024-06-27 Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs Huaying Zhang et.al. 2406.18836v1 null
2024-06-26 WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images Yannik Glaser et.al. 2406.18765v1 null
2024-06-26 View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis Subin Varghese et.al. 2406.18012v1 null
2024-06-25 Tell Me Where You Are: Multimodal LLMs Meet Place Recognition Zonglin Lyu et.al. 2406.17520v1 null
2024-06-25 SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation Xu Liu et.al. 2406.17249v1 link
2024-06-23 Breaking the Frame: Image Retrieval by Visual Overlap Prediction Tong Wei et.al. 2406.16204v1 link
2024-06-19 Towards a multimodal framework for remote sensing image change retrieval and captioning Roger Ferrod et.al. 2406.13424v1 link
2024-06-19 CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval Christian Lülf et.al. 2406.13322v1 link
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766v1 null
2024-06-22 Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment Jianan Jiang et.al. 2406.11551v2 link
2024-06-17 They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias Salma Abdel Magid et.al. 2406.11331v1 null
2024-06-17 Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion Guoyuan An et.al. 2406.11242v1 null
2024-06-14 Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval Genc Hoxha et.al. 2406.10107v1 null
2024-06-14 BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Imanol Miranda et.al. 2406.09952v1 link
2024-06-13 Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases Meng Wang et.al. 2406.09317v1 link
2024-06-13 Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval Jaeseok Byun et.al. 2406.09188v1 null
2024-06-13 DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification Zhengrui Xu et.al. 2406.08773v1 link
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463v1 null
2024-06-12 ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery Kam Woh Ng et.al. 2406.08457v1 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502v1 link
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450v1 link
2024-06-16 Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval Adrià Molina et.al. 2406.07315v2 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374v1 link
2024-06-09 Unified Text-to-Image Generation and Retrieval Leigang Qu et.al. 2406.05814v1 null
2024-06-07 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better Scott Geng et.al. 2406.05184v1 link
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746v1 link
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340v1 link
2024-06-06 Monocular Localization with Semantics Map for Autonomous Vehicles Jixiang Wan et.al. 2406.03835v1 null
2024-06-05 Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Saehyung Lee et.al. 2406.03411v1 link
2024-06-04 MeshVPR: Citywide Visual Place Recognition Using 3D Meshes Gabriele Berton et.al. 2406.02776v1 null
2024-06-04 Can CLIP help CLIP in learning 3D? Cristian Sbrolli et.al. 2406.02202v1 null
2024-06-03 Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian et.al. 2406.01583v1 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315v1 link
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885v1 link
2024-06-01 NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization Wugang Meng et.al. 2406.00312v1 null
2024-05-31 DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models Linli Yao et.al. 2405.20985v1 link
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333v1 null
2024-05-29 ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions Honglin Lin et.al. 2405.19226v1 null
2024-05-30 CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval Xintong Jiang et.al. 2405.19149v2 link
2024-05-29 SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation Zhenbei Wu et.al. 2405.18801v1 null
2024-05-29 Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs Jialiang Xu et.al. 2405.18740v1 link
2024-05-28 EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Issar Tzachor et.al. 2405.18065v1 null
2024-05-28 AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval Sihe Zhang et.al. 2405.17718v1 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599v1 null
2024-05-29 Composed Image Retrieval for Remote Sensing Bill Psomas et.al. 2405.15587v2 link
2024-05-24 Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval Yiming Wu et.al. 2405.15451v1 null
2024-05-20 UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization Wenjia Xu et.al. 2405.11936v1 link
2024-05-19 Register assisted aggregation for Visual Place Recognition Xuan Yu et.al. 2405.11526v1 null
2024-05-26 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793v2 null
2024-05-16 FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models Adrian Bulat et.al. 2405.10286v1 null
2024-05-15 Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study Farnaz Khun Jush et.al. 2405.09334v1 null
2024-05-14 BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment Lihong Jin et.al. 2405.09001v1 null
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434v1 null
2024-05-13 OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition Qiuchi Xiang et.al. 2405.07966v1 link
2024-05-14 HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval Chao He et.al. 2405.07524v2 link
2024-05-13 JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation Xubo Luo et.al. 2405.07429v1 link
2024-05-12 BoQ: A Place is Worth a Bag of Learnable Queries Amar Ali-bey et.al. 2405.07364v1 link
2024-05-07 Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction Nematollah Saeidi et.al. 2405.04211v1 null
2024-05-06 A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions Sharath Raghvendra et.al. 2405.03664v1 null
2024-05-06 Knowledge-aware Text-Image Retrieval for Remote Sensing Images Li Mi et.al. 2405.03373v1 null
2024-05-06 Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval Jiacheng Cheng et.al. 2405.03190v1 null
2024-05-05 iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval Lorenzo Agnolucci et.al. 2405.02951v1 link
2024-05-01 Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval Young Kyun Jang et.al. 2405.00571v1 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360v1 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174v1 null
2024-04-29 Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Hongyi Zhu et.al. 2404.18746v1 null
2024-04-29 Dual-Modal Prompting for Sketch-Based Image Retrieval Liying Gao et.al. 2404.18695v1 null
2024-05-01 Semantic Line Combination Detector Jinwon Ko et.al. 2404.18399v2 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498v1 null
2024-04-25 CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching Samia Shafique et.al. 2404.16972v1 link
2024-04-29 Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval Ryoya Nara et.al. 2404.16398v2 null
2024-04-24 Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval Haokun Wen et.al. 2404.15875v1 link
2024-04-24 DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines Xin Jiang et.al. 2404.15771v1 null
2024-04-23 Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval Young Kyun Jang et.al. 2404.15516v1 null
2024-04-22 EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models Mathias Thorsager et.al. 2404.14236v1 null
2024-04-22 Hierarchical localization with panoramic views and triplet loss functions Marcos Alfaro et.al. 2404.14117v1 link
2024-04-20 High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces Baoru Huang et.al. 2404.13437v1 null
2024-04-20 Collaborative Visual Place Recognition through Federated Learning Mattia Dutto et.al. 2404.13324v1 null
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339v1 null
2024-04-17 Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives Zhangchi Feng et.al. 2404.11317v1 link
2024-04-17 Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing Sanggeon Yun et.al. 2404.11025v1 null
2024-04-16 SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments Niklas Gard et.al. 2404.10527v1 link
2024-04-20 CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning Haojian Huang et.al. 2404.09640v3 link
2024-04-11 PRAM: Place Recognition Anywhere Model for Efficient Visual Localization Fei Xue et.al. 2404.07785v1 null
2024-04-16 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure Bin Zhang et.al. 2404.07644v4 link
2024-04-11 Semantically-correlated memories in a dense associative model Thomas F Burns et.al. 2404.07123v2 link
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542v1 null
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277v1 link
2024-04-07 Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Jinpeng Wang et.al. 2404.04998v1 link
2024-04-06 Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Juncheng Yang et.al. 2404.04538v1 link
2024-04-05 Towards introspective loop closure in 4D radar SLAM Maximilian Hilger et.al. 2404.03940v1 null
2024-04-02 TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation Yehui Shen et.al. 2404.01587v1 link
2024-04-01 On Train-Test Class Overlap and Detection for Image Retrieval Chull Hwan Song et.al. 2404.01524v1 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400v1 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546v1 null
2024-03-31 NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation Diwei Sheng et.al. 2404.00504v1 null
2024-03-30 SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs Yang Miao et.al. 2404.00469v1 null
2024-03-30 Do Vision-Language Models Understand Compound Nouns? Sonal Kumar et.al. 2404.00419v1 link
2024-04-05 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964v3 null
2024-03-28 JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition Gabriele Berton et.al. 2403.19787v1 link
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651v1 link
2024-03-27 AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation Changkun Liu et.al. 2403.18281v1 null
2024-03-26 Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge Dongjin Kim et.al. 2403.17420v1 link
2024-03-25 Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras Gokul B. Nair et.al. 2403.16425v1 link
2024-03-24 Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval Yucheng Suo et.al. 2403.16005v1 link
2024-03-24 BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Yinda Chen et.al. 2403.15992v1 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378v1 link
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152v1 null
2024-03-22 Piecewise-Linear Manifolds for Deep Metric Learning Shubhang Bhatnagar et.al. 2403.14977v1 null
2024-03-21 Enhancing Historical Image Retrieval with Compositional Cues Tingyu Lin et.al. 2403.14287v1 link
2024-03-20 Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval Aymene Berriche et.al. 2403.13747v1 null
2024-03-20 Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval Haoyu Liu et.al. 2403.13317v1 null
2024-03-19 Learning Neural Volumetric Pose Features for Camera Localization Jingyu Lin et.al. 2403.12800v1 null
2024-03-19 Quantixar: High-performance Vector Data Management System Gulshan Yadav et.al. 2403.12583v1 null
2024-03-17 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization Peng Jiang et.al. 2403.11367v1 null
2024-03-17 MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paul S. Scotti et.al. 2403.11207v1 link
2024-03-16 Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Shunsuke Tsubaki et.al. 2403.10756v1 null
2024-03-16 Vector search with small radiuses Gergely Szilvasy et.al. 2403.10746v1 null
2024-03-13 Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer Kenta Tsukahara et.al. 2403.10552v1 null
2024-03-20 Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression Huy-Hoang Bui et.al. 2403.10297v2 link
2024-03-15 Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline Fangming Yuan et.al. 2403.10283v1 null
2024-03-14 The NeRFect Match: Exploring NeRF Features for Visual Localization Qunjie Zhou et.al. 2403.09577v1 null
2024-03-14 VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition Benjamin Ramtoula et.al. 2403.09025v1 null
2024-03-13 PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models Siddharth Mishra-Sharma et.al. 2403.08851v1 link
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156v1 link
2024-03-12 It's All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234v1 link
2024-03-12 You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval Subhadeep Koley et.al. 2403.07222v1 null
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214v1 null
2024-03-11 How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? Subhadeep Koley et.al. 2403.07203v1 null
2024-03-11 EarthLoc: Astronaut Photography Localization by Indexing Earth from Space Gabriele Berton et.al. 2403.06758v1 link
2024-03-11 BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues Fudong Ge et.al. 2403.06600v1 link
2024-03-11 Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology Stefan Denner et.al. 2403.06567v1 link
2024-03-10 RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation Mathieu Labbé et.al. 2403.06341v1 null
2024-03-10 Texture image retrieval using a classification and contourlet-based features Asal Rouhafzay et.al. 2403.06048v1 null
2024-03-11 LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map Xinrui Wu et.al. 2403.05002v2 link
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765v2 null
2024-03-07 mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar Chengzhen Meng et.al. 2403.04703v1 null
2024-03-06 Self-supervised Photographic Image Layout Representation Learning Zhaoran Zhao et.al. 2403.03740v1 link
2024-03-04 Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models Benedikt Blumenstiel et.al. 2403.02059v1 link
2024-03-03 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval Yongchao Du et.al. 2403.01431v1 null
2024-03-01 Asymmetric Feature Fusion for Image Retrieval Hui Wu et.al. 2403.00671v1 null
2024-03-01 Structure Similarity Preservation Learning for Asymmetric Image Retrieval Hui Wu et.al. 2403.00648v1 link
2024-02-29 CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition Feng Lu et.al. 2402.19231v1 link
2024-02-28 Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport Bin Li et.al. 2402.18411v1 link
2024-02-28 Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning Hanyao Wang et.al. 2402.18400v1 null
2024-02-28 Representing 3D sparse map points and lines for camera relocalization Bach-Thuan Bui et.al. 2402.18011v1 link
2024-02-27 Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control Thong Nguyen et.al. 2402.17535v1 link
2024-02-29 Active propulsion noise shaping for multi-rotor aircraft localization Gabriele Serussi et.al. 2402.17289v2 link
2024-02-27 NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer Bingxi Liu et.al. 2402.17159v1 link
2024-02-25 Deep Homography Estimation for Visual Place Recognition Feng Lu et.al. 2402.16086v1 link
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961v1 link
2024-02-28 Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries Zijun Long et.al. 2402.15276v2 null
2024-02-23 Fine-tuning CLIP Text Encoders with Two-step Paraphrasing Hyunjae Kim et.al. 2402.15120v1 null
2024-02-22 Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition Feng Lu et.al. 2402.14505v1 link
2024-02-16 Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition Chenming Hu et.al. 2402.10476v1 null
2024-02-15 Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task Mirko Nava et.al. 2402.09886v1 link
2024-02-14 Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency Yannis Kalantidis et.al. 2402.09237v1 null
2024-02-13 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu et.al. 2402.08567v1 link
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359v1 link
2024-02-10 Semantic Object-level Modeling for Robust Visual Camera Relocalization Yifan Zhu et.al. 2402.06951v1 null
2024-02-09 Large Language Models for Captioning and Retrieving Remote Sensing Images João Daniel Silva et.al. 2402.06475v1 null
2024-02-09 PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes Xinggang Hu et.al. 2402.06131v1 null
2024-02-21 MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction Heng Zhou et.al. 2402.03762v3 null
2024-02-04 Region-Based Representations Revisited Michal Shlapentokh-Rothman et.al. 2402.02352v1 link
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141v1 link
2024-02-01 BrainSLAM: SLAM on Neural Population Activity Data Kipp Freud et.al. 2402.00588v1 null
2024-02-01 Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering Tianxiao Gao et.al. 2402.00330v1 link
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083v1 link
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592v1 link
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459v1 null
2024-01-29 Cross-Modal Coordination Across a Diverse Set of Input Modalities Jorge Sánchez et.al. 2401.16347v1 null
2024-01-29 Regressing Transformers for Data-efficient Visual Place Recognition María Leyva-Vallina et.al. 2401.16304v1 null
2024-01-27 Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Ayush Dubey et.al. 2401.15362v1 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613v1 null
2024-01-23 PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion Shyam Sundar Kannan et.al. 2401.13082v1 null
2024-01-23 SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization Mingyang Li et.al. 2401.13076v1 link
2024-01-25 CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios Xiangshuo Qiao et.al. 2401.10475v2 link
2024-01-19 PhotoScout: Synthesis-Powered Multi-Modal Image Search Celeste Barnaby et.al. 2401.10464v1 null
2024-01-19 Cross-Modality Perturbation Synergy Attack for Person Re-identification Yunpeng Gong et.al. 2401.10090v2 null
2024-01-16 Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging Zahra Tabatabaei et.al. 2401.08272v1 null
2024-01-16 Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2401.08263v1 null
2024-01-15 Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing Jakob Hackstein et.al. 2401.07782v1 link
2024-01-14 HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Zexuan Qiu et.al. 2401.07212v1 link
2024-01-11 UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization Rouwan Wu et.al. 2401.05971v1 link
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860v1 link
2024-01-05 Benchmarking PathCLIP for Pathology Image Analysis Sunyi Zheng et.al. 2401.02651v1 null
2024-01-03 DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding Mingrui Li et.al. 2401.01545v1 null
2024-01-02 BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving Dafeng Wei et.al. 2401.01065v1 null
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371v1 link
2023-12-29 Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering Long-Kun Du et.al. 2401.00032v1 null
2023-12-27 LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization Sai Shubodh Puligilla et.al. 2312.16648v1 null
2023-12-26 Recursive Distillation for Open-Set Distributed Robot Localization Kenta Tsukahara et.al. 2312.15897v1 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471v1 null
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242v1 null
2023-12-20 Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2312.12995v1 null
2023-12-19 VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Chun-Mei Feng et.al. 2312.12273v1 link
2023-12-18 Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback Boaz Lerner et.al. 2312.11078v1 link
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649v1 null
2023-12-17 DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition Sijie Wang et.al. 2312.10616v1 link
2023-12-16 Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Decheng Liu et.al. 2312.10320v1 link
2023-12-15 Data-Efficient Multimodal Fusion on a Single GPU Noël Vouitsis et.al. 2312.10144v1 link
2023-12-13 Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques Hamed Qazanfari et.al. 2312.10089v1 null
2023-12-15 Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval Zhe Ma et.al. 2312.09716v1 link
2023-12-14 Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition Oliver Grainge et.al. 2312.09028v1 null
2023-12-14 Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking Shitong Sun et.al. 2312.08924v1 null
2023-12-13 C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation Florian Fervers et.al. 2312.08060v1 null
2023-12-12 Contextually Affinitive Neighborhood Refinery for Deep Clustering Chunlin Yu et.al. 2312.07806v1 link
2023-12-12 Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval Qiwei Tian et.al. 2312.07364v1 link
2023-12-12 Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection Jonathan J. Y. Kim et.al. 2312.06991v1 null
2023-12-11 Dynamic Weighted Combiner for Mixed-Modal Image Retrieval Fuxiang Huang et.al. 2312.06179v1 link
2023-12-06 Lite-Mind: Towards Efficient and Versatile Brain Representation Network Zixuan Gong et.al. 2312.03781v1 link
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428v2 link
2023-12-04 Implicit Learning of Scene Geometry from Poses for Global Localization Mohammad Altillawi et.al. 2312.02029v1 null
2023-12-04 Language-only Efficient Training of Zero-shot Composed Image Retrieval Geonmo Gu et.al. 2312.01998v1 link
2023-12-03 G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training Che Liu et.al. 2312.01522v1 link
2023-12-01 Improve Supervised Representation Learning with Masked Image Modeling Kaifeng Chen et.al. 2312.00950v1 null
2023-12-05 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Walid Bousselham et.al. 2312.00878v2 link
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500v1 null
2023-11-30 HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance Zhuohao Yin et.al. 2311.18273v1 link
2023-11-30 Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models Raviteja Vemulapalli et.al. 2311.18237v1 link
2023-11-29 Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce Chang Liu et.al. 2311.17954v1 null
2023-11-28 Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Chao Chen et.al. 2311.17940v1 null
2023-11-29 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Huajian Huang et.al. 2311.17389v1 link
2023-11-27 Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation Samuele Poppi et.al. 2311.16254v1 link
2023-11-27 Optimal Transport Aggregation for Visual Place Recognition Sergio Izquierdo et.al. 2311.15937v1 link
2023-11-27 AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval Shicheng Xu et.al. 2311.14084v2 link
2023-11-23 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology Asma Ben Abacha et.al. 2311.13752v1 link
2023-11-22 Medical Image Retrieval Using Pretrained Embeddings Farnaz Khun Jush et.al. 2311.13547v1 null
2023-11-22 Applications of Spiking Neural Networks in Visual Place Recognition Somayeh Hussaini et.al. 2311.13186v1 link
2023-11-21 Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval Xiu-Shen Wei et.al. 2311.12894v1 null
2023-11-21 Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs Zhentian Qian et.al. 2311.12245v1 null
2023-11-19 From Categories to Classifier: Name-Only Continual Learning by Exploring the Web Ameya Prabhu et.al. 2311.11293v1 null
2023-11-18 Lesion Search with Self-supervised Learning Kristin Qi et.al. 2311.11014v1 null
2023-11-15 Flow reconstruction and particle characterization from inertial Lagrangian tracks Ke Zhou et.al. 2311.09076v1 null
2023-11-15 Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval Junyang Chen et.al. 2311.07622v2 null
2023-11-13 VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search Shuting He et.al. 2311.07514v1 null
2023-11-10 Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval Xin Lu et.al. 2311.06067v1 null
2023-11-08 Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model Junya Shiraishi et.al. 2311.04788v1 null
2023-11-08 Training CLIP models on Data from Scientific Papers Calvin Metzger et.al. 2311.04711v1 link
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098v1 link
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345v1 null
2023-11-06 FocusTune: Tuning Visual Localization through Focus-Guided Sampling Son Tung Nguyen et.al. 2311.02872v1 link
2023-11-01 DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing Gaoshuang Huang et.al. 2311.00230v1 link
2023-10-29 Identifiable Contrastive Learning with Automatic Feature Importance Discovery Qi Zhang et.al. 2310.18904v1 link
2023-10-27 LipSim: A Provably Robust Perceptual Similarity Metric Sara Ghazanfari et.al. 2310.18274v1 link
2023-10-27 Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation Susu Fang et.al. 2310.17879v1 null
2023-10-25 FoundLoc: Vision-based Onboard Aerial Localization in the Wild Yao He et.al. 2310.16299v1 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504v1 null
2023-10-23 Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval Xu Yuan et.al. 2310.14637v1 link
2023-10-21 Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation Anastasia Kritharoula et.al. 2310.14025v1 link
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605v1 null
2023-10-20 CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants Shaoan Wang et.al. 2310.13320v1 link
2023-10-27 Representation Learning via Consistent Assignment of Views over Random Partitions Thalles Silva et.al. 2310.12692v2 link
2023-10-18 Evaluating the Fairness of Discriminative Foundation Models in Computer Vision Junaid Ali et.al. 2310.11867v1 null
2023-10-17 Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification Shuanglin Yan et.al. 2310.11210v1 null
2023-10-16 Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People Dharmateja Adapa et.al. 2310.10290v1 null
2023-10-16 EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge Tom Bryan et.al. 2310.10050v1 null
2023-10-15 CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes Yulei Qin et.al. 2310.09761v1 link
2023-10-13 Pairwise Similarity Learning is SimPLE Yandong Wen et.al. 2310.09449v1 link
2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval Shyamgopal Karthik et.al. 2310.09291v1 link
2023-10-12 Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning Shiyang Yan et.al. 2310.08390v1 null
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082v1 null
2023-10-10 Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization Le Chen et.al. 2310.06984v1 null
2023-10-10 Distillation Improves Visual Place Recognition for Low-Quality Queries Anbang Yang et.al. 2310.06906v1 link
2023-10-10 Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets Jiajun Zhang et.al. 2310.06566v1 null
2023-10-10 Topological RANSAC for instance verification and retrieval without fine-tuning Guoyuan An et.al. 2310.06486v1 null
2023-10-10 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments Ghanta Sai Krishna et.al. 2310.06385v1 null
2023-10-09 Collaborative Visual Place Recognition Yiming Li et.al. 2310.05541v1 null
2023-10-09 Sentence-level Prompts Benefit Composed Image Retrieval Yang Bai et.al. 2310.05473v1 link
2023-10-08 AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition Feng Lu et.al. 2310.05184v1 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134v1 null
2023-10-12 ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer Yifan Xu et.al. 2310.04099v2 null
2023-10-06 Sub-token ViT Embedding via Stochastic Resonance Transformers Dong Lao et.al. 2310.03967v1 link
2023-10-04 Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach Matthew Hanlon et.al. 2310.02650v1 null
2023-10-02 NEUCORE: Neural Concept Reasoning for Composed Image Retrieval Shu Zhao et.al. 2310.01358v1 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092v1 null
2023-10-05 PlaceNav: Topological Navigation through Place Recognition Lauri Suomela et.al. 2309.17260v3 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992v1 link
2023-09-28 Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning Albert Mohwald et.al. 2309.16351v1 link
2023-09-28 FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding Pengxiang Wu et.al. 2309.16249v1 link
2023-09-28 Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2309.16137v1 link
2023-09-27 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization Vicente Vivanco Cepeda et.al. 2309.16020v1 link
2023-09-27 Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization Zhenbo Song et.al. 2309.15556v1 null
2023-09-26 Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features Hila Levi et.al. 2309.14999v1 null
2023-09-23 Resolving References in Visually-Grounded Dialogue via Text Generation Bram Willemsen et.al. 2309.13430v1 link
2023-09-21 Face Identity-Aware Disentanglement in StyleGAN Adrian Suwała et.al. 2309.12033v1 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883v1 link
2023-09-20 2D-3D Pose Tracking with Multi-View Constraints Huai Yu et.al. 2309.11335v1 null
2023-09-19 VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition Adam D. Hines et.al. 2309.10225v1 link
2023-09-18 DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach Chenghao Xu et.al. 2309.09879v1 null
2023-09-18 Decompose Semantic Shifts for Composed Image Retrieval Xingyu Yang et.al. 2309.09531v1 null
2023-09-16 Efficient Object Rearrangement via Multi-view Fusion Dehao Huang et.al. 2309.08994v1 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927v1 link
2023-09-16 Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning Pengyu Yin et.al. 2309.08914v1 link
2023-09-15 Active Learning for Fine-Grained Sketch-Based Image Retrieval Himanshu Thakur et.al. 2309.08743v1 null
2023-09-15 Optimization of Rank Losses for Image Retrieval Elias Ramzi et.al. 2309.08250v1 link
2023-09-18 Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer Yaoting Wang et.al. 2309.07929v2 link
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471v1 link
2023-09-13 RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline Mirko Usuelli et.al. 2309.07094v1 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438v1 link
2023-09-08 Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning Hiroki Nakamura et.al. 2309.04148v1 null
2023-09-05 Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection Natalia Pavlasek et.al. 2309.02394v1 null
2023-09-05 Dual Relation Alignment for Composed Image Retrieval Xintong Jiang et.al. 2309.02169v1 null
2023-09-04 NLLB-CLIP -- train performant multilingual image retrieval model on a budget Alexander Visheratin et.al. 2309.01859v1 null
2023-09-04 Target-Guided Composed Image Retrieval Haokun Wen et.al. 2309.01366v1 null
2023-09-02 Deep supervised hashing for fast retrieval of radio image cubes Steven Ndung'u et.al. 2309.00932v1 null
2023-08-31 Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval Prateksha Udhayanan et.al. 2308.16649v1 null
2023-08-28 Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics Nils Böhne et.al. 2308.14786v1 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746v1 link
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039v1 null
2023-08-26 Learning Efficient Representations for Image-Based Patent Retrieval Hongsong Wang et.al. 2308.13749v1 null
2023-08-25 Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers Mohammad Javad Rajabi et.al. 2308.13671v1 null
2023-08-24 Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities Jinze Bai et.al. 2308.12966v1 link
2023-08-23 Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval Huafeng Li et.al. 2308.11994v1 null
2023-08-23 OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes Tao Xie et.al. 2308.11928v1 link
2023-08-22 Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features Alberto Baldrati et.al. 2308.11485v1 link
2023-08-22 GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Xinchi Deng et.al. 2308.11331v1 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223v1 null
2023-08-21 EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition Gabriele Berton et.al. 2308.10832v1 link
2023-08-20 FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory Anwesan Pal et.al. 2308.10170v1 null
2023-08-18 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion Yanmei Jiao et.al. 2308.09566v1 null
2023-08-17 FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Yulin Su et.al. 2308.09012v1 link
2023-08-16 Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval Aishwarya Venkataramanan et.al. 2308.08431v1 link
2023-08-16 Ranking-aware Uncertainty for Text-guided Image Retrieval Junyang Chen et.al. 2308.08131v1 null
2023-08-19 Global Features are All You Need for Image Retrieval and Reranking Shihao Shao et.al. 2308.06954v2 link
2023-08-14 MixBCT: Towards Self-Adapting Backward-Compatible Training Yu Liang et.al. 2308.06948v1 link
2023-08-10 KS-APR: Keyframe Selection for Robust Absolute Pose Regression Changkun Liu et.al. 2308.05459v1 null
2023-08-09 AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities Jingdan Zhang et.al. 2308.04992v1 link
2023-08-08 Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval Yi Bin et.al. 2308.04343v1 link
2023-08-08 Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval Yunquan Zhu et.al. 2308.04008v1 link
2023-08-05 A Comprehensive Analysis of Real-World Image Captioning and Scene Identification Sai Suprabhanu Nallapaneni et.al. 2308.02833v1 null
2023-08-03 Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies Eunsuk Seo et.al. 2308.01871v1 null
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688v1 link
2023-07-31 Guiding Image Captioning Models Toward More Specific Captions Simon Kornblith et.al. 2307.16686v1 null
2023-07-31 Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks Kousik Rajesh et.al. 2307.16395v1 null
2023-07-28 D2S: Representing local descriptors and global scene coordinates for camera relocalization Bach-Thuan Bui et.al. 2307.15250v1 link
2023-07-26 Neural-based Cross-modal Search and Retrieval of Artwork Yan Gong et.al. 2307.14244v1 null
2023-07-26 Boon: A Neural Search Engine for Cross-Modal Information Retrieval Yan Gong et.al. 2307.14240v1 null
2023-07-25 Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network Chull Hwan Song et.al. 2307.13254v1 null
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702v2 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981v1 null
2023-07-19 Quantum Optics based Algorithm for Measuring the Similarity between Images Vivek Mehta et.al. 2307.09789v1 null
2023-07-18 Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments Max Moebius et.al. 2307.09172v1 null
2023-07-18 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving Qipeng Li et.al. 2307.09044v1 null
2023-07-19 Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation Rundong Luo et.al. 2307.08779v2 null
2023-07-17 Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition Gabriele Trivigno et.al. 2307.08417v1 link
2023-07-17 Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification Tengfei Liang et.al. 2307.08316v1 link
2023-07-17 NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM Lizhou Liao et.al. 2307.08221v1 link
2023-07-20 Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer Yujiao Shi et.al. 2307.08015v3 link
2023-07-10 Phoneme-retrieval; voice recognition; vowels recognition Brunello Tirozzi et.al. 2307.07407v1 null
2023-07-14 Risk Controlled Image Retrieval Kaiwen Cai et.al. 2307.07336v1 link
2023-07-11 ResMatch: Residual Attention Learning for Local Feature Matching Yuxin Deng et.al. 2307.05180v1 link
2023-07-11 Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification Yi Liao et.al. 2307.05017v1 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520v1 null
2023-07-10 RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold Hyesu Jang et.al. 2307.04321v1 link
2023-07-08 Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning Qin Zhang et.al. 2307.04047v1 null
2023-07-04 Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition Helen Carson et.al. 2307.01464v1 null
2023-07-04 Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network Zizhuo Li et.al. 2307.01447v1 null
2023-07-03 Cross-modal Place Recognition in Image Databases using Event-based Sensors Xiang Ji et.al. 2307.01047v1 null
2023-06-30 DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions Stephen Hausler et.al. 2306.17536v1 null
2023-06-30 Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization Stephen Hausler et.al. 2306.17529v1 null
2023-06-27 Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research Tanjida Kabir et.al. 2306.15651v1 null
2023-06-27 Mean Field Theory in Deep Metric Learning Takuya Furusawa et.al. 2306.15368v1 null
2023-06-26 Hierarchical Matching and Reasoning for Multi-Query Image Retrieval Zhong Ji et.al. 2306.14460v1 link
2023-06-25 Enhancing Dynamic Image Advertising with Vision-Language Pre-training Zhoufutu Wen et.al. 2306.14112v1 null
2023-06-23 Catching Image Retrieval Generalization Maksim Zhdanov et.al. 2306.13357v1 null
2023-06-22 Deep Metric Learning with Soft Orthogonal Proxies Farshad Saberi-Movahed et.al. 2306.13055v1 null
2023-06-22 What to Learn: Features, Image Transformations, or Both? Yuxuan Chen et.al. 2306.13040v1 null
2023-06-22 Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval Katrin Glinka et.al. 2306.12843v1 null
2023-06-26 Annotation Cost Efficient Active Learning for Content Based Image Retrieval Julia Henkel et.al. 2306.11605v2 null
2023-06-19 Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning Shivaen Ramshetty et.al. 2306.11065v1 link
2023-06-18 LiDAR-Based Place Recognition For Autonomous Driving: A Survey Pengcheng Shi et.al. 2306.10561v1 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012v1 link
2023-06-15 Prompt Performance Prediction for Generative IR Nicolas Bizzozzero et.al. 2306.08915v1 null
2023-06-15 Graph Convolution Based Efficient Re-Ranking for Visual Retrieval Yuqi Zhang et.al. 2306.08792v1 link
2023-06-13 GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze et.al. 2306.07969v1 null
2023-06-13 MOFI: Learning Image Representations from Noisy Entity Annotated Images Wentao Wu et.al. 2306.07952v1 link
2023-06-12 Zero-shot Composed Text-Image Retrieval Yikun Liu et.al. 2306.07272v1 link
2023-06-12 Sticker820K: Empowering Interactive Retrieval with Stickers Sijie Zhao et.al. 2306.06870v1 null
2023-06-11 Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models Yuguang Yang et.al. 2306.06691v1 null
2023-06-03 Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval Xu Zhang et.al. 2306.02092v1 null
2023-06-03 Class Anchor Margin Loss for Content-Based Image Retrieval Alexandru Ghita et.al. 2306.00630v2 null
2023-05-31 Chatting Makes Perfect -- Chat-based Image Retrieval Matan Levy et.al. 2305.20062v1 link
2023-05-31 Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization Junan Chen et.al. 2305.20044v1 null
2023-05-30 A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation Omar Seddati et.al. 2305.18988v1 null
2023-05-29 Synfeal: A Data-Driven Simulator for End-to-End Camera Localization Daniel Coelho et.al. 2305.18260v1 link
2023-05-29 Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films Shrinkhala Sharma et.al. 2305.18197v1 null
2023-05-29 TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition Tiago Barros et.al. 2305.18013v1 null
2023-05-28 ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval Jiapeng Wang et.al. 2305.17652v1 null
2023-06-01 FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing Zhuang Li et.al. 2305.17497v2 link
2023-05-27 Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation Yueh-Cheng Huang et.al. 2305.17463v1 null
2023-05-26 Generating Images with Multimodal Language Models Jing Yu Koh et.al. 2305.17216v1 link
2023-05-25 Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder Zheyuan Liu et.al. 2305.16304v1 link
2023-05-23 Leveraging BEV Representation for 360-degree Visual Place Recognition Xuecheng Xu et.al. 2305.13814v1 link
2023-05-23 EDIS: Entity-Driven Image Search over Multimodal Web Content Siqi Liu et.al. 2305.13631v1 link
2023-05-20 DAC: Detector-Agnostic Spatial Covariances for Deep Local Features Javier Tirado-Garín et.al. 2305.12250v1 link
2023-05-19 Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach Zahra Tabatabaei et.al. 2305.11728v1 null
2023-05-19 Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition Fenglin Zhang et.al. 2305.11467v1 link
2023-05-12 IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images Varuna Krishna et.al. 2305.10438v1 null
2023-05-17 Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval Haokun Wen et.al. 2305.09979v1 null
2023-05-13 Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance Xinyu Lin et.al. 2305.07943v1 link
2023-05-11 Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems Nathan Hughes et.al. 2305.07154v1 link
2023-05-09 Visual Place Recognition with Low-Resolution Images Mihnea-Alexandru Tomita et.al. 2305.05776v1 null
2023-05-09 Vision-Language Models in Remote Sensing: Current Progress and Future Trends Congcong Wen et.al. 2305.05726v1 null
2023-05-09 An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition Maria Waheed et.al. 2305.05705v1 null
2023-05-09 Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query Ho Hin Lee et.al. 2305.05598v1 null
2023-05-09 ColonMapper: topological mapping and localization for colonoscopy Javier Morlana et.al. 2305.05546v1 null
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301v1 link
2023-05-09 Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2305.05256v1 null
2023-05-09 Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval Shiyin Dong et.al. 2305.05144v1 null
2023-05-08 Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size Andrei Potapov et.al. 2305.04856v1 null
2023-05-08 Privacy-Preserving Representations are not Enough -- Recovering Scene Content from Camera Poses Kunal Chelani et.al. 2305.04603v1 link
2023-05-06 Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer Minyi Zhao et.al. 2305.04072v1 null
2023-05-06 Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing Swagatika Dash et.al. 2305.03881v1 link
2023-05-05 COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Arijit Ray et.al. 2305.03689v1 link
2023-05-05 HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer Shuzhe Wang et.al. 2305.03595v1 null
2023-05-05 WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval Zahra Tabatabaei et.al. 2305.03383v1 null
2023-05-04 Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval Tan Pan et.al. 2305.02610v1 link
2023-05-03 Learning-based Relational Object Matching Across Views Cathrin Elich et.al. 2305.02398v1 null
2023-05-05 A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text Yunxin Li et.al. 2305.02265v2 link
2023-05-03 AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation Shentong Mo et.al. 2305.01836v1 null
2023-04-30 Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection Jie Ren et.al. 2305.00435v1 null
2023-04-28 SFD2: Semantic-guided Feature Detection and Description Fei Xue et.al. 2304.14845v1 link
2023-04-28 Quantum enhanced non-interferometric quantitative phase imaging Giuseppe Ortolano et.al. 2304.14727v1 null
2023-04-26 Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams Yun Chang et.al. 2304.13487v1 null
2023-04-27 STIR: Siamese Transformer for Image Retrieval Postprocessing Aleksei Shabanov et.al. 2304.13393v2 null
2023-04-25 DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design Jiahao Weng et.al. 2304.12506v1 null
2023-04-24 Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning Lucas Pascotti Valem et.al. 2304.12448v1 link
2023-04-23 IDLL: Inverse Depth Line based Visual Localization in Challenging Environments Wanting Li et.al. 2304.11748v1 null
2023-04-23 Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval Mehdi Rafiei et.al. 2304.11734v1 null
2023-04-17 Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference Haotian Wu et.al. 2304.08221v1 null
2023-04-17 NeRF-Loc: Visual Localization with Conditional Neural Radiance Field Jianlin Liu et.al. 2304.07979v1 link
2023-04-16 Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification Luca Piano et.al. 2304.07883v1 null
2023-04-16 Language Guided Local Infiltration for Interactive Image Retrieval Fuxiang Huang et.al. 2304.07747v1 null
2023-04-16 Long-term Visual Localization with Mobile Sensors Shen Yan et.al. 2304.07691v1 null
2023-04-16 Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging Jielin Qiu et.al. 2304.07675v1 null
2023-04-14 CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression Mubariz Zaffar et.al. 2304.07426v1 null
2023-04-14 FM-Loc: Using Foundation Models for Improved Vision-based Localization Reihaneh Mirjalili et.al. 2304.07058v1 null
2023-04-17 Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning Seyed Mahdi Roostaiyan et.al. 2304.06907v2 link
2023-04-17 You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset Matteo Toso et.al. 2304.06373v3 link
2023-04-12 Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation Yifeng Shi et.al. 2304.06051v1 link
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947v1 link
2023-04-12 Are Local Features All You Need for Cross-Domain Visual Place Recognition? Giovanni Barbarani et.al. 2304.05887v1 link
2023-04-12 Unicom: Universal and Compact Representation Learning for Image Retrieval Xiang An et.al. 2304.05884v1 link
2023-04-12 SGL: Structure Guidance Learning for Camera Localization Xudong Zhang et.al. 2304.05571v1 null
2023-04-14 Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency Xingwu Ji et.al. 2304.05146v2 link
2023-04-10 CAVL: Learning Contrastive and Adaptive Representations of Vision and Language Shentong Mo et.al. 2304.04399v1 null
2023-04-09 Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval Yanru Xiao et.al. 2304.04228v1 null
2023-04-08 SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes Baosheng Zhang et.al. 2304.03872v1 null
2023-04-06 $R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition Sijie Zhu et.al. 2304.03410v1 null
2023-04-06 Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements Viktor Walter et.al. 2304.03057v1 link
2023-04-05 Efficient OCR for Building a Diverse Digital History Jacob Carlson et.al. 2304.02737v1 link
2023-04-05 LogoNet: a fine-grained network for instance-level logo sketch retrieval Binbin Feng et.al. 2304.02214v1 link
2023-04-04 OrienterNet: Visual Localization in 2D Public Maps with Neural Matching Paul-Edouard Sarlin et.al. 2304.02009v1 link
2023-04-04 Cross-Domain Image Captioning with Discriminative Finetuning Roberto Dessì et.al. 2304.01662v1 link
2023-04-02 Learning Similarity between Scene Graphs and Images with Transformers Yuren Cong et.al. 2304.00590v1 link
2023-04-01 NPR: Nocturnal Place Recognition in Street Bingxi Liu et.al. 2304.00276v1 null
2023-03-31 Unsupervised crack detection on complex stone masonry surfaces Panagiotis Agrafiotis et.al. 2303.17989v1 null
2023-03-30 If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval Finlay G. C. Hudson et.al. 2303.17703v1 null
2023-03-30 Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime Rhydian Windsor et.al. 2303.17644v1 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504v1 link
2023-03-30 Methods and advancement of content-based fashion image retrieval: A Review Amin Muhammad Shoib et.al. 2303.17371v1 null
2023-03-30 Adaptive Cross Batch Normalization for Metric Learning Thalaiyasingam Ajanthan et.al. 2303.17127v1 null
2023-03-30 MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Weicheng Kuo et.al. 2303.16839v2 null
2023-03-29 Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval Leo Sampaio Ferraz Ribeiro et.al. 2303.16769v1 null
2023-03-29 Bi-directional Training for Composed Image Retrieval via Text Prompt Learning Zheyuan Liu et.al. 2303.16604v1 link
2023-03-27 Model Cascades for Efficient Image Search Robert Hönig et.al. 2303.15595v1 null
2023-03-27 Zero-Shot Composed Image Retrieval with Textual Inversion Alberto Baldrati et.al. 2303.15247v1 link
2023-03-27 What Can Human Sketches Do for Object Detection? Pinaki Nath Chowdhury et.al. 2303.15149v1 null
2023-03-25 Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style Fengyin Lin et.al. 2303.14348v1 link
2023-03-24 A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2303.14247v1 null
2023-03-24 PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View Ze Shi et.al. 2303.14095v1 link
2023-03-24 Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR Aneeshan Sain et.al. 2303.13779v1 null
2023-03-28 CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not Aneeshan Sain et.al. 2303.13440v3 null
2023-03-22 Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval Xunguang Wang et.al. 2303.12658v1 null
2023-03-21 CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion Geonmo Gu et.al. 2303.11916v1 link
2023-03-21 LIMITR: Leveraging Local Information for Medical Image-Text Representation Gefen Dawidowicz et.al. 2303.11755v1 null
2023-03-25 Data-efficient Large Scale Place Recognition with Graded Similarity Supervision Maria Leyva-Vallina et.al. 2303.11739v2 link
2023-03-20 Picture that Sketch: Photorealistic Image Generation from Abstract Sketches Subhadeep Koley et.al. 2303.11162v1 null
2023-03-19 Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths Ming Xu et.al. 2303.10778v1 link
2023-03-17 MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities Boqi Chen et.al. 2303.10249v1 null
2023-03-17 IRGen: Generative Modeling for Image Retrieval Yidan Zhang et.al. 2303.10126v1 link
2023-03-16 Data Roaming and Early Fusion for Composed Image Retrieval Matan Levy et.al. 2303.09429v1 link
2023-03-16 Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval Yi Xie et.al. 2303.09230v1 null
2023-03-16 Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space Yuhang He et.al. 2303.09192v1 null
2023-03-16 Unsupervised Facial Expression Representation Learning with Contrastive Local Warping Fanglei Xue et.al. 2303.09034v1 null
2023-03-15 A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval Saeideh Yousefzadeh et.al. 2303.08398v1 null
2023-03-14 Data-Free Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2303.07775v1 link
2023-03-14 PATS: Patch Area Transportation with Subdivision for Local Feature Matching Junjie Ni et.al. 2303.07700v1 null
2023-03-10 Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors Kento Kawaharazuka et.al. 2303.05674v1 null
2023-03-09 Dominating Set Database Selection for Visual Place Recognition Anastasiia Kornilova et.al. 2303.05123v1 null
2023-03-07 Graph Neural Networks in Vision-Language Image Understanding: A Survey Henry Senior et.al. 2303.03761v1 null
2023-03-07 Sketch-based Medical Image Retrieval Kazuma Kobayashi et.al. 2303.03633v1 link
2023-03-06 Visual Place Recognition: A Tutorial Stefan Schubert et.al. 2303.03281v1 link
2023-03-06 MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval Rohit Agarwal et.al. 2303.03050v1 link
2023-03-06 Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints Chenjie Cao et.al. 2303.02885v1 link
2023-03-05 Composing Mood Board with User Feedback in Concept Space Shin Sano et.al. 2303.02547v1 null
2023-03-04 FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Xiao Han et.al. 2303.02483v1 link
2023-03-09 Self-Supervised Learning for Place Representation Generalization across Appearance Changes Mohamed Adel Musallam et.al. 2303.02370v2 null
2023-03-03 MixVPR: Feature Mixing for Visual Place Recognition Amar Ali-bey et.al. 2303.02190v1 link
2023-03-01 A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition Maria Waheed et.al. 2303.00714v1 null
2023-03-01 ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards T. Barros et.al. 2303.00477v1 link
2023-03-03 Renderable Neural Radiance Map for Visual Navigation Obin Kwon et.al. 2303.00304v2 null
2023-03-01 Region Prediction for Efficient Robot Localization on Large Maps Matteo Scucchia et.al. 2303.00295v1 link
2023-02-28 OEKG: The Open Event Knowledge Graph Simon Gottschalk et.al. 2302.14688v1 null
2023-02-28 Global Proxy-based Hard Mining for Visual Place Recognition Amar Ali-bey et.al. 2302.14217v1 link
2023-02-27 Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation Yue Xiang et.al. 2302.13929v1 link
2023-02-26 Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images Mihnea-Alexandru Tomita et.al. 2302.13314v1 null
2023-02-26 Learning cross space mapping via DNN using large scale click-through logs Wei Yu et.al. 2302.13275v1 null
2023-02-25 DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification Lemuel Puglisi et.al. 2302.13057v1 null
2023-02-23 Teaching CLIP to Count to Ten Roni Paiss et.al. 2302.12066v1 null
2023-02-22 Steerable Equivariant Representation Learning Sangnie Bhardwaj et.al. 2302.11349v1 null
2023-02-21 iQPP: A Benchmark for Image Query Performance Prediction Eduard Poesina et.al. 2302.10126v2 link
2023-02-20 Ontology-aware Network for Zero-shot Sketch-based Image Retrieval Haoxiang Zhang et.al. 2302.10040v1 null
2023-02-20 TBPos: Dataset for Large-Scale Precision Visual Localization Masud Fahim et.al. 2302.09825v1 link
2023-02-17 Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts Zhihong Chen et.al. 2302.08958v1 link
2023-02-22 Fashion Image Retrieval with Multi-Granular Alignment Jinkuan Zhu et.al. 2302.08902v2 null
2023-02-15 Unsupervised Hashing via Similarity Distribution Calibration Kam Woh Ng et.al. 2302.07669v1 link
2023-02-13 Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior Shen Yan et.al. 2302.06287v1 link
2023-02-13 Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation Binqian Jiang et.al. 2302.06149v1 link
2023-02-13 Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval Xu Wang et.al. 2302.06081v1 link
2023-02-11 Sketch Less Face Image Retrieval: A New Challenge Dawei Dai et.al. 2302.05576v1 link
2023-02-10 Is multi-modal vision supervision beneficial to language? Avinash Madasu et.al. 2302.05016v1 link
2023-02-06 Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval Kuniaki Saito et.al. 2302.03084v1 link
2023-02-06 Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs Michael Kirchhof et.al. 2302.02865v1 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572v1 link
2023-02-04 Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval Frederik Warburg et.al. 2302.01332v2 link
2023-01-31 Grounding Language Models to Images for Multimodal Generation Jing Yu Koh et.al. 2301.13823v1 link
2023-01-31 UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Dachuan Shi et.al. 2301.13741v1 link
2023-01-23 Lexi: Self-Supervised Learning of the UI Language Pratyay Banerjee et.al. 2301.10165v1 link
2023-01-17 Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval Yuchen Wu et.al. 2301.06685v1 null
2023-01-19 High-bandwidth Close-Range Information Transport through Light Pipes Joowon Lim et.al. 2301.06496v2 null
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604v1 null
2023-01-12 GH-Feat: Learning Versatile Generative Hierarchical Features from GANs Yinghao Xu et.al. 2301.05315v1 null
2023-01-10 Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images Xindi Wu et.al. 2301.04224v1 null
2023-01-10 Collaborative Semantic Communication at the Edge Wing Fei Lo et.al. 2301.03996v1 null
2023-01-10 Online Backfilling with No Regret for Large-Scale Image Retrieval Seonguk Seo et.al. 2301.03767v1 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403v1 null
2023-01-05 A Probabilistic Framework for Visual Localization in Ambiguous Scenes Fereidoon Zangeneh et.al. 2301.02086v1 link
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147v1 null
2022-12-30 HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images Dmitry Yudin et.al. 2212.14649v1 link
2022-12-27 Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning Wooyoung Kang et.al. 2212.13563v1 link
2022-12-23 SuperGF: Unifying Local and Global Features for Visual Localization Wenzheng Song et.al. 2212.13105v1 null
2022-12-24 GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration Parker C. Lusk et.al. 2212.12745v1 null
2022-12-19 From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration Zekun Qian et.al. 2212.09298v1 link
2022-12-14 The Infinite Index: Information Retrieval on Generative Text-To-Image Models Niklas Deckers et.al. 2212.07476v1 null
2022-12-14 Shared Coupling-bridge for Weakly Supervised Local Feature Learning Jiayuan Sun et.al. 2212.07047v1 link
2022-12-08 Group Generalized Mean Pooling for Vision Transformer Byungsoo Ko et.al. 2212.04114v1 null
2022-12-12 Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models Gowthami Somepalli et.al. 2212.03860v3 null
2022-12-07 LSVL: Large-scale season-invariant visual localization for UAVs Jouko Kinnari et.al. 2212.03581v1 null
2022-12-06 ADIR: Adaptive Diffusion for Image Reconstruction Shady Abu-Hussein et.al. 2212.03221v1 null
2022-12-08 Privacy-Preserving Visual Localization with Event Cameras Junho Kim et.al. 2212.03177v2 link
2022-12-06 Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach Wenjun Xu et.al. 2212.03037v1 null
2022-12-06 Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds Zhipeng Zhao et.al. 2212.02757v1 null
2022-12-04 Fast and Lightweight Scene Regressor for Camera Relocalization Thuan B. Bui et.al. 2212.01830v1 link
2022-12-02 Information Retrieval from the Digitized Books Riya Gupta et.al. 2212.00999v1 null
2022-12-09 StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen et.al. 2212.00937v2 null
2022-11-30 Self-Supervised Feature Learning for Long-Term Metric Visual Localization Yuxuan Chen et.al. 2212.00122v1 null
2022-11-30 SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation Tianyu Zhang et.al. 2211.16697v1 link
2022-11-28 SLAN: Self-Locator Aided Network for Cross-Modal Understanding Jiang-Tian Zhai et.al. 2211.16208v1 null
2022-11-29 RankDNN: Learning to Rank for Few-shot Learning Qianyu Guo et.al. 2211.15320v2 link
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127v1 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069v1 link
2022-11-27 BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images Zhihuang Zhang et.al. 2211.14927v1 null
2022-11-27 A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition Rui Huang et.al. 2211.14864v1 null
2022-11-26 Visual Place Recognition Bailu Guo et.al. 2211.14533v1 null
2022-11-26 Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval Fan Yang et.al. 2211.14515v1 link
2022-11-30 Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark Floriana Ciaglia et.al. 2211.13523v3 link
2022-11-23 InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images Konstantin Kobs et.al. 2211.12760v1 link
2022-11-29 Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments Joshua Knights et.al. 2211.12732v2 link
2022-11-23 FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events Kuanxu Hou et.al. 2211.12244v2 null
2022-11-22 Multimorbidity Content-Based Medical Image Retrieval Using Proxies Yunyan Xing et.al. 2211.12185v1 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988v1 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704v1 null
2022-11-21 LISA: Localized Image Stylization with Audio via Implicit Neural Representation Seung Hyun Lee et.al. 2211.11381v1 null
2022-11-21 NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization Shitao Tang et.al. 2211.11177v1 link
2022-11-16 Improving Feature-based Visual Localization by Geometry-Aided Matching Hailin Yu et.al. 2211.08712v1 link
2022-11-15 LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process Mikhail Kurenkov et.al. 2211.08480v1 null
2022-11-14 Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair Lin-Ding Yuan et.al. 2211.07803v1 null
2022-11-14 Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition Farid Alijani et.al. 2211.07696v1 null
2022-11-14 Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization Yiyang Chen et.al. 2211.07394v1 link
2022-11-14 Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment Junyang Wang et.al. 2211.07275v1 null
2022-11-14 ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations Chanda Grover et.al. 2211.07122v1 null
2022-11-14 Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval Deunsol Jung et.al. 2211.07116v1 null
2022-11-12 Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning Ryotaro Shimizu et.al. 2211.06688v1 null
2022-11-09 Visual Named Entity Linking: A New Dataset and A Baseline Wenxiang Sun et.al. 2211.04872v1 link
2022-11-07 Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System Julian Gamboa et.al. 2211.03881v1 null
2022-11-06 A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography Yueh-Cheng Huang et.al. 2211.03007v1 null
2022-11-02 Optimizing Fiducial Marker Placement for Improved Visual Localization Qiangqiang Huang et.al. 2211.01513v1 link
2022-11-02 A comparison of uncertainty estimation approaches for DNN-based camera localization Matteo Vaghi et.al. 2211.01234v1 null
2022-11-02 M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval Layne Berry et.al. 2211.01180v1 null
2022-11-11 Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality Anuj Diwan et.al. 2211.00768v3 link
2022-11-07 Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding Ryotaro Shimizu et.al. 2210.17417v2 null
2022-10-27 Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis Miriam Anschütz et.al. 2210.15377v1 link
2022-10-27 Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings Daniel Kvak et.al. 2210.15300v1 null
2022-10-27 Towards Practicality of Sketch-Based Visual Understanding Ayan Kumar Bhunia et.al. 2210.15146v1 null
2022-10-27 MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval Chen Bao et.al. 2210.15128v1 null
2022-10-26 FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning Suvir Mirchandani et.al. 2210.15028v1 null
2022-10-26 FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization Junyang Wang et.al. 2210.14562v1 null
2022-11-02 A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets Lukas Bernreiter et.al. 2210.13856v2 null
2022-10-27 Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Tzu-Jui Julius Wang et.al. 2210.13591v2 null
2022-10-24 Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval Zhaopeng Dou et.al. 2210.13440v1 link
2022-10-23 Neural Eigenfunctions Are Structured Representation Learners Zhijie Deng et.al. 2210.12637v1 link
2022-10-21 Boosting vision transformers for image retrieval Chull Hwan Song et.al. 2210.11909v1 link
2022-10-20 Communication breakdown: On the low mutual intelligibility between human and neural captioning Roberto Dessì et.al. 2210.11512v1 link
2022-10-19 Image Semantic Relation Generation Mingzhe Du et.al. 2210.11253v1 null
2022-10-20 General Image Descriptors for Open World Image Retrieval using ViT CLIP Marcos V. Conde et.al. 2210.11141v1 link
2022-10-20 DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition Sha Lu et.al. 2210.11029v1 null
2022-10-19 Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2210.10486v1 link
2022-10-19 GSV-Cities: Toward Appropriate Supervised Visual Place Recognition Amar Ali-bey et.al. 2210.10239v1 link
2022-10-18 A Real-Time Fusion Framework for Long-term Visual Localization Yuchen Yang et.al. 2210.09757v1 null
2022-10-17 Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval Yousef Alqasrawi et.al. 2210.08875v1 null
2022-10-17 SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation Woo Suk Choi et.al. 2210.08675v1 null
2022-10-16 Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers Tao Tang et.al. 2210.08458v1 link
2022-10-14 Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding Xuetong Xue et.al. 2210.07572v1 link
2022-10-14 Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique Connor Malone et.al. 2210.07509v1 null
2022-10-11 Large-to-small Image Resolution Asymmetry in Deep Metric Learning Pavel Suma et.al. 2210.05463v1 link
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236v1 null
2022-10-05 Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features Deepak Gupta et.al. 2210.02401v1 link
2022-10-05 Granularity-aware Adaptation for Image Retrieval over Multiple Tasks Jon Almazán et.al. 2210.02254v1 null
2022-10-05 Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective Zijian Zhang et.al. 2210.02206v1 link
2022-10-04 Supervised Metric Learning for Retrieval via Contextual Similarity Optimization Christopher Liao et.al. 2210.01908v1 link
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320v1 null
2022-10-03 Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments Bruno Arcanjo et.al. 2210.00834v1 null
2022-10-02 Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval Kei Nishimaki et.al. 2210.00506v1 null
2022-09-29 Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval Nicolae-Cătălin Ristea et.al. 2209.15034v1 null
2022-09-28 TVLT: Textless Vision-Language Transformer Zineng Tang et.al. 2209.14156v1 link
2022-09-28 SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval Yang Shen et.al. 2209.13833v1 link
2022-09-28 Learning Deep Representations via Contrastive Learning for Instance Retrieval Tao Wu et.al. 2209.13832v1 null
2022-09-28 Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text Cheng-An Hsieh et.al. 2209.13764v1 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586v1 link
2022-09-27 Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability Peisong Wen et.al. 2209.13262v1 link
2022-09-26 NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection Ruihao Zhou et.al. 2209.12513v1 link
2022-09-25 Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis Jiawen Kang et.al. 2209.12274v1 link
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894v1 null
2022-09-23 Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs Youya Xia et.al. 2209.11673v1 null
2022-09-23 Query-based Hard-Image Retrieval for Object Detection at Test Time Edward Ayers et.al. 2209.11559v1 link
2022-09-23 Unsupervised Hashing with Semantic Concept Mining Rong-Cheng Tu et.al. 2209.11475v1 link
2022-09-22 UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision Anbang Yang et.al. 2209.11336v1 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710v1 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699v1 link
2022-09-19 Deep Metric Learning with Chance Constraints Yeti Z. Gurbuz et.al. 2209.09060v1 link
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608v1 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578v1 link
2022-09-17 Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images Mihnea-Alexandru Tomita et.al. 2209.08343v1 null
2022-09-15 Efficient Planar Pose Estimation via UWB Measurements Haodong Jiang et.al. 2209.06779v2 link
2022-09-14 Transformers and CNNs both Beat Humans on SBIR Omar Seddati et.al. 2209.06629v1 null
2022-09-14 Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch J. Lu et.al. 2209.06545v1 link
2022-09-14 iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images Peng Yin et.al. 2209.06376v1 null
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497v1 link
2022-09-09 Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet Alnur Alimanov et.al. 2209.04234v1 link
2022-09-13 Segment Augmentation and Differentiable Ranking for Logo Retrieval Feyza Yavuz et.al. 2209.02482v2 null
2022-09-12 ScaleFace: Uncertainty-aware Deep Metric Learning Roman Kail et.al. 2209.01880v2 link
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605v1 null
2022-08-31 EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing Qihua Feng et.al. 2208.14657v1 link
2022-08-25 A Deep Perceptual Measure for Lens and Camera Calibration Yannick Hold-Geoffroy et.al. 2208.12300v1 null
2022-08-25 A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme Zhixun Lu et.al. 2208.11876v1 null
2022-08-23 Satellite Image Search in AgoraEO Ahmet Kerem Aksoy et.al. 2208.10830v1 null
2022-08-20 Fuse and Attend: Generalized Embedding Learning for Art and Sketches Ujjal Kr Dutta et.al. 2208.09698v1 null
2022-08-19 Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods Chao Chen et.al. 2208.09315v1 link
2022-08-19 TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval Soumava Paul et.al. 2208.09198v1 link
2022-08-17 Visual Cross-View Metric Localization with Dense Uncertainty Estimates Zimin Xia et.al. 2208.08519v1 link
2022-08-17 Understanding Attention for Vision-and-Language Tasks Feiqi Cao et.al. 2208.08104v1 link
2022-08-14 Visual Localization via Few-Shot Scene Region Classification Siyan Dong et.al. 2208.06933v1 link
2022-08-14 HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval Chengyin Xu et.al. 2208.06866v1 link
2022-08-13 Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization Ming Dai et.al. 2208.06561v1 link
2022-08-16 Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation Georgios Kouros et.al. 2208.06195v2 link
2022-08-12 Instance Image Retrieval by Learning Purely From Within the Dataset Zhongyan Zhang et.al. 2208.06119v1 null
2022-08-07 CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi et.al. 2208.03660v1 null
2022-08-05 A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch Patsorn Sangkloy et.al. 2208.03354v1 null
2022-08-05 ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding Bingning Wang et.al. 2208.03030v1 link
2022-08-04 Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing Caio da S. Dias et.al. 2208.02397v1 null
2022-07-27 On the robustness of self-supervised representations for multi-view object classification David Torpey et.al. 2208.00787v1 null
2022-07-26 Multimodal Neural Machine Translation with Search Engine Based Image Retrieval ZhenHao Tang et.al. 2208.00767v1 null
2022-07-30 Towards Privacy-Preserving, Real-Time and Lossless Feature Matching Qiang Meng et.al. 2208.00214v1 link
2022-07-30 DAS: Densely-Anchored Sampling for Deep Metric Learning Lizhao Liu et.al. 2208.00119v1 link
2022-07-29 Curriculum Learning for Data-Efficient Vision-Language Alignment Tejas Srinivasan et.al. 2207.14525v1 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455v1 link
2022-07-27 Abstracting Sketches through Simple Primitives Stephan Alaniz et.al. 2207.13543v1 link
2022-07-27 Satellite Image Based Cross-view Localization for Autonomous Vehicle Shan Wang et.al. 2207.13506v1 null
2022-07-26 RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments Jiahui Zhang et.al. 2207.12579v1 null
2022-07-25 A hybrid-qudit representation of digital RGB images Sreetama Das et.al. 2207.12550v1 null
2022-07-19 ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization Ivan Cisneros et.al. 2207.12317v1 link
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916v1 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762v2 link
2022-07-20 Revisiting Hotels-50K and Hotel-ID Aarash Feizi et.al. 2207.10200v1 link
2022-07-20 Feature Representation Learning for Unsupervised Cross-domain Image Retrieval Conghui Hu et.al. 2207.09721v1 link
2022-07-19 SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany Dominik Koßmann et.al. 2207.09507v1 null
2022-07-19 Context Unaware Knowledge Distillation for Image Retrieval Bytasandram Yaswanth Reddy et.al. 2207.09070v1 link
2022-07-17 FashionViL: Fashion-Focused Vision-and-Language Representation Learning Xiao Han et.al. 2207.08150v1 link
2022-07-14 AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments Peng Yin et.al. 2207.06965v1 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738v1 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732v1 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058v2 link
2022-07-12 CPO: Change Robust Panorama to Point Cloud Localization Junho Kim et.al. 2207.05317v1 link
2022-07-05 Hierarchical Average Precision Training for Pertinent Image Retrieval Elias Ramzi et.al. 2207.04873v1 link
2022-07-11 A clinically motivated self-supervised approach for content-based image retrieval of CT liver images Kristoffer Knutsen Wickstrøm et.al. 2207.04812v1 link
2022-07-09 BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval Wenqiao Zhang et.al. 2207.04211v1 null
2022-07-08 Learning Sequential Descriptors for Sequence-based Visual Place Recognition Riccardo Mereu et.al. 2207.03868v1 link
2022-07-08 GEMS: Scene Expansion using Generative Models of Graphs Rishi Agarwal et.al. 2207.03729v1 null
2022-07-05 Object-Level Targeted Selection via Deep Template Matching Suraj Kothawade et.al. 2207.01778v1 null
2022-07-06 Adaptive Fine-Grained Sketch-Based Image Retrieval Ayan Kumar Bhunia et.al. 2207.01723v2 link
2022-07-04 Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets Paul Albert et.al. 2207.01573v1 link
2022-07-08 Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval Keyu Wen et.al. 2207.00733v2 null
2022-07-01 DALG: Deep Attentive Local and Global Modeling for Image Retrieval Yuxin Song et.al. 2207.00287v1 null
2022-07-04 BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label Shengshan Hu et.al. 2207.00278v2 link
2022-06-28 Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems Stephen Hausler et.al. 2206.13883v1 null
2022-07-08 How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels Tobias Fischer et.al. 2206.13673v2 link
2022-06-25 FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance Yongzhi Fan et.al. 2206.12628v1 link
2022-06-25 Inverted Semantic-Index for Image Retrieval Ying Wang et.al. 2206.12623v1 null
2022-06-17 RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval Yihan Wu et.al. 2206.11225v1 null
2022-06-22 ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas Prathmesh Madhu et.al. 2206.11115v1 null
2022-06-20 Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval Guile Wu et.al. 2206.09806v1 null
2022-06-18 Attention-based Dynamic Subspace Learners for Medical Image Analysis Sukesh Adiga V et.al. 2206.09068v1 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733v1 null
2022-06-06 Learning Treatment Plan Representations for Content Based Image Retrieval Charles Huang et.al. 2206.02912v1 null
2022-06-19 NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation Ekaterina Nepovinnykh et.al. 2206.02498v3 link
2022-06-05 Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks B. G. Palm et.al. 2206.02278v1 null
2022-05-28 FaIRCoP: Facial Image Retrieval using Contrastive Personalization Devansh Gupta et.al. 2205.15870v1 null
2022-05-31 Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark Martin Humenberger et.al. 2205.15761v1 link
2022-05-27 Improving Road Segmentation in Challenging Domains Using Similar Place Priors Connor Malone et.al. 2205.14112v1 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135v2 link
2022-05-26 Fine-grained Image Captioning with CLIP Reward Jaemin Cho et.al. 2205.13115v1 link
2022-05-25 Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization Kyung Ho Park et.al. 2205.12544v1 null
2022-05-24 OnePose: One-Shot Object Pose Estimation without CAD Models Jiaming Sun et.al. 2205.12257v1 link
2022-05-23 VPAIR -- Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments Michael Schleiss et.al. 2205.11567v1 link
2022-05-23 VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering Yanan Wang et.al. 2205.11501v1 null
2022-05-23 Deep Image Retrieval is not Robust to Label Noise Stanislav Dereka et.al. 2205.11195v1 null
2022-05-22 Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval Zelong Zeng et.al. 2205.10878v1 link
2022-05-20 Visually-Augmented Language Modeling Weizhi Wang et.al. 2205.10178v1 link
2022-05-18 Deep Features for CBIR with Scarce Data using Hebbian Learning Gabriele Lagani et.al. 2205.08935v1 null
2022-05-19 Text Detection & Recognition in the Wild for Robot Localization Zobeir Raisi et.al. 2205.08565v2 null
2022-05-12 One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code Yong Dai et.al. 2205.06126v1 null
2022-05-11 Review on Panoramic Imaging and Its Applications in Scene Understanding Shaohua Gao et.al. 2205.05570v1 null
2022-05-18 Identical Image Retrieval using Deep Learning Sayan Nath et.al. 2205.04883v2 link
2022-05-09 Introspective Deep Metric Learning Chengkun Wang et.al. 2205.04449v1 link
2022-05-11 Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting Kai Uwe Barthel et.al. 2205.04255v2 link
2022-05-08 Adversarial Learning of Hard Positives for Place Recognition Wenxuan Fang et.al. 2205.03871v1 null
2022-05-10 AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching Khanh Nguyen et.al. 2205.02849v2 link
2022-04-29 Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval Shupeng Su et.al. 2204.13919v1 null
2022-04-29 Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval Siyu Ren et.al. 2204.13913v1 link
2022-04-28 Spatio-Temporal Graph Localization Networks for Image-based Navigation Takahiro Niwa et.al. 2204.13237v1 null
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831v1 null
2022-04-25 SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo Pinaki Nath Chowdhury et.al. 2204.11964v1 null
2022-04-23 On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning Muhammad Umer Anwaar et.al. 2204.11848v1 null
2022-04-24 Progressive Learning for Image Retrieval with Hybrid-Modality Queries Yida Zhao et.al. 2204.11212v1 null
2022-04-23 Training and challenging models for text-guided fashion image retrieval Eric Dodds et.al. 2204.11004v1 link
2022-04-18 Centralized Adversarial Learning for Robust Deep Hashing Xunguang Wang et.al. 2204.10779v1 link
2022-04-22 Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views Kanya Kurauchi et.al. 2204.10497v1 null
2022-04-21 Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval Zhiqiang Yuan et.al. 2204.09868v1 link
2022-04-21 Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information Zhiqiang Yuan et.al. 2204.09860v1 link
2022-04-20 Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Leila Pishdad et.al. 2204.09268v1 null
2022-04-19 Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing Georgii Mikriukov et.al. 2204.08707v1 null
2022-04-18 Multiple-environment Self-adaptive Network for Aerial-view Geo-localization Tingyu Wang et.al. 2204.08381v1 link
2022-04-15 Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder Hanjing Ye et.al. 2204.07350v1 link
2022-04-14 Composite Code Sparse Autoencoders for first stage retrieval Carlos Lassance et.al. 2204.07023v1 null
2022-04-13 Reuse your features: unifying retrieval and feature-metric alignment Javier Morlana et.al. 2204.06292v1 link
2022-04-12 Probabilistic Compositional Embeddings for Multimodal Image Retrieval Andrei Neculai et.al. 2204.05845v1 link
2022-04-12 Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval Yu-Wei Zhan et.al. 2204.05666v1 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481v1 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932v1 link
2022-04-10 Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image Yujiao Shi et.al. 2204.04752v1 link
2022-04-08 A Generic Image Retrieval Method for Date Estimation of Historical Document Collections Adrià Molina et.al. 2204.04028v1 null
2022-04-08 SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies Narges Norouzi et.al. 2204.03998v1 null
2022-04-05 Leveraging Equivariant Features for Absolute Pose Regression Mohamed Adel Musallam et.al. 2204.02163v1 null
2022-04-04 "This is my unicorn, Fluffy": Personalizing frozen vision-language representations Niv Cohen et.al. 2204.01694v1 link
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524v1 null
2022-04-01 LASER: LAtent SpacE Rendering for 2D Visual Localization Zhixiang Min et.al. 2204.00157v1 link
2022-03-31 Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning Semih Orhan et.al. 2203.16945v1 null
2022-03-30 AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift Burak Yildiz et.al. 2203.16291v1 link
2022-03-29 Long-term Visual Map Sparsification with Heterogeneous GNN Ming-Fang Chang et.al. 2203.15182v1 null
2022-04-01 A Simulation Benchmark for Vision-based Autonomous Navigation Lauri Suomela et.al. 2203.13048v2 link
2022-03-24 Is Geometry Enough for Matching in Visual Localization? Qunjie Zhou et.al. 2203.12979v1 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645v2 link
2022-03-10 ReF -- Rotation Equivariant Features for Local Feature Matching Abhishek Peri et.al. 2203.05206v1 null
2022-03-09 Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction Matthieu Zins et.al. 2203.04613v1 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446v1 link
2022-03-07 ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization Simon Maurer et.al. 2203.03610v1 link
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454v1 link
2022-03-01 SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments Maria Waheed et.al. 2203.00591v1 null
2022-02-28 Deep Camera Pose Regression Using Pseudo-LiDAR Ali Raza et.al. 2203.00080v1 null
2022-02-25 RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation Praveen Kumar Rajendran et.al. 2202.12838v1 null
2022-02-24 Highly-Efficient Binary Neural Networks for Visual Place Recognition Bruno Ferrarini et.al. 2202.12375v1 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146v1 link
2022-02-14 Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition Y. Shen et.al. 2202.06470v1 null
2022-02-11 Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition Yingfeng Cai et.al. 2202.05738v1 null
2022-02-09 Object-Guided Day-Night Visual Localization in Urban Scenes Assia Benbihi et.al. 2202.04445v1 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677v1 null
2022-02-25 CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938v2 null
2022-02-03 Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization Andrea Vallone et.al. 2202.01821v1 null
2022-02-02 Training Semantic Descriptors for Image-Based Localization Ibrahim Cinaroglu et.al. 2202.01212v1 null
2022-01-31 Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization Nathan Hughes et.al. 2201.13360v1 null
2022-01-31 Rigidity Preserving Image Transformations and Equivariance in Perspective Lucas Brynte et.al. 2201.13065v1 null
2022-01-25 Learning Semantics for Visual Place Recognition through Multi-Scale Attention Valerio Paolicelli et.al. 2201.09701v2 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048v1 link
2022-01-15 A Critical Analysis of Image-based Camera Pose Estimation Techniques Meng Xu et.al. 2201.05816v1 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386v1 link
2021-12-23 NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning Tony Ng et.al. 2112.12785v1 null
2021-12-16 CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data Qi Yan et.al. 2112.09081v1 link
2021-12-05 RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather Jialu Wang et.al. 2112.02469v1 null
2021-11-25 MegLoc: A Robust and Accurate Visual Localization Pipeline Shuxue Peng et.al. 2111.13063v1 null
2021-10-08 Semantic Image Alignment for Vehicle Localization Markus Herb et.al. 2110.04162v1 null
2021-10-05 Season-invariant GNSS-denied visual localization for UAVs Jouko Kinnari et.al. 2110.01967v1 link
2021-09-30 Forming a sparse representation for visual place recognition using a neurorobotic approach Sylvain Colomer et.al. 2109.14916v1 null
2021-09-22 Audio-Visual Grounding Referring Expression for Robotic Manipulation Yefei Wang et.al. 2109.10571v1 null
2021-09-20 Efficient shape mapping through dense touch and vision Sudharshan Suresh et.al. 2109.09884v1 link
2021-09-15 S3LAM: Structured Scene SLAM Mathieu Gonzalez et.al. 2109.07339v1 null
2021-09-13 Monocular Camera Localization for Automated Vehicles Using Image Retrieval Eunhyek Joa et.al. 2109.06296v1 null
2021-09-10 Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization Sungho Yoon et.al. 2109.04753v1 link
2021-09-09 CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization Ara Jafarzadeh et.al. 2109.04527v1 null
2021-09-09 Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization Mona Gridseth et.al. 2109.04041v1 link

(back to top)

Keypoint Detection

Publish Date Title Authors PDF Code
2025-03-06 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499v1 null
2025-03-04 A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection Junyi Wang et.al. 2503.02481v1 null
2025-03-01 Autonomous Dissection in Robotic Cholecystectomy Ki-Hwan Oh et.al. 2503.00666v1 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132v1 null
2025-02-27 Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets Jisoo Lee et.al. 2502.19766v1 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522v1 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484v1 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453v1 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361v1 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110v1 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844v1 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299v1 null
2025-01-19 Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation Shibang Liu et.al. 2501.11069v1 null
2025-01-13 Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications Lukas Rustler et.al. 2501.07421v1 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399v1 null
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221v1 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755v1 null
2024-12-19 Corn Ear Detection and Orientation Estimation Using Deep Learning Nathan Sprague et.al. 2412.14954v1 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739v1 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488v1 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292v1 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487v1 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472v1 link
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422v1 null
2024-11-23 OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin et.al. 2411.15653v1 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676v1 null
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851v1 null
2024-11-04 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang et.al. 2411.01846v1 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906v1 null
2024-10-04 Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation Aman Anand et.al. 2410.14700v1 null
2024-11-27 Sim2real Cattle Joint Estimation in 3D point clouds Mohammad Okour et.al. 2410.14419v2 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742v1 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718v1 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848v1 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155v1 null
2024-10-08 Unsupervised Model Diagnosis Yinong Oliver Wang et.al. 2410.06243v1 null
2024-10-08 Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration Xueyang Kang et.al. 2410.05729v1 link
2024-10-16 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237v2 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404v1 null
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899v1 link
2024-10-07 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082v2 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502v1 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668v1 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695v3 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060v1 null
2024-10-01 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202v2 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117v1 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907v1 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232v1 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791v1 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730v1 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857v1 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830v1 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014v1 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672v1 null
2024-07-23 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837v2 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315v1 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676v3 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085v1 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476v2 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434v1 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300v2 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594v1 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943v1 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650v1 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311v1 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560v2 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625v1 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985v1 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885v1 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527v1 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259v1 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662v1 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217v1 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241v1 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488v2 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172v4 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724v3 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173v1 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336v1 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742v1 link
2024-03-22 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029v3 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502v1 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471v1 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706v1 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480v1 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793v1 link
2023-12-11 VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Jian Shi et.al. 2312.08871v1 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865v1 link
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592v1 link
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281v1 null
2023-11-29 Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features Thomas Wimmer et.al. 2311.18113v1 link
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024v1 link
2023-11-28 Riemannian Self-Attention Mechanism for SPD Networks Rui Wang et.al. 2311.16738v1 null
2023-11-27 A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor Jialin Liu et.al. 2311.15609v1 null
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291v1 null
2023-11-20 CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement Boni Hu et.al. 2311.11604v1 link
2023-11-17 Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration Paul J. Claasen et.al. 2311.10361v1 link
2023-11-13 Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning Tomáš Kunzo et.al. 2311.07398v1 null
2023-11-11 CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer Haoyu Ma et.al. 2311.06443v1 link
2023-11-08 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud Jianchao Ci et.al. 2311.04699v1 null
2023-11-06 TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains Alexander Naumann et.al. 2311.03124v1 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842v1 null
2023-10-20 Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification Mateus Roder et.al. 2310.13490v1 null
2023-10-12 UniPose: Detecting Any Keypoints Jie Yang et.al. 2310.08530v1 link
2023-10-10 l-dyno: framework to learn consistent visual features using robot's motion Kartikeya Singh et.al. 2310.06249v1 link
2023-10-10 Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face Hao Zhang et.al. 2310.05056v2 link
2023-10-13 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404v2 link
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527v3 link
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268v2 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436v1 link
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563v1 null
2023-09-17 CryoAlign: feature-based method for global and local 3D alignment of EM density maps Bintao He et.al. 2309.09217v1 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471v1 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750v1 link
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895v1 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324v1 null
2023-09-12 Improving the matching of deformable objects by learning to detect keypoints Felipe Cadar et.al. 2309.00434v2 link
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen et.al. 2308.16876v1 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984v1 link
2023-08-29 A lightweight 3D dense facial landmark estimation model from position map data Shubhajit Basak et.al. 2308.15170v1 link
2023-08-27 Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors Francesco Pirotti et.al. 2308.14047v1 null
2023-08-24 VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition Gengxuan Tian et.al. 2308.12870v1 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223v1 null
2023-08-20 Neural Interactive Keypoint Detection Jie Yang et.al. 2308.10174v1 link
2023-08-19 ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment Bingyang Zhou et.al. 2308.09987v1 null
2023-09-03 DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479v2 link
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926v1 link
2023-08-15 ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition Wenyuan Xue et.al. 2308.07743v1 null
2023-08-14 DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport Sk Aziz Ali et.al. 2308.07153v1 null
2023-08-14 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds Minhao Li et.al. 2308.05667v2 link
2023-08-02 Automated Hit-frame Detection for Badminton Match Analysis Yu-Hang Chien et.al. 2307.16000v2 link
2023-07-25 Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception Chuanyu Luo et.al. 2307.13300v1 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698v2 link
2023-07-19 SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid Zi Li et.al. 2307.09727v1 link
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306v1 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669v1 link
2023-06-26 CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild Li Ding et.al. 2306.15073v1 null
2023-06-28 Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset Ziqiao Weng et.al. 2306.07089v2 link
2023-06-07 Learning Probabilistic Coordinate Fields for Robust Correspondences Weiyue Zhao et.al. 2306.04231v1 null
2023-06-03 LDEB -- Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues Amitabha Dey et.al. 2306.02193v1 null
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938v1 null
2023-06-01 A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm Onur Beker et.al. 2306.00892v1 null
2023-05-30 Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection Supeng Wang et.al. 2305.18714v1 link
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334v1 null
2023-05-15 Non-Separable Multi-Dimensional Network Flows for Visual Computing Viktoria Ehm et.al. 2305.08628v1 null
2023-05-13 Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance Xinyu Lin et.al. 2305.07943v1 link
2023-05-05 HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration Canhui Tang et.al. 2305.03487v1 link
2023-04-17 Human Pose Estimation in Monocular Omnidirectional Top-View Images Jingrui Yu et.al. 2304.08186v1 null
2023-04-14 CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression Mubariz Zaffar et.al. 2304.07426v1 null
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194v1 link
2023-04-06 From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection Changsheng Lu et.al. 2304.03140v1 null
2023-03-29 NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud Xiangyu Zhu et.al. 2303.16465v1 link
2023-03-24 PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View Ze Shi et.al. 2303.14095v1 link
2023-03-23 Semantic Image Attack for Visual Model Diagnosis Jinqi Luo et.al. 2303.13010v1 null
2023-03-22 Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation Heng Yang et.al. 2303.12246v1 link
2023-03-21 RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network Sangmin Yoo et.al. 2303.10770v2 null
2023-03-17 ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty Vanessa Wirth et.al. 2303.10042v1 null
2023-03-15 Descriptor Distillation for Efficient Multi-Robot SLAM Xiyue Guo et.al. 2303.08420v1 null
2023-03-15 From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning Zhuo Su et.al. 2303.08414v1 null
2023-03-16 KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input Yiye Chen et.al. 2303.05617v2 link
2023-03-07 External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors Simon Bultmann et.al. 2303.03797v1 null
2023-02-26 PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection Shenwei Xie et.al. 2302.13263v1 null
2023-02-24 Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks Julian Lißner et.al. 2302.12545v1 null
2023-02-21 Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging Yuhong Deng et.al. 2302.10446v1 null
2023-02-12 A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training Jingnan Shi et.al. 2302.06019v1 null
2023-02-11 Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing Zitong Yu et.al. 2302.05744v1 null
2023-02-09 MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection Yuhe Ding et.al. 2302.04589v1 link
2023-02-03 Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation Jie Yang et.al. 2302.01593v1 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572v1 link
2023-01-21 Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection Feiyang Wen et.al. 2301.08973v1 null
2023-01-18 OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models Xingyi He et.al. 2301.07673v1 null
2023-01-12 Towards High Performance One-Stage Human Pose Estimation Ling Li et.al. 2301.04842v1 null
2022-12-31 Rethinking Rotation Invariance with Point Cloud Registration Jianhui Yu et.al. 2301.00149v1 null
2023-02-06 Fruit Ripeness Classification: a Survey Matteo Rizzo et.al. 2212.14441v2 null
2022-12-28 NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action Kuan-Chieh Wang et.al. 2212.13660v1 link
2022-12-24 HandsOff: Labeled Dataset Generation With No Additional Human Annotations Austin Xu et.al. 2212.12645v1 null
2022-12-13 Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images Welerson Melo et.al. 2212.09589v1 link
2022-12-15 Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation Bugra C. Sefercik et.al. 2212.07567v1 null
2023-02-01 DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization Xiangyu Xu et.al. 2212.04575v2 null
2022-12-07 ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation Yufei Xu et.al. 2212.04246v1 link
2022-12-15 Designing Feature Vector Representations: A case study from Chemistry Signe Sidwall Thygesen et.al. 2212.03731v2 null
2022-12-09 DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model Jeongjun Choi et.al. 2212.02796v2 link
2022-12-05 Images Speak in Images: A Generalist Painter for In-Context Visual Learning Xinlong Wang et.al. 2212.02499v1 link
2022-12-06 R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor Bai Zhu et.al. 2212.02277v2 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069v1 link
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731v2 null
2022-11-21 Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching Paul Roetzer et.al. 2211.11589v1 link
2022-11-07 Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration Zixin Yang et.al. 2211.03688v1 null
2022-10-31 Tree Detection and Diameter Estimation Based on Deep Learning Vincent Grondin et.al. 2210.17424v1 link
2022-10-26 Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds Zhiyuan Zhang et.al. 2210.14899v1 null
2022-10-23 Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders Ömer Sümer et.al. 2210.12705v1 null
2022-10-21 Real-time Detection of 2D Tool Landmarks with Synthetic Training Data Bram Vanherle et.al. 2210.11991v1 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236v1 null
2022-10-04 Centroid Distance Keypoint Detector for Colored Point Clouds Hanzhe Teng et.al. 2210.01298v1 link
2022-09-28 Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences Jun-Jee Chao et.al. 2209.14419v1 null
2022-09-28 USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation Zhengrong Xue et.al. 2209.13864v1 null
2022-10-16 Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection Neelay Joglekar et.al. 2209.13657v2 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586v1 link
2022-09-26 Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments Kyungmin Jung et.al. 2209.12881v1 null
2022-10-07 Long-Lived Accurate Keypoints in Event Streams Philippe Chiberre et.al. 2209.10385v2 null
2022-09-20 Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence Sunghwan Hong et.al. 2209.08742v2 null
2022-09-15 Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections Bastian Pätzold et.al. 2209.07393v1 link
2022-09-07 Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip Yang Li et.al. 2209.03440v1 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997v1 null
2022-08-24 Self-Supervised Endoscopic Image Key-Points Matching Manel Farhat et.al. 2208.11424v1 link
2022-08-19 Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture Muhammad Muzammel et.al. 2208.08224v2 null
2022-08-08 MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis Maximilian Gilles et.al. 2208.03963v1 null
2022-08-07 CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi et.al. 2208.03660v1 null
2022-07-29 Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation Qihao Liu et.al. 2208.00090v1 null
2022-07-25 Translating a Visual LEGO Manual to a Machine-Executable Plan Ruocheng Wang et.al. 2207.12572v1 null
2022-07-21 Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network Aline Sindel et.al. 2207.10506v1 null
2022-07-15 Human keypoint detection for close proximity human-robot interaction Jan Docekal et.al. 2207.07742v1 null
2022-07-15 Adversarial Focal Loss: Asking Your Discriminator for Hard Examples Chen Liu et.al. 2207.07739v1 null
2022-07-13 Rapid Person Re-Identification via Sub-space Consistency Regularization Qingze Yin et.al. 2207.05933v1 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539v1 null
2022-08-15 Semi-supervised Human Pose Estimation in Art-historical Images Matthias Springstein et.al. 2207.02976v3 link
2022-07-01 Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling Jiamin Liang et.al. 2207.00474v1 null
2022-06-24 Motion Estimation for Large Displacements and Deformations Qiao Chen et.al. 2206.12464v1 null
2022-06-24 Deep embedded clustering algorithm for clustering PACS repositories Teo Manojlović et.al. 2206.12417v1 null
2022-06-21 KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences Xuanhan Wang et.al. 2206.10090v1 link
2022-06-20 Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval Guile Wu et.al. 2206.09806v1 null
2022-06-15 A Unified Sequence Interface for Vision Tasks Ting Chen et.al. 2206.07669v1 link
2022-06-09 Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields Mingtong Zhang et.al. 2206.04669v1 null
2022-06-03 SNAKE: Shape-aware Neural 3D Keypoint Field Chengliang Zhong et.al. 2206.01724v1 link
2022-05-17 MulT: An End-to-End Multitask Learning Transformer Deblina Bhattacharjee et.al. 2205.08303v1 null
2022-05-10 ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild Chirag Raman et.al. 2205.05177v1 link
2022-04-28 Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept Emilio Gomez-Gonzalez et.al. 2204.14050v1 null
2022-05-02 GRIT: General Robust Image Task Benchmark Tanmay Gupta et.al. 2204.13653v2 link
2022-05-24 ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Yufei Xu et.al. 2204.12484v2 link
2022-04-26 Unified GCNs: Towards Connecting GCNs with CNNs Ziyan Zhang et.al. 2204.12300v1 null
2022-04-19 Self-Supervised Equivariant Learning for Oriented Keypoint Detection Jongmin Lee et.al. 2204.08613v1 link
2022-04-17 The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation Bao Zhao et.al. 2204.08024v1 null
2022-04-15 2D Human Pose Estimation: A Survey Haoming Chen et.al. 2204.07370v1 null
2022-04-11 Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification Haojie Liu et.al. 2204.04842v1 null
2022-04-07 Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification Yanan Wang et.al. 2204.02611v2 link
2022-04-02 SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning Nilaksh Das et.al. 2204.00734v1 link
2022-04-01 MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration Chenzhong Gao et.al. 2204.00260v1 null
2022-03-29 Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning David Howard et.al. 2203.15172v1 null
2022-03-28 REGTR: End-to-end Point Cloud Correspondences with Transformers Zi Jian Yew et.al. 2203.14517v1 link
2022-03-27 UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection Ye Liu et.al. 2203.12745v2 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645v2 link
2022-03-16 PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research R. James Cotton et.al. 2203.08792v1 link
2022-03-11 DRTAM: Dual Rank-1 Tensor Attention Module Hanxing Chi et.al. 2203.05893v1 null
2022-03-07 Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation Meng Tian et.al. 2203.03498v1 null
2022-02-10 Motion-Aware Transformer For Occluded Person Re-identification Mi Zhou et.al. 2202.04243v2 null
2022-02-03 Sim2Real Object-Centric Keypoint Detection and Description Chengliang Zhong et.al. 2202.00448v2 null
2022-01-16 Cross-Centroid Ripple Pattern for Facial Expression Recognition Monu Verma et.al. 2201.05958v1 null
2022-01-14 Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words Harry Nguyen et.al. 2201.03556v2 link
2022-01-10 TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials Jinnavat Sanalohit et.al. 2201.03170v1 null
2022-01-06 A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration Aline Sindel et.al. 2201.02242v1 null
2021-12-28 Skin feature point tracking using deep feature encodings Jose Ramon Chang et.al. 2112.14159v1 null
2021-12-23 Data-efficient learning for 3D mirror symmetry detection Yancong Lin et.al. 2112.12579v1 null
2021-12-22 Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model Michael Zwölfer et.al. 2112.12193v1 null
2021-12-22 Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction Henrique Siqueira et.al. 2112.12002v1 link
2021-12-19 Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection Renjie Li et.al. 2112.10275v1 null
2021-12-19 GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor Jean-Baptiste Carluer et.al. 2112.10258v1 link
2021-12-16 Masked Feature Prediction for Self-Supervised Visual Pre-Training Chen Wei et.al. 2112.09133v1 link
2021-12-13 DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points Zhengfei Kuang et.al. 2112.06910v1 null
2021-12-12 Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species Changsheng Lu et.al. 2112.06183v1 link
2021-12-13 Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings Mel Vecerik et.al. 2112.04910v2 null
2021-12-06 ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction Xiaoming Zhao et.al. 2112.02906v1 link
2021-11-25 Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association Sen Yang et.al. 2111.12892v1 link
2021-11-08 Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images Jianfei Guo et.al. 2111.04237v1 null
2021-11-04 Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image Feng Liu et.al. 2111.03098v1 null
2021-11-01 Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision Ali Safa et.al. 2111.00791v2 null
2021-10-30 Geometry-Aware Hierarchical Bayesian Learning on Manifolds Yonghui Fan et.al. 2111.00184v1 null
2021-10-26 CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration Hao Yu et.al. 2110.14076v1 link
2021-10-23 HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware James Hegarty et.al. 2110.12106v1 null
2021-10-18 Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning Shengzeng Huo et.al. 2110.08962v1 null
2021-10-11 High-order Tensor Pooling with Attention for Action Recognition Piotr Koniusz et.al. 2110.05216v1 null
2021-10-10 Digging Into Self-Supervised Learning of Feature Descriptors Iaroslav Melekhov et.al. 2110.04773v1 null
2021-10-04 BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion Zhaoqun Li et.al. 2110.01179v1 link
2021-10-01 Machine learning aided noise filtration and signal classification for CREDO experiment Łukasz Bibrzycki et.al. 2110.00297v1 null
2021-09-28 PDC-Net+: Enhanced Probabilistic Dense Correspondence Network Prune Truong et.al. 2109.13912v2 link
2021-09-27 HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines Fabio Bellavia et.al. 2109.12925v3 null
2021-09-24 Catadioptric Stereo on a Smartphone Kristijan Bartol et.al. 2109.11872v1 null
2021-09-20 Semi-supervised Dense Keypointsusing Unlabeled Multiview Images Zhixuan Yu et.al. 2109.09299v1 null
2021-08-31 A Novel Dataset for Keypoint Detection of quadruped Animals from Images Prianka Banik et.al. 2108.13958v1 link
2021-08-27 A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images Xiaoteng Zhou et.al. 2108.12151v1 null

(back to top)

Image Matching

Publish Date Title Authors PDF Code
2025-03-06 Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis Xingcan Hu et.al. 2503.04205v1 null
2025-03-06 Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration Qianliang Wu et.al. 2503.04127v1 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132v1 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036v1 link
2025-02-27 RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges Thibaut Loiseau et.al. 2502.19955v1 null
2025-02-26 BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure Haoxin Cai et.al. 2502.19242v1 link
2025-02-25 PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching Han Nie et.al. 2502.18104v1 link
2025-02-25 Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking Xin Tong et.al. 2502.17766v1 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779v3 null
2025-02-16 FeaKM: Robust Collaborative Perception under Noisy Pose Conditions Jiuwu Hao et.al. 2502.11003v1 link
2025-02-24 Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation Emanuele Mule et.al. 2502.06288v3 link
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O'Donnell et.al. 2502.02624v1 null
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277v1 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299v1 null
2025-01-13 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Xingyi He et.al. 2501.07556v1 null
2025-01-13 Matching Free Depth Recovery from Structured Light Zhuohang Yu et.al. 2501.07113v1 null
2025-01-02 Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views Yulun Wu et.al. 2501.01196v1 null
2024-12-31 Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights Bharath Kumar Agnur et.al. 2412.20210v2 null
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412v1 link
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221v1 link
2024-12-17 Bringing Multimodality to Amazon Visual Search System Xinliang Zhu et.al. 2412.13364v1 null
2024-12-04 Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis Siyoon Jin et.al. 2412.03150v1 null
2024-11-20 DT-LSD: Deformable Transformer-based Line Segment Detection Sebastian Janampa et.al. 2411.13005v1 link
2024-11-15 Image Matching Filtering and Refinement by Planes and Beyond Fabio Bellavia et.al. 2411.09484v2 link
2024-11-11 XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration Ismail Can Yagmur et.al. 2411.07430v1 link
2024-11-07 The Impact of Semi-Supervised Learning on Line Segment Detection Johanna Engman et.al. 2411.04596v1 link
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851v1 null
2024-10-30 Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants Azadeh Sharafi et.al. 2410.23329v1 null
2024-11-05 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280v2 null
2024-10-30 LoFLAT: Local Feature Matching using Focused Linear Attention Transformer Naijian Cao et.al. 2410.22710v1 null
2024-10-26 Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification Yue Su et.al. 2410.20097v1 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848v1 null
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673v1 null
2024-09-25 Game4Loc: A UAV Geo-Localization Benchmark from Game Data Yuxiang Ji et.al. 2409.16925v1 link
2024-09-24 Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge Marek Wodzinski et.al. 2409.15931v1 null
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471v1 link
2024-09-05 Enabling Practical and Privacy-Preserving Image Processing Chao Wang et.al. 2409.03568v1 null
2024-09-20 A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering Shuang Song et.al. 2409.03032v2 link
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445v2 link
2024-08-26 Affine steerers for structured keypoint description Georg Bökman et.al. 2408.14186v1 link
2024-09-11 Coarse-to-fine Alignment Makes Better Speech-image Retrieval Lifeng Zhou et.al. 2408.13119v2 null
2024-08-19 BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval Zhenyu Lu et.al. 2408.10383v1 null
2024-08-14 RSD-DOG : A New Image Descriptor based on Second Order Derivatives Darshan Venkatrayappa et.al. 2408.07687v1 null
2024-08-07 PRISM: PRogressive dependency maxImization for Scale-invariant image Matching Xudong Cai et.al. 2408.03598v1 null
2024-08-05 ConDL: Detector-Free Dense Image Matching Monika Kwiatkowski et.al. 2408.02766v1 null
2024-08-04 Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image Xinlin Ren et.al. 2408.02079v1 link
2024-07-29 Image-text matching for large-scale book collections Artemis Llabrés et.al. 2407.19812v1 link
2024-07-26 PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis Sohyeong Kim et.al. 2407.18695v1 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791v1 null
2024-07-16 REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching Han Nie et.al. 2407.11637v1 link
2024-07-16 A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation Chengsheng Li et.al. 2407.11287v1 null
2024-07-14 Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching Xiaoyong Lu et.al. 2407.07789v2 null
2024-07-10 Mutual Information calculation on different appearances Jiecheng Liao et.al. 2407.07410v1 null
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939v3 null
2024-07-03 IMC 2024 Methods & Solutions Review Shyam Gupta et.al. 2407.03172v1 null
2024-06-21 High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method F. S. Mortazavi et.al. 2406.15121v1 null
2024-06-16 Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models Yikai Zhang et.al. 2406.10902v1 link
2024-06-14 Grounding Image Matching in 3D with MASt3R Vincent Leroy et.al. 2406.09756v1 link
2024-05-22 Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching Hongkai Chen et.al. 2405.13874v1 null
2024-05-21 OmniGlue: Generalizable Feature Matching with Foundation Model Guidance Hanwen Jiang et.al. 2405.12979v1 link
2024-07-09 Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation Rezkellah Noureddine Khiati et.al. 2405.08556v2 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434v1 null
2024-05-13 Authentic Hand Avatar from a Phone Scan via Universal Hand Model Gyeongsik Moon et.al. 2405.07933v1 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311v1 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174v1 null
2024-06-10 MinBackProp -- Backpropagating through Minimal Solvers Diana Sungatullina et.al. 2404.17993v2 link
2024-04-23 FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction Hang Hua et.al. 2404.14715v1 null
2024-05-23 A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching Francesco Pro et.al. 2404.11302v2 link
2024-04-16 Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction John Francis et.al. 2404.10626v1 null
2024-04-15 XoFTR: Cross-modal Feature Matching Transformer Önder Tuzcuoğlu et.al. 2404.09692v1 link
2024-04-13 DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector Johan Edstedt et.al. 2404.08928v1 link
2024-04-09 Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences Axel Barroso-Laguna et.al. 2404.06337v1 link
2024-04-01 Marrying NeRF with Feature Matching for One-step Pose Estimation Ronghan Chen et.al. 2404.00891v1 null
2024-04-01 3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching Yibin Ye et.al. 2404.00838v1 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546v1 null
2024-03-30 Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation Yuan Wang et.al. 2404.00262v1 null
2024-03-26 Staircase Localization for Autonomous Exploration in Urban Environments Jinrae Kim et.al. 2403.17330v1 null
2024-03-23 MatchSeg: Towards Better Segmentation via Reference Image Matching Ruiqiang Xiao et.al. 2403.15901v1 link
2024-03-19 HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching Ying Chen et.al. 2403.12543v1 null
2024-03-16 Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Shunsuke Tsubaki et.al. 2403.10756v1 null
2024-03-16 Vector search with small radiuses Gergely Szilvasy et.al. 2403.10746v1 null
2024-03-15 Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline Fangming Yuan et.al. 2403.10283v1 null
2024-03-15 Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning Meixuan Li et.al. 2403.10252v1 null
2024-03-14 Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning Xilin Yang et.al. 2403.09100v1 null
2024-03-18 Matching Non-Identical Objects Yusuke Marumo et.al. 2403.08227v2 null
2024-03-07 Scene Depth Estimation from Traditional Oriental Landscape Paintings Sungho Kang et.al. 2403.03408v2 null
2024-02-21 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974v2 link
2024-02-16 GIM: Learning Generalizable Image Matcher From Internet Videos Xuelun Shen et.al. 2402.11095v1 link
2024-02-13 Are Semi-Dense Detector-Free Methods Good at Matching Local Features? Matthieu Vilain et.al. 2402.08671v1 null
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359v1 link
2024-01-24 Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry Qi Cai et.al. 2401.13357v1 null
2024-01-18 Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation Songhe Deng et.al. 2401.09883v1 link
2024-01-26 RomniStereo: Recurrent Omnidirectional Stereo Matching Hualie Jiang et.al. 2401.04345v2 link
2024-01-05 CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs Daoan Zhang et.al. 2401.02582v1 null
2024-01-03 Local Adaptive Clustering Based Image Matching for Automatic Visual Identification Zhizhen Wang et.al. 2401.01720v1 null
2024-01-03 A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization Shishen Li et.al. 2401.01574v1 null
2023-12-23 BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation Tavis Shore et.al. 2312.15363v1 link
2023-12-22 Harnessing Diffusion Models for Visual Perception with Meta Prompts Qiang Wan et.al. 2312.14733v1 link
2024-01-05 MatchDet: A Collaborative Framework for Image Matching and Object Detection Jinxiang Lai et.al. 2312.10983v2 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563v1 null
2023-12-04 Steerers: A framework for rotation equivariant keypoint descriptors Georg Bökman et.al. 2312.02152v1 link
2023-11-30 DSeg: Direct Line Segments Detection Berger Cyrille et.al. 2311.18344v1 null
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281v1 null
2023-11-29 LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching Wenhao Zhong et.al. 2311.17571v1 link
2023-11-08 Zero-shot Translation of Attention Patterns in VQA Models to Natural Language Leonard Salewski et.al. 2311.05043v1 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842v1 null
2023-10-23 RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments Jinyu Li et.al. 2310.15072v1 link
2023-10-23 Player Re-Identification Using Body Part Appearences Mahesh Bhosale et.al. 2310.14469v1 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605v1 null
2023-10-07 UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation Shuai Yuan et.al. 2310.04712v1 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092v1 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992v1 link
2023-09-27 KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping Renlang Huang et.al. 2309.15394v1 null
2023-10-13 A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo Debao Huang et.al. 2309.09379v2 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438v1 link
2023-09-09 Neural Semantic Surface Maps Luca Morreale et.al. 2309.04836v1 null
2023-09-05 Doppelgangers: Learning to Disambiguate Images of Similar Structures Ruojin Cai et.al. 2309.02420v1 link
2023-08-14 Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions Miao Fan et.al. 2308.16160v1 null
2023-08-22 Scene-Aware Feature Matching Xiaoyong Lu et.al. 2308.09949v2 null
2023-08-02 ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation Bo Zhang et.al. 2308.00400v2 link
2023-07-28 Cross-Modal Concept Learning and Inference for Vision-Language Models Yi Zhang et.al. 2307.15460v1 null
2023-07-22 CryptoMask : Privacy-preserving Face Recognition Jianli Bai et.al. 2307.12010v1 null
2023-07-22 A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration Jing Hao et.al. 2307.11997v1 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698v2 link
2023-08-08 Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education Neel Kanwal et.al. 2307.09426v2 null
2023-08-01 Unsupervised Deep Graph Matching Based on Cycle Consistency Siddharth Tourani et.al. 2307.08930v4 link
2023-07-15 Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents Ke Cao et.al. 2307.07763v1 null
2023-07-09 Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion Jie S. Li et.al. 2307.05564v1 null
2023-07-11 TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation Paul Grimal et.al. 2307.05134v1 link
2023-07-02 TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching Khang Truong Giang et.al. 2307.00485v1 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669v1 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-25 Enhancing Dynamic Image Advertising with Vision-Language Pre-training Zhoufutu Wen et.al. 2306.14112v1 null
2023-06-19 Graph Self-Supervised Learning for Endoscopic Image Matching Manel Farhat et.al. 2306.11141v1 link
2023-06-07 A2B: Anchor to Barycentric Coordinate for Robust Correspondence Weiyue Zhao et.al. 2306.02760v2 null
2023-05-27 Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation Yueh-Cheng Huang et.al. 2305.17463v1 null
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036v1 link
2023-05-18 LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation Yujie Lu et.al. 2305.11116v1 link
2023-05-16 A Method for Training-free Person Image Picture Generation Tianyu Chen et.al. 2305.09817v1 null
2023-05-15 Image Matching by Bare Homography Fabio Bellavia et.al. 2305.08946v1 null
2023-05-12 CLIP-Count: Towards Text-Guided Zero-Shot Object Counting Ruixiang Jiang et.al. 2305.07304v1 link
2023-05-10 SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking Adam Schmidt et.al. 2305.06477v1 null
2023-05-10 Level-line Guided Edge Drawing for Robust Line Segment Detection Xinyu Lin et.al. 2305.05883v1 link
2023-05-09 ColonMapper: topological mapping and localization for colonoscopy Javier Morlana et.al. 2305.05546v1 null
2023-04-29 A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges Xinyu Lin et.al. 2305.00264v1 link
2023-04-28 SFD2: Semantic-guided Feature Detection and Description Fei Xue et.al. 2304.14845v1 link
2023-04-17 DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching Mohamed Ali Chebbi et.al. 2304.08056v1 link
2023-04-16 Long-term Visual Localization with Mobile Sensors Shen Yan et.al. 2304.07691v1 null
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194v1 link
2023-04-16 ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation Xiaoming Zhao et.al. 2304.03608v2 link
2023-04-04 GlueStick: Robust Image Matching by Sticking Points and Lines Together Rémi Pautrat et.al. 2304.02008v1 link
2023-04-03 PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching Pedro Castro et.al. 2304.01382v1 null
2023-04-02 Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints Guilherme Potje et.al. 2304.00583v1 link
2023-04-13 Structured Epipolar Matcher for Local Feature Matching Jiahao Chang et.al. 2303.16646v3 null
2023-03-28 ASIC: Aligning Sparse in-the-wild Image Collections Kamal Gupta et.al. 2303.16201v1 null
2023-03-25 Learning Rotation-Equivariant Features for Visual Correspondence Jongmin Lee et.al. 2303.15472v1 null
2023-03-27 Learnable Graph Matching: A Practical Paradigm for Data Association Jiawei He et.al. 2303.15414v1 link
2023-03-24 Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance Hongjian Song et.al. 2303.13794v1 null
2023-03-15 Rethinking Optical Flow from Geometric Matching Consistent Perspective Qiaole Dong et.al. 2303.08384v1 link
2023-03-07 Parsing Line Segments of Floor Plan Images Using Graph Neural Networks Mingxiang Chen et.al. 2303.03851v1 null
2023-03-06 Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints Chenjie Cao et.al. 2303.02885v1 link
2023-03-10 ParaFormer: Parallel Attention Transformer for Efficient Feature Matching Xiaoyong Lu et.al. 2303.00941v2 null
2023-03-01 RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique Jiayuan Li et.al. 2303.00319v1 link
2023-02-28 Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images Zhongli Fan et.al. 2302.14239v1 link
2023-02-25 BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI Yulong Liu et.al. 2302.12971v1 link
2023-02-24 Classification of structural building damage grades from multi-temporal photogrammetric point clouds using a machine learning model trained on virtual laser scanning data Vivien Zahs et.al. 2302.12591v1 null
2023-02-20 A Large Scale Homography Benchmark Daniel Barath et.al. 2302.09997v1 link
2023-02-10 General, Single-shot, Target-less, and Automatic LiDAR-Camera Extrinsic Calibration Toolbox Kenji Koide et.al. 2302.05094v1 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572v1 link
2023-01-27 Harmonizing Flows: Unsupervised MR harmonization based on normalizing flows Farzad Beizaee et.al. 2301.11551v1 link
2023-01-24 Feature-based Image Matching for Identifying Individual Kākā Fintan O'Sullivan et.al. 2301.06678v2 null
2023-01-18 Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images Johannes Bayer et.al. 2301.03155v2 null
2023-01-07 Deep Learning-Based UAV Aerial Triangulation without Image Control Points Jiageng Zhong et.al. 2301.02869v1 null
2023-01-06 The UNCOVER Survey: A first-look HST+JWST catalog of 50,000 galaxies near Abell 2744 and beyond John R. Weaver et.al. 2301.02671v1 link
2023-02-13 Translating Text Synopses to Video Storyboards Xu Gu et.al. 2301.00135v2 link
2022-12-23 SuperGF: Unifying Local and Global Features for Visual Localization Wenzheng Song et.al. 2212.13105v1 null
2022-12-26 Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images Weizhi Du et.al. 2212.13068v1 null
2022-12-20 Seafloor-Invariant Caustics Removal from Underwater Imagery Panagiotis Agrafiotis et.al. 2212.10167v1 null
2022-12-15 DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients Rémi Pautrat et.al. 2212.07766v1 link
2022-12-14 Shared Coupling-bridge for Weakly Supervised Local Feature Learning Jiayuan Sun et.al. 2212.07047v1 link
2022-12-05 Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter Suleyman Melih Portakal et.al. 2212.02302v1 null
2022-12-05 ObjectMatch: Robust Registration using Canonical Object Correspondences Can Gümeli et.al. 2212.01985v1 null
2022-12-07 Universe Points Representation Learning for Partial Multi-Graph Matching Zhakshylyk Nurlanov et.al. 2212.00780v2 null
2022-11-30 Self-Supervised Feature Learning for Long-Term Metric Visual Localization Yuxuan Chen et.al. 2212.00122v1 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069v1 link
2022-11-19 Person Text-Image Matching via Text-Feature Interpretability Embedding and External Attack Node Implantation Fan Li et.al. 2211.08657v2 link
2022-11-20 Detecting Line Segments in Motion-blurred Images with Events Huai Yu et.al. 2211.07365v2 link
2022-11-15 Fast Key Points Detection and Matching for Tree-Structured Images Hao Wang et.al. 2211.03242v2 null
2022-10-25 A Comparative Study on Deep-Learning Methods for Dense Image Matching of Multi-angle and Multi-date Remote Sensing Stereo Images Hessah Albanwan et.al. 2210.14031v1 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517v1 null
2022-10-07 Mars Rover Localization Based on A2G Obstacle Distribution Pattern Matching Lang Zhou et.al. 2210.03398v1 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586v1 link
2022-09-25 ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement Dongli Tan et.al. 2209.12213v1 null
2022-09-22 DRKF: Distilled Rotated Kernel Fusion for Efficiently Boosting Rotation Invariance in Image Matching Chao Li et.al. 2209.10907v1 null
2022-11-15 Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology Arpan Kusari et.al. 2209.09090v2 null
2022-09-16 SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence Lei Li et.al. 2209.07806v1 link
2022-08-30 ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer Hongkai Chen et.al. 2208.14201v1 link
2022-08-25 A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning Jianli Wei et.al. 2208.12251v1 link
2022-08-25 UAS Navigation in the Real World Using Visual Observation Yuci Han et.al. 2208.12125v1 null
2022-08-24 Self-Supervised Endoscopic Image Key-Points Matching Manel Farhat et.al. 2208.11424v1 link
2022-08-22 Equivariant Hypergraph Neural Networks Jinwoo Kim et.al. 2208.10428v1 link
2022-09-22 Understanding Attention for Vision-and-Language Tasks Feiqi Cao et.al. 2208.08104v2 link
2022-08-16 Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning Dongwoo Park et.al. 2208.07039v2 link
2022-08-04 Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification Xinyu Lin et.al. 2208.02450v1 link
2022-08-04 OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images Weijia Li et.al. 2208.00928v2 null
2022-07-29 Testing Relational Understanding in Text-Guided Image Generation Colin Conwell et.al. 2208.00005v1 null
2022-07-21 Pose for Everything: Towards Category-Agnostic Pose Estimation Lumin Xu et.al. 2207.10387v1 link
2022-07-20 Explaining Deepfake Detection by Analysing Image Matching Shichao Dong et.al. 2207.09679v1 link
2022-07-18 Adaptive Assignment for Geometry Aware Local Feature Matching Dihe Huang et.al. 2207.08427v1 link
2022-07-16 Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching Jiazhen Liu et.al. 2207.07932v1 link
2022-07-06 Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks Yijie Zhang et.al. 2207.02946v1 null
2022-07-01 TopicFM: Robust and Interpretable Feature Matching with Topic-assisted Khang Truong Giang et.al. 2207.00328v1 link
2022-06-16 Virtual Correspondence: Humans as a Cue for Extreme-View Geometry Wei-Chiu Ma et.al. 2206.08365v1 null
2022-06-15 Self-Supervised Learning of Image Scale and Orientation Jongmin Lee et.al. 2206.07259v1 link
2022-05-27 Image Keypoint Matching using Graph Neural Networks Nancy Xu et.al. 2205.14275v1 null
2022-05-27 Fine-tuning deep learning models for stereo matching using results from semi-global matching Hessah Albanwan et.al. 2205.14051v1 null
2022-05-23 TransforMatcher: Match-to-Match Attention for Semantic Correspondence Seungwook Kim et.al. 2205.11634v1 link
2022-05-16 ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning Yuxin Deng et.al. 2205.07439v1 null
2022-05-06 BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching Jingwei Song et.al. 2205.03133v1 link
2022-05-10 AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching Khanh Nguyen et.al. 2205.02849v2 link
2022-04-27 Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points Chao Li et.al. 2204.12884v1 null
2022-04-22 SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite Runzhe Zhu et.al. 2204.10704v1 link
2022-04-20 Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Leila Pishdad et.al. 2204.09268v1 null
2022-04-19 OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching Ostap Viniavskyi et.al. 2204.08870v1 link
2022-04-19 Self-Supervised Equivariant Learning for Oriented Keypoint Detection Jongmin Lee et.al. 2204.08613v1 link
2022-04-22 Efficient Linear Attention for Fast and Accurate Keypoint Matching Suwichaya Suwanwimolkul et.al. 2204.07731v3 null
2022-04-08 Lightweight starshade position sensing with convolutional neural networks and simulation-based inference Andrew Chen et.al. 2204.03853v1 link
2022-03-30 AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift Burak Yildiz et.al. 2203.16291v1 link
2022-03-29 Photographic Visualization of Weather Forecasts with Generative Adversarial Networks Christian Sigg et.al. 2203.15601v1 link
2022-03-29 Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots Pranay Mathur et.al. 2203.15272v1 null
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901v1 link
2022-03-28 S2-Net: Self-supervision Guided Feature Representation Learning for Cross-Modality Images Shasha Mei et.al. 2203.14581v1 null
2022-03-26 Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching Yujiao Shi et.al. 2203.14148v1 link
2022-03-24 Keypoints Tracking via Transformer Networks Oleksii Nasypanyi et.al. 2203.12848v1 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645v2 link
2022-03-14 There's no difference: Convolutional Neural Networks for transient detection without template subtraction Tatiana Acero-Cuellar et.al. 2203.07390v1 link
2022-03-25 Cross Language Image Matching for Weakly Supervised Semantic Segmentation Jinheng Xie et.al. 2203.02668v2 link
2022-03-01 CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP Zihao Wang et.al. 2203.00386v1 null
2022-03-09 Time-resolved Imaging of Stochastic Cascade Reactions over a Submillisecond to Second Time Range at the Angstrom Level Toshiki Shimizu et.al. 2202.13332v2 null
2022-02-16 Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images Matheus M. Dos Santos et.al. 2202.07817v1 null
2022-02-14 CATs++: Boosting Cost Aggregation with Convolutions and Transformers Seokju Cho et.al. 2202.06817v1 link
2022-02-11 Improving Image-recognition Edge Caches with a Generative Adversarial Network Guilherme B. Souza et.al. 2202.05929v1 null
2022-02-08 Learning Optical Flow with Adaptive Graph Reasoning Ao Luo et.al. 2202.03857v1 link
2022-02-03 Sim2Real Object-Centric Keypoint Detection and Description Chengliang Zhong et.al. 2202.00448v2 null
2022-01-27 Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context Jie Shao et.al. 2201.11296v1 null
2021-12-24 Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation Zhiwei Liu et.al. 2112.12917v1 null
2021-12-20 Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching Yujie Fu et.al. 2112.10485v1 null
2021-12-19 GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor Jean-Baptiste Carluer et.al. 2112.10258v1 link
2021-12-14 More Control for Free! Image Synthesis with Semantic Diffusion Guidance Xihui Liu et.al. 2112.05744v2 null
2021-12-08 Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning Bijie Bai et.al. 2112.05240v1 null
2021-12-01 FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery Boitumelo Ruf et.al. 2112.00821v1 null
2021-12-01 CLIPstyler: Image Style Transfer with a Single Text Condition Gihyun Kwon et.al. 2112.00374v1 link
2021-11-29 Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution Features Xiaoteng Zhou et.al. 2111.15514v1 null
2021-11-29 Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic Yoad Tewel et.al. 2111.14447v1 link
2021-11-29 Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality Discriminator Usman Cheema et.al. 2111.14339v1 null
2021-11-17 Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network Xiaoming Zhao et.al. 2111.09006v2 null
2021-11-17 Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features Xiaoteng Zhou et.al. 2111.08994v3 null
2021-10-30 A Deep Search for Faint Chandra X-ray Sources, Radio Sources, and Optical Counterparts in NGC 6752 Haldan N. Cohn et.al. 2111.00357v1 null
2021-10-01 Robustly Removing Deep Sea Lighting Effects for Visual Mapping of Abyssal Plains Kevin Köser et.al. 2110.00480v1 null
2021-09-29 Visually Grounded Concept Composition Bowen Zhang et.al. 2109.14115v1 null
2021-09-27 HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines Fabio Bellavia et.al. 2109.12925v3 null
2021-09-20 Viewpoint Invariant Dense Matching for Visual Geolocalization Gabriele Berton et.al. 2109.09827v1 link
2021-09-20 Image Subtraction in Fourier Space Lei Hu et.al. 2109.09334v1 link
2021-09-10 Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization Sungho Yoon et.al. 2109.04753v1 link
2021-09-08 Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes Wenzheng Song et.al. 2109.03585v2 null
2021-08-27 A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images Xiaoteng Zhou et.al. 2108.12151v1 null
2021-08-27 Matching Underwater Sonar Images by the Learned Descriptor Based on Style Transfer Method Xiaoteng Zhou et.al. 2108.12072v1 null
2021-08-26 Efficient Joint Object Matching via Linear Programming Antonio De Rosa et.al. 2108.11911v1 null

(back to top)

NeRF

Publish Date Title Authors PDF Code
2025-03-06 Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering Idris O. Sunmola et.al. 2503.04079v1 null
2025-03-05 LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation Qian Feng et.al. 2503.03890v1 null
2025-03-04 Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries Zeqing Wang et.al. 2503.02558v1 null
2025-03-04 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting Qipeng Yan et.al. 2503.02452v1 null
2025-03-04 Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views Yingji Zhong et.al. 2503.02230v1 null
2025-03-04 Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints Yan Miao et.al. 2503.02198v1 null
2025-03-03 Data Augmentation for NeRFs in the Low Data Limit Ayush Gaggar et.al. 2503.02092v1 null
2025-03-03 Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jay Zhangjie Wu et.al. 2503.01774v1 null
2025-03-05 Category-level Meta-learned NeRF Priors for Efficient Object Mapping Saad Ejaz et.al. 2503.01582v2 null
2025-03-03 LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training Kaimin Liao et.al. 2503.01199v1 null
2025-03-02 DreamPrinting: Volumetric Printing Primitives for High-Fidelity 3D Printing Youjia Wang et.al. 2503.00887v1 null
2025-03-01 Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups Nicholas Pfaff et.al. 2503.00370v1 null
2025-02-27 Identity-preserving Distillation Sampling by Fixed-Point Iterator SeonHwa Kim et.al. 2502.19930v1 null
2025-02-27 NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission Weijie Yue et.al. 2502.19873v1 null
2025-02-26 Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions Muhammad Salman Ali et.al. 2502.19457v1 null
2025-02-26 Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? Adam Celarek et.al. 2502.19318v1 link
2025-02-26 The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields Ziyuan Luo et.al. 2502.19125v1 null
2025-02-24 Semantic Neural Radiance Fields for Multi-Date Satellite Data Valentin Wagner et.al. 2502.16992v1 link
2025-02-22 AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal Luca Gough et.al. 2502.16351v1 null
2025-02-22 DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation Yuxuan Xiong et.al. 2502.16302v1 null
2025-02-24 Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis Ziqian Ni et.al. 2502.15635v2 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931v1 null
2025-02-20 NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis Xiaoxing Liu et.al. 2502.14178v1 null
2025-02-19 GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian Bang Du et.al. 2502.14129v1 null
2025-02-18 Geometry-Aware Diffusion Models for Multiview Scene Inpainting Ahmad Salimi et.al. 2502.13335v1 null
2025-02-18 GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis Pedro Martin et.al. 2502.13196v1 null
2025-02-18 ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition Quoc-Anh Bui et.al. 2502.12673v1 null
2025-02-21 HumanGif: Single-View Human Diffusion with Generative Prior Shoukang Hu et.al. 2502.12080v2 link
2025-02-17 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency Sheng-Yu Huang et.al. 2502.11801v1 null
2025-02-13 Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures Francesco Ballerini et.al. 2502.09623v1 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111v1 null
2025-02-12 Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision Tianle Liu et.al. 2502.08352v1 null
2025-02-10 PrismAvatar: Real-time animated 3D neural head avatars on edge devices Prashant Raina et.al. 2502.07030v1 null
2025-02-10 Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC Siwei Meng et.al. 2502.07007v1 null
2025-02-08 GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling Kang Yang et.al. 2502.05708v1 null
2025-02-05 VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning Jayram Palamadai et.al. 2502.05222v1 null
2025-02-11 PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression Feifei Li et.al. 2502.04843v2 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657v1 null
2025-02-04 MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning Shengbo Gu et.al. 2502.02372v1 null
2025-02-03 FourieRF: Few-Shot NeRFs via Progressive Fourier Frequency Control Diego Gomez et.al. 2502.01405v1 null
2025-01-31 VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting Mateusz Nowak et.al. 2501.17978v2 null
2025-01-28 LinPrim: Linear Primitives for Differentiable Volumetric Rendering Nicolas von Lützow et.al. 2501.16312v2 null
2025-01-24 SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation Yujian Liu et.al. 2501.14646v1 null
2025-02-05 GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting Junzhe Jiang et.al. 2501.13971v2 link
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402v1 null
2025-01-22 Neural Radiance Fields for the Real World: A Survey Wenhui Xiao et.al. 2501.13104v1 null
2025-02-02 DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform Hung Nguyen et.al. 2501.12637v2 null
2025-01-21 DNRSelect: Active Best View Selection for Deferred Neural Rendering Dongli Wu et.al. 2501.12150v1 null
2025-01-21 Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging Shuyi Hu et.al. 2501.11884v1 null
2025-01-16 Poxel: Voxel Reconstruction for 3D Printing Ruixiang Cao et.al. 2501.10474v1 null
2025-01-17 Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation Xiaoyun Zheng et.al. 2501.09947v1 link
2025-01-16 Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes Ji Shi et.al. 2501.09460v1 link
2025-01-15 SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM Yuhang Ming et.al. 2501.08880v1 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286v1 null
2025-01-13 Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes Yuhang Zhang et.al. 2501.08072v1 null
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015v2 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927v1 link
2025-01-12 ActiveGAMER: Active GAussian Mapping through Efficient Rendering Liyan Chen et.al. 2501.06897v1 null
2025-01-17 SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis Peng Zheng et.al. 2501.06770v2 null
2025-01-11 NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References Qiang Qu et.al. 2501.06488v1 link
2025-01-10 UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping Yanjie Li et.al. 2501.05783v1 null
2025-01-13 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226v2 null
2025-01-07 NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives Leif Van Holland et.al. 2501.04074v1 link
2025-01-07 NeuralSVG: An Implicit Representation for Text-to-Vector Generation Sagi Polaczek et.al. 2501.03992v1 null
2025-01-14 DehazeGS: Seeing Through Fog with 3D Gaussian Splatting Jinze Yu et.al. 2501.03659v2 null
2025-01-07 ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting Yifeng Yang et.al. 2501.03605v1 link
2025-01-07 AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene Chaoran Feng et.al. 2501.02807v2 null
2024-12-29 Bringing Objects to Life: 4D generation from 3D objects Ohad Rahamim et.al. 2412.20422v1 null
2024-12-27 Learning Radiance Fields from a Single Snapshot Compressive Image Yunhao Li et.al. 2412.19483v1 null
2025-01-05 BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream Gopi Raju Matta et.al. 2412.19370v2 null
2024-12-26 Generating Editable Head Avatars with 3D Gaussian GANs Guohao Li et.al. 2412.19149v1 link
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130v1 null
2024-12-26 Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos Changwoon Choi et.al. 2412.19089v1 null
2024-12-23 Editing Implicit and Explicit Representations of Radiance Fields: A Survey Arthur Hubert et.al. 2412.17628v1 null
2024-12-23 Exploring Dynamic Novel View Synthesis Technologies for Cinematography Adrian Azzarelli et.al. 2412.17532v1 null
2024-12-21 LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo Fotios Logothetis et.al. 2412.16737v1 null
2024-12-20 NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems Laura Weihl et.al. 2412.16141v1 null
2024-12-20 NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images Yue Guo et.al. 2412.15890v1 null
2024-12-19 LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction Pou-Chun Kung et.al. 2412.15447v1 null
2024-12-18 DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields Xingyu Zhu et.al. 2412.15278v1 null
2024-12-19 GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting Qianpu Sun et.al. 2412.14579v1 null
2024-12-19 Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images Min Wang et.al. 2412.14547v1 null
2024-12-18 GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians Xiaobao Wei et.al. 2412.13983v1 **[link](https://github.com/ucwxb/graphav

About

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%