GitHub - agipro/cv-arxiv-daily: 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2025.03.09

Table of Contents

SLAM
SFM
Visual Localization
Keypoint Detection
Image Matching
NeRF

SLAM

Publish Date	Title	Authors	PDF	Code
2025-03-06	Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes	Hui Zhang et.al.	2503.04235v1	null
2025-03-05	Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments	Jie Deng et.al.	2503.03373v1	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661v1	null
2025-03-04	DnD Filter: Differentiable State Estimation for Dynamic Systems using Diffusion Models	Ziyu Wan et.al.	2503.01274v2	null
2025-02-27	BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground	Yufei Wei et.al.	2502.20078v1	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932v1	null
2025-02-26	Efficient and Distributed Large-Scale Point Cloud Bundle Adjustment via Majorization-Minimization	Rundong Li et.al.	2502.18801v1	null
2025-02-23	Improving Monocular Visual-Inertial Initialization with Structureless Visual-Inertial Bundle Adjustment	Junlin Song et.al.	2502.16598v1	null
2025-02-19	Active Illumination for Visual Ego-Motion Estimation in the Dark	Francesco Crocetti et.al.	2502.13708v1	null
2025-02-19	pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM	Luigi Freda et.al.	2502.11955v2	link
2025-03-05	Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions	Dario Pisanti et.al.	2502.09795v2	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111v1	null
2025-02-13	PTZ-Calib: Robust Pan-Tilt-Zoom Camera Calibration	Jinhui Guo et.al.	2502.09075v1	link
2025-02-12	LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features	Shujie Zhou et.al.	2502.08676v1	link
2025-02-10	Building Rome with Convex Optimization	Haoyu Han et.al.	2502.04640v2	null
2025-01-31	Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping	Yiming Huang et.al.	2501.19319v1	link
2025-01-23	FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation	Bingyang Zhou et.al.	2501.13876v1	null
2025-02-14	DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM	Jesse Morris et.al.	2501.11893v2	link
2025-01-19	Tracking Mouse from Incomplete Body-Part Observations and Deep-Learned Deformable-Mouse Model Motion-Track Constraint for Behavior Analysis	Olaf Hellwich et.al.	2501.11030v1	null
2025-01-15	SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM	Yuhang Ming et.al.	2501.08880v1	null
2025-01-16	BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module	Dongzhihan Wang et.al.	2501.08659v2	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286v1	null
2025-01-07	MAD-BA: 3D LiDAR Bundle Adjustment -- from Uncertainty Modelling to Structure Optimization	Krzysztof Ćwian et.al.	2501.03972v1	null
2025-01-06	Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation	Yuezhang Lv et.al.	2501.02821v1	null
2024-12-28	MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing	Shuo Wang et.al.	2412.20082v1	null
2025-01-18	Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry	Zhaoxing Zhang et.al.	2412.16923v3	null
2024-12-18	Event-based Photometric Bundle Adjustment	Shuang Guo et.al.	2412.14111v1	link
2024-12-18	4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching	Fernando Amodeo et.al.	2412.13639v1	link
2024-12-17	NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment	Andrea Dunn Beltran et.al.	2412.13176v1	null
2024-12-16	Efficient LiDAR Bundle Adjustment for Multi-Scan Alignment Utilizing Continuous-Time Trajectories	Louis Wiesmann et.al.	2412.11760v1	null
2024-12-19	RoMeO: Robust Metric Visual Odometry	Junda Cheng et.al.	2412.11530v2	null
2024-12-12	eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction	Jad Mansour et.al.	2412.09209v1	link
2024-12-08	GBR: Generative Bundle Refinement for High-fidelity Gaussian Splatting and Meshing	Jianing Zhang et.al.	2412.05908v1	null
2024-12-04	BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement	Miguel Arturo Vega Torres et.al.	2412.03434v1	link
2024-12-04	MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras	Huai Yu et.al.	2412.03146v1	link
2024-12-04	An indoor DSO-based ceiling-vision odometry system for indoor industrial environments	Abdelhak Bougouffa et.al.	2412.02950v1	null
2024-12-13	SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure Frames	Yuxuan Zhou et.al.	2412.01500v2	link
2024-12-01	DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair	Weihang Li et.al.	2412.00851v1	null
2024-11-29	Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction	Shaoxiang Wang et.al.	2412.00242v1	null
2024-11-27	SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images	Yanyan Li et.al.	2411.18072v1	null
2024-11-27	HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction	Wei Zhang et.al.	2411.17982v1	null
2024-11-24	Bundle Adjusted Gaussian Avatars Deblurring	Muyao Niu et.al.	2411.16758v1	null
2024-11-21	InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation	Marziyeh Bamdad et.al.	2411.14358v1	link
2024-11-20	Robust Monocular Visual Odometry using Curriculum Learning	Assaf Lahiany et.al.	2411.13438v1	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291v1	null
2024-11-15	BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation	Yufei Wei et.al.	2411.10195v1	null
2024-11-24	Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments	Ankit Shaw et.al.	2411.08231v2	null
2024-11-10	A novel algorithm for optimizing bundle adjustment in image sequence alignment	Hailin Xu et.al.	2411.06343v1	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796v1	null
2024-11-13	DEIO: Deep Event Inertial Odometry	Weipeng Guan et.al.	2411.03928v3	link
2024-11-08	GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting	Jilan Mei et.al.	2411.03807v3	null
2024-10-30	LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM	Yucheng Huang et.al.	2410.23231v1	link
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213v1	null
2024-10-09	Very High-Resolution Bridge Deformation Monitoring Using UAV-based Photogrammetry	Mehdi Maboudi et.al.	2410.18984v1	null
2024-10-22	EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting	Bohao Liao et.al.	2410.15392v2	null
2024-10-18	Graph Optimality-Aware Stochastic LiDAR Bundle Adjustment with Progressive Spatial Smoothing	Jianping Li et.al.	2410.14565v1	null
2024-10-17	Hybrid bundle-adjusting 3D Gaussians for view consistent rendering with pose optimization	Yanan Guo et.al.	2410.13280v1	link
2024-10-12	ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras	Junkai Niu et.al.	2410.09374v1	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935v1	link
2024-10-18	IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Jian Huang et.al.	2410.08107v2	link
2024-10-02	SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment	Xingyu Ji et.al.	2410.01618v1	null
2024-09-30	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111v1	null
2024-09-26	Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot	Justin Yu et.al.	2409.18108v1	null
2024-09-20	Learning Visual Information Utility with PIXER	Yash Turkar et.al.	2409.13151v1	null
2024-09-18	Bundle Adjustment in the Eager Mode	Zitong Zhan et.al.	2409.12190v1	null
2024-09-18	Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments	Lei Cheng et.al.	2409.11854v1	null
2024-09-18	ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation	Yanlin Jin et.al.	2409.11692v1	null
2024-09-17	LVBA: LiDAR-Visual Bundle Adjustment for RGB Point Cloud Mapping	Rundong Li et.al.	2409.10868v1	null
2024-09-14	MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry	Yuheng Qiu et.al.	2409.09479v1	null
2024-09-14	GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians	Dasong Gao et.al.	2409.09295v1	link
2024-09-14	Panoramic Direct LiDAR-assisted Visual Odometry	Zikang Yuan et.al.	2409.09287v1	link
2024-09-13	SLIM: Scalable and Lightweight LiDAR Mapping in Urban Environments	Zehuan Yu et.al.	2409.08681v1	link
2024-09-11	Event-based Mosaicing Bundle Adjustment	Shuang Guo et.al.	2409.07365v1	link
2024-09-23	Robust Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric	Tingchen Ma et.al.	2409.01856v2	null
2024-09-02	Robust Vehicle Localization and Tracking in Rain using Street Maps	Yu Xiang Tan et.al.	2409.01038v1	link
2024-09-05	EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System	Bonan Liu et.al.	2409.00343v2	null
2024-08-30	Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning	Shuyang Zhang et.al.	2408.17005v1	link
2024-08-29	Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry	Michael Adlerstein et.al.	2408.16472v1	null
2024-08-28	Single-Photon 3D Imaging with Equi-Depth Photon Histograms	Kaustubh Sadekar et.al.	2408.16150v1	null
2024-08-28	ES-PTAM: Event-based Stereo Parallel Tracking and Mapping	Suman Ghosh et.al.	2408.15605v1	link
2024-08-28	FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2408.14035v2	link
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682v1	null
2024-08-20	TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks	Jinjie Mai et.al.	2408.10739v1	null
2024-08-20	LoopSplat: Loop Closure by Registering 3D Gaussian Splats	Liyuan Zhu et.al.	2408.10154v2	link
2024-08-10	RSL-BA: Rolling Shutter Line Bundle Adjustment	Yongcong Zhang et.al.	2408.05409v1	null
2024-08-07	Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR	Racquel Fygenson et.al.	2408.03503v1	link
2024-08-03	FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields	Yifan Wu et.al.	2408.01878v1	null
2024-08-03	Deep Patch Visual SLAM	Lahav Lipson et.al.	2408.01654v1	link
2024-07-25	CodedVO: Coded Visual Odometry	Sachin Shah et.al.	2407.18240v1	null
2024-07-25	PGD-VIO: An Accurate Plane-Aided Visual-Inertial Odometry with Graph-Based Drift Suppression	Yidi Zhang et.al.	2407.17709v1	null
2024-07-22	Reinforcement Learning Meets Visual Odometry	Nico Messikommer et.al.	2407.15626v1	link
2024-07-21	Semi-Supervised Pipe Video Temporal Defect Interval Localization	Zhu Huang et.al.	2407.15170v1	null
2024-07-18	Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain	Bach Nguyen Gia et.al.	2407.13159v1	link
2024-07-17	Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge	Andrea Albanese et.al.	2407.12663v1	null
2024-07-15	LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning	Zhuozhu Jian et.al.	2407.10782v1	null
2024-07-06	Incremental Multiview Point Cloud Registration	Xiaoya Cheng et.al.	2407.05021v1	link
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939v3	null
2024-07-01	Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation	Lianjie Guo et.al.	2407.01292v1	link
2024-05-29	Rotation Averaging: A Primal-Dual Method and Closed-Forms in Cycle Graphs	Gabriel Moreira et.al.	2406.18564v1	null
2024-07-25	Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy	Chen Wang et.al.	2406.16087v3	null
2024-06-20	Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment	Yunshan Qi et.al.	2406.14360v1	null
2024-06-16	Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry	Boris Chidlovskii et.al.	2406.11019v1	null
2024-06-12	From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers	Swaminathan Gurumurthy et.al.	2406.07785v1	link
2024-06-03	The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry	Paolo Cudrano et.al.	2406.01797v1	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929v1	null
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614v1	null
2024-05-27	Adaptive VIO: Deep Visual-Inertial Odometry with Online Continual Learning	Youqi Pan et.al.	2405.16754v1	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599v1	null
2024-06-20	Advancements in Translation Accuracy for Stereo Visual-Inertial Initialization	Han Song et.al.	2405.15082v3	null
2024-06-08	EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving	Boyi Liu et.al.	2405.12120v2	null
2024-05-13	SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling	Yijun Yuan et.al.	2405.07847v1	null
2024-05-10	MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization	Pengcheng Zhu et.al.	2405.06241v1	null
2024-05-09	Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment	Simon Weber et.al.	2405.05079v2	link
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290v1	null
2024-05-07	IMU-Aided Event-based Stereo Visual Odometry	Junkai Niu et.al.	2405.04071v1	link
2024-05-05	Blending Distributed NeRFs with Tri-stage Robust Pose Optimization	Baijun Ye et.al.	2405.02880v1	null
2024-04-29	$ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction	Yunxuan Mao et.al.	2404.18439v1	null
2024-04-28	S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM	Zhiyao Zhang et.al.	2404.18284v1	null
2024-04-27	An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation	Olivier Brochu Dufour et.al.	2404.17745v1	null
2024-04-26	Camera Motion Estimation from RGB-D-Inertial Scene Flow	Samuel Cerezo et.al.	2404.17251v1	null
2024-04-23	Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization	Lahav Lipson et.al.	2404.15263v1	link
2024-04-23	FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent	Cameron Smith et.al.	2404.15259v1	link
2024-04-22	RESFM: Robust Equivariant Multiview Structure from Motion	Fadi Khatib et.al.	2404.14280v1	null
2024-04-23	CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory	Yunlong Ran et.al.	2404.13896v2	null
2024-04-20	EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment	Guanghao Li et.al.	2404.13346v1	link
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339v1	null
2024-04-17	SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping	Vincent Cartillier et.al.	2404.11419v1	null
2024-04-17	VBR: A Vision Benchmark in Rome	Leonardo Brizi et.al.	2404.11322v1	link
2024-04-14	Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration	Yanhao Zhang et.al.	2404.09169v1	link
2024-04-09	Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes	Tianchen Deng et.al.	2404.06050v1	null
2024-04-06	Salient Sparse Visual Odometry With Pose-Only Supervision	Siyu Chen et.al.	2404.04677v1	null
2024-04-01	Visual-inertial state estimation based on Chebyshev polynomial optimization	Hongyu Zhang et.al.	2404.01150v1	null
2024-04-01	BundledSLAM: An Accurate Visual SLAM System Using Multiple Cameras	Han Song et.al.	2403.19886v2	null
2024-03-30	GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM	Ganlin Zhang et.al.	2403.19549v2	link
2024-03-25	A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments	Gianluca D'Amico et.al.	2403.17084v1	null
2024-03-20	DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping	Yuxuan Zhou et.al.	2403.13714v1	link
2024-03-19	On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine	Jagatpreet Singh Nir et.al.	2403.13170v1	null
2024-03-18	The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions	Margaret Hansen et.al.	2403.12194v1	null
2024-03-19	BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting	Lingzhe Zhao et.al.	2403.11831v2	link
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639v1	null
2024-03-17	Compact 3D Gaussian Splatting For Dense Visual SLAM	Tianchen Deng et.al.	2403.11247v1	link
2024-03-16	Efficient Domain Adaptation for Endoscopic Visual Odometry	Junyang Wu et.al.	2403.10860v1	null
2024-03-25	URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields	Bo Xu et.al.	2403.10119v2	null
2024-03-14	Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO)	Matthew Lisondra et.al.	2403.09882v1	null
2024-03-12	CMax-SLAM: Event-based Rotational-Motion Bundle Adjustment and SLAM System using Contrast Maximization	Shuang Guo et.al.	2403.08119v1	link
2024-03-12	SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM	Siting Zhu et.al.	2403.07494v1	link
2024-03-12	Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints	Weihan Wang et.al.	2403.07225v1	link
2024-03-10	PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial Smoothing	Jianping Li et.al.	2403.06124v1	null
2024-03-02	RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking	Ray Zhang et.al.	2403.01254v1	link
2024-03-02	Grid-based Fast and Structural Visual Odometry	Zhang Zhihe et.al.	2403.01110v1	null
2024-02-27	Differentiable Biomechanics Unlocks Opportunities for Markerless Motion Capture	R. James Cotton et.al.	2402.17192v1	null
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961v1	link
2024-02-22	Secure Navigation using Landmark-based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2402.14280v1	null
2024-02-26	VOOM: Robust Visual Object Odometry and Mapping using Hierarchical Landmarks	Yutong Wang et.al.	2402.13609v2	link
2024-02-19	Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment	Ganesh Sapkota et.al.	2402.12551v1	null
2024-02-07	Online and Certifiably Correct Visual Odometry and Mapping	Devansh R Agrawal et.al.	2402.05254v1	null
2024-02-06	YOLOPoint Joint Keypoint and Object Detection	Anton Backhaus et.al.	2402.03989v1	link
2024-02-11	BA-LINS: A Frame-to-Frame Bundle Adjustment for LiDAR-Inertial Navigation	Hailiang Tang et.al.	2401.11491v2	null
2024-01-19	Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning	André O. Françani et.al.	2401.10857v1	null
2024-01-17	Event-Based Visual Odometry on Non-Holonomic Ground Vehicles	Wanting Xu et.al.	2401.09331v1	link
2024-01-11	On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering	Feng Zhu et.al.	2401.05836v1	null
2023-12-19	Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry	Olaya Álvarez-Tuñón et.al.	2401.05396v1	link
2024-01-07	Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people	Ali Samadzadeh et.al.	2401.03604v1	link
2024-01-03	LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry	Weirong Chen et.al.	2401.01887v1	null
2023-12-28	SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction	Zikang Yuan et.al.	2312.16800v1	link
2023-12-20	NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields	Jens Naumann et.al.	2312.13471v1	null
2023-12-22	Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM	Junru Lin et.al.	2312.13332v2	null
2023-12-20	Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach	Habib Boloorchi Tabrizi et.al.	2312.13162v1	link
2023-12-20	Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera	Abdulkadhem A. Abdulkadhem et.al.	2312.12680v1	null
2023-12-15	PLGSLAM: Progressive Neural Scene Represenation with Local to Global Bundle Adjustment	Tianchen Deng et.al.	2312.09866v1	null
2023-12-15	Deep Event Visual Odometry	Simon Klenk et.al.	2312.09800v1	link
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889v1	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563v1	null
2023-12-04	iMatching: Imperative Correspondence Learning	Zitong Zhan et.al.	2312.02141v1	link
2023-12-04	Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks	Yan Xu et.al.	2312.01561v1	null
2023-11-30	Event-based Visual Inertial Velometer	Xiuyuan Lu et.al.	2311.18189v1	null
2023-11-21	CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems	Young-Hee Lee et.al.	2311.12580v1	null
2023-11-21	Implicit Event-RGBD Neural SLAM	Delin Qu et.al.	2311.11013v2	null
2023-11-14	CP-SLAM: Collaborative Neural Point-based SLAM System	Jiarui Hu et.al.	2311.08013v1	null
2023-11-10	Dense Visual Odometry Using Genetic Algorithm	Slimane Djema et.al.	2311.06149v1	null
2023-11-07	Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM	Seongwook Yoon et.al.	2311.03722v1	null
2023-11-02	Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images	Hermes McGriff et.al.	2311.01292v1	link
2023-10-29	3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets	Ta-Ying Cheng et.al.	2310.19188v1	null
2023-10-23	RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments	Jinyu Li et.al.	2310.15072v1	link
2023-10-23	Converting Depth Images and Point Clouds for Feature-based Pose Estimation	Robert Lösch et.al.	2310.14924v1	link
2023-10-20	PACE: Human and Camera Motion Estimation from in-the-wild Videos	Muhammed Kocabas et.al.	2310.13768v1	null
2023-10-17	Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms	Yanyan Li et.al.	2310.10931v1	link
2023-10-15	CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses	Hongyu Fu et.al.	2310.09776v1	null
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082v1	null
2023-10-10	l-dyno: framework to learn consistent visual features using robot's motion	Kartikeya Singh et.al.	2310.06249v1	link
2023-10-07	HI-SLAM: Monocular Real-time Dense Mapping with Hybrid Implicit Fields	Wei Zhang et.al.	2310.04787v1	null
2023-10-05	USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields	Moyang Li et.al.	2310.02687v2	link
2023-10-08	XVO: Generalized Visual Odometry via Cross-Modal Self-Training	Lei Lai et.al.	2309.16772v3	null
2023-09-27	Handbook on Leveraging Lines for Two-View Relative Pose Estimation	Petr Hruby et.al.	2309.16040v1	null
2023-09-27	BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields	Shreya Saha et.al.	2309.15329v1	null
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268v2	link
2023-09-23	Tag-based Visual Odometry Estimation for Indoor UAVs Localization	Massimiliano Bertoni et.al.	2309.13311v1	null
2023-09-22	Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms	Olivier Gamache et.al.	2309.13139v1	link
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883v1	link
2023-09-20	Conformalized Multimodal Uncertainty Regression and Reasoning	Domenico Parente et.al.	2309.11018v1	null
2023-09-20	OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving	Heng Li et.al.	2309.11011v1	link
2023-09-19	PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation	Luigi Freda et.al.	2309.10896v1	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436v1	link
2023-09-21	Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration	Hongbo Zhao et.al.	2309.10314v2	null
2023-09-18	End-to-End Learned Event- and Image-based Visual Odometry	Roberto Pellerito et.al.	2309.09947v1	link
2023-09-18	DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach	Chenghao Xu et.al.	2309.09879v1	null
2023-09-17	a critical analysis of internal reliability for uncertainty quantification of dense image matching in multi-view stereo	Debao Huang et.al.	2309.09379v1	null
2023-09-14	MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems	Yu Gao et.al.	2309.07846v1	null
2023-09-14	An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments	Yehao Liu et.al.	2309.07408v1	null
2023-09-11	Evaluating Visual Odometry Methods for Autonomous Driving in Rain	Yu Xiang Tan et.al.	2309.05249v1	null
2023-09-11	SIM-Sync: From Certifiably Optimal Synchronization over the 3D Similarity Group to Scene Reconstruction with Learned Depth	Xihang Yu et.al.	2309.05184v1	link
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147v1	null
2023-09-08	Depth Completion with Multiple Balanced Bases and Confidence for Dense Monocular SLAM	Weijian Xie et.al.	2309.04145v1	null
2023-09-05	GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction	Youmin Zhang et.al.	2309.02436v1	link
2023-09-04	EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity	Zijie Jiang et.al.	2309.01296v1	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984v1	link
2023-08-28	R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras	Aron Schmied et.al.	2308.14713v1	null
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039v1	null
2023-08-25	A Game of Bundle Adjustment -- Learning Efficient Convergence	Amir Belder et.al.	2308.13270v1	null
2023-08-24	Joint Intrinsic and Extrinsic LiDAR-Camera Calibration in Targetless Environments Using Plane-Constrained Bundle Adjustment	Liang Li et.al.	2308.12629v1	link
2023-08-19	Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters	Xiao Liu et.al.	2308.09870v1	link
2023-08-24	MIPS-Fusion: Multi-Implicit-Submaps for Scalable and Robust Online Neural RGB-D Reconstruction	Yijie Tang et.al.	2308.08741v2	null
2023-08-12	4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion	Guirong Zhuo et.al.	2308.06573v1	null
2023-08-10	Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU	U. V. B. L. Udugama et.al.	2308.05515v1	null
2023-08-01	NR-SLAM: Non-Rigid Monocular SLAM	Juan J. Gomez Rodriguez et.al.	2308.04036v1	null
2023-08-02	A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry	Cora A. Dimmig et.al.	2308.01398v1	null
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125v1	null
2023-08-02	Preliminary Design of the Dragonfly Navigation Filter	Ben Schilling et.al.	2307.13513v2	null
2023-07-19	Optimizing the extended Fourier Mellin Transformation Algorithm	Wenqing Jiang et.al.	2307.10015v1	link
2023-08-13	Distributed bundle adjustment with block-based sparse matrix compression for super large scale datasets	Maoteng Zheng et.al.	2307.08383v2	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763v1	null
2023-07-14	Multi-Session, Localization-oriented and Lightweight LiDAR Mapping Using Semantic Lines and Planes	Zehuan Yu et.al.	2307.07126v1	null
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667v2	null
2023-06-24	3D Reconstruction of Spherical Images based on Incremental Structure from Motion	San Jiang et.al.	2306.12770v2	link
2023-06-08	2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction	Jiawei He et.al.	2306.05418v1	null
2023-06-09	BAA-NGP: Bundle-Adjusting Accelerated Neural Graphics Primitives	Sainan Liu et.al.	2306.04166v2	link
2023-07-26	Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression	Jianeng Wang et.al.	2306.01188v2	null
2023-06-14	BAMF-SLAM: Bundle Adjusted Multi-Fisheye Visual-Inertial SLAM Using Recurrent Field Transforms	Wei Zhang et.al.	2306.01173v2	null
2023-07-06	OSPC: Online Sequential Photometric Calibration	Jawad Haidar et.al.	2305.17673v2	null
2023-05-20	DAC: Detector-Agnostic Spatial Covariances for Deep Local Features	Javier Tirado-Garín et.al.	2305.12250v1	link
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036v1	link
2023-05-15	Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface	Shifan Zhu et.al.	2305.08962v1	null
2023-05-15	Decentralization and Acceleration Enables Large-Scale Bundle Adjustment	Taosha Fan et.al.	2305.07026v2	link
2023-05-10	Transformer-based model for monocular visual odometry: a video understanding approach	André O. Françani et.al.	2305.06121v1	link
2023-04-29	Modality-invariant Visual Odometry for Embodied Vision	Marius Memmel et.al.	2305.00348v1	link
2023-04-29	An Efficient Plane Extraction Approach for Bundle Adjustment on LiDAR Point clouds	Zheng Liu et.al.	2305.00287v1	null
2023-04-27	Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM	Hengyi Wang et.al.	2304.14377v1	link
2023-04-23	IDLL: Inverse Depth Line based Visual Localization in Challenging Environments	Wanting Li et.al.	2304.11748v1	null
2023-04-21	FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving	Yuxuan Liu et.al.	2304.10719v1	null
2023-04-18	Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Motion Compensation	Hanyu Cai et.al.	2304.08978v1	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194v1	link
2023-04-12	SGL: Structure Guidance Learning for Camera Localization	Xudong Zhang et.al.	2304.05571v1	null
2023-04-14	Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency	Xingwu Ji et.al.	2304.05146v2	link
2023-04-11	Pointless Global Bundle Adjustment With Relative Motions Hessians	Ewelina Rupnik et.al.	2304.05118v1	link
2023-04-11	ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster	Yifei Dong et.al.	2304.04943v1	null
2023-04-04	Distributed Block Coordinate Moving Horizon Estimation for 2D Visual-Inertial-Odometry SLAM	Emilien Flayac et.al.	2304.01613v1	null
2023-03-31	LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses	Noah Stier et.al.	2304.00054v1	link
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504v1	link
2023-03-29	Photometric LiDAR and RGB-D Bundle Adjustment	Luca Di Giammarino et.al.	2303.16878v1	link
2023-03-27	3D Video Object Detection with Learnable Object-Centric Global Optimization	Jiawei He et.al.	2303.15416v1	link
2023-03-25	DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields	Yu Chen et.al.	2303.14478v1	null
2023-03-23	RGB-D-Inertial SLAM in Indoor Dynamic Environments with Long-term Large Occlusion	Ran Long et.al.	2303.13316v1	null
2023-03-21	Learning a Depth Covariance Function	Eric Dexheimer et.al.	2303.12157v1	null
2023-03-21	Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network	Alessandro Navone et.al.	2303.11725v1	null
2023-03-20	VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors	Thien Hoang Nguyen et.al.	2303.10903v1	null
2023-03-17	CoVIO: Online Continual Learning for Visual-Inertial Odometry	Niclas Vödisch et.al.	2303.10149v1	link
2023-03-15	UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry	Chaoyang Jiang et.al.	2303.08550v1	null
2023-03-13	Discovering Multiple Algorithm Configurations	Leonid Keselman et.al.	2303.07434v1	null
2023-03-09	Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation	Masahiro Hirano et.al.	2303.05192v1	null
2023-03-16	Stereo Event-based Visual-Inertial Odometry	Kunfeng Wang et.al.	2303.05086v2	link
2023-03-07	Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor	Eduardo Gallo et.al.	2303.03804v1	null
2023-03-03	Lightweight, Uncertainty-Aware Conformalized Visual Odometry	Alex C. Stutts et.al.	2303.02207v1	null
2023-02-28	LIW-OAM: Lidar-Inertial-Wheel Odometry and Mapping	Zikang Yuan et.al.	2302.14298v1	link
2023-02-24	FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets	Yelena Randall et.al.	2302.12772v1	null
2023-02-27	CP+: Camera Poses Augmentation with Large-scale LiDAR Maps	Jiadi Cui et.al.	2302.12198v2	null
2023-02-19	EdgeVO: An Efficient and Accurate Edge-based Visual Odometry	Hui Zhao et.al.	2302.09493v1	null
2023-02-12	Uncertainty-Driven Dense Two-View Structure from Motion	Weirong Chen et.al.	2302.00523v2	null
2023-01-31	Design and Implementation of A Soccer Ball Detection System with Multiple Cameras	Lei Li et.al.	2302.00123v1	null
2023-01-27	HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera	Mostafa Ahmadi et.al.	2301.11823v1	null
2023-01-26	Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial	Ola Shorinwa et.al.	2301.11313v1	null
2023-01-24	Generalized Object Search	Kaiyu Zheng et.al.	2301.10121v1	null
2023-01-22	Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories	Hanlin Chen et.al.	2301.09194v1	null
2023-01-21	Dense RGB SLAM with Neural Implicit Maps	Heng Li et.al.	2301.08930v1	null
2023-01-18	Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information	Junshi Chen et.al.	2301.07560v1	null
2023-01-17	COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM	Manthan Patel et.al.	2301.07147v1	link
2023-01-31	Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems	Pierre-Yves Lajoie et.al.	2301.06230v2	link
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604v1	null
2023-01-11	AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization	Ying Chen et.al.	2301.04620v1	link
2023-01-12	TBV Radar SLAM -- trust but verify loop candidates	Daniel Adolfsson et.al.	2301.04397v2	link
2022-12-31	Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges	Maxwell McManus et.al.	2301.03359v1	null
2023-01-09	Motion Addition and Motion Optimization	Liqun Qi et.al.	2301.03174v1	null
2023-01-08	Towards Open World NeRF-Based SLAM	Daniil Lisus et.al.	2301.03102v1	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403v1	null
2023-01-03	LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation	Shreyansh Daftry et.al.	2301.01350v1	null
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147v1	null
2023-01-03	BS3D: Building-scale 3D Reconstruction from RGB-D Images	Janne Mustaniemi et.al.	2301.01057v1	null
2023-01-10	An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping	Masoud Dayani Najafabadi et.al.	2301.00618v2	link
2022-12-25	A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion	Nadia Figueroa et.al.	2212.14772v1	null
2022-12-29	An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping	Kangcheng Liu et.al.	2212.14209v1	link
2022-12-27	Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands	Felipe Gómez-Cuba et.al.	2212.13477v1	link
2022-12-26	ESVIO: Event-based Stereo Visual Inertial Odometry	Peiyu Chen et.al.	2212.13184v1	link
2022-12-24	A Comprehensive Review on Autonomous Navigation	Saeid Nahavandi et.al.	2212.12808v1	null
2022-12-23	Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation	Marina Lotti et.al.	2212.12388v1	null
2022-12-23	Implementation of a Blind navigation method in outdoors/indoors areas	Mohammad Javadian Farzaneh et.al.	2212.12185v1	null
2022-12-22	S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations	Hriday Bavle et.al.	2212.11770v1	link
2022-12-22	Active SLAM: A Review On Last Decade	Muhammad Farhan Ahmed et.al.	2212.11654v1	null
2022-12-27	Motion, Unit Dual Quaternion and Motion Optimization	Liqun Qi et.al.	2212.11593v2	null
2022-12-22	Vision-Based Environmental Perception for Autonomous Driving	Fei Liu et.al.	2212.11453v1	null
2022-12-19	Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models	Yong Cheng et.al.	2212.09553v1	null
2022-12-16	Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments	Lasitha Weerakoon et.al.	2212.08633v1	null
2022-12-16	rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments	Bo Wei et.al.	2212.08418v1	null
2022-12-15	AirVO: An Illumination-Robust Point-Line Visual Odometry	Kuan Xu et.al.	2212.07595v1	link
2022-12-14	Autonomous Vehicle Navigation with LIDAR using Path Planning	Rahul M K et.al.	2212.07155v1	null
2022-12-14	RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping	Hyowon Kim et.al.	2212.07141v1	null
2022-12-13	Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version)	Daniil Lisus et.al.	2212.06923v1	null
2022-12-13	SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance	Chenyangguang Zhang et.al.	2212.06524v1	null
2022-12-13	Localization and Navigation System for Indoor Mobile Robot	Yanbaihui Liu et.al.	2212.06391v1	null
2022-12-12	Evaluation of RGB-D SLAM in Large Indoor Environments	Kirill Muravyev et.al.	2212.05980v1	null
2022-12-19	A Light-Weight LiDAR-Inertial SLAM System with Loop Closing	Kangcheng Liu et.al.	2212.05743v2	link
2022-12-12	An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds	Kangcheng Liu et.al.	2212.05705v1	link
2022-12-09	SLAM for Visually Impaired People: A Survey	Marziyeh Bamdad et.al.	2212.04745v1	null
2022-12-09	Ego-Body Pose Estimation via Ego-Head Pose Estimation	Jiaman Li et.al.	2212.04636v1	null
2022-12-06	Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles	Sushant Veer et.al.	2212.03323v1	link
2022-12-06	PRISM: Probabilistic Real-Time Inference in Spatial World Models	Atanas Mirchev et.al.	2212.02988v1	null
2022-12-06	RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps	Florian Sauerbeck et.al.	2212.02085v2	link
2022-12-05	DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization	Xuebo Tian et.al.	2212.02077v1	null
2022-12-05	ObjectMatch: Robust Registration using Canonical Object Correspondences	Can Gümeli et.al.	2212.01985v1	null
2022-12-02	Sparse SPN: Depth Completion from Sparse Keypoints	Yuqun Wu et.al.	2212.00987v1	null
2022-12-01	maplab 2.0 -- A Modular and Multi-Modal Mapping Framework	Andrei Cramariuc et.al.	2212.00654v1	link
2022-12-01	AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments	Mehregan Dor et.al.	2212.00350v1	null
2022-11-30	MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves	Pranjali Pathre et.al.	2211.16882v1	null
2022-11-29	PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images	Hartmut Surmann et.al.	2211.16266v1	link
2022-11-29	MmWave Mapping and SLAM for 5G and Beyond	Yu Ge et.al.	2211.16024v1	null
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127v1	null
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731v2	null
2022-11-27	Development of a Modular Real-time Shared-control System for a Smart Wheelchair	Vaishanth Ramaraj et.al.	2211.14711v1	null
2022-11-26	A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors	Jerred Chen et.al.	2211.14432v1	link
2022-11-23	ActiveRMAP: Radiance Field for Active Mapping And Planning	Huangying Zhan et.al.	2211.12656v1	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988v1	null
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836v1	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704v1	null
2022-11-24	Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths	Erik Leitinger et.al.	2211.09241v2	null
2022-11-16	Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery	Hao Qu et.al.	2211.08904v1	null
2022-11-20	Detecting Line Segments in Motion-blurred Images with Events	Huai Yu et.al.	2211.07365v2	link
2022-11-13	Automatic Eye-in-Hand Calibration using EKF	Aditya Ramakrishnan et.al.	2211.06881v1	null
2022-11-12	Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling	Zhihao Wang et.al.	2211.06557v1	link
2022-11-11	Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications	Jie Yang et.al.	2211.05982v1	null
2022-11-10	Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time	Ignacio Torroba et.al.	2211.05601v1	link
2022-11-07	When Geometry is not Enough: Using Reflector Markers in Lidar SLAM	Gerhard Kurz et.al.	2211.03484v1	null
2022-11-07	Detecting Invalid Map Merges in Lifelong SLAM	Matthias Holoch et.al.	2211.03423v1	null
2022-11-06	Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU	Yibin Wu et.al.	2211.03174v1	link
2022-11-07	Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments	Daniel Adolfsson et.al.	2211.02445v2	link
2022-11-03	DyOb-SLAM : Dynamic Object Tracking SLAM System	Rushmian Annoy Wadud et.al.	2211.01941v1	null
2022-11-03	Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM	Yang Chen et.al.	2211.01749v1	null
2022-11-04	$D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm	Hao Xu et.al.	2211.01538v2	link
2022-11-02	Semantic SuperPoint: A Deep Semantic Descriptor	Gabriel S. Gama et.al.	2211.01098v1	link
2022-11-02	Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation	Myung-Hwan Jeon et.al.	2211.00960v1	link
2022-10-31	Mapping Extended Landmarks for Radar SLAM	Shuai Sun et.al.	2210.17207v1	null
2022-10-25	MAROAM: Map-based Radar SLAM through Two-step Feature Selection	Dequan Wang et.al.	2210.13797v1	null
2022-10-25	S3E: A Large-scale Multimodal Dataset for Collaborative SLAM	Dapeng Feng et.al.	2210.13723v1	link
2022-10-24	NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields	Antoni Rosinol et.al.	2210.13641v1	link
2022-10-24	Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging	Geng Wang et.al.	2210.13556v1	null
2022-10-28	VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points	Andreas Georgis et.al.	2210.12756v2	null
2022-10-22	SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation	Junliang Chen et.al.	2210.12417v1	null
2022-10-21	DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm	Shipeng Zhong et.al.	2210.11978v1	link
2022-10-21	Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments	Shubham Kedia et.al.	2210.11652v1	null
2022-10-22	Visual SLAM: What are the Current Trends and What to Expect?	Ali Tourani et.al.	2210.10491v2	null
2022-10-18	Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM	Geon Choi et.al.	2210.09636v1	null
2022-10-16	D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments	Ayman Beghdadi et.al.	2210.08647v1	null
2022-10-16	Indoor Smartphone SLAM with Learned Echoic Location Features	Wenjie Luo et.al.	2210.08493v1	null
2022-10-15	Self-Improving SLAM in Dynamic Environments: Learning When to Mask	Adrian Bojko et.al.	2210.08350v1	link
2022-10-13	Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems	Pushyami Kaveti et.al.	2210.07315v1	link
2022-10-12	RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map	Xuecheng Xu et.al.	2210.05984v1	link
2022-10-11	Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization	Yuanzheng He et.al.	2210.05600v1	null
2022-10-11	Autonomous Asteroid Characterization Through Nanosatellite Swarming	Kaitlin Dennison et.al.	2210.05518v1	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517v1	null
2022-10-11	Multi-Object Navigation with dynamically learned neural implicit representations	Pierre Marza et.al.	2210.05129v1	link
2022-10-12	Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation	Yulun Tian et.al.	2210.05020v2	null
2022-10-10	Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios	Xingyu Chen et.al.	2210.04562v1	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236v1	null
2022-10-06	SCORE: A Second-Order Conic Initialization for Range-Aided SLAM	Alan Papalia et.al.	2210.03177v1	link
2022-10-06	Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Kirill Mazur et.al.	2210.03043v1	null
2022-10-06	Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence	Osian Morgan et.al.	2210.02642v1	null
2022-10-05	MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation	Hanwei Zhang et.al.	2210.02038v1	null
2022-10-04	O2S: Open-source open shuttle	Nwankwo Linus et.al.	2210.01627v1	null
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320v1	null
2022-10-03	Probabilistic Volumetric Fusion for Dense Monocular SLAM	Antoni Rosinol et.al.	2210.01276v1	null
2022-10-03	DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams	John McConnell et.al.	2210.00867v1	link
2022-10-03	A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments	Ha Sier et.al.	2210.00812v1	link
2022-10-01	Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2	Ali Eslamian et.al.	2210.00278v1	null
2022-09-30	PyPose: A Library for Robot Learning with Physics-based Optimization	Chen Wang et.al.	2209.15428v1	link
2022-09-29	DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment	Mariia Gladkova et.al.	2209.14965v1	null
2022-09-28	Robust Incremental Smoothing and Mapping (riSAM)	Daniel McGann et.al.	2209.14359v1	null
2022-09-27	Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping	Chi-Ming Chung et.al.	2209.13274v1	link
2022-09-24	Graph Neural Networks for Multi-Robot Active Information Acquisition	Mariliza Tzes et.al.	2209.12091v1	null
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894v1	null
2022-09-23	involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs	Gilad Rotman et.al.	2209.11591v1	null
2022-09-23	Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot	David Balaban et.al.	2209.11432v1	null
2022-09-22	SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation	Xiao Han et.al.	2209.10817v1	null
2022-09-22	Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio	Wenhao Qiu et.al.	2209.10726v1	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710v1	null
2022-09-20	Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM	Sabir Hossain et.al.	2209.10047v1	null
2022-09-20	WGICP: Differentiable Weighted GICP-Based Lidar Odometry	Sanghyun Son et.al.	2209.09777v1	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699v1	link
2022-09-19	MeSLAM: Memory Efficient SLAM based on Neural Fields	Evgenii Kruzhkov et.al.	2209.09357v1	null
2022-09-19	LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM	Letian Zhang et.al.	2209.08810v1	null
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608v1	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578v1	link
2022-09-17	DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments	Shihao Shen et.al.	2209.08430v1	link
2022-09-17	OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM	Matthieu Zins et.al.	2209.08338v1	null
2022-09-17	PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments	Adam Dai et.al.	2209.08248v1	link
2022-09-16	ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM	Aditya Arun et.al.	2209.08091v1	null
2022-09-16	iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking	Yuhang Ming et.al.	2209.07919v1	null
2022-09-16	TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM	Mathieu Gonzalez et.al.	2209.07888v1	null
2022-09-15	Landmark Management in the Application of Radar SLAM	Shuai Sun et.al.	2209.07199v1	link
2022-09-15	PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization	Xianwei Meng et.al.	2209.07061v1	null
2022-09-14	Semantic Visual Simultaneous Localization and Mapping: A Survey	Kaiqi Chen et.al.	2209.06428v1	null
2022-09-13	Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets	Islam Ali et.al.	2209.06316v1	null
2022-09-12	A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding	Tin Lai et.al.	2209.05222v1	null
2022-09-12	Attitude-Guided Loop Closure for Cameras with Negative Plane	Ze Wang et.al.	2209.05167v1	link
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497v1	link
2022-09-08	ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology	Julio A. Placed et.al.	2209.03693v1	link
2022-09-08	R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator	Jiarong Lin et.al.	2209.03666v1	link
2022-09-06	Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection	Brendon Forsgren et.al.	2209.02658v1	link
2022-09-05	Neuromorphic Visual Odometry with Resonator Networks	Alpha Renner et.al.	2209.02000v1	null
2022-09-05	MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM	Pavel Karpyshev et.al.	2209.01936v1	null
2022-09-05	ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics	Boyi Liu et.al.	2209.01774v1	null
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605v1	null
2022-08-31	PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM	Yifan Duan et.al.	2208.14848v1	null
2022-08-30	BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition	Peng Yin et.al.	2208.14543v1	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997v1	null
2022-08-25	FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms	Jianhao Jiao et.al.	2208.11865v1	null
2022-08-25	Lidar SLAM for Autonomous Driving Vehicles	Farhad Aghili et.al.	2208.11855v1	null
2022-08-24	DynaVINS: A Visual-Inertial SLAM for Dynamic Environments	Seungwon Song et.al.	2208.11500v1	link
2022-08-22	Doppler Exploitation in Bistatic mmWave Radio SLAM	Yu Ge et.al.	2208.10204v1	null
2022-08-21	Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping	Lintong Zhang et.al.	2208.09825v1	link
2022-08-26	JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario	Longrui Dong et.al.	2208.09777v2	null
2022-08-15	BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM	Yunge Cui et.al.	2208.07473v1	link
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325v1	null
2022-08-11	RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild	Jason Y. Zhang et.al.	2208.05963v1	null
2022-08-08	Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation	Yifei Ren et.al.	2208.04274v1	link
2022-08-08	SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty	Shuai Zhang et.al.	2208.03945v1	link
2022-08-05	A Survey on Visual Map Localization Using LiDARs and Cameras	Elhousni Mahdi et.al.	2208.03376v1	null
2022-08-04	SROS2: Usable Cyber Security Tools for ROS 2	Victor Mayoral Vilches et.al.	2208.02615v1	link
2022-08-03	Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms	Bharath Garigipati et.al.	2208.02063v1	null
2022-08-02	Present and Future of SLAM in Extreme Underground Environments	Kamak Ebadi et.al.	2208.01787v1	null
2022-08-01	Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion	Simon Boche et.al.	2208.00709v1	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455v1	link
2022-07-25	DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions	Tristan Laidlow et.al.	2207.12244v1	null
2022-07-25	Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration	Kenji Koide et.al.	2207.11942v1	null
2022-07-22	NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction	Yunlong Ran et.al.	2207.10985v1	null
2022-07-22	Dense RGB-D-Inertial SLAM with Map Deformations	Tristan Laidlow et.al.	2207.10940v1	null
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916v1	null
2022-07-21	Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion	Suman Ghosh et.al.	2207.10494v1	link
2022-07-21	Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions	Quentin Serdel et.al.	2207.10489v1	link
2022-07-21	On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity	Yujin Lu et.al.	2207.10413v1	null
2022-07-19	Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM	Tuvy Lemberg et.al.	2207.09103v1	null
2022-07-18	DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM	Weicai Ye et.al.	2207.08794v1	link
2022-07-18	Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction	Marco Orsingher et.al.	2207.08439v1	null
2022-07-18	ORB-based SLAM accelerator on SoC FPGA	Vibhakar Vemulapati et.al.	2207.08405v1	null
2022-07-14	Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset	Riccardo Giubilato et.al.	2207.06815v1	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738v1	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732v1	null
2022-07-13	SLAM: SLO-Aware Memory Optimization for Serverless Applications	Gor Safaryan et.al.	2207.06183v1	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058v2	link
2022-07-12	Accelerating Certifiable Estimation with Preconditioned Eigensolvers	David M. Rosen et.al.	2207.05257v1	null
2022-07-12	Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features	Meiyu Zhi et.al.	2207.05244v1	null
2022-07-14	SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial	Chih-Yuan Chiu et.al.	2207.05043v2	null
2022-07-08	BlindSpotNet: Seeing Where We Cannot See	Taichi Fukuda et.al.	2207.03870v1	null
2022-07-08	Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints	Philipp Glira et.al.	2207.03785v1	null
2022-07-08	Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements	Ran Liu et.al.	2207.03700v1	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539v1	null
2022-07-06	VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization	Marius Laska et.al.	2207.02668v1	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396v1	null
2022-07-04	VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM	Ling Gao et.al.	2207.01404v1	null
2022-07-04	VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM	Danpeng Chen et.al.	2207.01158v1	null
2022-07-03	Wireless Channel Prediction in Partially Observed Environments	Mingsheng Yin et.al.	2207.00934v1	null
2022-07-01	A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers	Julio A. Placed et.al.	2207.00254v1	null
2022-07-01	Keeping Less is More: Point Sparsification for Visual SLAM	Yeonsoo Park et.al.	2207.00225v1	null
2022-06-30	Controlled and impulsive compression of an entrapped air bubble during impact	Utkarsh Jain et.al.	2206.15297v1	null
2022-06-30	Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery	Yuehao Wang et.al.	2206.15255v1	link
2022-06-27	IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments	Abanob Soliman et.al.	2206.13455v1	link
2022-06-26	An Efficient Global Optimality Certificate for Landmark-Based SLAM	Connor Holmes et.al.	2206.12961v1	link
2022-06-21	Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping	Davide Tateo et.al.	2206.10263v1	link
2022-06-20	Data Fusion for Radio Frequency SLAM with Robust Sampling	Erik Leitinger et.al.	2206.09746v1	null
2022-06-19	RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments	Chenglong Qian et.al.	2206.09463v1	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733v1	null
2022-06-17	An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions	Yijun Yuan et.al.	2206.08712v1	link
2022-06-13	ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy	Hao Bai et.al.	2206.06435v1	null
2022-06-10	Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming	Javier Cremona et.al.	2206.05066v1	link
2022-06-09	SparseFormer: Attention-based Depth Completion Network	Frederik Warburg et.al.	2206.04557v1	null
2022-06-07	Robot Self-Calibration Using Actuated 3D Sensors	Arne Peters et.al.	2206.03430v1	null
2022-06-07	Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map	Haodong Yuan et.al.	2206.03062v1	null
2022-06-05	DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions	Alena Savinykh et.al.	2206.02199v1	null
2022-06-04	C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy	Erez Posner et.al.	2206.01961v1	null
2022-06-01	PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry	Dong-Uk Seo et.al.	2206.00266v1	link
2022-05-27	A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching	Arno Solin et.al.	2205.13821v1	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135v2	link
2022-05-25	Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM	Milad Ramezani et.al.	2205.12595v1	null
2022-05-24	Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM	Christopher E. Denniston et.al.	2205.12402v1	link
2022-05-22	ALITA: A Large-scale Incremental Dataset for Long-term Autonomy	Peng Yin et.al.	2205.10737v1	link
2022-05-19	FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2	Jeffrey Ichnowski et.al.	2205.09778v1	link
2022-05-17	Global Data Association for SLAM with 3D Grassmannian Manifold Objects	Parker C. Lusk et.al.	2205.08556v1	null
2022-05-19	Cluster on Wheels	Yuanyuan Yang et.al.	2205.08151v2	null
2022-05-12	Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry	Shihao Shen et.al.	2205.05916v1	link
2022-05-12	S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization	Ran Cheng et.al.	2205.05861v1	null
2022-05-14	Multi-modal Semantic SLAM for Complex Dynamic Environments	Han Wang et.al.	2205.04300v2	link
2022-05-06	OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations	Carmen Delgado et.al.	2205.03256v1	null
2022-05-05	CNN-Augmented Visual-Inertial SLAM with Planar Constraints	Pan Ji et.al.	2205.02940v1	null
2022-05-05	PMBM-based SLAM Filters in 5G mmWave Vehicular Networks	Hyowon Kim et.al.	2205.02502v1	null
2022-05-04	BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking	Dorian Henning et.al.	2205.02301v1	null
2022-05-04	A Global Asymptotic Convergent Observer for SLAM	Seyed Hamed Hashemi et.al.	2205.01953v1	null
2022-05-04	Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation	Nathaniel Merrill et.al.	2205.01823v1	link
2022-05-03	GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping	Pan Ji et.al.	2205.01656v1	null
2022-04-29	Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM	Jinwoo Jeon et.al.	2204.13877v1	link
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831v1	null
2022-04-27	Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment	Wenyu Li et.al.	2204.12769v1	null
2022-04-29	MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment	Tingchen Ma et.al.	2204.11621v2	null
2022-04-23	Indoor simultaneous localization and mapping based on fringe projection profilometry	Yang Zhao et.al.	2204.11020v1	null
2022-04-22	Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria	Julio A. Placed et.al.	2204.10631v1	null
2022-04-22	Fast Autonomous Robotic Exploration Using the Underlying Graph Structure	Julio A. Placed et.al.	2204.10610v1	null
2022-04-22	Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions	Yutong Hu et.al.	2204.10552v1	null
2022-04-22	Implicit Object Mapping With Noisy Data	Jad Abou-Chakra et.al.	2204.10516v1	link
2022-04-19	Photometric single-view dense 3D reconstruction in endoscopy	Victor M. Batlle et.al.	2204.09083v1	null
2022-04-18	Pulsar skips: Understanding variations in the regular periods of rotating neutron stars	Clayton Miller et.al.	2204.08449v1	null
2022-04-18	Tracking monocular camera pose and deformation for SLAM inside the human body	Juan J. Gomez Rodriguez et.al.	2204.08309v1	null
2022-04-18	Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker	Hanjing Ye et.al.	2204.08163v1	null
2022-04-14	ViViD++: Vision for Visibility Dataset	Alex Junho Lee et.al.	2204.06183v2	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481v1	null
2022-04-12	RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room	Cong Gao et.al.	2204.05467v1	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932v1	link
2022-04-04	Monitoring social distancing with single image depth estimation	Alessio Mingozzi et.al.	2204.01693v1	null
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524v1	null
2022-04-04	IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers	Lei Sun et.al.	2204.01324v1	link
2022-04-03	Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor	Wenyan Ou et.al.	2204.01154v1	null
2022-04-02	UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps	Ayyappa Swamy Thatavarthy et.al.	2204.00865v1	link
2022-03-31	Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects	Yujie Lu et.al.	2204.00035v1	null
2022-03-30	GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios	Chih-Yuan Chiu et.al.	2203.16690v1	null
2022-03-29	Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field	Mostafa Osman et.al.	2203.15866v1	null
2022-03-29	Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform	Mingjun Li et.al.	2203.15439v1	null
2022-03-29	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots	Pranay Mathur et.al.	2203.15272v1	null
2022-03-28	Are High-Resolution Event Cameras Really Needed?	Daniel Gehrig et.al.	2203.14672v1	null
2022-03-25	Spectral Measurement Sparsification for Pose-Graph SLAM	Kevin J. Doherty et.al.	2203.13897v1	link
2022-03-25	FD-SLAM: 3-D Reconstruction Using Features and Dense Matching	Xingrui Yang et.al.	2203.13861v1	null
2022-03-25	Gravity-constrained point cloud registration	Vladimír Kubelka et.al.	2203.13799v1	null
2022-03-24	MD-SLAM: Multi-cue Direct SLAM	Luca Di Giammarino et.al.	2203.13237v1	link
2022-03-24	Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video	Shun Taguchi et.al.	2203.12804v1	null
2022-03-19	Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems	Jie Yang et.al.	2203.10267v1	null
2022-03-16	Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR	Ian D. Miller et.al.	2203.08925v1	link
2022-03-15	Neural RF SLAM for unsupervised positioning and mapping with channel state information	Shreya Kadambi et.al.	2203.08264v1	null
2022-03-15	Simultaneous Localisation and Mapping with Quadric Surfaces	Tristan Laidlow et.al.	2203.08040v1	null
2022-03-14	Drift Reduced Navigation with Deep Explainable Features	Mohd Omama et.al.	2203.06897v1	link
2022-03-11	An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs	Keisuke Sugiura et.al.	2203.05763v1	null
2022-03-10	High Definition, Inexpensive, Underwater Mapping	Bharat Joshi et.al.	2203.05640v1	link
2022-03-10	SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning	Jaehoon Choi et.al.	2203.05332v1	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446v1	link
2022-03-08	SLAM-Supported Self-Training for 6D Object Pose Estimation	Ziqi Lu et.al.	2203.04424v1	link
2022-03-08	An Online Semantic Mapping System for Extending and Enhancing Visual SLAM	Thorsten Hempel et.al.	2203.03944v1	null
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454v1	link
2022-03-07	OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition	Junyi Ma et.al.	2203.03397v1	link
2022-03-06	Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM	Kazushi Aiba et.al.	2203.02887v1	null
2022-03-06	RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects	Ran Long et.al.	2203.02882v1	null
2022-03-03	STUN: Self-Teaching Uncertainty Estimation for Place Recognition	Kaiwen Cai et.al.	2203.01851v1	link
2022-03-03	Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning	Niclas Vödisch et.al.	2203.01578v1	link
2022-03-02	FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2203.00893v1	link
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851v1	null
2022-03-01	Descriptellation: Deep Learned Constellation Descriptors for SLAM	Chunwei Xing et.al.	2203.00567v1	null
2022-03-01	Collaborative Robot Mapping using Spectral Graph Analysis	Lukas Bernreiter et.al.	2203.00308v1	null
2022-02-26	RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization	Nikolaos Kourtzanidis et.al.	2202.13221v1	link
2022-02-25	Probabilistic Data Association for Semantic SLAM at Scale	Elad Michael et.al.	2202.12802v1	link
2022-02-24	TwistSLAM: Constrained SLAM in Dynamic Environment	Mathieu Gonzalez et.al.	2202.12384v1	null
2022-02-24	Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion	Hyeonsoo Jang et.al.	2202.12108v1	null
2022-02-23	MITI: SLAM Benchmark for Laparoscopic Surgery	Regine Hartwig et.al.	2202.11496v1	null
2022-02-23	DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization	Xuebo Tian et.al.	2202.11431v1	null
2022-02-23	Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets	Islam Ali et.al.	2202.11312v1	null
2022-02-22	SAGE: SLAM with Appearance and Geometry Prior for Endoscopy	Xingtong Liu et.al.	2202.09487v2	link
2022-02-18	OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure	Stefan Leutenegger et.al.	2202.09199v1	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146v1	link
2022-02-18	An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems	Qiang Liu et.al.	2202.08952v1	null
2022-02-17	Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study	Giovanni Cioffi et.al.	2202.08894v1	link
2022-02-17	LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building	Jiashi Zhang et.al.	2202.08487v1	null
2022-02-16	Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments	Jinkun Wang et.al.	2202.08359v1	null
2022-02-11	Overhead Image Factors for Underwater Sonar-based SLAM	John McConnell et.al.	2202.05811v1	null
2022-02-10	Scale Estimation with Dual Quadrics for Monocular Object SLAM	Shuangfu Song et.al.	2202.04816v1	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677v1	null
2022-01-25	Autonomous Vehicles: Open-Source Technologies, Considerations, and Development	Oussama Saoudi et.al.	2202.03148v1	null
2022-02-07	Temporal Point Cloud Completion with Pose Disturbance	Jieqi Shi et.al.	2202.03084v1	null
2022-02-04	DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938v1	null
2022-02-01	A Model for Multi-View Residual Covariances based on Perspective Deformation	Alejandro Fontan et.al.	2202.00765v1	null
2022-01-30	Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM	Xinghe Chu et.al.	2201.12726v1	null
2022-01-28	RGB-D SLAM Using Attention Guided Frame Association	Ali Caglayan et.al.	2201.12047v1	null
2022-02-04	Learning to Act with Affordance-Aware Multimodal Neural SLAM	Zhiwei Jia et.al.	2201.09862v2	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048v1	link
2022-01-17	SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System	Giseop Kim et.al.	2201.06423v1	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386v1	link
2022-01-19	Multi-Hypothesis Scan Matching through Clustering	Giorgio Iavicoli et.al.	2201.03814v2	null
2022-01-11	Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM	Kevin J. Doherty et.al.	2201.03773v1	null
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364v1	link
2022-01-10	Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition	M. Usman Maqbool Bhutta et.al.	2201.03212v1	link
2022-01-04	Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds	Xueliang Wen et.al.	2201.00959v1	null
2021-12-29	Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic	Khen Elimelech et.al.	2112.14428v1	null
2021-12-19	M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots	Jie Yin et.al.	2112.13659v1	link
2021-12-27	UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping	Hyunjun Lim et.al.	2112.13515v1	link
2021-12-25	Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs	Yusheng Wang et.al.	2112.13224v1	null
2021-12-25	Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping	Peng Huang et.al.	2112.13222v1	null
2021-12-24	3D Point Cloud Reconstruction and SLAM as an Input	Ziyu Li et.al.	2112.12907v1	null
2021-12-22	NICE-SLAM: Neural Implicit Scalable Encoding for SLAM	Zihan Zhu et.al.	2112.12130v1	link
2021-12-18	Fast and Robust Registration of Partially Overlapping Point Clouds	Eduardo Arnold et.al.	2112.09922v1	link
2021-12-17	Symmetry-aware Neural Architecture for Embodied Visual Navigation	Shuang Liu et.al.	2112.09515v1	null
2021-12-27	Homography Decomposition Networks for Planar Object Tracking	Xinrui Zhan et.al.	2112.07909v3	link
2021-12-14	Autonomous Navigation System from Simultaneous Localization and Mapping	Micheal Caracciolo et.al.	2112.07723v1	link
2021-12-12	360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation	Bolivar Solarte et.al.	2112.06180v2	link
2021-12-11	Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization	Amay Saxena et.al.	2112.05921v1	null
2021-12-07	Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems	Gideon Billings et.al.	2112.03826v1	link
2021-12-05	Iterated Posterior Linearization PMB Filter for 5G SLAM	Yu Ge et.al.	2112.02575v1	null
2021-12-03	Fast Direct Stereo Visual SLAM	Jiawei Mo et.al.	2112.01890v1	link
2021-12-02	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349v2	link
2021-12-01	Research on Event Accumulator Settings for Event-Based SLAM	Kun Xiao et.al.	2112.00427v1	link
2021-11-29	An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments	Assem Sadek et.al.	2111.14666v1	null
2021-11-29	Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report	Hartmut Surmann et.al.	2111.14542v1	null
2021-11-24	Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment	V. Ayala-Alfaro et.al.	2111.12690v1	null
2021-11-24	Autonomous bot with ML-based reactive navigation for indoor environment	Yash Srivastava et.al.	2111.12542v1	null
2021-11-22	A General Framework for Lifelong Localization and Mapping in Changing Environment	Min Zhao et.al.	2111.10946v1	link
2021-11-17	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network	Xiaoming Zhao et.al.	2111.09006v2	null
2021-11-10	Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models	Bruno Santos et.al.	2111.05631v1	null
2021-11-10	TomoSLAM: factor graph optimization for rotation angle refinement in microtomography	Mark Griguletskii et.al.	2111.05562v1	null
2021-11-07	Hierarchical Segment-based Optimization for SLAM	Yuxin Tian et.al.	2111.04101v1	null
2021-11-07	Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM	Shing Yan Loo et.al.	2111.04096v2	null
2021-11-05	MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry	Joan P. Company-Corcoles et.al.	2111.03408v1	null
2021-10-31	Loop closure detection using local 3D deep descriptors	Youjie Zhou et.al.	2111.00440v1	link
2021-10-27	Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification	Mingsheng Yin et.al.	2110.14789v2	link
2021-10-27	Efficient Placard Discovery for Semantic Mapping During Frontier Exploration	David Balaban et.al.	2110.14742v1	null
2021-10-26	Robust Multi-view Registration of Point Sets with Laplacian Mixture Model	Jin Zhang et.al.	2110.13744v1	null
2021-10-25	WOLF: A modular estimation framework for robotics based on factor graphs	Joan Sola et.al.	2110.12919v1	null
2021-10-21	Real-Time Ground-Plane Refined LiDAR SLAM	Fan Yang et.al.	2110.11517v1	null
2021-10-21	SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words	Jonathan J. Y. Kim et.al.	2110.11491v1	null
2021-10-21	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion	Zhenkun Zhu et.al.	2110.11040v2	null
2021-10-20	SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training	Ankur Bapna et.al.	2110.10329v1	null
2021-10-18	Enhancing exploration algorithms for navigation with visual SLAM	Kirill Muravyev et.al.	2110.09156v1	null
2021-10-18	Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment	Rui Tian et.al.	2110.08977v1	null
2021-10-16	Partial Hierarchical Pose Graph Optimization for SLAM	Alexander Korovko et.al.	2110.08639v1	null
2021-10-14	Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach	Shumon Koga et.al.	2110.07546v1	null
2021-10-13	Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity	Ran Liu et.al.	2110.06541v2	null
2021-10-12	Learning Efficient Multi-Agent Cooperative Visual Exploration	Chao Yu et.al.	2110.05734v1	null
2021-10-07	Self-Supervised Depth Completion for Active Stereo	Frederik Warburg et.al.	2110.03234v1	null
2021-10-06	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes	Zhenkun Zhu et.al.	2110.02593v1	null
2021-10-03	AEROS: Adaptive RObust least-Squares for Graph-Based SLAM	Milad Ramezani et.al.	2110.02018v1	null
2021-10-04	Fast Uncertainty Quantification for Active Graph SLAM	Julio A. Placed et.al.	2110.01289v1	link
2021-10-04	Geometry-based Graph Pruning for Lifelong SLAM	Gerhard Kurz et.al.	2110.01286v1	null
2021-10-03	Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration	Marcus Greiff et.al.	2110.01099v1	null
2021-10-02	Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows	Qiangqiang Huang et.al.	2110.00876v1	link

(back to top)

SFM

Publish Date	Title	Authors	PDF	Code
2025-03-06	PLMP -- Point-Line Minimal Problems for Projective SfM	Kim Kiehn et.al.	2503.04351v1	null
2025-03-03	ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization	Anas Abdelkarim et.al.	2503.01311v1	null
2025-03-05	A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping	Jialei He et.al.	2503.01202v3	null
2025-03-02	MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain	Rui Yi Yong et.al.	2503.00853v1	null
2025-03-02	PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery	BoCheng Li et.al.	2503.00848v1	null
2025-03-02	Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration	Jinjiang You et.al.	2503.00737v1	link
2025-02-27	Best Foot Forward: Robust Foot Reconstruction in-the-wild	Kyle Fogarty et.al.	2502.20511v1	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779v3	null
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684v1	link
2025-02-19	Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections	Seong Jong Yoo et.al.	2502.13986v1	null
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545v2	null
2025-02-10	FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences	Oliver Boyne et.al.	2502.06367v1	link
2025-02-10	Building Rome with Convex Optimization	Haoyu Han et.al.	2502.04640v2	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657v1	null
2025-03-02	GP-GS: Gaussian Processes for Enhanced Gaussian Splatting	Zhihao Guo et.al.	2502.02283v3	link
2025-02-03	XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications	Shangjin Zhai et.al.	2502.01297v1	null
2025-01-28	Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction	Tim Flückiger et.al.	2501.16221v2	null
2025-01-25	Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos	Zhen-Hui Dong et.al.	2501.15096v1	null
2025-01-24	MATCHA:Towards Matching Anything	Fei Xue et.al.	2501.14945v1	null
2025-01-24	Light3R-SfM: Towards Feed-forward Structure-from-Motion	Sven Elflein et.al.	2501.14914v1	null
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277v1	null
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015v2	null
2025-02-02	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927v2	link
2025-01-11	Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis	Aditya Rauniyar et.al.	2501.06431v1	null
2025-01-06	Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation	Yuezhang Lv et.al.	2501.02821v1	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409v1	null
2025-01-02	EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy	Ao Gao et.al.	2501.01003v1	null
2024-12-30	KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences	Keng-Wei Chang et.al.	2412.20767v1	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806v1	null
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103v1	null
2024-12-18	SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video	Jongmin Park et.al.	2412.09982v2	null
2024-12-10	Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling	Hui Deng et.al.	2412.07230v1	null
2024-12-08	Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features	Yuanbo Xiangli et.al.	2412.05826v1	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463v2	null
2024-12-02	SfM-Free 3D Gaussian Splatting via Hierarchical Training	Bo Ji et.al.	2412.01553v1	link
2024-12-02	MVImgNet2.0: A Larger-scale Dataset of Multi-view Images	Xiaoguang Han et.al.	2412.01430v1	null
2024-12-02	Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM	Alejandro Fontan et.al.	2412.01116v1	null
2024-11-27	RoMo: Robust Motion Segmentation Improves Structure from Motion	Lily Goli et.al.	2411.18650v1	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779v1	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291v1	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592v1	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546v1	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879v1	null
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453v1	null
2024-11-08	From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS	Haoran Zhang et.al.	2411.05362v1	link
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213v1	null
2024-10-25	A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint	Changshi Mu et.al.	2410.19473v1	link
2024-10-30	Large Spatial Model: End-to-end Unposed Images to Semantic 3D	Zhiwen Fan et.al.	2410.18956v2	link
2024-10-23	PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting	Yu Wang et.al.	2410.17505v1	null
2024-10-20	Neural Active Structure-from-Motion in Dark and Textureless Environment	Kazuto Ichimaru et.al.	2410.15378v1	null
2024-10-16	Gravity-aligned Rotation Averaging with Circular Regression	Linfei Pan et.al.	2410.12763v1	link
2024-10-15	SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection	Yizhe Liu et.al.	2410.12080v1	link
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505v1	null
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533v1	link
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434v1	null
2024-10-08	Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?	Charalambos Tzamos et.al.	2410.05984v1	link
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861v1	null
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386v1	null
2024-09-29	Robust Incremental Structure-from-Motion with Hybrid Features	Shaohui Liu et.al.	2409.19811v1	null
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152v1	null
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673v1	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981v1	null
2024-09-24	Frequency-based View Selection in Gaussian Splatting Reconstruction	Monica M. Q. Li et.al.	2409.16470v1	null
2024-10-07	Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion	Juan-Diego Florez et.al.	2409.16465v2	null
2024-09-24	Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research	Vandita Shukla et.al.	2409.15914v1	null
2024-09-23	Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments	Francisco Roza de Moraes et.al.	2409.15602v1	null
2024-09-17	GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module	Yichen Zhang et.al.	2409.11307v1	null
2024-09-13	Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints	Shan Chen et.al.	2409.08613v1	null
2024-09-09	KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction	Davide Di Nucci et.al.	2409.05407v1	null
2024-09-04	Object Gaussian for Monocular 6D Pose Estimation from Sparse Views	Luqing Luo et.al.	2409.02581v1	null
2024-09-25	Geometry-aware Feature Matching for Large-Scale Structure from Motion	Gonglin Chen et.al.	2409.02310v3	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373v3	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445v2	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966v1	null
2024-08-20	TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks	Jinjie Mai et.al.	2408.10739v1	null
2024-08-16	Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS	Wei Sun et.al.	2408.08723v1	null
2024-08-15	CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning	Wei Zhu et.al.	2408.08134v1	link
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648v1	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825v1	null
2024-08-04	Birational geometry of critical loci in Algebraic Vision	Marina Bertolini et.al.	2408.02067v1	null
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053v1	null
2024-08-02	Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris	Kentaro Uno et.al.	2408.01035v1	null
2024-08-01	LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting	Zhenyu Bao et.al.	2408.00254v1	null
2024-07-29	Global Structure-from-Motion Revisited	Linfei Pan et.al.	2407.20219v1	link
2024-08-06	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166v2	null
2024-07-16	NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models	Francesco Milano et.al.	2407.12207v1	link
2024-07-15	LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning	Zhuozhu Jian et.al.	2407.10782v1	null
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406v1	null
2024-07-14	3DEgo: 3D Editing on the Go!	Umar Khalid et.al.	2407.10102v1	null
2024-07-10	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization	Jinjie Mai et.al.	2407.08023v1	link
2024-07-09	Computer vision tasks for intelligent aerospace missions: An overview	Huilin Chen et.al.	2407.06513v1	null
2024-07-08	Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views	Jiawei Guo et.al.	2407.05666v1	null
2024-07-05	Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization	Shaohan Li et.al.	2407.04260v1	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939v3	null
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918v1	link
2024-07-02	Indoor 3D Reconstruction with an Unknown Camera-Projector Pair	Zhaoshuai Qi et.al.	2407.01945v1	null
2024-05-29	Rotation Averaging: A Primal-Dual Method and Closed-Forms in Cycle Graphs	Gabriel Moreira et.al.	2406.18564v1	null
2024-06-26	VDG: Vision-Only Dynamic Gaussian for Driving Simulation	Hao Li et.al.	2406.18198v1	null
2024-06-25	Consensus Learning with Deep Sets for Essential Matrix Estimation	Dror Moran et.al.	2406.17414v1	link
2024-06-24	Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction	Tong Qin et.al.	2406.16289v1	null
2024-06-19	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	Umair Haroon et.al.	2406.13515v1	null
2024-06-17	MegaScenes: Scene-Level View Synthesis at Scale	Joseph Tung et.al.	2406.11819v1	link
2024-06-10	Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Xin Jin et.al.	2406.06216v1	link
2024-06-13	Gaussian Splatting with Localized Points Management	Haosen Yang et.al.	2406.04251v2	null
2024-06-04	CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation	Dejia Xu et.al.	2406.02509v1	null
2024-05-29	Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy	Zijie Jiang et.al.	2405.18863v1	null
2024-05-29	3D Reconstruction with Fast Dipole Sums	Hanyu Chen et.al.	2405.16788v3	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599v1	null
2024-05-09	Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment	Simon Weber et.al.	2405.05079v2	link
2024-05-07	Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications	Markus Hillemann et.al.	2405.04345v1	null
2024-05-07	Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling	Jiawei Shi et.al.	2405.04309v1	null
2024-05-03	HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2	Miriam Jäger et.al.	2405.02005v1	null
2024-04-22	Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer	Eric Brachmann et.al.	2404.14351v1	null
2024-04-22	RESFM: Robust Equivariant Multiview Structure from Motion	Fadi Khatib et.al.	2404.14280v1	null
2024-05-23	Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting	Yalda Foroutan et.al.	2404.12547v3	null
2024-05-07	A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion	Feng Yu et.al.	2404.11590v2	link
2024-04-18	DeblurGS: Gaussian Splatting for Camera Motion Blur	Jeongtaek Oh et.al.	2404.11358v2	null
2024-05-21	LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives	Jiadi Cui et.al.	2404.09748v2	null
2024-04-12	MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance	Yuqun Wu et.al.	2404.08252v1	null
2024-04-11	Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Keonhee Han et.al.	2404.07933v1	null
2024-04-07	NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization	Peng Tu et.al.	2404.04875v1	null
2024-04-04	GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis	Emmanouil Nikolakakis et.al.	2404.03126v1	null
2024-03-29	InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds	Zhiwen Fan et.al.	2403.20309v1	link
2024-03-29	HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes	Zhuopeng Li et.al.	2403.20032v1	null
2024-03-26	NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation	Jiahao Chen et.al.	2403.17537v1	null
2024-03-25	INPC: Implicit Neural Point Clouds for Radiance Field Rendering	Florian Hahlbohm et.al.	2403.16862v1	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639v1	null
2024-03-14	Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting	Jaewoo Jung et.al.	2403.09413v1	link
2024-03-13	Refractive COLMAP: Refractive Structure-from-Motion Revisited	Mengkun She et.al.	2403.08640v1	null
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156v1	link
2024-03-11	SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection	Yifu Tao et.al.	2403.06877v1	null
2024-03-24	BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling	Cheng Peng et.al.	2403.04926v2	link
2024-02-22	GaussianPro: 3D Gaussian Splatting with Progressive Propagation	Kai Cheng et.al.	2402.14650v1	null
2024-02-25	A Robust Error-Resistant View Selection Method for 3D Reconstruction	Shaojie Zhang et.al.	2402.11431v2	null
2024-02-17	Dense Matchers for Dense Tracking	Tomáš Jelínek et.al.	2402.11287v1	null
2024-03-11	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592v2	link
2024-01-22	HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs	Zelin Gao et.al.	2401.11711v1	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886v1	null
2024-01-15	3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data	Mathilde Letard et.al.	2401.09481v1	link
2024-01-17	3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey	Thiago Lopes Trugillo da Silveira et.al.	2401.09252v1	null
2024-01-17	ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization	Weiyao Wang et.al.	2401.08937v1	null
2024-01-16	Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions	Yi-Fan Zuo et.al.	2401.08043v1	link
2024-01-10	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects	Tianhang Cheng et.al.	2401.05236v1	link
2024-01-07	A Classification of Critical Configurations for any Number of Projective Views	Martin Bråtelund et.al.	2401.03450v1	link
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471v1	null
2023-12-16	Transformers in Unsupervised Structure-from-Motion	Hemang Chawla et.al.	2312.10529v1	link
2023-12-14	HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video	Xueying Wang et.al.	2312.08863v1	null
2023-12-14	CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning	Qingsong Yan et.al.	2312.08760v1	null
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865v1	link
2023-12-11	Gaussian Splatting SLAM	Hidenobu Matsuki et.al.	2312.06741v1	null
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889v1	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563v1	null
2023-11-30	Distributed Global Structure-from-Motion with a Deep Front-End	Ayush Baid et.al.	2311.18801v1	link
2023-11-21	Robot Hand-Eye Calibration using Structure-from-Motion	Nicolas Andreff et.al.	2311.11808v2	null
2023-11-18	LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation	Sébastien Henry et.al.	2311.11171v1	null
2023-11-10	MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty	Rémi Marsal et.al.	2311.06137v1	link
2023-11-08	VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering	Linus Franke et.al.	2311.04634v1	link
2023-10-22	A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video	Jan Emily Mangulabnan et.al.	2310.14364v1	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605v1	null
2023-10-09	Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration	Chunge Bai et.al.	2310.05504v1	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134v1	null
2023-11-29	Pose-Free Generalizable Rendering Transformer	Zhiwen Fan et.al.	2310.03704v2	link
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092v1	null
2023-10-01	Propagating Semantic Labels in Video Data	David Balaban et.al.	2310.00783v1	null
2023-09-22	Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning	Jonathan Sauder et.al.	2309.12804v1	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883v1	link
2023-09-19	Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water	Jayesh Tripathi et.al.	2309.10269v1	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927v1	link
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147v1	null
2023-09-01	SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation	Youhong Wang et.al.	2309.00526v1	null
2023-09-01	Dense Voxel 3D Reconstruction Using a Monocular Event Camera	Haodong Chen et.al.	2309.00385v1	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984v1	link
2023-08-26	Disjoint Pose and Shape for 3D Face Reconstruction	Raja Kumar et.al.	2308.13903v1	null
2023-08-30	CamP: Camera Preconditioning for Neural Radiance Fields	Keunhong Park et.al.	2308.10902v2	null
2023-08-18	Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling	Haorui Ji et.al.	2308.10705v1	null
2023-08-14	Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation	Tao Liu et.al.	2308.07231v1	link
2023-08-11	Efficient Large-scale AUV-based Visual Seafloor Mapping	Mengkun She et.al.	2308.06147v1	null
2023-08-04	EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems	Weihan Wang et.al.	2308.02670v1	null
2023-08-15	Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites	Jyotirmaya Shivottam et.al.	2308.01246v2	link
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125v1	null
2023-07-27	PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking	Yang Zheng et.al.	2307.15055v1	link
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702v2	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981v1	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520v1	null
2023-07-07	RGB-D Mapping and Tracking in a Plenoxel Radiance Field	Andreas L. Teigen et.al.	2307.03404v1	link
2023-06-29	The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes	David Recasens et.al.	2306.16917v1	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669v1	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667v2	null
2023-06-24	3D Reconstruction of Spherical Images based on Incremental Structure from Motion	San Jiang et.al.	2306.12770v2	link
2023-06-15	NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations	Varun Jampani et.al.	2306.09109v1	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012v1	link
2023-06-10	3D reconstruction using Structure for Motion	Kshitij Karnawat et.al.	2306.06360v1	link
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938v1	null
2023-05-31	FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow	Cameron Smith et.al.	2306.00180v1	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036v1	link
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301v1	link
2023-05-09	Rotation Synchronization via Deep Matrix Factorization	Gk Tejus et.al.	2305.05268v1	link
2023-04-20	A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion	Miriam Jäger et.al.	2304.10664v1	null
2023-04-14	Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments	Felix Ott et.al.	2304.07250v1	null
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947v1	link
2023-04-08	Photometric Correction for Infrared Sensors	Jincheng Zhang et.al.	2304.03930v1	null
2023-04-07	DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium	Antyanta Bangunharcana et.al.	2304.03560v1	link
2023-04-05	Semantic Validation in Structure from Motion	Joseph Rowell et.al.	2304.02420v1	link
2023-03-31	Learning Internal Representations of 3D Transformations from 2D Projected Inputs	Marissa Connor et.al.	2303.17776v1	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504v1	link
2023-03-27	TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering	Jaehoon Choi et.al.	2303.15060v1	null
2023-03-26	On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks	HyunJun Jung et.al.	2303.14840v1	link
2023-03-24	Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container	Jinguang Tong et.al.	2303.13805v1	link
2023-03-24	Progressively Optimized Local Radiance Fields for Robust View Synthesis	Andreas Meuleman et.al.	2303.13791v1	null
2023-03-15	RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters	Shuja Khalid et.al.	2303.08695v1	null
2023-03-09	Revisiting Rotation Averaging: Uncertainties and Robust Losses	Ganlin Zhang et.al.	2303.05195v1	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239v1	link
2023-03-25	BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling	Sameera Ramasinghe et.al.	2302.13543v3	null
2023-02-21	EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images	Zhichao Ye et.al.	2302.10544v1	link
2023-02-18	Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering	Tatsuro Yamane et.al.	2302.09208v1	null
2023-02-12	Uncertainty-Driven Dense Two-View Structure from Motion	Weirong Chen et.al.	2302.00523v2	null
2023-01-28	AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion	Yu Chen et.al.	2301.12135v1	null
2023-01-20	A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles	Zhefan Xu et.al.	2301.08422v1	link
2023-03-21	Robust Dynamic Radiance Fields	Yu-Lun Liu et.al.	2301.02239v2	link
2022-12-24	Polarimetric Multi-View Inverse Rendering	Jinyu Zhao et.al.	2212.12721v1	null
2022-12-13	Accidental Turntables: Learning 3D Pose by Watching Objects Turn	Zezhou Cheng et.al.	2212.06300v1	null
2022-12-04	3D Object Aided Self-Supervised Monocular Depth Estimation	Songlin Wei et.al.	2212.01768v1	null
2022-12-02	High-Res Facial Appearance Capture from Polarized Smartphone Images	Dejan Azinović et.al.	2212.01160v1	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069v1	link
2022-11-24	JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models	Sepidehsadat Hosseini et.al.	2211.13785v1	null
2022-11-24	SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks	Sergio Izquierdo et.al.	2211.13551v1	link
2022-11-22	Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces	Yuxi Xiao et.al.	2211.12018v1	link
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836v1	null
2022-11-14	Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion	René Haas et.al.	2211.07195v1	null
2022-10-13	Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach	Zhiang Chen et.al.	2210.07349v1	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517v1	null
2022-10-07	Leveraging Structure from Motion to Localize Inaccessible Bus Stops	Indu Panigrahi et.al.	2210.03646v1	link
2022-10-01	Structure-Aware NeRF without Posed Camera via Epipolar Constraint	Shu Chen et.al.	2210.00183v1	link
2022-10-05	FAST-LIO, Then Bayesian ICP, Then GTSFM	Jerred Chen et.al.	2210.00146v2	null
2022-09-20	BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction	Ahalya Ravendran et.al.	2209.09470v1	null
2022-09-19	A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion	Gerry Chen et.al.	2209.08690v1	null
2022-09-14	End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes	Qiao Chen et.al.	2209.06926v1	null
2022-09-07	Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021	Hartmut Surmann et.al.	2209.03084v1	null
2022-08-27	Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data	Thomas A. Ciarfuglia et.al.	2208.13001v1	null
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325v1	null
2022-08-04	Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training	Yao-Chih Lee et.al.	2208.02709v1	link
2022-07-31	One Object at a Time: Accurate and Robust Structure From Motion for Robots	Aravind Battaje et.al.	2208.00487v1	null
2022-07-23	Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks	Daniel Posada et.al.	2207.11413v1	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762v2	link
2022-07-19	ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild	Wang Zhao et.al.	2207.09137v1	link
2022-07-16	Organic Priors in Non-Rigid Structure from Motion	Suryansh Kumar et.al.	2207.06262v3	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396v1	null
2022-06-24	Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set	San Jiang et.al.	2206.11499v2	null
2022-06-13	TC-SfM: Robust Track-Community-Based Structure-from-Motion	Lei Wang et.al.	2206.05866v1	null
2022-06-10	EigenFairing: 3D Model Fairing using Image Coherence	Pragyana Mishra et.al.	2206.05309v1	null
2022-06-01	Semantic Room Wireframe Detection from a Single View	David Gillsjö et.al.	2206.00491v1	link
2022-05-31	Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction	Qiancheng Fu et.al.	2205.15848v1	null
2022-05-09	Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression	HyunJun Jung et.al.	2205.04565v1	null
2022-05-07	Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs	Pedro F. Proença et.al.	2205.03522v1	null
2022-05-06	EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms	Levi Burner et.al.	2205.03467v1	null
2022-04-20	Learned Monocular Depth Priors in Visual-Inertial Initialization	Yunwen Zhou et.al.	2204.09171v1	null
2022-04-10	Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective	Hui Deng et.al.	2204.04730v1	null
2022-04-08	Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems	Debao Huang et.al.	2204.04145v1	null
2022-04-07	SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation	Yi Wei et.al.	2204.03636v1	link
2022-04-06	Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion	Lukas Bommes et.al.	2204.02733v1	link
2022-04-05	Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows	Sheng Liu et.al.	2204.02509v1	link
2022-03-31	Fast, Accurate and Memory-Efficient Partial Permutation Synchronization	Shaohan Li et.al.	2203.16505v2	null
2022-03-28	Visual Odometry for RGB-D Cameras	Afonso Fontes et.al.	2203.15119v1	null
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901v1	link
2022-03-23	Event-Based Dense Reconstruction Pipeline	Kun Xiao et.al.	2203.12270v1	null
2022-03-21	DiffPoseNet: Direct Differentiable Camera Pose Estimation	Chethan M. Parameshwara et.al.	2203.11174v1	null
2022-03-02	Asynchronous Optimisation for Event-based Visual Odometry	Daqi Liu et.al.	2203.01037v1	null
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851v1	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146v1	link
2022-01-20	GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry	Yunhan Zhao et.al.	2201.08131v1	null
2022-01-13	Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching	Yunpeng Shi et.al.	2201.04797v1	link
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364v1	link
2022-01-06	De-rendering 3D Objects in the Wild	Felix Wimbauer et.al.	2201.02279v1	link
2021-12-29	On the Instability of Relative Pose Estimation and RANSAC's Role	Hongyi Fan et.al.	2112.14651v1	null
2021-12-16	Road-aware Monocular Structure from Motion and Homography Estimation	Wei Sui et.al.	2112.08635v1	null
2021-12-10	Critical configurations for three projective views	Martin Bråtelund et.al.	2112.05478v1	null
2021-12-09	Critical configurations for two projective views, a new approach	Martin Bråtelund et.al.	2112.05074v1	null
2021-12-06	Dense Depth Priors for Neural Radiance Fields from Sparse Input Views	Barbara Roessle et.al.	2112.03288v1	link
2021-12-10	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349v2	link
2021-11-11	Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft	Pascal Schoppmann et.al.	2111.06271v1	null
2021-11-10	Damage Estimation and Localization from Sparse Aerial Imagery	Rene Garcia Franceschini et.al.	2111.03708v2	null
2021-11-03	Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems	Swarnabja Bhaumik et.al.	2111.02064v1	null
2021-10-14	Modeling dynamic target deformation in camera calibration	Annika Hagemann et.al.	2110.07322v1	null
2021-10-13	Hyperspectral 3D Mapping of Underwater Environments	Maxime Ferrera et.al.	2110.06571v1	null
2021-09-24	Automatic Map Update Using Dashcam Videos	Aziza Zhanabatyrova et.al.	2109.12131v1	null
2021-09-16	Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs	Gabriel Moreira et.al.	2109.08046v1	link
2021-09-06	Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications	Tejas Mane et.al.	2109.02740v1	null
2021-09-02	Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency	Beatrix-Emőke Fülöp-Balogh et.al.	2109.01018v1	null
2021-09-01	On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation	Eric Brachmann et.al.	2109.00524v1	link
2021-08-31	DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension	Roman Shapovalov et.al.	2109.00033v1	null
2021-08-29	Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration	Seyed-Mahdi Nasiri et.al.	2108.12876v1	null
2021-08-23	Burst Imaging for Light-Constrained Structure-From-Motion	Ahalya Ravendran et.al.	2108.09895v1	null

(back to top)

Visual Localization

Publish Date	Title	Authors	PDF	Code
2025-03-06	RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining	Tengfei Zhang et.al.	2503.04653v1	null
2025-03-06	ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images	Yanqing Shen et.al.	2503.04475v1	null
2025-03-06	Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes	Hui Zhang et.al.	2503.04235v1	null
2025-03-06	Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior	Haitao Wu et.al.	2503.04207v1	null
2025-03-06	Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments	Beverley Gorry et.al.	2503.04096v1	null
2025-03-04	TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition	Oliver Grainge et.al.	2503.02511v1	null
2025-03-04	Introspective Loop Closure for SLAM with 4D Imaging Radar	Maximilian Hilger et.al.	2503.02383v1	null
2025-03-04	Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models	Kenta Tsukahara et.al.	2503.02256v1	null
2025-03-03	Composed Multi-modal Retrieval: A Survey of Approaches and Applications	Kun Zhang et.al.	2503.01334v1	link
2025-03-03	AirRoom: Objects Matter in Room Reidentification	Runmao Yao et.al.	2503.01130v1	null
2025-03-02	Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching	Jinyu Miao et.al.	2503.00862v1	null
2025-03-01	Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning	Songlin Dong et.al.	2503.00515v1	null
2025-02-28	EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration	Kuangyi Chen et.al.	2503.00167v1	null
2025-02-28	CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval	Zelong Sun et.al.	2502.20826v1	null
2025-02-28	SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition	Shanshan Wan et.al.	2502.20676v1	null
2025-02-27	A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization	Yejun Zhang et.al.	2502.20036v1	link
2025-02-27	On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation	Ruben T. Lucassen et.al.	2502.19285v2	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242v1	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932v1	null
2025-02-19	A Comprehensive Survey on Composed Image Retrieval	Xuemeng Song et.al.	2502.18495v1	null
2025-02-25	MegaLoc: One Retrieval to Place Them All	Gabriele Berton et.al.	2502.17237v2	link
2025-02-23	Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries	Yin Wu et.al.	2502.16636v1	link
2025-02-23	SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition	Feng Lu et.al.	2502.16601v1	link
2025-02-21	ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval	Guanqi Zhan et.al.	2502.15682v1	null
2025-02-20	Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition	Tianyi Shang et.al.	2502.14195v1	link
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803v1	null
2025-02-18	Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization	Shuo Xing et.al.	2502.13146v1	link
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545v2	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303v1	null
2025-02-17	Descriminative-Generative Custom Tokens for Vision-Language Models	Pramuditha Perera et.al.	2502.12095v1	null
2025-02-17	ILIAS: Instance-Level Image retrieval At Scale	Giorgos Kordopatis-Zilos et.al.	2502.11748v1	null
2025-02-17	Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition	Jianyi Peng et.al.	2502.11742v1	null
2025-02-17	Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics	Francesco Croce et.al.	2502.11725v1	link
2025-02-17	Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization	Yuanze Xu et.al.	2502.11408v1	null
2025-02-12	E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection	Junjie Wu et.al.	2502.10455v1	null
2025-02-11	Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning	Yuhang Dong et.al.	2502.09649v1	null
2025-02-13	ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation	Rotem Shalev-Arkushin et.al.	2502.09411v1	null
2025-02-12	SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization	Artem Dementyev et.al.	2502.08848v1	null
2025-02-12	Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions	Prajwal Gatti et.al.	2502.08438v1	null
2025-02-11	Captured by Captions: On Memorization and its Mitigation in CLIP Models	Wenhao Wang et.al.	2502.07830v1	null
2025-02-11	Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields	Petr Koutenský et.al.	2502.07338v1	null
2025-02-11	Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos	Haowen Gao et.al.	2502.07327v1	null
2025-02-11	PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval	Osman Tursun et.al.	2502.07215v1	null
2025-02-10	AstroLoc: Robust Space to Ground Image Localizer	Gabriele Berton et.al.	2502.07003v1	null
2025-02-09	Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education	Yanhao Jia et.al.	2502.05863v1	null
2025-02-07	Learning Street View Representations with Spatiotemporal Contrast	Yong Li et.al.	2502.04638v1	null
2025-02-06	Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion	Marco Mistretta et.al.	2502.04263v1	link
2025-02-05	Human-Aligned Image Models Improve Visual Decoding from the Brain	Nona Rajabi et.al.	2502.03081v1	null
2025-02-03	ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies	Costin F. Ciusdel et.al.	2502.01335v1	null
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382v1	link
2025-01-27	Freestyle Sketch-in-the-Loop Image Segmentation	Subhadeep Koley et.al.	2501.16022v1	null
2025-01-26	Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations	Zijun Long et.al.	2501.15379v1	null
2025-01-24	Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection	Viktor Kozák et.al.	2501.14587v1	null
2025-01-23	Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models	Jakob Krogh Petersen et.al.	2501.14051v1	link
2025-01-22	Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation	Kenta Uesugi et.al.	2501.13968v1	null
2025-01-19	Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection	Zhipeng Yu et.al.	2501.11063v1	link
2025-01-18	A Resource-Efficient Training Framework for Remote Sensing Text--Image Retrieval	Weihang Zhang et.al.	2501.10638v1	null
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887v1	null
2025-01-15	Vision Foundation Models for Computed Tomography	Suraj Pai et.al.	2501.09001v1	link
2025-01-12	SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval	Bhavin Jawade et.al.	2501.08347v1	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286v1	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399v1	null
2025-01-12	Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation	Zhenyang Feng et.al.	2501.06749v1	null
2025-01-06	Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI	Xujin Li et.al.	2501.02841v1	null
2025-01-03	A Minimal Subset Approach for Efficient and Scalable Loop Closure	Nikolaos Stathoulopoulos et.al.	2501.01791v1	link
2025-01-03	iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings	Shuhei Tomoshige et.al.	2501.01642v1	null
2025-01-02	R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization	Xudong Jiang et.al.	2501.01421v1	null
2025-01-02	Training Medical Large Vision-Language Models with Abnormal-Aware Feedback	Yucheng Zhou et.al.	2501.01377v1	null
2025-01-02	Domain-invariant feature learning in brain MR imaging for content-based image retrieval	Shuya Tobari et.al.	2501.01326v1	null
2024-12-28	GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting	Atticus J. Zeller et.al.	2412.20056v1	link
2024-12-25	FOR: Finetuning for Object Level Open Vocabulary Image Retrieval	Hila Levi et.al.	2412.18806v1	null
2024-12-24	ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval	Le Dong et.al.	2412.18136v1	link
2024-12-22	Where am I? Cross-View Geo-localization with Natural Language Descriptions	Junyan Ye et.al.	2412.17007v1	null
2024-12-22	Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process	Shenghai Yuan et.al.	2412.16880v1	null
2024-12-24	Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling	Daichi Yashima et.al.	2412.16576v2	link
2024-12-20	A New Method to Capturing Compositional Knowledge in Linguistic Space	Jiahe Wan et.al.	2412.15632v1	null
2024-12-20	Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation	Samantha J Alloo et.al.	2412.15513v1	null
2024-12-19	Learning Visual Composition through Improved Semantic Guidance	Austin Stone et.al.	2412.15396v1	null
2024-12-19	MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval	Junjie Zhou et.al.	2412.14475v1	null
2024-12-18	Adversarial Hubness in Multi-Modal Retrieval	Tingwei Zhang et.al.	2412.14113v1	link
2024-12-18	Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval	Giacomo Pacini et.al.	2412.13834v1	null
2024-12-18	ConDo: Continual Domain Expansion for Absolute Pose Regression	Zijun Li et.al.	2412.13452v1	link
2024-12-17	Three Things to Know about Deep Metric Learning	Yash Patel et.al.	2412.12432v1	null
2024-12-15	Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval	Zelong Sun et.al.	2412.11087v1	null
2024-12-20	Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2412.11077v3	null
2024-12-13	MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition	Qiwen Gu et.al.	2412.09199v2	null
2024-12-12	A Flexible Plug-and-Play Module for Generating Variable-Length	Liyang He et.al.	2412.08922v1	link
2024-12-11	Image Retrieval Methods in the Dissimilarity Space	Madhu Kiran et.al.	2412.08618v1	null
2024-12-11	Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization	Siyan Dong et.al.	2412.08376v1	link
2024-12-11	Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin	Benjamin D. Killeen et.al.	2412.08020v1	null
2024-12-10	On Motion Blur and Deblurring in Visual Place Recognition	Timur Ismagilov et.al.	2412.07751v1	null
2024-12-10	Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance	Wanwen Chen et.al.	2412.07741v1	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488v1	link
2024-12-09	A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition	Connor Malone et.al.	2412.06153v1	null
2024-12-07	Compositional Image Retrieval via Instruction-Aware Contrastive Learning	Wenliang Zhong et.al.	2412.05756v1	link
2024-12-06	DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification	Ying Jin et.al.	2412.04828v1	null
2024-12-04	Distillation of Diffusion Features for Semantic Correspondence	Frank Fundel et.al.	2412.03512v1	null
2024-12-04	Composed Image Retrieval for Training-Free Domain Conversion	Nikos Efthymiadis et.al.	2412.03297v1	link
2024-12-03	A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration	Thulio Amorim et.al.	2412.02881v1	null
2024-12-03	Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval	Leah Bar et.al.	2412.02310v1	link
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039v1	link
2024-12-02	Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features	MD Shaikh Rahman et.al.	2412.01555v1	null
2024-12-02	Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models	Yi Liao et.al.	2412.01202v1	null
2024-12-01	EDTformer: An Efficient Decoder Transformer for Visual Place Recognition	Tong Jin et.al.	2412.00784v1	null
2024-11-28	EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval	Muhammad Huzaifa et.al.	2412.00139v1	null
2024-11-28	Unleashing the Power of Data Synthesis in Visual Localization	Sihang Li et.al.	2412.00138v1	null
2024-11-28	Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval	Yang Liu et.al.	2412.00120v1	null
2024-11-29	A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications	Liqiang Zhang Ye Tian Dongyan Wei et.al.	2411.19845v1	null
2024-11-27	Optimizing Image Retrieval with an Extended b-Metric Space	Abdelkader Belhenniche et.al.	2411.18800v1	null
2024-11-26	Learning Visual Hierarchies with Hyperbolic Embeddings	Ziwei Wang et.al.	2411.17490v1	null
2024-12-02	Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy	You Li et.al.	2411.16752v2	null
2024-12-02	AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks	You Li et.al.	2411.16749v2	null
2024-11-25	Image Generation Diversity Issues and How to Tame Them	Mischa Dombrowski et.al.	2411.16171v1	link
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800v1	null
2024-11-22	Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval	Zengbao Sun et.al.	2411.14704v1	null
2024-11-20	Globally Correlation-Aware Hard Negative Generation	Wenjie Peng et.al.	2411.13145v1	link
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481v1	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665v1	link
2024-11-13	Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval	Saul Santos et.al.	2411.08590v1	link
2024-11-22	Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments	Ashkan Nejad et.al.	2411.08567v2	link
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279v1	link
2024-11-05	From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing	Xintian Sun et.al.	2411.05826v1	null
2024-11-04	TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives	Maitreya Patel et.al.	2411.02545v1	null
2024-11-11	INQUIRE: A Natural World Text-to-Image Retrieval Benchmark	Edward Vendrow et.al.	2411.02537v3	link
2024-11-20	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925v2	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804v1	null
2024-11-03	Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification	MD Shaikh Rahman et.al.	2411.01473v1	null
2024-11-01	Identifying Implicit Social Biases in Vision-Language Models	Kimia Hamidieh et.al.	2411.00997v1	null
2024-10-31	Nearest Neighbor Normalization Improves Multimodal Retrieval	Neil Chowdhury et.al.	2410.24114v1	link
2024-10-31	MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval	Haiwen Li et.al.	2410.23736v1	null
2024-10-30	Decoupling Semantic Similarity from Spatial Alignment for Neural Networks	Tassilo Wald et.al.	2410.23107v1	link
2024-10-29	Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Monica Riedler et.al.	2410.21943v1	link
2024-10-28	NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments	Taiyi Pan et.al.	2410.21615v1	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341v1	link
2024-10-24	ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval	Zijia Zhao et.al.	2410.18715v1	link
2024-10-25	On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features	Tomáš Pivoňka et.al.	2410.18573v2	null
2024-10-22	Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2410.17393v1	null
2024-10-20	GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Haiwen Diao et.al.	2410.15266v1	link
2024-10-19	Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection	Marie Roald et.al.	2410.14969v1	link
2024-10-16	Development of Image Collection Method Using YOLO and Siamese Network	Chan Young Shin et.al.	2410.12561v1	null
2024-10-16	LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment	Juelin Zhu et.al.	2410.12269v1	link
2024-10-16	Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization	Nanda Febri Istighfarin et.al.	2410.12240v1	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505v1	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187v1	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533v1	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935v1	link
2024-10-16	Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP	Eunji Kim et.al.	2410.08469v2	null
2024-10-11	A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification	Eugene P. W. Ang et.al.	2410.08456v1	null
2024-10-10	A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks	Hoin Jung et.al.	2410.07593v1	link
2024-10-09	Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval	Mohammad Omama et.al.	2410.07022v1	null
2024-10-09	Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers	Stephen Hausler et.al.	2410.06614v1	link
2024-10-09	MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging	Noel C. F. Codella et.al.	2410.06542v1	null
2024-10-08	Temporal Image Caption Retrieval Competition -- Description and Results	Jakub Pokrywka et.al.	2410.06314v1	null
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285v1	null
2024-10-08	GSLoc: Visual Localization with 3D Gaussian Splatting	Kazii Botashev et.al.	2410.06165v1	null
2024-10-08	Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning	Ayush Singh et.al.	2410.05928v1	null
2024-10-08	RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps	Minsoo Kim et.al.	2410.05621v1	null
2024-10-11	LoTLIP: Improving Language-Image Pre-training for Long Text Understanding	Wei Wu et.al.	2410.05249v3	null
2024-10-06	LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation	Jianhao Jiao et.al.	2410.04419v1	null
2024-10-02	Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension	Zaiquan Yang et.al.	2410.01544v1	null
2024-10-03	EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections	Francesc Net et.al.	2410.01536v2	link
2024-10-04	CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment	Safouane El Ghazouali et.al.	2410.01411v2	link
2024-09-30	Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation	Aleyna Kütük et.al.	2410.00266v1	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597v1	null
2024-09-28	VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition	Ahmad Khaliq et.al.	2409.19293v1	link
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152v1	null
2024-09-26	Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval	Mankeerat Sidhu et.al.	2409.18733v1	null
2024-09-26	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049v1	link
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502v1	link
2024-09-23	CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis	Xiang Zhang et.al.	2409.15169v1	null
2024-09-21	Combining Absolute and Semi-Generalized Relative Poses for Visual Localization	Vojtech Panek et.al.	2409.14269v1	null
2024-09-21	SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality	Hongjia Zhai et.al.	2409.14067v1	null
2024-09-20	Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval	Morris Florek et.al.	2409.13513v1	link
2024-09-18	Towards Global Localization using Multi-Modal Object-Instance Re-Identification	Aneesh Chavan et.al.	2409.12002v1	link
2024-09-17	Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching	Kurran Singh et.al.	2409.11555v1	null
2024-09-17	Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information	Kunal Chelani et.al.	2409.11536v1	null
2024-09-17	Improving the Efficiency of Visually Augmented Language Models	Paula Ontalvilla et.al.	2409.11148v1	link
2024-09-21	HGSLoc: 3DGS-based Heuristic Camera Pose Refinement	Zhongyan Niu et.al.	2409.10925v2	null
2024-09-16	SOLVR: Submap Oriented LiDAR-Visual Re-Localisation	Joshua Knights et.al.	2409.10247v1	null
2024-09-16	Garment Attribute Manipulation with Multi-level Attention	Vittorio Casula et.al.	2409.10206v1	null
2024-09-14	Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Amirreza Mahbod et.al.	2409.09430v1	link
2024-09-12	Structured Pruning for Efficient Visual Place Recognition	Oliver Grainge et.al.	2409.07834v1	null
2024-09-10	GeoCalib: Learning Single-image Calibration with Geometric Optimization	Alexander Veicht et.al.	2409.06704v1	link
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471v1	link
2024-09-10	A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions	Zhicong Wu et.al.	2409.06381v1	null
2024-09-09	Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding	Bram Willemsen et.al.	2409.05721v1	link
2024-09-09	Open-World Dynamic Prompt and Continual Visual Representation Learning	Youngeun Kim et.al.	2409.05312v1	null
2024-09-12	Training-free ZS-CIR via Weighted Modality Fusion and Similarity	Ren-Di Wu et.al.	2409.04918v2	link
2024-09-12	Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models	Saghir Alfasly et.al.	2409.04631v2	null
2024-09-06	Reprojection Errors as Prompts for Efficient Scene Coordinate Regression	Ting-Ru Liu et.al.	2409.04178v1	null
2024-09-06	Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments	Therese Joseph et.al.	2409.03998v1	null
2024-09-04	Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications	Abby Stylianou et.al.	2409.03012v1	null
2024-09-04	NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval	Sepanta Zeighami et.al.	2409.02343v1	link
2024-09-03	Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment	Konstantin Schall et.al.	2409.01936v1	link
2024-09-02	A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches	Kim Jinwoo et.al.	2409.01219v1	null
2024-09-02	Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection	Manon Kok et.al.	2409.01091v1	null
2024-09-02	Evidential Transformers for Improved Image Retrieval	Danilo Dordevic et.al.	2409.01082v1	null
2024-09-05	EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System	Bonan Liu et.al.	2409.00343v2	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373v3	null
2024-09-02	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance	Avideep Mukherjee et.al.	2408.17095v2	null
2024-08-29	A compact neuromorphic system for ultra energy-efficient, on-device robot localization	Adam D. Hines et.al.	2408.16754v1	link
2024-08-29	Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models	Kengo Nakata et.al.	2408.16296v1	null
2024-08-28	Temporal Attention for Cross-View Sequential Image Localization	Dong Yuan et.al.	2408.15569v1	link
2024-08-27	Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild	Tianqi Wei et.al.	2408.14723v1	null
2024-08-25	LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task	Ali Asgarov et.al.	2408.13909v1	link
2024-08-15	Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval	Lifeng Zhou et.al.	2408.13705v1	null
2024-08-15	Coarse-to-fine Alignment Makes Better Speech-image Retrieval	Lifeng Zhou et.al.	2408.13119v1	null
2024-08-21	FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization	Son Tung Nguyen et.al.	2408.12037v1	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966v1	null
2024-08-21	UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation	Xiangyu Zhao et.al.	2408.11305v1	link
2024-08-20	GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting	Changkun Liu et.al.	2408.11085v1	link
2024-08-19	BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval	Zhenyu Lu et.al.	2408.10383v1	null
2024-08-23	Fashion Image-to-Image Translation for Complementary Item Retrieval	Matteo Attimonelli et.al.	2408.09847v2	link
2024-08-20	MambaLoc: Efficient Camera Localisation via State Space Model	Jialu Wang et.al.	2408.09680v2	null
2024-08-15	DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions	Ryosuke Korekata et.al.	2408.07910v1	null
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648v1	null
2024-08-10	Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network	Junyan Ye et.al.	2408.05475v1	link
2024-08-09	Spherical World-Locking for Audio-Visual Localization in Egocentric Videos	Heeseung Yun et.al.	2408.05364v1	null
2024-08-06	AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval	Pavel Suma et.al.	2408.03282v1	link
2024-08-05	CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration	Gongxin Yao et.al.	2408.02394v1	null
2024-08-09	BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles	Lun Luo et.al.	2408.01841v2	link
2024-08-02	On Validation of Search & Retrieval of Tissue Images in Digital Pathology	H. R. Tizhoosh et.al.	2408.01570v1	null
2024-07-31	VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning	Yuhang Ming et.al.	2407.21416v1	null
2024-07-31	SuperVINS: A visual-inertial SLAM framework integrated deep learning features	Hongkun Luo et.al.	2407.21348v1	link
2024-07-30	Re-localization acceleration with Medoid Silhouette Clustering	Hongyi Zhang et.al.	2407.20749v1	null
2024-07-29	A flexible framework for accurate LiDAR odometry, map manipulation, and localization	José Luis Blanco-Claraco et.al.	2407.20465v1	link
2024-07-26	From 2D to 3D: AISG-SLA Visual Localization Challenge	Jialin Gao et.al.	2407.18590v1	null
2024-07-24	Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation	Yongqi Li et.al.	2407.17274v1	null
2024-07-24	Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments	Wei Gao et.al.	2407.17078v1	null
2024-07-24	Pose Estimation from Camera Images for Underwater Inspection	Luyuan Peng et.al.	2407.16961v1	null
2024-07-22	Memory Management for Real-Time Appearance-Based Loop Closure Detection	Mathieu Labbé et.al.	2407.15890v1	null
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791v1	null
2024-07-22	Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM	Mathieu Labbe et.al.	2407.15305v1	null
2024-07-22	Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation	Mathieu Labbé et.al.	2407.15304v1	null
2024-07-19	Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization	Yuehua Ding et.al.	2407.14643v1	null
2024-07-18	Visual Haystacks: Answering Harder Questions About Sets of Images	Tsung-Han Wu et.al.	2407.13766v1	link
2024-07-17	Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM	Markus Weißflog et.al.	2407.12408v1	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736v2	link
2024-07-16	EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis	Ruijie Yang et.al.	2407.11401v1	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964v1	link
2024-07-15	DINO Pre-training for Vision-based End-to-end Autonomous Driving	Shubham Juneja et.al.	2407.10803v1	null
2024-07-15	Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval	Youngsun Lim et.al.	2407.10683v1	null
2024-07-15	An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots	J. J. Cabrera et.al.	2407.10596v1	link
2024-07-15	An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments	J. J. Cabrera et.al.	2407.10536v1	null
2024-07-12	Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval	Vaibhav Balloli et.al.	2407.08908v1	link
2024-07-11	Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates	Owen Claxton et.al.	2407.08162v1	link
2024-07-12	Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal	Xinyu Zhu et.al.	2407.08153v2	link
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106v1	link
2024-07-09	LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Teng Wang et.al.	2407.06730v1	null
2024-07-09	CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding	Wenhao Xu et.al.	2407.06611v1	null
2024-07-08	Pseudo-triplet Guided Few-shot Composed Image Retrieval	Bohan Hou et.al.	2407.06001v1	null
2024-07-09	HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels	Yingying Jiang et.al.	2407.05795v2	null
2024-07-05	Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning	Mainak Singha et.al.	2407.04207v1	link
2024-07-04	Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models	Chang-Sheng Kao et.al.	2407.03615v1	link
2024-07-03	Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach	Pronay Debnath et.al.	2407.03486v1	null
2024-07-02	Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition	Sergio Izquierdo et.al.	2407.02422v1	link
2024-07-01	Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval	Aneeshan Sain et.al.	2407.01810v1	null
2024-07-01	Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval	Hanwen Su et.al.	2407.00979v1	null
2024-07-01	Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios	Connor Malone et.al.	2407.00863v1	null
2024-06-27	PathAlign: A vision-language model for whole slide images in histopathology	Faruk Ahmed et.al.	2406.19578v1	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898v2	null
2024-06-27	Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs	Huaying Zhang et.al.	2406.18836v1	null
2024-06-26	WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images	Yannik Glaser et.al.	2406.18765v1	null
2024-06-26	View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis	Subin Varghese et.al.	2406.18012v1	null
2024-06-25	Tell Me Where You Are: Multimodal LLMs Meet Place Recognition	Zonglin Lyu et.al.	2406.17520v1	null
2024-06-25	SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation	Xu Liu et.al.	2406.17249v1	link
2024-06-23	Breaking the Frame: Image Retrieval by Visual Overlap Prediction	Tong Wei et.al.	2406.16204v1	link
2024-06-19	Towards a multimodal framework for remote sensing image change retrieval and captioning	Roger Ferrod et.al.	2406.13424v1	link
2024-06-19	CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval	Christian Lülf et.al.	2406.13322v1	link
2024-06-17	Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization	Huaiji Zhou et.al.	2406.11766v1	null
2024-06-22	Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment	Jianan Jiang et.al.	2406.11551v2	link
2024-06-17	They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias	Salma Abdel Magid et.al.	2406.11331v1	null
2024-06-17	Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion	Guoyuan An et.al.	2406.11242v1	null
2024-06-14	Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval	Genc Hoxha et.al.	2406.10107v1	null
2024-06-14	BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval	Imanol Miranda et.al.	2406.09952v1	link
2024-06-13	Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases	Meng Wang et.al.	2406.09317v1	link
2024-06-13	Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval	Jaeseok Byun et.al.	2406.09188v1	null
2024-06-13	DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification	Zhengrui Xu et.al.	2406.08773v1	link
2024-06-12	Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement	Maxime Pietrantoni et.al.	2406.08463v1	null
2024-06-12	ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery	Kam Woh Ng et.al.	2406.08457v1	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502v1	link
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450v1	link
2024-06-16	Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval	Adrià Molina et.al.	2406.07315v2	null
2024-06-10	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374v1	link
2024-06-09	Unified Text-to-Image Generation and Retrieval	Leigang Qu et.al.	2406.05814v1	null
2024-06-07	The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better	Scott Geng et.al.	2406.05184v1	link
2024-06-07	PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction	Eduard Poesina et.al.	2406.04746v1	link
2024-06-06	GLACE: Global Local Accelerated Coordinate Encoding	Fangjinhua Wang et.al.	2406.04340v1	link
2024-06-06	Monocular Localization with Semantics Map for Autonomous Vehicles	Jixiang Wan et.al.	2406.03835v1	null
2024-06-05	Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach	Saehyung Lee et.al.	2406.03411v1	link
2024-06-04	MeshVPR: Citywide Visual Place Recognition Using 3D Meshes	Gabriele Berton et.al.	2406.02776v1	null
2024-06-04	Can CLIP help CLIP in learning 3D?	Cristian Sbrolli et.al.	2406.02202v1	null
2024-06-03	Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP	Sriram Balasubramanian et.al.	2406.01583v1	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315v1	link
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885v1	link
2024-06-01	NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization	Wugang Meng et.al.	2406.00312v1	null
2024-05-31	DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models	Linli Yao et.al.	2405.20985v1	link
2024-05-29	Multi-Modal Generative Embedding Model	Feipeng Ma et.al.	2405.19333v1	null
2024-05-29	ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions	Honglin Lin et.al.	2405.19226v1	null
2024-05-30	CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval	Xintong Jiang et.al.	2405.19149v2	link
2024-05-29	SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation	Zhenbei Wu et.al.	2405.18801v1	null
2024-05-29	Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs	Jialiang Xu et.al.	2405.18740v1	link
2024-05-28	EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition	Issar Tzachor et.al.	2405.18065v1	null
2024-05-28	AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval	Sihe Zhang et.al.	2405.17718v1	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599v1	null
2024-05-29	Composed Image Retrieval for Remote Sensing	Bill Psomas et.al.	2405.15587v2	link
2024-05-24	Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval	Yiming Wu et.al.	2405.15451v1	null
2024-05-20	UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization	Wenjia Xu et.al.	2405.11936v1	link
2024-05-19	Register assisted aggregation for Visual Place Recognition	Xuan Yu et.al.	2405.11526v1	null
2024-05-26	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793v2	null
2024-05-16	FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models	Adrian Bulat et.al.	2405.10286v1	null
2024-05-15	Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study	Farnaz Khun Jush et.al.	2405.09334v1	null
2024-05-14	BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment	Lihong Jin et.al.	2405.09001v1	null
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434v1	null
2024-05-13	OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition	Qiuchi Xiang et.al.	2405.07966v1	link
2024-05-14	HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval	Chao He et.al.	2405.07524v2	link
2024-05-13	JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation	Xubo Luo et.al.	2405.07429v1	link
2024-05-12	BoQ: A Place is Worth a Bag of Learnable Queries	Amar Ali-bey et.al.	2405.07364v1	link
2024-05-07	Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction	Nematollah Saeidi et.al.	2405.04211v1	null
2024-05-06	A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions	Sharath Raghvendra et.al.	2405.03664v1	null
2024-05-06	Knowledge-aware Text-Image Retrieval for Remote Sensing Images	Li Mi et.al.	2405.03373v1	null
2024-05-06	Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval	Jiacheng Cheng et.al.	2405.03190v1	null
2024-05-05	iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval	Lorenzo Agnolucci et.al.	2405.02951v1	link
2024-05-01	Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval	Young Kyun Jang et.al.	2405.00571v1	null
2024-04-30	Large Language Model Informed Patent Image Retrieval	Hao-Cheng Lo et.al.	2404.19360v1	null
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174v1	null
2024-04-29	Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models	Hongyi Zhu et.al.	2404.18746v1	null
2024-04-29	Dual-Modal Prompting for Sketch-Based Image Retrieval	Liying Gao et.al.	2404.18695v1	null
2024-05-01	Semantic Line Combination Detector	Jinwon Ko et.al.	2404.18399v2	link
2024-04-26	Learning text-to-video retrieval from image captioning	Lucas Ventura et.al.	2404.17498v1	null
2024-04-25	CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching	Samia Shafique et.al.	2404.16972v1	link
2024-04-29	Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval	Ryoya Nara et.al.	2404.16398v2	null
2024-04-24	Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval	Haokun Wen et.al.	2404.15875v1	link
2024-04-24	DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines	Xin Jiang et.al.	2404.15771v1	null
2024-04-23	Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval	Young Kyun Jang et.al.	2404.15516v1	null
2024-04-22	EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models	Mathias Thorsager et.al.	2404.14236v1	null
2024-04-22	Hierarchical localization with panoramic views and triplet loss functions	Marcos Alfaro et.al.	2404.14117v1	link
2024-04-20	High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces	Baoru Huang et.al.	2404.13437v1	null
2024-04-20	Collaborative Visual Place Recognition through Federated Learning	Mattia Dutto et.al.	2404.13324v1	null
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339v1	null
2024-04-17	Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives	Zhangchi Feng et.al.	2404.11317v1	link
2024-04-17	Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing	Sanggeon Yun et.al.	2404.11025v1	null
2024-04-16	SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments	Niklas Gard et.al.	2404.10527v1	link
2024-04-20	CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning	Haojian Huang et.al.	2404.09640v3	link
2024-04-11	PRAM: Place Recognition Anywhere Model for Efficient Visual Localization	Fei Xue et.al.	2404.07785v1	null
2024-04-16	2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure	Bin Zhang et.al.	2404.07644v4	link
2024-04-11	Semantically-correlated memories in a dense associative model	Thomas F Burns et.al.	2404.07123v2	link
2024-04-09	Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Luca Barsellotti et.al.	2404.06542v1	null
2024-04-09	Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping	Anas Gouda et.al.	2404.06277v1	link
2024-04-07	Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval	Jinpeng Wang et.al.	2404.04998v1	link
2024-04-06	Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning	Juncheng Yang et.al.	2404.04538v1	link
2024-04-05	Towards introspective loop closure in 4D radar SLAM	Maximilian Hilger et.al.	2404.03940v1	null
2024-04-02	TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation	Yehui Shen et.al.	2404.01587v1	link
2024-04-01	On Train-Test Class Overlap and Detection for Image Retrieval	Chull Hwan Song et.al.	2404.01524v1	link
2024-04-01	NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification	Juyeop Han et.al.	2404.01400v1	null
2024-03-31	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546v1	null
2024-03-31	NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation	Diwei Sheng et.al.	2404.00504v1	null
2024-03-30	SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs	Yang Miao et.al.	2404.00469v1	null
2024-03-30	Do Vision-Language Models Understand Compound Nouns?	Sonal Kumar et.al.	2404.00419v1	link
2024-04-05	FairRAG: Fair Human Generation via Fair Retrieval Augmentation	Robik Shrestha et.al.	2403.19964v3	null
2024-03-28	JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition	Gabriele Berton et.al.	2403.19787v1	link
2024-03-28	MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Kai Zhang et.al.	2403.19651v1	link
2024-03-27	AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation	Changkun Liu et.al.	2403.18281v1	null
2024-03-26	Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge	Dongjin Kim et.al.	2403.17420v1	link
2024-03-25	Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras	Gokul B. Nair et.al.	2403.16425v1	link
2024-03-24	Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval	Yucheng Suo et.al.	2403.16005v1	link
2024-03-24	BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval	Yinda Chen et.al.	2403.15992v1	null
2024-03-22	Long-CLIP: Unlocking the Long-Text Capability of CLIP	Beichen Zhang et.al.	2403.15378v1	link
2024-03-22	A Multimodal Approach for Cross-Domain Image Retrieval	Lucas Iijima et.al.	2403.15152v1	null
2024-03-22	Piecewise-Linear Manifolds for Deep Metric Learning	Shubhang Bhatnagar et.al.	2403.14977v1	null
2024-03-21	Enhancing Historical Image Retrieval with Compositional Cues	Tingyu Lin et.al.	2403.14287v1	link
2024-03-20	Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval	Aymene Berriche et.al.	2403.13747v1	null
2024-03-20	Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval	Haoyu Liu et.al.	2403.13317v1	null
2024-03-19	Learning Neural Volumetric Pose Features for Camera Localization	Jingyu Lin et.al.	2403.12800v1	null
2024-03-19	Quantixar: High-performance Vector Data Management System	Gulshan Yadav et.al.	2403.12583v1	null
2024-03-17	3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization	Peng Jiang et.al.	2403.11367v1	null
2024-03-17	MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data	Paul S. Scotti et.al.	2403.11207v1	link
2024-03-16	Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval	Shunsuke Tsubaki et.al.	2403.10756v1	null
2024-03-16	Vector search with small radiuses	Gergely Szilvasy et.al.	2403.10746v1	null
2024-03-13	Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer	Kenta Tsukahara et.al.	2403.10552v1	null
2024-03-20	Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression	Huy-Hoang Bui et.al.	2403.10297v2	link
2024-03-15	Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline	Fangming Yuan et.al.	2403.10283v1	null
2024-03-14	The NeRFect Match: Exploring NeRF Features for Visual Localization	Qunjie Zhou et.al.	2403.09577v1	null
2024-03-14	VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition	Benjamin Ramtoula et.al.	2403.09025v1	null
2024-03-13	PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models	Siddharth Mishra-Sharma et.al.	2403.08851v1	link
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156v1	link
2024-03-12	It's All About Your Sketch: Democratising Sketch Control in Diffusion Models	Subhadeep Koley et.al.	2403.07234v1	link
2024-03-12	You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval	Subhadeep Koley et.al.	2403.07222v1	null
2024-03-12	Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers	Subhadeep Koley et.al.	2403.07214v1	null
2024-03-11	How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?	Subhadeep Koley et.al.	2403.07203v1	null
2024-03-11	EarthLoc: Astronaut Photography Localization by Indexing Earth from Space	Gabriele Berton et.al.	2403.06758v1	link
2024-03-11	BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues	Fudong Ge et.al.	2403.06600v1	link
2024-03-11	Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology	Stefan Denner et.al.	2403.06567v1	link
2024-03-10	RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation	Mathieu Labbé et.al.	2403.06341v1	null
2024-03-10	Texture image retrieval using a classification and contourlet-based features	Asal Rouhafzay et.al.	2403.06048v1	null
2024-03-11	LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map	Xinrui Wu et.al.	2403.05002v2	link
2024-03-11	Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Yifan Wang et.al.	2403.04765v2	null
2024-03-07	mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar	Chengzhen Meng et.al.	2403.04703v1	null
2024-03-06	Self-supervised Photographic Image Layout Representation Learning	Zhaoran Zhao et.al.	2403.03740v1	link
2024-03-04	Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models	Benedikt Blumenstiel et.al.	2403.02059v1	link
2024-03-03	Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval	Yongchao Du et.al.	2403.01431v1	null
2024-03-01	Asymmetric Feature Fusion for Image Retrieval	Hui Wu et.al.	2403.00671v1	null
2024-03-01	Structure Similarity Preservation Learning for Asymmetric Image Retrieval	Hui Wu et.al.	2403.00648v1	link
2024-02-29	CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition	Feng Lu et.al.	2402.19231v1	link
2024-02-28	Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport	Bin Li et.al.	2402.18411v1	link
2024-02-28	Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning	Hanyao Wang et.al.	2402.18400v1	null
2024-02-28	Representing 3D sparse map points and lines for camera relocalization	Bach-Thuan Bui et.al.	2402.18011v1	link
2024-02-27	Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control	Thong Nguyen et.al.	2402.17535v1	link
2024-02-29	Active propulsion noise shaping for multi-rotor aircraft localization	Gabriele Serussi et.al.	2402.17289v2	link
2024-02-27	NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer	Bingxi Liu et.al.	2402.17159v1	link
2024-02-25	Deep Homography Estimation for Visual Place Recognition	Feng Lu et.al.	2402.16086v1	link
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961v1	link
2024-02-28	Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries	Zijun Long et.al.	2402.15276v2	null
2024-02-23	Fine-tuning CLIP Text Encoders with Two-step Paraphrasing	Hyunjae Kim et.al.	2402.15120v1	null
2024-02-22	Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition	Feng Lu et.al.	2402.14505v1	link
2024-02-16	Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition	Chenming Hu et.al.	2402.10476v1	null
2024-02-15	Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task	Mirko Nava et.al.	2402.09886v1	link
2024-02-14	Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency	Yannis Kalantidis et.al.	2402.09237v1	null
2024-02-13	Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast	Xiangming Gu et.al.	2402.08567v1	link
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359v1	link
2024-02-10	Semantic Object-level Modeling for Robust Visual Camera Relocalization	Yifan Zhu et.al.	2402.06951v1	null
2024-02-09	Large Language Models for Captioning and Retrieving Remote Sensing Images	João Daniel Silva et.al.	2402.06475v1	null
2024-02-09	PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes	Xinggang Hu et.al.	2402.06131v1	null
2024-02-21	MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction	Heng Zhou et.al.	2402.03762v3	null
2024-02-04	Region-Based Representations Revisited	Michal Shlapentokh-Rothman et.al.	2402.02352v1	link
2024-02-03	Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization	Bo Yang et.al.	2402.02141v1	link
2024-02-01	BrainSLAM: SLAM on Neural Population Activity Data	Kipp Freud et.al.	2402.00588v1	null
2024-02-01	Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering	Tianxiao Gao et.al.	2402.00330v1	link
2024-01-31	Improved Scene Landmark Detection for Camera Localization	Tien Do et.al.	2401.18083v1	link
2024-01-31	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592v1	link
2024-01-29	Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors	Shiyin Dong et.al.	2401.16459v1	null
2024-01-29	Cross-Modal Coordination Across a Diverse Set of Input Modalities	Jorge Sánchez et.al.	2401.16347v1	null
2024-01-29	Regressing Transformers for Data-efficient Visual Place Recognition	María Leyva-Vallina et.al.	2401.16304v1	null
2024-01-27	Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval	Ayush Dubey et.al.	2401.15362v1	null
2024-01-24	Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode	Naresh Kumar Lahajal et.al.	2401.13613v1	null
2024-01-23	PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion	Shyam Sundar Kannan et.al.	2401.13082v1	null
2024-01-23	SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization	Mingyang Li et.al.	2401.13076v1	link
2024-01-25	CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios	Xiangshuo Qiao et.al.	2401.10475v2	link
2024-01-19	PhotoScout: Synthesis-Powered Multi-Modal Image Search	Celeste Barnaby et.al.	2401.10464v1	null
2024-01-19	Cross-Modality Perturbation Synergy Attack for Person Re-identification	Yunpeng Gong et.al.	2401.10090v2	null
2024-01-16	Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging	Zahra Tabatabaei et.al.	2401.08272v1	null
2024-01-16	Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments	Bruno Arcanjo et.al.	2401.08263v1	null
2024-01-15	Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing	Jakob Hackstein et.al.	2401.07782v1	link
2024-01-14	HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval	Zexuan Qiu et.al.	2401.07212v1	link
2024-01-11	UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization	Rouwan Wu et.al.	2401.05971v1	link
2024-01-10	Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval	Eunyi Lyou et.al.	2401.04860v1	link
2024-01-05	Benchmarking PathCLIP for Pathology Image Analysis	Sunyi Zheng et.al.	2401.02651v1	null
2024-01-03	DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding	Mingrui Li et.al.	2401.01545v1	null
2024-01-02	BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving	Dafeng Wei et.al.	2401.01065v1	null
2023-12-31	Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval	Liang Wang et.al.	2401.00371v1	link
2023-12-29	Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering	Long-Kun Du et.al.	2401.00032v1	null
2023-12-27	LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization	Sai Shubodh Puligilla et.al.	2312.16648v1	null
2023-12-26	Recursive Distillation for Open-Set Distributed Robot Localization	Kenta Tsukahara et.al.	2312.15897v1	null
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471v1	null
2023-12-23	CaLDiff: Camera Localization in NeRF via Pose Diffusion	Rashik Shrestha et.al.	2312.15242v1	null
2023-12-20	Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition	Bruno Arcanjo et.al.	2312.12995v1	null
2023-12-19	VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering	Chun-Mei Feng et.al.	2312.12273v1	link
2023-12-18	Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback	Boaz Lerner et.al.	2312.11078v1	link
2023-12-17	PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields	Boming Zhao et.al.	2312.10649v1	null
2023-12-17	DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition	Sijie Wang et.al.	2312.10616v1	link
2023-12-16	Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval	Decheng Liu et.al.	2312.10320v1	link
2023-12-15	Data-Efficient Multimodal Fusion on a Single GPU	Noël Vouitsis et.al.	2312.10144v1	link
2023-12-13	Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques	Hamed Qazanfari et.al.	2312.10089v1	null
2023-12-15	Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval	Zhe Ma et.al.	2312.09716v1	link
2023-12-14	Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition	Oliver Grainge et.al.	2312.09028v1	null
2023-12-14	Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking	Shitong Sun et.al.	2312.08924v1	null
2023-12-13	C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation	Florian Fervers et.al.	2312.08060v1	null
2023-12-12	Contextually Affinitive Neighborhood Refinery for Deep Clustering	Chunlin Yu et.al.	2312.07806v1	link
2023-12-12	Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval	Qiwei Tian et.al.	2312.07364v1	link
2023-12-12	Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection	Jonathan J. Y. Kim et.al.	2312.06991v1	null
2023-12-11	Dynamic Weighted Combiner for Mixed-Modal Image Retrieval	Fuxiang Huang et.al.	2312.06179v1	link
2023-12-06	Lite-Mind: Towards Efficient and Versatile Brain Representation Network	Zixuan Gong et.al.	2312.03781v1	link
2023-12-08	FreestyleRet: Retrieving Images from Style-Diversified Queries	Hao Li et.al.	2312.02428v2	link
2023-12-04	Implicit Learning of Scene Geometry from Poses for Global Localization	Mohammad Altillawi et.al.	2312.02029v1	null
2023-12-04	Language-only Efficient Training of Zero-shot Composed Image Retrieval	Geonmo Gu et.al.	2312.01998v1	link
2023-12-03	G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training	Che Liu et.al.	2312.01522v1	link
2023-12-01	Improve Supervised Representation Learning with Masked Image Modeling	Kaifeng Chen et.al.	2312.00950v1	null
2023-12-05	Grounding Everything: Emerging Localization Properties in Vision-Language Transformers	Walid Bousselham et.al.	2312.00878v2	link
2023-12-01	Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras	Mohammad Altillawi et.al.	2312.00500v1	null
2023-11-30	HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance	Zhuohao Yin et.al.	2311.18273v1	link
2023-11-30	Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models	Raviteja Vemulapalli et.al.	2311.18237v1	link
2023-11-29	Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce	Chang Liu et.al.	2311.17954v1	null
2023-11-28	Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames	Chao Chen et.al.	2311.17940v1	null
2023-11-29	360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries	Huajian Huang et.al.	2311.17389v1	link
2023-11-27	Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation	Samuele Poppi et.al.	2311.16254v1	link
2023-11-27	Optimal Transport Aggregation for Visual Place Recognition	Sergio Izquierdo et.al.	2311.15937v1	link
2023-11-27	AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval	Shicheng Xu et.al.	2311.14084v2	link
2023-11-23	3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology	Asma Ben Abacha et.al.	2311.13752v1	link
2023-11-22	Medical Image Retrieval Using Pretrained Embeddings	Farnaz Khun Jush et.al.	2311.13547v1	null
2023-11-22	Applications of Spiking Neural Networks in Visual Place Recognition	Somayeh Hussaini et.al.	2311.13186v1	link
2023-11-21	Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval	Xiu-Shen Wei et.al.	2311.12894v1	null
2023-11-21	Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs	Zhentian Qian et.al.	2311.12245v1	null
2023-11-19	From Categories to Classifier: Name-Only Continual Learning by Exploring the Web	Ameya Prabhu et.al.	2311.11293v1	null
2023-11-18	Lesion Search with Self-supervised Learning	Kristin Qi et.al.	2311.11014v1	null
2023-11-15	Flow reconstruction and particle characterization from inertial Lagrangian tracks	Ke Zhou et.al.	2311.09076v1	null
2023-11-15	Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval	Junyang Chen et.al.	2311.07622v2	null
2023-11-13	VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search	Shuting He et.al.	2311.07514v1	null
2023-11-10	Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval	Xin Lu et.al.	2311.06067v1	null
2023-11-08	Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model	Junya Shiraishi et.al.	2311.04788v1	null
2023-11-08	Training CLIP models on Data from Scientific Papers	Calvin Metzger et.al.	2311.04711v1	link
2023-11-07	DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding	Kehinde Ajayi et.al.	2311.04098v1	link
2023-11-06	Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences	Zador Pataki et.al.	2311.03345v1	null
2023-11-06	FocusTune: Tuning Visual Localization through Focus-Guided Sampling	Son Tung Nguyen et.al.	2311.02872v1	link
2023-11-01	DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing	Gaoshuang Huang et.al.	2311.00230v1	link
2023-10-29	Identifiable Contrastive Learning with Automatic Feature Importance Discovery	Qi Zhang et.al.	2310.18904v1	link
2023-10-27	LipSim: A Provably Robust Perceptual Similarity Metric	Sara Ghazanfari et.al.	2310.18274v1	link
2023-10-27	Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation	Susu Fang et.al.	2310.17879v1	null
2023-10-25	FoundLoc: Vision-based Onboard Aerial Localization in the Wild	Yao He et.al.	2310.16299v1	null
2023-10-24	Cross-view Self-localization from Synthesized Scene-graphs	Ryogo Yamamoto et.al.	2310.15504v1	null
2023-10-23	Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval	Xu Yuan et.al.	2310.14637v1	link
2023-10-21	Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation	Anastasia Kritharoula et.al.	2310.14025v1	link
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605v1	null
2023-10-20	CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants	Shaoan Wang et.al.	2310.13320v1	link
2023-10-27	Representation Learning via Consistent Assignment of Views over Random Partitions	Thalles Silva et.al.	2310.12692v2	link
2023-10-18	Evaluating the Fairness of Discriminative Foundation Models in Computer Vision	Junaid Ali et.al.	2310.11867v1	null
2023-10-17	Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification	Shuanglin Yan et.al.	2310.11210v1	null
2023-10-16	Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People	Dharmateja Adapa et.al.	2310.10290v1	null
2023-10-16	EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge	Tom Bryan et.al.	2310.10050v1	null
2023-10-15	CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes	Yulei Qin et.al.	2310.09761v1	link
2023-10-13	Pairwise Similarity Learning is SimPLE	Yandong Wen et.al.	2310.09449v1	link
2023-10-13	Vision-by-Language for Training-Free Compositional Image Retrieval	Shyamgopal Karthik et.al.	2310.09291v1	link
2023-10-12	Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning	Shiyang Yan et.al.	2310.08390v1	null
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082v1	null
2023-10-10	Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization	Le Chen et.al.	2310.06984v1	null
2023-10-10	Distillation Improves Visual Place Recognition for Low-Quality Queries	Anbang Yang et.al.	2310.06906v1	link
2023-10-10	Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets	Jiajun Zhang et.al.	2310.06566v1	null
2023-10-10	Topological RANSAC for instance verification and retrieval without fine-tuning	Guoyuan An et.al.	2310.06486v1	null
2023-10-10	3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments	Ghanta Sai Krishna et.al.	2310.06385v1	null
2023-10-09	Collaborative Visual Place Recognition	Yiming Li et.al.	2310.05541v1	null
2023-10-09	Sentence-level Prompts Benefit Composed Image Retrieval	Yang Bai et.al.	2310.05473v1	link
2023-10-08	AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition	Feng Lu et.al.	2310.05184v1	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134v1	null
2023-10-12	ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer	Yifan Xu et.al.	2310.04099v2	null
2023-10-06	Sub-token ViT Embedding via Stochastic Resonance Transformers	Dong Lao et.al.	2310.03967v1	link
2023-10-04	Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach	Matthew Hanlon et.al.	2310.02650v1	null
2023-10-02	NEUCORE: Neural Concept Reasoning for Composed Image Retrieval	Shu Zhao et.al.	2310.01358v1	null
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092v1	null
2023-10-05	PlaceNav: Topological Navigation through Place Recognition	Lauri Suomela et.al.	2309.17260v3	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992v1	link
2023-09-28	Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning	Albert Mohwald et.al.	2309.16351v1	link
2023-09-28	FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding	Pengxiang Wu et.al.	2309.16249v1	link
2023-09-28	Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2309.16137v1	link
2023-09-27	GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization	Vicente Vivanco Cepeda et.al.	2309.16020v1	link
2023-09-27	Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization	Zhenbo Song et.al.	2309.15556v1	null
2023-09-26	Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features	Hila Levi et.al.	2309.14999v1	null
2023-09-23	Resolving References in Visually-Grounded Dialogue via Text Generation	Bram Willemsen et.al.	2309.13430v1	link
2023-09-21	Face Identity-Aware Disentanglement in StyleGAN	Adrian Suwała et.al.	2309.12033v1	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883v1	link
2023-09-20	2D-3D Pose Tracking with Multi-View Constraints	Huai Yu et.al.	2309.11335v1	null
2023-09-19	VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition	Adam D. Hines et.al.	2309.10225v1	link
2023-09-18	DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach	Chenghao Xu et.al.	2309.09879v1	null
2023-09-18	Decompose Semantic Shifts for Composed Image Retrieval	Xingyu Yang et.al.	2309.09531v1	null
2023-09-16	Efficient Object Rearrangement via Multi-view Fusion	Dehao Huang et.al.	2309.08994v1	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927v1	link
2023-09-16	Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning	Pengyu Yin et.al.	2309.08914v1	link
2023-09-15	Active Learning for Fine-Grained Sketch-Based Image Retrieval	Himanshu Thakur et.al.	2309.08743v1	null
2023-09-15	Optimization of Rank Losses for Image Retrieval	Elias Ramzi et.al.	2309.08250v1	link
2023-09-18	Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer	Yaoting Wang et.al.	2309.07929v2	link
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471v1	link
2023-09-13	RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline	Mirko Usuelli et.al.	2309.07094v1	null
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438v1	link
2023-09-08	Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning	Hiroki Nakamura et.al.	2309.04148v1	null
2023-09-05	Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection	Natalia Pavlasek et.al.	2309.02394v1	null
2023-09-05	Dual Relation Alignment for Composed Image Retrieval	Xintong Jiang et.al.	2309.02169v1	null
2023-09-04	NLLB-CLIP -- train performant multilingual image retrieval model on a budget	Alexander Visheratin et.al.	2309.01859v1	null
2023-09-04	Target-Guided Composed Image Retrieval	Haokun Wen et.al.	2309.01366v1	null
2023-09-02	Deep supervised hashing for fast retrieval of radio image cubes	Steven Ndung'u et.al.	2309.00932v1	null
2023-08-31	Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval	Prateksha Udhayanan et.al.	2308.16649v1	null
2023-08-28	Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics	Nils Böhne et.al.	2308.14786v1	null
2023-08-28	CoVR: Learning Composed Video Retrieval from Web Video Captions	Lucas Ventura et.al.	2308.14746v1	link
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039v1	null
2023-08-26	Learning Efficient Representations for Image-Based Patent Retrieval	Hongsong Wang et.al.	2308.13749v1	null
2023-08-25	Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers	Mohammad Javad Rajabi et.al.	2308.13671v1	null
2023-08-24	Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities	Jinze Bai et.al.	2308.12966v1	link
2023-08-23	Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval	Huafeng Li et.al.	2308.11994v1	null
2023-08-23	OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes	Tao Xie et.al.	2308.11928v1	link
2023-08-22	Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features	Alberto Baldrati et.al.	2308.11485v1	link
2023-08-22	GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training	Xinchi Deng et.al.	2308.11331v1	null
2023-08-22	LDP-Feat: Image Features with Local Differential Privacy	Francesco Pittaluga et.al.	2308.11223v1	null
2023-08-21	EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition	Gabriele Berton et.al.	2308.10832v1	link
2023-08-20	FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory	Anwesan Pal et.al.	2308.10170v1	null
2023-08-18	3D Model-free Visual localization System from Essential Matrix under Local Planar Motion	Yanmei Jiao et.al.	2308.09566v1	null
2023-08-17	FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings	Yulin Su et.al.	2308.09012v1	link
2023-08-16	Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval	Aishwarya Venkataramanan et.al.	2308.08431v1	link
2023-08-16	Ranking-aware Uncertainty for Text-guided Image Retrieval	Junyang Chen et.al.	2308.08131v1	null
2023-08-19	Global Features are All You Need for Image Retrieval and Reranking	Shihao Shao et.al.	2308.06954v2	link
2023-08-14	MixBCT: Towards Self-Adapting Backward-Compatible Training	Yu Liang et.al.	2308.06948v1	link
2023-08-10	KS-APR: Keyframe Selection for Robust Absolute Pose Regression	Changkun Liu et.al.	2308.05459v1	null
2023-08-09	AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities	Jingdan Zhang et.al.	2308.04992v1	link
2023-08-08	Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval	Yi Bin et.al.	2308.04343v1	link
2023-08-08	Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval	Yunquan Zhu et.al.	2308.04008v1	link
2023-08-05	A Comprehensive Analysis of Real-World Image Captioning and Scene Identification	Sai Suprabhanu Nallapaneni et.al.	2308.02833v1	null
2023-08-03	Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies	Eunsuk Seo et.al.	2308.01871v1	null
2023-08-01	AnyLoc: Towards Universal Visual Place Recognition	Nikhil Keetha et.al.	2308.00688v1	link
2023-07-31	Guiding Image Captioning Models Toward More Specific Captions	Simon Kornblith et.al.	2307.16686v1	null
2023-07-31	Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks	Kousik Rajesh et.al.	2307.16395v1	null
2023-07-28	D2S: Representing local descriptors and global scene coordinates for camera relocalization	Bach-Thuan Bui et.al.	2307.15250v1	link
2023-07-26	Neural-based Cross-modal Search and Retrieval of Artwork	Yan Gong et.al.	2307.14244v1	null
2023-07-26	Boon: A Neural Search Engine for Cross-Modal Information Retrieval	Yan Gong et.al.	2307.14240v1	null
2023-07-25	Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network	Chull Hwan Song et.al.	2307.13254v1	null
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702v2	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981v1	null
2023-07-19	Quantum Optics based Algorithm for Measuring the Similarity between Images	Vivek Mehta et.al.	2307.09789v1	null
2023-07-18	Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments	Max Moebius et.al.	2307.09172v1	null
2023-07-18	3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving	Qipeng Li et.al.	2307.09044v1	null
2023-07-19	Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation	Rundong Luo et.al.	2307.08779v2	null
2023-07-17	Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition	Gabriele Trivigno et.al.	2307.08417v1	link
2023-07-17	Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification	Tengfei Liang et.al.	2307.08316v1	link
2023-07-17	NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM	Lizhou Liao et.al.	2307.08221v1	link
2023-07-20	Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer	Yujiao Shi et.al.	2307.08015v3	link
2023-07-10	Phoneme-retrieval; voice recognition; vowels recognition	Brunello Tirozzi et.al.	2307.07407v1	null
2023-07-14	Risk Controlled Image Retrieval	Kaiwen Cai et.al.	2307.07336v1	link
2023-07-11	ResMatch: Residual Attention Learning for Local Feature Matching	Yuxin Deng et.al.	2307.05180v1	link
2023-07-11	Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification	Yi Liao et.al.	2307.05017v1	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520v1	null
2023-07-10	RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold	Hyesu Jang et.al.	2307.04321v1	link
2023-07-08	Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning	Qin Zhang et.al.	2307.04047v1	null
2023-07-04	Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition	Helen Carson et.al.	2307.01464v1	null
2023-07-04	Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network	Zizhuo Li et.al.	2307.01447v1	null
2023-07-03	Cross-modal Place Recognition in Image Databases using Event-based Sensors	Xiang Ji et.al.	2307.01047v1	null
2023-06-30	DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions	Stephen Hausler et.al.	2306.17536v1	null
2023-06-30	Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization	Stephen Hausler et.al.	2306.17529v1	null
2023-06-27	Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research	Tanjida Kabir et.al.	2306.15651v1	null
2023-06-27	Mean Field Theory in Deep Metric Learning	Takuya Furusawa et.al.	2306.15368v1	null
2023-06-26	Hierarchical Matching and Reasoning for Multi-Query Image Retrieval	Zhong Ji et.al.	2306.14460v1	link
2023-06-25	Enhancing Dynamic Image Advertising with Vision-Language Pre-training	Zhoufutu Wen et.al.	2306.14112v1	null
2023-06-23	Catching Image Retrieval Generalization	Maksim Zhdanov et.al.	2306.13357v1	null
2023-06-22	Deep Metric Learning with Soft Orthogonal Proxies	Farshad Saberi-Movahed et.al.	2306.13055v1	null
2023-06-22	What to Learn: Features, Image Transformations, or Both?	Yuxuan Chen et.al.	2306.13040v1	null
2023-06-22	Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval	Katrin Glinka et.al.	2306.12843v1	null
2023-06-26	Annotation Cost Efficient Active Learning for Content Based Image Retrieval	Julia Henkel et.al.	2306.11605v2	null
2023-06-19	Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning	Shivaen Ramshetty et.al.	2306.11065v1	link
2023-06-18	LiDAR-Based Place Recognition For Autonomous Driving: A Survey	Pengcheng Shi et.al.	2306.10561v1	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012v1	link
2023-06-15	Prompt Performance Prediction for Generative IR	Nicolas Bizzozzero et.al.	2306.08915v1	null
2023-06-15	Graph Convolution Based Efficient Re-Ranking for Visual Retrieval	Yuqi Zhang et.al.	2306.08792v1	link
2023-06-13	GeneCIS: A Benchmark for General Conditional Image Similarity	Sagar Vaze et.al.	2306.07969v1	null
2023-06-13	MOFI: Learning Image Representations from Noisy Entity Annotated Images	Wentao Wu et.al.	2306.07952v1	link
2023-06-12	Zero-shot Composed Text-Image Retrieval	Yikun Liu et.al.	2306.07272v1	link
2023-06-12	Sticker820K: Empowering Interactive Retrieval with Stickers	Sijie Zhao et.al.	2306.06870v1	null
2023-06-11	Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models	Yuguang Yang et.al.	2306.06691v1	null
2023-06-03	Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval	Xu Zhang et.al.	2306.02092v1	null
2023-06-03	Class Anchor Margin Loss for Content-Based Image Retrieval	Alexandru Ghita et.al.	2306.00630v2	null
2023-05-31	Chatting Makes Perfect -- Chat-based Image Retrieval	Matan Levy et.al.	2305.20062v1	link
2023-05-31	Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization	Junan Chen et.al.	2305.20044v1	null
2023-05-30	A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation	Omar Seddati et.al.	2305.18988v1	null
2023-05-29	Synfeal: A Data-Driven Simulator for End-to-End Camera Localization	Daniel Coelho et.al.	2305.18260v1	link
2023-05-29	Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films	Shrinkhala Sharma et.al.	2305.18197v1	null
2023-05-29	TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition	Tiago Barros et.al.	2305.18013v1	null
2023-05-28	ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval	Jiapeng Wang et.al.	2305.17652v1	null
2023-06-01	FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing	Zhuang Li et.al.	2305.17497v2	link
2023-05-27	Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation	Yueh-Cheng Huang et.al.	2305.17463v1	null
2023-05-26	Generating Images with Multimodal Language Models	Jing Yu Koh et.al.	2305.17216v1	link
2023-05-25	Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder	Zheyuan Liu et.al.	2305.16304v1	link
2023-05-23	Leveraging BEV Representation for 360-degree Visual Place Recognition	Xuecheng Xu et.al.	2305.13814v1	link
2023-05-23	EDIS: Entity-Driven Image Search over Multimodal Web Content	Siqi Liu et.al.	2305.13631v1	link
2023-05-20	DAC: Detector-Agnostic Spatial Covariances for Deep Local Features	Javier Tirado-Garín et.al.	2305.12250v1	link
2023-05-19	Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach	Zahra Tabatabaei et.al.	2305.11728v1	null
2023-05-19	Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition	Fenglin Zhang et.al.	2305.11467v1	link
2023-05-12	IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images	Varuna Krishna et.al.	2305.10438v1	null
2023-05-17	Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval	Haokun Wen et.al.	2305.09979v1	null
2023-05-13	Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance	Xinyu Lin et.al.	2305.07943v1	link
2023-05-11	Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems	Nathan Hughes et.al.	2305.07154v1	link
2023-05-09	Visual Place Recognition with Low-Resolution Images	Mihnea-Alexandru Tomita et.al.	2305.05776v1	null
2023-05-09	Vision-Language Models in Remote Sensing: Current Progress and Future Trends	Congcong Wen et.al.	2305.05726v1	null
2023-05-09	An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition	Maria Waheed et.al.	2305.05705v1	null
2023-05-09	Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query	Ho Hin Lee et.al.	2305.05598v1	null
2023-05-09	ColonMapper: topological mapping and localization for colonoscopy	Javier Morlana et.al.	2305.05546v1	null
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301v1	link
2023-05-09	Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition	Bruno Arcanjo et.al.	2305.05256v1	null
2023-05-09	Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval	Shiyin Dong et.al.	2305.05144v1	null
2023-05-08	Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size	Andrei Potapov et.al.	2305.04856v1	null
2023-05-08	Privacy-Preserving Representations are not Enough -- Recovering Scene Content from Camera Poses	Kunal Chelani et.al.	2305.04603v1	link
2023-05-06	Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer	Minyi Zhao et.al.	2305.04072v1	null
2023-05-06	Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing	Swagatika Dash et.al.	2305.03881v1	link
2023-05-05	COLA: How to adapt vision-language models to Compose Objects Localized with Attributes?	Arijit Ray et.al.	2305.03689v1	link
2023-05-05	HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer	Shuzhe Wang et.al.	2305.03595v1	null
2023-05-05	WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval	Zahra Tabatabaei et.al.	2305.03383v1	null
2023-05-04	Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval	Tan Pan et.al.	2305.02610v1	link
2023-05-03	Learning-based Relational Object Matching Across Views	Cathrin Elich et.al.	2305.02398v1	null
2023-05-05	A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text	Yunxin Li et.al.	2305.02265v2	link
2023-05-03	AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation	Shentong Mo et.al.	2305.01836v1	null
2023-04-30	Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection	Jie Ren et.al.	2305.00435v1	null
2023-04-28	SFD2: Semantic-guided Feature Detection and Description	Fei Xue et.al.	2304.14845v1	link
2023-04-28	Quantum enhanced non-interferometric quantitative phase imaging	Giuseppe Ortolano et.al.	2304.14727v1	null
2023-04-26	Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams	Yun Chang et.al.	2304.13487v1	null
2023-04-27	STIR: Siamese Transformer for Image Retrieval Postprocessing	Aleksei Shabanov et.al.	2304.13393v2	null
2023-04-25	DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design	Jiahao Weng et.al.	2304.12506v1	null
2023-04-24	Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning	Lucas Pascotti Valem et.al.	2304.12448v1	link
2023-04-23	IDLL: Inverse Depth Line based Visual Localization in Challenging Environments	Wanting Li et.al.	2304.11748v1	null
2023-04-23	Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval	Mehdi Rafiei et.al.	2304.11734v1	null
2023-04-17	Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference	Haotian Wu et.al.	2304.08221v1	null
2023-04-17	NeRF-Loc: Visual Localization with Conditional Neural Radiance Field	Jianlin Liu et.al.	2304.07979v1	link
2023-04-16	Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification	Luca Piano et.al.	2304.07883v1	null
2023-04-16	Language Guided Local Infiltration for Interactive Image Retrieval	Fuxiang Huang et.al.	2304.07747v1	null
2023-04-16	Long-term Visual Localization with Mobile Sensors	Shen Yan et.al.	2304.07691v1	null
2023-04-16	Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging	Jielin Qiu et.al.	2304.07675v1	null
2023-04-14	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426v1	null
2023-04-14	FM-Loc: Using Foundation Models for Improved Vision-based Localization	Reihaneh Mirjalili et.al.	2304.07058v1	null
2023-04-17	Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning	Seyed Mahdi Roostaiyan et.al.	2304.06907v2	link
2023-04-17	You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset	Matteo Toso et.al.	2304.06373v3	link
2023-04-12	Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation	Yifeng Shi et.al.	2304.06051v1	link
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947v1	link
2023-04-12	Are Local Features All You Need for Cross-Domain Visual Place Recognition?	Giovanni Barbarani et.al.	2304.05887v1	link
2023-04-12	Unicom: Universal and Compact Representation Learning for Image Retrieval	Xiang An et.al.	2304.05884v1	link
2023-04-12	SGL: Structure Guidance Learning for Camera Localization	Xudong Zhang et.al.	2304.05571v1	null
2023-04-14	Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency	Xingwu Ji et.al.	2304.05146v2	link
2023-04-10	CAVL: Learning Contrastive and Adaptive Representations of Vision and Language	Shentong Mo et.al.	2304.04399v1	null
2023-04-09	Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval	Yanru Xiao et.al.	2304.04228v1	null
2023-04-08	SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes	Baosheng Zhang et.al.	2304.03872v1	null
2023-04-06	$R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition	Sijie Zhu et.al.	2304.03410v1	null
2023-04-06	Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements	Viktor Walter et.al.	2304.03057v1	link
2023-04-05	Efficient OCR for Building a Diverse Digital History	Jacob Carlson et.al.	2304.02737v1	link
2023-04-05	LogoNet: a fine-grained network for instance-level logo sketch retrieval	Binbin Feng et.al.	2304.02214v1	link
2023-04-04	OrienterNet: Visual Localization in 2D Public Maps with Neural Matching	Paul-Edouard Sarlin et.al.	2304.02009v1	link
2023-04-04	Cross-Domain Image Captioning with Discriminative Finetuning	Roberto Dessì et.al.	2304.01662v1	link
2023-04-02	Learning Similarity between Scene Graphs and Images with Transformers	Yuren Cong et.al.	2304.00590v1	link
2023-04-01	NPR: Nocturnal Place Recognition in Street	Bingxi Liu et.al.	2304.00276v1	null
2023-03-31	Unsupervised crack detection on complex stone masonry surfaces	Panagiotis Agrafiotis et.al.	2303.17989v1	null
2023-03-30	If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval	Finlay G. C. Hudson et.al.	2303.17703v1	null
2023-03-30	Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime	Rhydian Windsor et.al.	2303.17644v1	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504v1	link
2023-03-30	Methods and advancement of content-based fashion image retrieval: A Review	Amin Muhammad Shoib et.al.	2303.17371v1	null
2023-03-30	Adaptive Cross Batch Normalization for Metric Learning	Thalaiyasingam Ajanthan et.al.	2303.17127v1	null
2023-03-30	MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	Weicheng Kuo et.al.	2303.16839v2	null
2023-03-29	Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval	Leo Sampaio Ferraz Ribeiro et.al.	2303.16769v1	null
2023-03-29	Bi-directional Training for Composed Image Retrieval via Text Prompt Learning	Zheyuan Liu et.al.	2303.16604v1	link
2023-03-27	Model Cascades for Efficient Image Search	Robert Hönig et.al.	2303.15595v1	null
2023-03-27	Zero-Shot Composed Image Retrieval with Textual Inversion	Alberto Baldrati et.al.	2303.15247v1	link
2023-03-27	What Can Human Sketches Do for Object Detection?	Pinaki Nath Chowdhury et.al.	2303.15149v1	null
2023-03-25	Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style	Fengyin Lin et.al.	2303.14348v1	link
2023-03-24	A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments	Bruno Arcanjo et.al.	2303.14247v1	null
2023-03-24	PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View	Ze Shi et.al.	2303.14095v1	link
2023-03-24	Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR	Aneeshan Sain et.al.	2303.13779v1	null
2023-03-28	CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not	Aneeshan Sain et.al.	2303.13440v3	null
2023-03-22	Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval	Xunguang Wang et.al.	2303.12658v1	null
2023-03-21	CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion	Geonmo Gu et.al.	2303.11916v1	link
2023-03-21	LIMITR: Leveraging Local Information for Medical Image-Text Representation	Gefen Dawidowicz et.al.	2303.11755v1	null
2023-03-25	Data-efficient Large Scale Place Recognition with Graded Similarity Supervision	Maria Leyva-Vallina et.al.	2303.11739v2	link
2023-03-20	Picture that Sketch: Photorealistic Image Generation from Abstract Sketches	Subhadeep Koley et.al.	2303.11162v1	null
2023-03-19	Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths	Ming Xu et.al.	2303.10778v1	link
2023-03-17	MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities	Boqi Chen et.al.	2303.10249v1	null
2023-03-17	IRGen: Generative Modeling for Image Retrieval	Yidan Zhang et.al.	2303.10126v1	link
2023-03-16	Data Roaming and Early Fusion for Composed Image Retrieval	Matan Levy et.al.	2303.09429v1	link
2023-03-16	Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval	Yi Xie et.al.	2303.09230v1	null
2023-03-16	Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space	Yuhang He et.al.	2303.09192v1	null
2023-03-16	Unsupervised Facial Expression Representation Learning with Contrastive Local Warping	Fanglei Xue et.al.	2303.09034v1	null
2023-03-15	A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval	Saeideh Yousefzadeh et.al.	2303.08398v1	null
2023-03-14	Data-Free Sketch-Based Image Retrieval	Abhra Chaudhuri et.al.	2303.07775v1	link
2023-03-14	PATS: Patch Area Transportation with Subdivision for Local Feature Matching	Junjie Ni et.al.	2303.07700v1	null
2023-03-10	Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors	Kento Kawaharazuka et.al.	2303.05674v1	null
2023-03-09	Dominating Set Database Selection for Visual Place Recognition	Anastasiia Kornilova et.al.	2303.05123v1	null
2023-03-07	Graph Neural Networks in Vision-Language Image Understanding: A Survey	Henry Senior et.al.	2303.03761v1	null
2023-03-07	Sketch-based Medical Image Retrieval	Kazuma Kobayashi et.al.	2303.03633v1	link
2023-03-06	Visual Place Recognition: A Tutorial	Stefan Schubert et.al.	2303.03281v1	link
2023-03-06	MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval	Rohit Agarwal et.al.	2303.03050v1	link
2023-03-06	Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints	Chenjie Cao et.al.	2303.02885v1	link
2023-03-05	Composing Mood Board with User Feedback in Concept Space	Shin Sano et.al.	2303.02547v1	null
2023-03-04	FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks	Xiao Han et.al.	2303.02483v1	link
2023-03-09	Self-Supervised Learning for Place Representation Generalization across Appearance Changes	Mohamed Adel Musallam et.al.	2303.02370v2	null
2023-03-03	MixVPR: Feature Mixing for Visual Place Recognition	Amar Ali-bey et.al.	2303.02190v1	link
2023-03-01	A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition	Maria Waheed et.al.	2303.00714v1	null
2023-03-01	ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards	T. Barros et.al.	2303.00477v1	link
2023-03-03	Renderable Neural Radiance Map for Visual Navigation	Obin Kwon et.al.	2303.00304v2	null
2023-03-01	Region Prediction for Efficient Robot Localization on Large Maps	Matteo Scucchia et.al.	2303.00295v1	link
2023-02-28	OEKG: The Open Event Knowledge Graph	Simon Gottschalk et.al.	2302.14688v1	null
2023-02-28	Global Proxy-based Hard Mining for Visual Place Recognition	Amar Ali-bey et.al.	2302.14217v1	link
2023-02-27	Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation	Yue Xiang et.al.	2302.13929v1	link
2023-02-26	Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images	Mihnea-Alexandru Tomita et.al.	2302.13314v1	null
2023-02-26	Learning cross space mapping via DNN using large scale click-through logs	Wei Yu et.al.	2302.13275v1	null
2023-02-25	DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification	Lemuel Puglisi et.al.	2302.13057v1	null
2023-02-23	Teaching CLIP to Count to Ten	Roni Paiss et.al.	2302.12066v1	null
2023-02-22	Steerable Equivariant Representation Learning	Sangnie Bhardwaj et.al.	2302.11349v1	null
2023-02-21	iQPP: A Benchmark for Image Query Performance Prediction	Eduard Poesina et.al.	2302.10126v2	link
2023-02-20	Ontology-aware Network for Zero-shot Sketch-based Image Retrieval	Haoxiang Zhang et.al.	2302.10040v1	null
2023-02-20	TBPos: Dataset for Large-Scale Precision Visual Localization	Masud Fahim et.al.	2302.09825v1	link
2023-02-17	Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts	Zhihong Chen et.al.	2302.08958v1	link
2023-02-22	Fashion Image Retrieval with Multi-Granular Alignment	Jinkuan Zhu et.al.	2302.08902v2	null
2023-02-15	Unsupervised Hashing via Similarity Distribution Calibration	Kam Woh Ng et.al.	2302.07669v1	link
2023-02-13	Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior	Shen Yan et.al.	2302.06287v1	link
2023-02-13	Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation	Binqian Jiang et.al.	2302.06149v1	link
2023-02-13	Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval	Xu Wang et.al.	2302.06081v1	link
2023-02-11	Sketch Less Face Image Retrieval: A New Challenge	Dawei Dai et.al.	2302.05576v1	link
2023-02-10	Is multi-modal vision supervision beneficial to language?	Avinash Madasu et.al.	2302.05016v1	link
2023-02-06	Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval	Kuniaki Saito et.al.	2302.03084v1	link
2023-02-06	Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs	Michael Kirchhof et.al.	2302.02865v1	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572v1	link
2023-02-04	Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval	Frederik Warburg et.al.	2302.01332v2	link
2023-01-31	Grounding Language Models to Images for Multimodal Generation	Jing Yu Koh et.al.	2301.13823v1	link
2023-01-31	UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers	Dachuan Shi et.al.	2301.13741v1	link
2023-01-23	Lexi: Self-Supervised Learning of the UI Language	Pratyay Banerjee et.al.	2301.10165v1	link
2023-01-17	Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval	Yuchen Wu et.al.	2301.06685v1	null
2023-01-19	High-bandwidth Close-Range Information Transport through Light Pipes	Joowon Lim et.al.	2301.06496v2	null
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604v1	null
2023-01-12	GH-Feat: Learning Versatile Generative Hierarchical Features from GANs	Yinghao Xu et.al.	2301.05315v1	null
2023-01-10	Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images	Xindi Wu et.al.	2301.04224v1	null
2023-01-10	Collaborative Semantic Communication at the Edge	Wing Fei Lo et.al.	2301.03996v1	null
2023-01-10	Online Backfilling with No Regret for Large-Scale Image Retrieval	Seonguk Seo et.al.	2301.03767v1	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403v1	null
2023-01-05	A Probabilistic Framework for Visual Localization in Ambiguous Scenes	Fereidoon Zangeneh et.al.	2301.02086v1	link
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147v1	null
2022-12-30	HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images	Dmitry Yudin et.al.	2212.14649v1	link
2022-12-27	Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning	Wooyoung Kang et.al.	2212.13563v1	link
2022-12-23	SuperGF: Unifying Local and Global Features for Visual Localization	Wenzheng Song et.al.	2212.13105v1	null
2022-12-24	GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration	Parker C. Lusk et.al.	2212.12745v1	null
2022-12-19	From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration	Zekun Qian et.al.	2212.09298v1	link
2022-12-14	The Infinite Index: Information Retrieval on Generative Text-To-Image Models	Niklas Deckers et.al.	2212.07476v1	null
2022-12-14	Shared Coupling-bridge for Weakly Supervised Local Feature Learning	Jiayuan Sun et.al.	2212.07047v1	link
2022-12-08	Group Generalized Mean Pooling for Vision Transformer	Byungsoo Ko et.al.	2212.04114v1	null
2022-12-12	Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models	Gowthami Somepalli et.al.	2212.03860v3	null
2022-12-07	LSVL: Large-scale season-invariant visual localization for UAVs	Jouko Kinnari et.al.	2212.03581v1	null
2022-12-06	ADIR: Adaptive Diffusion for Image Reconstruction	Shady Abu-Hussein et.al.	2212.03221v1	null
2022-12-08	Privacy-Preserving Visual Localization with Event Cameras	Junho Kim et.al.	2212.03177v2	link
2022-12-06	Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach	Wenjun Xu et.al.	2212.03037v1	null
2022-12-06	Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds	Zhipeng Zhao et.al.	2212.02757v1	null
2022-12-04	Fast and Lightweight Scene Regressor for Camera Relocalization	Thuan B. Bui et.al.	2212.01830v1	link
2022-12-02	Information Retrieval from the Digitized Books	Riya Gupta et.al.	2212.00999v1	null
2022-12-09	StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition	Yanqing Shen et.al.	2212.00937v2	null
2022-11-30	Self-Supervised Feature Learning for Long-Term Metric Visual Localization	Yuxuan Chen et.al.	2212.00122v1	null
2022-11-30	SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation	Tianyu Zhang et.al.	2211.16697v1	link
2022-11-28	SLAN: Self-Locator Aided Network for Cross-Modal Understanding	Jiang-Tian Zhai et.al.	2211.16208v1	null
2022-11-29	RankDNN: Learning to Rank for Few-shot Learning	Qianyu Guo et.al.	2211.15320v2	link
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127v1	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069v1	link
2022-11-27	BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images	Zhihuang Zhang et.al.	2211.14927v1	null
2022-11-27	A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition	Rui Huang et.al.	2211.14864v1	null
2022-11-26	Visual Place Recognition	Bailu Guo et.al.	2211.14533v1	null
2022-11-26	Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval	Fan Yang et.al.	2211.14515v1	link
2022-11-30	Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark	Floriana Ciaglia et.al.	2211.13523v3	link
2022-11-23	InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images	Konstantin Kobs et.al.	2211.12760v1	link
2022-11-29	Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments	Joshua Knights et.al.	2211.12732v2	link
2022-11-23	FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events	Kuanxu Hou et.al.	2211.12244v2	null
2022-11-22	Multimorbidity Content-Based Medical Image Retrieval Using Proxies	Yunyan Xing et.al.	2211.12185v1	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988v1	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704v1	null
2022-11-21	LISA: Localized Image Stylization with Audio via Implicit Neural Representation	Seung Hyun Lee et.al.	2211.11381v1	null
2022-11-21	NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization	Shitao Tang et.al.	2211.11177v1	link
2022-11-16	Improving Feature-based Visual Localization by Geometry-Aided Matching	Hailin Yu et.al.	2211.08712v1	link
2022-11-15	LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process	Mikhail Kurenkov et.al.	2211.08480v1	null
2022-11-14	Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair	Lin-Ding Yuan et.al.	2211.07803v1	null
2022-11-14	Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition	Farid Alijani et.al.	2211.07696v1	null
2022-11-14	Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization	Yiyang Chen et.al.	2211.07394v1	link
2022-11-14	Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment	Junyang Wang et.al.	2211.07275v1	null
2022-11-14	ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations	Chanda Grover et.al.	2211.07122v1	null
2022-11-14	Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval	Deunsol Jung et.al.	2211.07116v1	null
2022-11-12	Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning	Ryotaro Shimizu et.al.	2211.06688v1	null
2022-11-09	Visual Named Entity Linking: A New Dataset and A Baseline	Wenxiang Sun et.al.	2211.04872v1	link
2022-11-07	Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System	Julian Gamboa et.al.	2211.03881v1	null
2022-11-06	A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography	Yueh-Cheng Huang et.al.	2211.03007v1	null
2022-11-02	Optimizing Fiducial Marker Placement for Improved Visual Localization	Qiangqiang Huang et.al.	2211.01513v1	link
2022-11-02	A comparison of uncertainty estimation approaches for DNN-based camera localization	Matteo Vaghi et.al.	2211.01234v1	null
2022-11-02	M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval	Layne Berry et.al.	2211.01180v1	null
2022-11-11	Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality	Anuj Diwan et.al.	2211.00768v3	link
2022-11-07	Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding	Ryotaro Shimizu et.al.	2210.17417v2	null
2022-10-27	Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis	Miriam Anschütz et.al.	2210.15377v1	link
2022-10-27	Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings	Daniel Kvak et.al.	2210.15300v1	null
2022-10-27	Towards Practicality of Sketch-Based Visual Understanding	Ayan Kumar Bhunia et.al.	2210.15146v1	null
2022-10-27	MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval	Chen Bao et.al.	2210.15128v1	null
2022-10-26	FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning	Suvir Mirchandani et.al.	2210.15028v1	null
2022-10-26	FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization	Junyang Wang et.al.	2210.14562v1	null
2022-11-02	A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets	Lukas Bernreiter et.al.	2210.13856v2	null
2022-10-27	Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision	Tzu-Jui Julius Wang et.al.	2210.13591v2	null
2022-10-24	Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval	Zhaopeng Dou et.al.	2210.13440v1	link
2022-10-23	Neural Eigenfunctions Are Structured Representation Learners	Zhijie Deng et.al.	2210.12637v1	link
2022-10-21	Boosting vision transformers for image retrieval	Chull Hwan Song et.al.	2210.11909v1	link
2022-10-20	Communication breakdown: On the low mutual intelligibility between human and neural captioning	Roberto Dessì et.al.	2210.11512v1	link
2022-10-19	Image Semantic Relation Generation	Mingzhe Du et.al.	2210.11253v1	null
2022-10-20	General Image Descriptors for Open World Image Retrieval using ViT CLIP	Marcos V. Conde et.al.	2210.11141v1	link
2022-10-20	DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition	Sha Lu et.al.	2210.11029v1	null
2022-10-19	Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval	Abhra Chaudhuri et.al.	2210.10486v1	link
2022-10-19	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition	Amar Ali-bey et.al.	2210.10239v1	link
2022-10-18	A Real-Time Fusion Framework for Long-term Visual Localization	Yuchen Yang et.al.	2210.09757v1	null
2022-10-17	Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval	Yousef Alqasrawi et.al.	2210.08875v1	null
2022-10-17	SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation	Woo Suk Choi et.al.	2210.08675v1	null
2022-10-16	Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers	Tao Tang et.al.	2210.08458v1	link
2022-10-14	Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding	Xuetong Xue et.al.	2210.07572v1	link
2022-10-14	Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique	Connor Malone et.al.	2210.07509v1	null
2022-10-11	Large-to-small Image Resolution Asymmetry in Deep Metric Learning	Pavel Suma et.al.	2210.05463v1	link
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236v1	null
2022-10-05	Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features	Deepak Gupta et.al.	2210.02401v1	link
2022-10-05	Granularity-aware Adaptation for Image Retrieval over Multiple Tasks	Jon Almazán et.al.	2210.02254v1	null
2022-10-05	Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective	Zijian Zhang et.al.	2210.02206v1	link
2022-10-04	Supervised Metric Learning for Retrieval via Contextual Similarity Optimization	Christopher Liao et.al.	2210.01908v1	link
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320v1	null
2022-10-03	Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments	Bruno Arcanjo et.al.	2210.00834v1	null
2022-10-02	Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval	Kei Nishimaki et.al.	2210.00506v1	null
2022-09-29	Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval	Nicolae-Cătălin Ristea et.al.	2209.15034v1	null
2022-09-28	TVLT: Textless Vision-Language Transformer	Zineng Tang et.al.	2209.14156v1	link
2022-09-28	SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval	Yang Shen et.al.	2209.13833v1	link
2022-09-28	Learning Deep Representations via Contrastive Learning for Instance Retrieval	Tao Wu et.al.	2209.13832v1	null
2022-09-28	Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text	Cheng-An Hsieh et.al.	2209.13764v1	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586v1	link
2022-09-27	Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability	Peisong Wen et.al.	2209.13262v1	link
2022-09-26	NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection	Ruihao Zhou et.al.	2209.12513v1	link
2022-09-25	Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis	Jiawen Kang et.al.	2209.12274v1	link
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894v1	null
2022-09-23	Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs	Youya Xia et.al.	2209.11673v1	null
2022-09-23	Query-based Hard-Image Retrieval for Object Detection at Test Time	Edward Ayers et.al.	2209.11559v1	link
2022-09-23	Unsupervised Hashing with Semantic Concept Mining	Rong-Cheng Tu et.al.	2209.11475v1	link
2022-09-22	UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision	Anbang Yang et.al.	2209.11336v1	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710v1	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699v1	link
2022-09-19	Deep Metric Learning with Chance Constraints	Yeti Z. Gurbuz et.al.	2209.09060v1	link
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608v1	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578v1	link
2022-09-17	Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images	Mihnea-Alexandru Tomita et.al.	2209.08343v1	null
2022-09-15	Efficient Planar Pose Estimation via UWB Measurements	Haodong Jiang et.al.	2209.06779v2	link
2022-09-14	Transformers and CNNs both Beat Humans on SBIR	Omar Seddati et.al.	2209.06629v1	null
2022-09-14	Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch	J. Lu et.al.	2209.06545v1	link
2022-09-14	iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images	Peng Yin et.al.	2209.06376v1	null
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497v1	link
2022-09-09	Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet	Alnur Alimanov et.al.	2209.04234v1	link
2022-09-13	Segment Augmentation and Differentiable Ranking for Logo Retrieval	Feyza Yavuz et.al.	2209.02482v2	null
2022-09-12	ScaleFace: Uncertainty-aware Deep Metric Learning	Roman Kail et.al.	2209.01880v2	link
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605v1	null
2022-08-31	EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing	Qihua Feng et.al.	2208.14657v1	link
2022-08-25	A Deep Perceptual Measure for Lens and Camera Calibration	Yannick Hold-Geoffroy et.al.	2208.12300v1	null
2022-08-25	A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme	Zhixun Lu et.al.	2208.11876v1	null
2022-08-23	Satellite Image Search in AgoraEO	Ahmet Kerem Aksoy et.al.	2208.10830v1	null
2022-08-20	Fuse and Attend: Generalized Embedding Learning for Art and Sketches	Ujjal Kr Dutta et.al.	2208.09698v1	null
2022-08-19	Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods	Chao Chen et.al.	2208.09315v1	link
2022-08-19	TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval	Soumava Paul et.al.	2208.09198v1	link
2022-08-17	Visual Cross-View Metric Localization with Dense Uncertainty Estimates	Zimin Xia et.al.	2208.08519v1	link
2022-08-17	Understanding Attention for Vision-and-Language Tasks	Feiqi Cao et.al.	2208.08104v1	link
2022-08-14	Visual Localization via Few-Shot Scene Region Classification	Siyan Dong et.al.	2208.06933v1	link
2022-08-14	HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval	Chengyin Xu et.al.	2208.06866v1	link
2022-08-13	Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization	Ming Dai et.al.	2208.06561v1	link
2022-08-16	Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation	Georgios Kouros et.al.	2208.06195v2	link
2022-08-12	Instance Image Retrieval by Learning Purely From Within the Dataset	Zhongyan Zhang et.al.	2208.06119v1	null
2022-08-07	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization	Yujiao Shi et.al.	2208.03660v1	null
2022-08-05	A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch	Patsorn Sangkloy et.al.	2208.03354v1	null
2022-08-05	ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding	Bingning Wang et.al.	2208.03030v1	link
2022-08-04	Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing	Caio da S. Dias et.al.	2208.02397v1	null
2022-07-27	On the robustness of self-supervised representations for multi-view object classification	David Torpey et.al.	2208.00787v1	null
2022-07-26	Multimodal Neural Machine Translation with Search Engine Based Image Retrieval	ZhenHao Tang et.al.	2208.00767v1	null
2022-07-30	Towards Privacy-Preserving, Real-Time and Lossless Feature Matching	Qiang Meng et.al.	2208.00214v1	link
2022-07-30	DAS: Densely-Anchored Sampling for Deep Metric Learning	Lizhao Liu et.al.	2208.00119v1	link
2022-07-29	Curriculum Learning for Data-Efficient Vision-Language Alignment	Tejas Srinivasan et.al.	2207.14525v1	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455v1	link
2022-07-27	Abstracting Sketches through Simple Primitives	Stephan Alaniz et.al.	2207.13543v1	link
2022-07-27	Satellite Image Based Cross-view Localization for Autonomous Vehicle	Shan Wang et.al.	2207.13506v1	null
2022-07-26	RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments	Jiahui Zhang et.al.	2207.12579v1	null
2022-07-25	A hybrid-qudit representation of digital RGB images	Sreetama Das et.al.	2207.12550v1	null
2022-07-19	ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization	Ivan Cisneros et.al.	2207.12317v1	link
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916v1	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762v2	link
2022-07-20	Revisiting Hotels-50K and Hotel-ID	Aarash Feizi et.al.	2207.10200v1	link
2022-07-20	Feature Representation Learning for Unsupervised Cross-domain Image Retrieval	Conghui Hu et.al.	2207.09721v1	link
2022-07-19	SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Dominik Koßmann et.al.	2207.09507v1	null
2022-07-19	Context Unaware Knowledge Distillation for Image Retrieval	Bytasandram Yaswanth Reddy et.al.	2207.09070v1	link
2022-07-17	FashionViL: Fashion-Focused Vision-and-Language Representation Learning	Xiao Han et.al.	2207.08150v1	link
2022-07-14	AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments	Peng Yin et.al.	2207.06965v1	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738v1	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732v1	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058v2	link
2022-07-12	CPO: Change Robust Panorama to Point Cloud Localization	Junho Kim et.al.	2207.05317v1	link
2022-07-05	Hierarchical Average Precision Training for Pertinent Image Retrieval	Elias Ramzi et.al.	2207.04873v1	link
2022-07-11	A clinically motivated self-supervised approach for content-based image retrieval of CT liver images	Kristoffer Knutsen Wickstrøm et.al.	2207.04812v1	link
2022-07-09	BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval	Wenqiao Zhang et.al.	2207.04211v1	null
2022-07-08	Learning Sequential Descriptors for Sequence-based Visual Place Recognition	Riccardo Mereu et.al.	2207.03868v1	link
2022-07-08	GEMS: Scene Expansion using Generative Models of Graphs	Rishi Agarwal et.al.	2207.03729v1	null
2022-07-05	Object-Level Targeted Selection via Deep Template Matching	Suraj Kothawade et.al.	2207.01778v1	null
2022-07-06	Adaptive Fine-Grained Sketch-Based Image Retrieval	Ayan Kumar Bhunia et.al.	2207.01723v2	link
2022-07-04	Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets	Paul Albert et.al.	2207.01573v1	link
2022-07-08	Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval	Keyu Wen et.al.	2207.00733v2	null
2022-07-01	DALG: Deep Attentive Local and Global Modeling for Image Retrieval	Yuxin Song et.al.	2207.00287v1	null
2022-07-04	BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label	Shengshan Hu et.al.	2207.00278v2	link
2022-06-28	Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems	Stephen Hausler et.al.	2206.13883v1	null
2022-07-08	How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels	Tobias Fischer et.al.	2206.13673v2	link
2022-06-25	FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance	Yongzhi Fan et.al.	2206.12628v1	link
2022-06-25	Inverted Semantic-Index for Image Retrieval	Ying Wang et.al.	2206.12623v1	null
2022-06-17	RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval	Yihan Wu et.al.	2206.11225v1	null
2022-06-22	ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas	Prathmesh Madhu et.al.	2206.11115v1	null
2022-06-20	Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval	Guile Wu et.al.	2206.09806v1	null
2022-06-18	Attention-based Dynamic Subspace Learners for Medical Image Analysis	Sukesh Adiga V et.al.	2206.09068v1	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733v1	null
2022-06-06	Learning Treatment Plan Representations for Content Based Image Retrieval	Charles Huang et.al.	2206.02912v1	null
2022-06-19	NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation	Ekaterina Nepovinnykh et.al.	2206.02498v3	link
2022-06-05	Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks	B. G. Palm et.al.	2206.02278v1	null
2022-05-28	FaIRCoP: Facial Image Retrieval using Contrastive Personalization	Devansh Gupta et.al.	2205.15870v1	null
2022-05-31	Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark	Martin Humenberger et.al.	2205.15761v1	link
2022-05-27	Improving Road Segmentation in Challenging Domains Using Similar Place Priors	Connor Malone et.al.	2205.14112v1	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135v2	link
2022-05-26	Fine-grained Image Captioning with CLIP Reward	Jaemin Cho et.al.	2205.13115v1	link
2022-05-25	Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization	Kyung Ho Park et.al.	2205.12544v1	null
2022-05-24	OnePose: One-Shot Object Pose Estimation without CAD Models	Jiaming Sun et.al.	2205.12257v1	link
2022-05-23	VPAIR -- Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments	Michael Schleiss et.al.	2205.11567v1	link
2022-05-23	VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering	Yanan Wang et.al.	2205.11501v1	null
2022-05-23	Deep Image Retrieval is not Robust to Label Noise	Stanislav Dereka et.al.	2205.11195v1	null
2022-05-22	Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval	Zelong Zeng et.al.	2205.10878v1	link
2022-05-20	Visually-Augmented Language Modeling	Weizhi Wang et.al.	2205.10178v1	link
2022-05-18	Deep Features for CBIR with Scarce Data using Hebbian Learning	Gabriele Lagani et.al.	2205.08935v1	null
2022-05-19	Text Detection & Recognition in the Wild for Robot Localization	Zobeir Raisi et.al.	2205.08565v2	null
2022-05-12	One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code	Yong Dai et.al.	2205.06126v1	null
2022-05-11	Review on Panoramic Imaging and Its Applications in Scene Understanding	Shaohua Gao et.al.	2205.05570v1	null
2022-05-18	Identical Image Retrieval using Deep Learning	Sayan Nath et.al.	2205.04883v2	link
2022-05-09	Introspective Deep Metric Learning	Chengkun Wang et.al.	2205.04449v1	link
2022-05-11	Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting	Kai Uwe Barthel et.al.	2205.04255v2	link
2022-05-08	Adversarial Learning of Hard Positives for Place Recognition	Wenxuan Fang et.al.	2205.03871v1	null
2022-05-10	AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching	Khanh Nguyen et.al.	2205.02849v2	link
2022-04-29	Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval	Shupeng Su et.al.	2204.13919v1	null
2022-04-29	Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval	Siyu Ren et.al.	2204.13913v1	link
2022-04-28	Spatio-Temporal Graph Localization Networks for Image-based Navigation	Takahiro Niwa et.al.	2204.13237v1	null
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831v1	null
2022-04-25	SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo	Pinaki Nath Chowdhury et.al.	2204.11964v1	null
2022-04-23	On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning	Muhammad Umer Anwaar et.al.	2204.11848v1	null
2022-04-24	Progressive Learning for Image Retrieval with Hybrid-Modality Queries	Yida Zhao et.al.	2204.11212v1	null
2022-04-23	Training and challenging models for text-guided fashion image retrieval	Eric Dodds et.al.	2204.11004v1	link
2022-04-18	Centralized Adversarial Learning for Robust Deep Hashing	Xunguang Wang et.al.	2204.10779v1	link
2022-04-22	Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views	Kanya Kurauchi et.al.	2204.10497v1	null
2022-04-21	Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	Zhiqiang Yuan et.al.	2204.09868v1	link
2022-04-21	Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information	Zhiqiang Yuan et.al.	2204.09860v1	link
2022-04-20	Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Leila Pishdad et.al.	2204.09268v1	null
2022-04-19	Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing	Georgii Mikriukov et.al.	2204.08707v1	null
2022-04-18	Multiple-environment Self-adaptive Network for Aerial-view Geo-localization	Tingyu Wang et.al.	2204.08381v1	link
2022-04-15	Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder	Hanjing Ye et.al.	2204.07350v1	link
2022-04-14	Composite Code Sparse Autoencoders for first stage retrieval	Carlos Lassance et.al.	2204.07023v1	null
2022-04-13	Reuse your features: unifying retrieval and feature-metric alignment	Javier Morlana et.al.	2204.06292v1	link
2022-04-12	Probabilistic Compositional Embeddings for Multimodal Image Retrieval	Andrei Neculai et.al.	2204.05845v1	link
2022-04-12	Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval	Yu-Wei Zhan et.al.	2204.05666v1	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481v1	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932v1	link
2022-04-10	Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image	Yujiao Shi et.al.	2204.04752v1	link
2022-04-08	A Generic Image Retrieval Method for Date Estimation of Historical Document Collections	Adrià Molina et.al.	2204.04028v1	null
2022-04-08	SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies	Narges Norouzi et.al.	2204.03998v1	null
2022-04-05	Leveraging Equivariant Features for Absolute Pose Regression	Mohamed Adel Musallam et.al.	2204.02163v1	null
2022-04-04	"This is my unicorn, Fluffy": Personalizing frozen vision-language representations	Niv Cohen et.al.	2204.01694v1	link
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524v1	null
2022-04-01	LASER: LAtent SpacE Rendering for 2D Visual Localization	Zhixiang Min et.al.	2204.00157v1	link
2022-03-31	Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning	Semih Orhan et.al.	2203.16945v1	null
2022-03-30	AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift	Burak Yildiz et.al.	2203.16291v1	link
2022-03-29	Long-term Visual Map Sparsification with Heterogeneous GNN	Ming-Fang Chang et.al.	2203.15182v1	null
2022-04-01	A Simulation Benchmark for Vision-based Autonomous Navigation	Lauri Suomela et.al.	2203.13048v2	link
2022-03-24	Is Geometry Enough for Matching in Visual Localization?	Qunjie Zhou et.al.	2203.12979v1	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645v2	link
2022-03-10	ReF -- Rotation Equivariant Features for Local Feature Matching	Abhishek Peri et.al.	2203.05206v1	null
2022-03-09	Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction	Matthieu Zins et.al.	2203.04613v1	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446v1	link
2022-03-07	ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization	Simon Maurer et.al.	2203.03610v1	link
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454v1	link
2022-03-01	SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments	Maria Waheed et.al.	2203.00591v1	null
2022-02-28	Deep Camera Pose Regression Using Pseudo-LiDAR	Ali Raza et.al.	2203.00080v1	null
2022-02-25	RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation	Praveen Kumar Rajendran et.al.	2202.12838v1	null
2022-02-24	Highly-Efficient Binary Neural Networks for Visual Place Recognition	Bruno Ferrarini et.al.	2202.12375v1	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146v1	link
2022-02-14	Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition	Y. Shen et.al.	2202.06470v1	null
2022-02-11	Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition	Yingfeng Cai et.al.	2202.05738v1	null
2022-02-09	Object-Guided Day-Night Visual Localization in Urban Scenes	Assia Benbihi et.al.	2202.04445v1	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677v1	null
2022-02-25	CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938v2	null
2022-02-03	Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization	Andrea Vallone et.al.	2202.01821v1	null
2022-02-02	Training Semantic Descriptors for Image-Based Localization	Ibrahim Cinaroglu et.al.	2202.01212v1	null
2022-01-31	Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization	Nathan Hughes et.al.	2201.13360v1	null
2022-01-31	Rigidity Preserving Image Transformations and Equivariance in Perspective	Lucas Brynte et.al.	2201.13065v1	null
2022-01-25	Learning Semantics for Visual Place Recognition through Multi-Scale Attention	Valerio Paolicelli et.al.	2201.09701v2	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048v1	link
2022-01-15	A Critical Analysis of Image-based Camera Pose Estimation Techniques	Meng Xu et.al.	2201.05816v1	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386v1	link
2021-12-23	NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning	Tony Ng et.al.	2112.12785v1	null
2021-12-16	CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data	Qi Yan et.al.	2112.09081v1	link
2021-12-05	RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather	Jialu Wang et.al.	2112.02469v1	null
2021-11-25	MegLoc: A Robust and Accurate Visual Localization Pipeline	Shuxue Peng et.al.	2111.13063v1	null
2021-10-08	Semantic Image Alignment for Vehicle Localization	Markus Herb et.al.	2110.04162v1	null
2021-10-05	Season-invariant GNSS-denied visual localization for UAVs	Jouko Kinnari et.al.	2110.01967v1	link
2021-09-30	Forming a sparse representation for visual place recognition using a neurorobotic approach	Sylvain Colomer et.al.	2109.14916v1	null
2021-09-22	Audio-Visual Grounding Referring Expression for Robotic Manipulation	Yefei Wang et.al.	2109.10571v1	null
2021-09-20	Efficient shape mapping through dense touch and vision	Sudharshan Suresh et.al.	2109.09884v1	link
2021-09-15	S3LAM: Structured Scene SLAM	Mathieu Gonzalez et.al.	2109.07339v1	null
2021-09-13	Monocular Camera Localization for Automated Vehicles Using Image Retrieval	Eunhyek Joa et.al.	2109.06296v1	null
2021-09-10	Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization	Sungho Yoon et.al.	2109.04753v1	link
2021-09-09	CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization	Ara Jafarzadeh et.al.	2109.04527v1	null
2021-09-09	Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization	Mona Gridseth et.al.	2109.04041v1	link

(back to top)

Keypoint Detection

Publish Date	Title	Authors	PDF	Code
2025-03-06	Spatial regularisation for improved accuracy and interpretability in keypoint-based registration	Benjamin Billot et.al.	2503.04499v1	null
2025-03-04	A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection	Junyi Wang et.al.	2503.02481v1	null
2025-03-01	Autonomous Dissection in Robotic Cholecystectomy	Ki-Hwan Oh et.al.	2503.00666v1	null
2025-02-28	CNSv2: Probabilistic Correspondence Encoded Neural Image Servo	Anzhe Chen et.al.	2503.00132v1	null
2025-02-27	Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets	Jisoo Lee et.al.	2502.19766v1	null
2025-02-23	Rewards-based image analysis in microscopy	Kamyar Barakati et.al.	2502.18522v1	null
2025-02-19	2.5D U-Net with Depth Reduction for 3D CryoET Object Identification	Yusuke Uchida et.al.	2502.13484v1	link
2025-01-30	Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images	Wei-Lun Chen et.al.	2501.18453v1	null
2025-01-30	Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models	Bhargav Ghanekar et.al.	2501.18361v1	null
2025-01-30	Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems	Liudi Yang et.al.	2501.18110v1	null
2025-01-21	Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction	Mengyuan Li et.al.	2501.11844v1	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299v1	null
2025-01-19	Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation	Shibang Liu et.al.	2501.11069v1	null
2025-01-13	Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications	Lukas Rustler et.al.	2501.07421v1	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399v1	null
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221v1	link
2024-12-21	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2412.16755v1	null
2024-12-19	Corn Ear Detection and Orientation Estimation Using Deep Learning	Nathan Sprague et.al.	2412.14954v1	null
2024-12-12	Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models	Faith Johnson et.al.	2412.09739v1	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488v1	link
2024-12-09	ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models	Bingchen Gong et.al.	2412.06292v1	null
2024-12-07	Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures	Muhammad Umar Farooq et.al.	2412.05487v1	null
2024-12-04	Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything	Yongkyu Lee et.al.	2412.03472v1	link
2024-12-02	MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection	Yonghao Dang et.al.	2412.01422v1	null
2024-11-23	OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs	Chen Xin et.al.	2411.15653v1	link
2024-11-19	IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose	Fei Ren et.al.	2411.12676v1	null
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851v1	null
2024-11-04	KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension	Jie Yang et.al.	2411.01846v1	null
2024-10-31	From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots	Vasileios Tzouras et.al.	2410.23906v1	null
2024-10-04	Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Aman Anand et.al.	2410.14700v1	null
2024-11-27	Sim2real Cattle Joint Estimation in 3D point clouds	Mohammad Okour et.al.	2410.14419v2	null
2024-10-16	PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network	Asish Bera et.al.	2410.12742v1	null
2024-10-16	RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition	Asish Bera et.al.	2410.12718v1	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848v1	null
2024-10-11	Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image	Marta Veganzones Rodriguez et.al.	2410.09155v1	null
2024-10-08	Unsupervised Model Diagnosis	Yinong Oliver Wang et.al.	2410.06243v1	null
2024-10-08	Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration	Xueyang Kang et.al.	2410.05729v1	link
2024-10-16	Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features	Chengkai Hou et.al.	2410.02237v2	null
2024-10-02	Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection	Hongru Yan et.al.	2410.01404v1	null
2024-09-30	OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection	Changsheng Lu et.al.	2409.19899v1	link
2024-10-07	SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation	Xin Li et.al.	2409.18082v2	null
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502v1	link
2024-09-20	Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators	Niloufar Amiri et.al.	2409.13668v1	null
2024-09-25	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695v3	link
2024-09-06	D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection	Kentaro Hirahara et.al.	2409.04060v1	null
2024-10-01	Towards Practical Human Motion Prediction with LiDAR Point Clouds	Xiao Han et.al.	2408.08202v2	null
2024-07-31	Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods	Xusheng Luo et.al.	2408.00117v1	null
2024-07-26	SHIC: Shape-Image Correspondences with no Keypoint Supervision	Aleksandar Shtedritski et.al.	2407.18907v1	null
2024-07-25	LION: Linear Group RNN for 3D Object Detection in Point Clouds	Zhe Liu et.al.	2407.18232v1	link
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791v1	null
2024-07-09	LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Teng Wang et.al.	2407.06730v1	null
2024-07-04	PFGS: High Fidelity Point Cloud Rendering via Feature Splatting	Jiaxu Wang et.al.	2407.03857v1	link
2024-07-03	A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes	Li Fang et.al.	2407.02830v1	link
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014v1	link
2024-06-28	Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics	Chengrui Gao et.al.	2406.19672v1	null
2024-07-23	A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking	Lorenzo Shaikewitz et.al.	2406.16837v2	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315v1	link
2024-06-23	W-Net: A Facial Feature-Guided Face Super-Resolution Network	Hao Liu et.al.	2406.00676v3	null
2024-05-25	Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration	Junjie Gao et.al.	2405.16085v1	null
2024-06-01	Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding	Weizhen Liu et.al.	2405.12476v2	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434v1	null
2024-05-15	Vector-Symbolic Architecture for Event-Based Optical Flow	Hongzhi You et.al.	2405.08300v2	null
2024-05-13	RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration	Congjia Chen et.al.	2405.07594v1	null
2024-05-08	Unsupervised Skin Feature Tracking with Deep Neural Networks	Jose Chang et.al.	2405.04943v1	null
2024-05-07	A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images	László Kopácsi et.al.	2405.04650v1	null
2024-04-30	A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images	Wang Zhang et.al.	2404.19311v1	null
2024-04-25	Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach	Tahmim Hossain et.al.	2404.14560v2	null
2024-04-19	SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers	Vandad Davoodnia et.al.	2404.12625v1	null
2024-04-17	Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images	Junbiao Pang et.al.	2404.10985v1	null
2024-03-28	Towards Long Term SLAM on Thermal Imagery	Colin Keil et.al.	2403.19885v1	link
2024-03-28	Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation	Xiao Lin et.al.	2403.19527v1	link
2024-03-27	RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation	Yang Tian et.al.	2403.18259v1	null
2024-03-18	FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events	Xiangyuan Wang et.al.	2403.11662v1	link
2024-03-05	Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion	Meng Zheng et.al.	2403.03217v1	null
2024-02-22	A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets	Chengzhang Yu et.al.	2402.14241v1	null
2024-02-25	A Feature Matching Method Based on Multi-Level Refinement Strategy	Shaojie Zhang et.al.	2402.13488v2	null
2024-03-05	3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data	Zhi-Yi Lin et.al.	2402.13172v4	null
2024-02-25	Region Feature Descriptor Adapted to High Affine Transformations	Shaojie Zhang et.al.	2402.09724v3	null
2024-01-29	Reconstructing Close Human Interactions from Multiple Views	Qing Shuai et.al.	2401.16173v1	link
2024-01-17	To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection	Luyi Han et.al.	2401.09336v1	link
2024-01-08	Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach	Huanyu Liu et.al.	2401.03742v1	link
2024-03-22	6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation	Li Xu et.al.	2401.00029v3	null
2023-12-27	Bezier-based Regression Feature Descriptor for Deformable Linear Objects	Fangqing Chen et.al.	2312.16502v1	null
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471v1	null
2023-12-22	BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions	Elias Marks et.al.	2312.14706v1	null
2023-12-19	Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation	Jiaming Liu et.al.	2312.12480v1	null
2023-12-19	An effective image copy-move forgery detection using entropy image	Zhaowei Lu et.al.	2312.11793v1	link
2023-12-11	VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data	Jian Shi et.al.	2312.08871v1	link
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865v1	link
2023-12-01	Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version)	Emma Cramer et.al.	2312.00592v1	link
2023-11-30	Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications	Sahar Almahfouz Nasser et.al.	2311.18281v1	null
2023-11-29	Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features	Thomas Wimmer et.al.	2311.18113v1	link
2023-11-28	Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features	Niladri Shekhar Dutt et.al.	2311.17024v1	link
2023-11-28	Riemannian Self-Attention Mechanism for SPD Networks	Rui Wang et.al.	2311.16738v1	null
2023-11-27	A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor	Jialin Liu et.al.	2311.15609v1	null
2023-11-21	Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers	Bo Sun et.al.	2311.12291v1	null
2023-11-20	CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement	Boni Hu et.al.	2311.11604v1	link
2023-11-17	Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration	Paul J. Claasen et.al.	2311.10361v1	link
2023-11-13	Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning	Tomáš Kunzo et.al.	2311.07398v1	null
2023-11-11	CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer	Haoyu Ma et.al.	2311.06443v1	link
2023-11-08	3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud	Jianchao Ci et.al.	2311.04699v1	null
2023-11-06	TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains	Alexander Naumann et.al.	2311.03124v1	link
2023-11-06	An invariant feature extraction for multi-modal images matching	Chenzhong Gao et.al.	2311.02842v1	null
2023-10-20	Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification	Mateus Roder et.al.	2310.13490v1	null
2023-10-12	UniPose: Detecting Any Keypoints	Jie Yang et.al.	2310.08530v1	link
2023-10-10	l-dyno: framework to learn consistent visual features using robot's motion	Kartikeya Singh et.al.	2310.06249v1	link
2023-10-10	Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face	Hao Zhang et.al.	2310.05056v2	link
2023-10-13	H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation	Yanjie Ze et.al.	2310.01404v2	link
2023-10-04	Self-supervised Learning of Contextualized Local Visual Embeddings	Thalles Santos Silva et.al.	2310.00527v3	link
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268v2	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436v1	link
2023-09-18	RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy	Mert Asim Karaoglu et.al.	2309.09563v1	null
2023-09-17	CryoAlign: feature-based method for global and local 3D alignment of EM density maps	Bintao He et.al.	2309.09217v1	null
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471v1	link
2023-09-09	Mirror-Aware Neural Humans	Daniel Ajisafe et.al.	2309.04750v1	link
2023-09-07	InstructDiffusion: A Generalist Modeling Interface for Vision Tasks	Zigang Geng et.al.	2309.03895v1	null
2023-09-04	SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras	Himanshu Pahadia et.al.	2309.01324v1	null
2023-09-12	Improving the matching of deformable objects by learning to detect keypoints	Felipe Cadar et.al.	2309.00434v2	link
2023-08-31	SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation	Jiaben Chen et.al.	2308.16876v1	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984v1	link
2023-08-29	A lightweight 3D dense facial landmark estimation model from position map data	Shubhajit Basak et.al.	2308.15170v1	link
2023-08-27	Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors	Francesco Pirotti et.al.	2308.14047v1	null
2023-08-24	VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition	Gengxuan Tian et.al.	2308.12870v1	null
2023-08-22	LDP-Feat: Image Features with Local Differential Privacy	Francesco Pittaluga et.al.	2308.11223v1	null
2023-08-20	Neural Interactive Keypoint Detection	Jie Yang et.al.	2308.10174v1	link
2023-08-19	ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment	Bingyang Zhou et.al.	2308.09987v1	null
2023-09-03	DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching	Johan Edstedt et.al.	2308.08479v2	link
2023-08-15	CoDeF: Content Deformation Fields for Temporally Consistent Video Processing	Hao Ouyang et.al.	2308.07926v1	link
2023-08-15	ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition	Wenyuan Xue et.al.	2308.07743v1	null
2023-08-14	DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport	Sk Aziz Ali et.al.	2308.07153v1	null
2023-08-14	2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds	Minhao Li et.al.	2308.05667v2	link
2023-08-02	Automated Hit-frame Detection for Badminton Match Analysis	Yu-Hang Chien et.al.	2307.16000v2	link
2023-07-25	Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception	Chuanyu Luo et.al.	2307.13300v1	null
2023-07-21	Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data	Sahar Almahfouz Nasser et.al.	2307.10698v2	link
2023-07-19	SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid	Zi Li et.al.	2307.09727v1	link
2023-07-01	SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation	Fabian Duffhauss et.al.	2307.00306v1	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669v1	link
2023-06-26	CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild	Li Ding et.al.	2306.15073v1	null
2023-06-28	Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset	Ziqiao Weng et.al.	2306.07089v2	link
2023-06-07	Learning Probabilistic Coordinate Fields for Robust Correspondences	Weiyue Zhao et.al.	2306.04231v1	null
2023-06-03	LDEB -- Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues	Amitabha Dey et.al.	2306.02193v1	null
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938v1	null
2023-06-01	A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm	Onur Beker et.al.	2306.00892v1	null
2023-05-30	Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection	Supeng Wang et.al.	2305.18714v1	link
2023-05-23	Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence	Grace Luo et.al.	2305.14334v1	null
2023-05-15	Non-Separable Multi-Dimensional Network Flows for Visual Computing	Viktoria Ehm et.al.	2305.08628v1	null
2023-05-13	Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance	Xinyu Lin et.al.	2305.07943v1	link
2023-05-05	HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration	Canhui Tang et.al.	2305.03487v1	link
2023-04-17	Human Pose Estimation in Monocular Omnidirectional Top-View Images	Jingrui Yu et.al.	2304.08186v1	null
2023-04-14	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426v1	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194v1	link
2023-04-06	From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection	Changsheng Lu et.al.	2304.03140v1	null
2023-03-29	NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud	Xiangyu Zhu et.al.	2303.16465v1	link
2023-03-24	PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View	Ze Shi et.al.	2303.14095v1	link
2023-03-23	Semantic Image Attack for Visual Model Diagnosis	Jinqi Luo et.al.	2303.13010v1	null
2023-03-22	Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation	Heng Yang et.al.	2303.12246v1	link
2023-03-21	RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network	Sangmin Yoo et.al.	2303.10770v2	null
2023-03-17	ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty	Vanessa Wirth et.al.	2303.10042v1	null
2023-03-15	Descriptor Distillation for Efficient Multi-Robot SLAM	Xiyue Guo et.al.	2303.08420v1	null
2023-03-15	From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning	Zhuo Su et.al.	2303.08414v1	null
2023-03-16	KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input	Yiye Chen et.al.	2303.05617v2	link
2023-03-07	External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors	Simon Bultmann et.al.	2303.03797v1	null
2023-02-26	PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection	Shenwei Xie et.al.	2302.13263v1	null
2023-02-24	Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks	Julian Lißner et.al.	2302.12545v1	null
2023-02-21	Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging	Yuhong Deng et.al.	2302.10446v1	null
2023-02-12	A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training	Jingnan Shi et.al.	2302.06019v1	null
2023-02-11	Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing	Zitong Yu et.al.	2302.05744v1	null
2023-02-09	MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection	Yuhe Ding et.al.	2302.04589v1	link
2023-02-03	Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation	Jie Yang et.al.	2302.01593v1	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572v1	link
2023-01-21	Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection	Feiyang Wen et.al.	2301.08973v1	null
2023-01-18	OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models	Xingyi He et.al.	2301.07673v1	null
2023-01-12	Towards High Performance One-Stage Human Pose Estimation	Ling Li et.al.	2301.04842v1	null
2022-12-31	Rethinking Rotation Invariance with Point Cloud Registration	Jianhui Yu et.al.	2301.00149v1	null
2023-02-06	Fruit Ripeness Classification: a Survey	Matteo Rizzo et.al.	2212.14441v2	null
2022-12-28	NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action	Kuan-Chieh Wang et.al.	2212.13660v1	link
2022-12-24	HandsOff: Labeled Dataset Generation With No Additional Human Annotations	Austin Xu et.al.	2212.12645v1	null
2022-12-13	Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images	Welerson Melo et.al.	2212.09589v1	link
2022-12-15	Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation	Bugra C. Sefercik et.al.	2212.07567v1	null
2023-02-01	DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization	Xiangyu Xu et.al.	2212.04575v2	null
2022-12-07	ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation	Yufei Xu et.al.	2212.04246v1	link
2022-12-15	Designing Feature Vector Representations: A case study from Chemistry	Signe Sidwall Thygesen et.al.	2212.03731v2	null
2022-12-09	DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model	Jeongjun Choi et.al.	2212.02796v2	link
2022-12-05	Images Speak in Images: A Generalist Painter for In-Context Visual Learning	Xinlong Wang et.al.	2212.02499v1	link
2022-12-06	R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor	Bai Zhu et.al.	2212.02277v2	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069v1	link
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731v2	null
2022-11-21	Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching	Paul Roetzer et.al.	2211.11589v1	link
2022-11-07	Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration	Zixin Yang et.al.	2211.03688v1	null
2022-10-31	Tree Detection and Diameter Estimation Based on Deep Learning	Vincent Grondin et.al.	2210.17424v1	link
2022-10-26	Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds	Zhiyuan Zhang et.al.	2210.14899v1	null
2022-10-23	Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders	Ömer Sümer et.al.	2210.12705v1	null
2022-10-21	Real-time Detection of 2D Tool Landmarks with Synthetic Training Data	Bram Vanherle et.al.	2210.11991v1	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236v1	null
2022-10-04	Centroid Distance Keypoint Detector for Colored Point Clouds	Hanzhe Teng et.al.	2210.01298v1	link
2022-09-28	Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences	Jun-Jee Chao et.al.	2209.14419v1	null
2022-09-28	USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation	Zhengrong Xue et.al.	2209.13864v1	null
2022-10-16	Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection	Neelay Joglekar et.al.	2209.13657v2	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586v1	link
2022-09-26	Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments	Kyungmin Jung et.al.	2209.12881v1	null
2022-10-07	Long-Lived Accurate Keypoints in Event Streams	Philippe Chiberre et.al.	2209.10385v2	null
2022-09-20	Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence	Sunghwan Hong et.al.	2209.08742v2	null
2022-09-15	Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections	Bastian Pätzold et.al.	2209.07393v1	link
2022-09-07	Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip	Yang Li et.al.	2209.03440v1	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997v1	null
2022-08-24	Self-Supervised Endoscopic Image Key-Points Matching	Manel Farhat et.al.	2208.11424v1	link
2022-08-19	Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture	Muhammad Muzammel et.al.	2208.08224v2	null
2022-08-08	MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis	Maximilian Gilles et.al.	2208.03963v1	null
2022-08-07	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization	Yujiao Shi et.al.	2208.03660v1	null
2022-07-29	Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation	Qihao Liu et.al.	2208.00090v1	null
2022-07-25	Translating a Visual LEGO Manual to a Machine-Executable Plan	Ruocheng Wang et.al.	2207.12572v1	null
2022-07-21	Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network	Aline Sindel et.al.	2207.10506v1	null
2022-07-15	Human keypoint detection for close proximity human-robot interaction	Jan Docekal et.al.	2207.07742v1	null
2022-07-15	Adversarial Focal Loss: Asking Your Discriminator for Hard Examples	Chen Liu et.al.	2207.07739v1	null
2022-07-13	Rapid Person Re-Identification via Sub-space Consistency Regularization	Qingze Yin et.al.	2207.05933v1	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539v1	null
2022-08-15	Semi-supervised Human Pose Estimation in Art-historical Images	Matthias Springstein et.al.	2207.02976v3	link
2022-07-01	Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling	Jiamin Liang et.al.	2207.00474v1	null
2022-06-24	Motion Estimation for Large Displacements and Deformations	Qiao Chen et.al.	2206.12464v1	null
2022-06-24	Deep embedded clustering algorithm for clustering PACS repositories	Teo Manojlović et.al.	2206.12417v1	null
2022-06-21	KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences	Xuanhan Wang et.al.	2206.10090v1	link
2022-06-20	Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval	Guile Wu et.al.	2206.09806v1	null
2022-06-15	A Unified Sequence Interface for Vision Tasks	Ting Chen et.al.	2206.07669v1	link
2022-06-09	Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields	Mingtong Zhang et.al.	2206.04669v1	null
2022-06-03	SNAKE: Shape-aware Neural 3D Keypoint Field	Chengliang Zhong et.al.	2206.01724v1	link
2022-05-17	MulT: An End-to-End Multitask Learning Transformer	Deblina Bhattacharjee et.al.	2205.08303v1	null
2022-05-10	ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild	Chirag Raman et.al.	2205.05177v1	link
2022-04-28	Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept	Emilio Gomez-Gonzalez et.al.	2204.14050v1	null
2022-05-02	GRIT: General Robust Image Task Benchmark	Tanmay Gupta et.al.	2204.13653v2	link
2022-05-24	ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation	Yufei Xu et.al.	2204.12484v2	link
2022-04-26	Unified GCNs: Towards Connecting GCNs with CNNs	Ziyan Zhang et.al.	2204.12300v1	null
2022-04-19	Self-Supervised Equivariant Learning for Oriented Keypoint Detection	Jongmin Lee et.al.	2204.08613v1	link
2022-04-17	The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation	Bao Zhao et.al.	2204.08024v1	null
2022-04-15	2D Human Pose Estimation: A Survey	Haoming Chen et.al.	2204.07370v1	null
2022-04-11	Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification	Haojie Liu et.al.	2204.04842v1	null
2022-04-07	Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification	Yanan Wang et.al.	2204.02611v2	link
2022-04-02	SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning	Nilaksh Das et.al.	2204.00734v1	link
2022-04-01	MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration	Chenzhong Gao et.al.	2204.00260v1	null
2022-03-29	Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning	David Howard et.al.	2203.15172v1	null
2022-03-28	REGTR: End-to-end Point Cloud Correspondences with Transformers	Zi Jian Yew et.al.	2203.14517v1	link
2022-03-27	UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection	Ye Liu et.al.	2203.12745v2	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645v2	link
2022-03-16	PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research	R. James Cotton et.al.	2203.08792v1	link
2022-03-11	DRTAM: Dual Rank-1 Tensor Attention Module	Hanxing Chi et.al.	2203.05893v1	null
2022-03-07	Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation	Meng Tian et.al.	2203.03498v1	null
2022-02-10	Motion-Aware Transformer For Occluded Person Re-identification	Mi Zhou et.al.	2202.04243v2	null
2022-02-03	Sim2Real Object-Centric Keypoint Detection and Description	Chengliang Zhong et.al.	2202.00448v2	null
2022-01-16	Cross-Centroid Ripple Pattern for Facial Expression Recognition	Monu Verma et.al.	2201.05958v1	null
2022-01-14	Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words	Harry Nguyen et.al.	2201.03556v2	link
2022-01-10	TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials	Jinnavat Sanalohit et.al.	2201.03170v1	null
2022-01-06	A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration	Aline Sindel et.al.	2201.02242v1	null
2021-12-28	Skin feature point tracking using deep feature encodings	Jose Ramon Chang et.al.	2112.14159v1	null
2021-12-23	Data-efficient learning for 3D mirror symmetry detection	Yancong Lin et.al.	2112.12579v1	null
2021-12-22	Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model	Michael Zwölfer et.al.	2112.12193v1	null
2021-12-22	Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction	Henrique Siqueira et.al.	2112.12002v1	link
2021-12-19	Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection	Renjie Li et.al.	2112.10275v1	null
2021-12-19	GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor	Jean-Baptiste Carluer et.al.	2112.10258v1	link
2021-12-16	Masked Feature Prediction for Self-Supervised Visual Pre-Training	Chen Wei et.al.	2112.09133v1	link
2021-12-13	DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points	Zhengfei Kuang et.al.	2112.06910v1	null
2021-12-12	Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species	Changsheng Lu et.al.	2112.06183v1	link
2021-12-13	Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings	Mel Vecerik et.al.	2112.04910v2	null
2021-12-06	ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction	Xiaoming Zhao et.al.	2112.02906v1	link
2021-11-25	Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association	Sen Yang et.al.	2111.12892v1	link
2021-11-08	Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images	Jianfei Guo et.al.	2111.04237v1	null
2021-11-04	Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image	Feng Liu et.al.	2111.03098v1	null
2021-11-01	Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision	Ali Safa et.al.	2111.00791v2	null
2021-10-30	Geometry-Aware Hierarchical Bayesian Learning on Manifolds	Yonghui Fan et.al.	2111.00184v1	null
2021-10-26	CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration	Hao Yu et.al.	2110.14076v1	link
2021-10-23	HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware	James Hegarty et.al.	2110.12106v1	null
2021-10-18	Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning	Shengzeng Huo et.al.	2110.08962v1	null
2021-10-11	High-order Tensor Pooling with Attention for Action Recognition	Piotr Koniusz et.al.	2110.05216v1	null
2021-10-10	Digging Into Self-Supervised Learning of Feature Descriptors	Iaroslav Melekhov et.al.	2110.04773v1	null
2021-10-04	BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion	Zhaoqun Li et.al.	2110.01179v1	link
2021-10-01	Machine learning aided noise filtration and signal classification for CREDO experiment	Łukasz Bibrzycki et.al.	2110.00297v1	null
2021-09-28	PDC-Net+: Enhanced Probabilistic Dense Correspondence Network	Prune Truong et.al.	2109.13912v2	link
2021-09-27	HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines	Fabio Bellavia et.al.	2109.12925v3	null
2021-09-24	Catadioptric Stereo on a Smartphone	Kristijan Bartol et.al.	2109.11872v1	null
2021-09-20	Semi-supervised Dense Keypointsusing Unlabeled Multiview Images	Zhixuan Yu et.al.	2109.09299v1	null
2021-08-31	A Novel Dataset for Keypoint Detection of quadruped Animals from Images	Prianka Banik et.al.	2108.13958v1	link
2021-08-27	A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images	Xiaoteng Zhou et.al.	2108.12151v1	null

(back to top)

Image Matching

Publish Date	Title	Authors	PDF	Code
2025-03-06	Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis	Xingcan Hu et.al.	2503.04205v1	null
2025-03-06	Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration	Qianliang Wu et.al.	2503.04127v1	null
2025-02-28	CNSv2: Probabilistic Correspondence Encoded Neural Image Servo	Anzhe Chen et.al.	2503.00132v1	null
2025-02-27	A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization	Yejun Zhang et.al.	2502.20036v1	link
2025-02-27	RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges	Thibaut Loiseau et.al.	2502.19955v1	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242v1	link
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104v1	link
2025-02-25	Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking	Xin Tong et.al.	2502.17766v1	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779v3	null
2025-02-16	FeaKM: Robust Collaborative Perception under Noisy Pose Conditions	Jiuwu Hao et.al.	2502.11003v1	link
2025-02-24	Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation	Emanuele Mule et.al.	2502.06288v3	link
2025-02-04	Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications	William O'Donnell et.al.	2502.02624v1	null
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277v1	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299v1	null
2025-01-13	MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training	Xingyi He et.al.	2501.07556v1	null
2025-01-13	Matching Free Depth Recovery from Structured Light	Zhuohang Yu et.al.	2501.07113v1	null
2025-01-02	Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views	Yulun Wu et.al.	2501.01196v1	null
2024-12-31	Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights	Bharath Kumar Agnur et.al.	2412.20210v2	null
2024-12-27	MINIMA: Modality Invariant Image Matching	Xingyu Jiang et.al.	2412.19412v1	link
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221v1	link
2024-12-17	Bringing Multimodality to Amazon Visual Search System	Xinliang Zhu et.al.	2412.13364v1	null
2024-12-04	Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis	Siyoon Jin et.al.	2412.03150v1	null
2024-11-20	DT-LSD: Deformable Transformer-based Line Segment Detection	Sebastian Janampa et.al.	2411.13005v1	link
2024-11-15	Image Matching Filtering and Refinement by Planes and Beyond	Fabio Bellavia et.al.	2411.09484v2	link
2024-11-11	XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration	Ismail Can Yagmur et.al.	2411.07430v1	link
2024-11-07	The Impact of Semi-Supervised Learning on Line Segment Detection	Johanna Engman et.al.	2411.04596v1	link
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851v1	null
2024-10-30	Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants	Azadeh Sharafi et.al.	2410.23329v1	null
2024-11-05	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280v2	null
2024-10-30	LoFLAT: Local Feature Matching using Focused Linear Attention Transformer	Naijian Cao et.al.	2410.22710v1	null
2024-10-26	Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification	Yue Su et.al.	2410.20097v1	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848v1	null
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673v1	null
2024-09-25	Game4Loc: A UAV Geo-Localization Benchmark from Game Data	Yuxiang Ji et.al.	2409.16925v1	link
2024-09-24	Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge	Marek Wodzinski et.al.	2409.15931v1	null
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471v1	link
2024-09-05	Enabling Practical and Privacy-Preserving Image Processing	Chao Wang et.al.	2409.03568v1	null
2024-09-20	A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering	Shuang Song et.al.	2409.03032v2	link
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445v2	link
2024-08-26	Affine steerers for structured keypoint description	Georg Bökman et.al.	2408.14186v1	link
2024-09-11	Coarse-to-fine Alignment Makes Better Speech-image Retrieval	Lifeng Zhou et.al.	2408.13119v2	null
2024-08-19	BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval	Zhenyu Lu et.al.	2408.10383v1	null
2024-08-14	RSD-DOG : A New Image Descriptor based on Second Order Derivatives	Darshan Venkatrayappa et.al.	2408.07687v1	null
2024-08-07	PRISM: PRogressive dependency maxImization for Scale-invariant image Matching	Xudong Cai et.al.	2408.03598v1	null
2024-08-05	ConDL: Detector-Free Dense Image Matching	Monika Kwiatkowski et.al.	2408.02766v1	null
2024-08-04	Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image	Xinlin Ren et.al.	2408.02079v1	link
2024-07-29	Image-text matching for large-scale book collections	Artemis Llabrés et.al.	2407.19812v1	link
2024-07-26	PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis	Sohyeong Kim et.al.	2407.18695v1	null
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791v1	null
2024-07-16	REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching	Han Nie et.al.	2407.11637v1	link
2024-07-16	A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation	Chengsheng Li et.al.	2407.11287v1	null
2024-07-14	Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching	Xiaoyong Lu et.al.	2407.07789v2	null
2024-07-10	Mutual Information calculation on different appearances	Jiecheng Liao et.al.	2407.07410v1	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939v3	null
2024-07-03	IMC 2024 Methods & Solutions Review	Shyam Gupta et.al.	2407.03172v1	null
2024-06-21	High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method	F. S. Mortazavi et.al.	2406.15121v1	null
2024-06-16	Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models	Yikai Zhang et.al.	2406.10902v1	link
2024-06-14	Grounding Image Matching in 3D with MASt3R	Vincent Leroy et.al.	2406.09756v1	link
2024-05-22	Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching	Hongkai Chen et.al.	2405.13874v1	null
2024-05-21	OmniGlue: Generalizable Feature Matching with Foundation Model Guidance	Hanwen Jiang et.al.	2405.12979v1	link
2024-07-09	Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation	Rezkellah Noureddine Khiati et.al.	2405.08556v2	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434v1	null
2024-05-13	Authentic Hand Avatar from a Phone Scan via Universal Hand Model	Gyeongsik Moon et.al.	2405.07933v1	null
2024-04-30	A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images	Wang Zhang et.al.	2404.19311v1	null
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174v1	null
2024-06-10	MinBackProp -- Backpropagating through Minimal Solvers	Diana Sungatullina et.al.	2404.17993v2	link
2024-04-23	FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction	Hang Hua et.al.	2404.14715v1	null
2024-05-23	A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching	Francesco Pro et.al.	2404.11302v2	link
2024-04-16	Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction	John Francis et.al.	2404.10626v1	null
2024-04-15	XoFTR: Cross-modal Feature Matching Transformer	Önder Tuzcuoğlu et.al.	2404.09692v1	link
2024-04-13	DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector	Johan Edstedt et.al.	2404.08928v1	link
2024-04-09	Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences	Axel Barroso-Laguna et.al.	2404.06337v1	link
2024-04-01	Marrying NeRF with Feature Matching for One-step Pose Estimation	Ronghan Chen et.al.	2404.00891v1	null
2024-04-01	3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching	Yibin Ye et.al.	2404.00838v1	null
2024-03-31	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546v1	null
2024-03-30	Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation	Yuan Wang et.al.	2404.00262v1	null
2024-03-26	Staircase Localization for Autonomous Exploration in Urban Environments	Jinrae Kim et.al.	2403.17330v1	null
2024-03-23	MatchSeg: Towards Better Segmentation via Reference Image Matching	Ruiqiang Xiao et.al.	2403.15901v1	link
2024-03-19	HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching	Ying Chen et.al.	2403.12543v1	null
2024-03-16	Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval	Shunsuke Tsubaki et.al.	2403.10756v1	null
2024-03-16	Vector search with small radiuses	Gergely Szilvasy et.al.	2403.10746v1	null
2024-03-15	Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline	Fangming Yuan et.al.	2403.10283v1	null
2024-03-15	Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning	Meixuan Li et.al.	2403.10252v1	null
2024-03-14	Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning	Xilin Yang et.al.	2403.09100v1	null
2024-03-18	Matching Non-Identical Objects	Yusuke Marumo et.al.	2403.08227v2	null
2024-03-07	Scene Depth Estimation from Traditional Oriental Landscape Paintings	Sungho Kang et.al.	2403.03408v2	null
2024-02-21	Visual Style Prompting with Swapping Self-Attention	Jaeseok Jeong et.al.	2402.12974v2	link
2024-02-16	GIM: Learning Generalizable Image Matcher From Internet Videos	Xuelun Shen et.al.	2402.11095v1	link
2024-02-13	Are Semi-Dense Detector-Free Methods Good at Matching Local Features?	Matthieu Vilain et.al.	2402.08671v1	null
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359v1	link
2024-01-24	Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry	Qi Cai et.al.	2401.13357v1	null
2024-01-18	Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation	Songhe Deng et.al.	2401.09883v1	link
2024-01-26	RomniStereo: Recurrent Omnidirectional Stereo Matching	Hualie Jiang et.al.	2401.04345v2	link
2024-01-05	CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs	Daoan Zhang et.al.	2401.02582v1	null
2024-01-03	Local Adaptive Clustering Based Image Matching for Automatic Visual Identification	Zhizhen Wang et.al.	2401.01720v1	null
2024-01-03	A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization	Shishen Li et.al.	2401.01574v1	null
2023-12-23	BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation	Tavis Shore et.al.	2312.15363v1	link
2023-12-22	Harnessing Diffusion Models for Visual Perception with Meta Prompts	Qiang Wan et.al.	2312.14733v1	link
2024-01-05	MatchDet: A Collaborative Framework for Image Matching and Object Detection	Jinxiang Lai et.al.	2312.10983v2	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563v1	null
2023-12-04	Steerers: A framework for rotation equivariant keypoint descriptors	Georg Bökman et.al.	2312.02152v1	link
2023-11-30	DSeg: Direct Line Segments Detection	Berger Cyrille et.al.	2311.18344v1	null
2023-11-30	Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications	Sahar Almahfouz Nasser et.al.	2311.18281v1	null
2023-11-29	LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching	Wenhao Zhong et.al.	2311.17571v1	link
2023-11-08	Zero-shot Translation of Attention Patterns in VQA Models to Natural Language	Leonard Salewski et.al.	2311.05043v1	link
2023-11-06	An invariant feature extraction for multi-modal images matching	Chenzhong Gao et.al.	2311.02842v1	null
2023-10-23	RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments	Jinyu Li et.al.	2310.15072v1	link
2023-10-23	Player Re-Identification Using Body Part Appearences	Mahesh Bhosale et.al.	2310.14469v1	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605v1	null
2023-10-07	UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation	Shuai Yuan et.al.	2310.04712v1	null
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092v1	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992v1	link
2023-09-27	KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping	Renlang Huang et.al.	2309.15394v1	null
2023-10-13	A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo	Debao Huang et.al.	2309.09379v2	null
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438v1	link
2023-09-09	Neural Semantic Surface Maps	Luca Morreale et.al.	2309.04836v1	null
2023-09-05	Doppelgangers: Learning to Disambiguate Images of Similar Structures	Ruojin Cai et.al.	2309.02420v1	link
2023-08-14	Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions	Miao Fan et.al.	2308.16160v1	null
2023-08-22	Scene-Aware Feature Matching	Xiaoyong Lu et.al.	2308.09949v2	null
2023-08-02	ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation	Bo Zhang et.al.	2308.00400v2	link
2023-07-28	Cross-Modal Concept Learning and Inference for Vision-Language Models	Yi Zhang et.al.	2307.15460v1	null
2023-07-22	CryptoMask : Privacy-preserving Face Recognition	Jianli Bai et.al.	2307.12010v1	null
2023-07-22	A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration	Jing Hao et.al.	2307.11997v1	null
2023-07-21	Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data	Sahar Almahfouz Nasser et.al.	2307.10698v2	link
2023-08-08	Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education	Neel Kanwal et.al.	2307.09426v2	null
2023-08-01	Unsupervised Deep Graph Matching Based on Cycle Consistency	Siddharth Tourani et.al.	2307.08930v4	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763v1	null
2023-07-09	Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion	Jie S. Li et.al.	2307.05564v1	null
2023-07-11	TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation	Paul Grimal et.al.	2307.05134v1	link
2023-07-02	TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching	Khang Truong Giang et.al.	2307.00485v1	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669v1	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667v2	null
2023-06-25	Enhancing Dynamic Image Advertising with Vision-Language Pre-training	Zhoufutu Wen et.al.	2306.14112v1	null
2023-06-19	Graph Self-Supervised Learning for Endoscopic Image Matching	Manel Farhat et.al.	2306.11141v1	link
2023-06-07	A2B: Anchor to Barycentric Coordinate for Robust Correspondence	Weiyue Zhao et.al.	2306.02760v2	null
2023-05-27	Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation	Yueh-Cheng Huang et.al.	2305.17463v1	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036v1	link
2023-05-18	LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation	Yujie Lu et.al.	2305.11116v1	link
2023-05-16	A Method for Training-free Person Image Picture Generation	Tianyu Chen et.al.	2305.09817v1	null
2023-05-15	Image Matching by Bare Homography	Fabio Bellavia et.al.	2305.08946v1	null
2023-05-12	CLIP-Count: Towards Text-Guided Zero-Shot Object Counting	Ruixiang Jiang et.al.	2305.07304v1	link
2023-05-10	SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking	Adam Schmidt et.al.	2305.06477v1	null
2023-05-10	Level-line Guided Edge Drawing for Robust Line Segment Detection	Xinyu Lin et.al.	2305.05883v1	link
2023-05-09	ColonMapper: topological mapping and localization for colonoscopy	Javier Morlana et.al.	2305.05546v1	null
2023-04-29	A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges	Xinyu Lin et.al.	2305.00264v1	link
2023-04-28	SFD2: Semantic-guided Feature Detection and Description	Fei Xue et.al.	2304.14845v1	link
2023-04-17	DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching	Mohamed Ali Chebbi et.al.	2304.08056v1	link
2023-04-16	Long-term Visual Localization with Mobile Sensors	Shen Yan et.al.	2304.07691v1	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194v1	link
2023-04-16	ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation	Xiaoming Zhao et.al.	2304.03608v2	link
2023-04-04	GlueStick: Robust Image Matching by Sticking Points and Lines Together	Rémi Pautrat et.al.	2304.02008v1	link
2023-04-03	PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching	Pedro Castro et.al.	2304.01382v1	null
2023-04-02	Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints	Guilherme Potje et.al.	2304.00583v1	link
2023-04-13	Structured Epipolar Matcher for Local Feature Matching	Jiahao Chang et.al.	2303.16646v3	null
2023-03-28	ASIC: Aligning Sparse in-the-wild Image Collections	Kamal Gupta et.al.	2303.16201v1	null
2023-03-25	Learning Rotation-Equivariant Features for Visual Correspondence	Jongmin Lee et.al.	2303.15472v1	null
2023-03-27	Learnable Graph Matching: A Practical Paradigm for Data Association	Jiawei He et.al.	2303.15414v1	link
2023-03-24	Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance	Hongjian Song et.al.	2303.13794v1	null
2023-03-15	Rethinking Optical Flow from Geometric Matching Consistent Perspective	Qiaole Dong et.al.	2303.08384v1	link
2023-03-07	Parsing Line Segments of Floor Plan Images Using Graph Neural Networks	Mingxiang Chen et.al.	2303.03851v1	null
2023-03-06	Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints	Chenjie Cao et.al.	2303.02885v1	link
2023-03-10	ParaFormer: Parallel Attention Transformer for Efficient Feature Matching	Xiaoyong Lu et.al.	2303.00941v2	null
2023-03-01	RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique	Jiayuan Li et.al.	2303.00319v1	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239v1	link
2023-02-25	BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI	Yulong Liu et.al.	2302.12971v1	link
2023-02-24	Classification of structural building damage grades from multi-temporal photogrammetric point clouds using a machine learning model trained on virtual laser scanning data	Vivien Zahs et.al.	2302.12591v1	null
2023-02-20	A Large Scale Homography Benchmark	Daniel Barath et.al.	2302.09997v1	link
2023-02-10	General, Single-shot, Target-less, and Automatic LiDAR-Camera Extrinsic Calibration Toolbox	Kenji Koide et.al.	2302.05094v1	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572v1	link
2023-01-27	Harmonizing Flows: Unsupervised MR harmonization based on normalizing flows	Farzad Beizaee et.al.	2301.11551v1	link
2023-01-24	Feature-based Image Matching for Identifying Individual Kākā	Fintan O'Sullivan et.al.	2301.06678v2	null
2023-01-18	Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images	Johannes Bayer et.al.	2301.03155v2	null
2023-01-07	Deep Learning-Based UAV Aerial Triangulation without Image Control Points	Jiageng Zhong et.al.	2301.02869v1	null
2023-01-06	The UNCOVER Survey: A first-look HST+JWST catalog of 50,000 galaxies near Abell 2744 and beyond	John R. Weaver et.al.	2301.02671v1	link
2023-02-13	Translating Text Synopses to Video Storyboards	Xu Gu et.al.	2301.00135v2	link
2022-12-23	SuperGF: Unifying Local and Global Features for Visual Localization	Wenzheng Song et.al.	2212.13105v1	null
2022-12-26	Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images	Weizhi Du et.al.	2212.13068v1	null
2022-12-20	Seafloor-Invariant Caustics Removal from Underwater Imagery	Panagiotis Agrafiotis et.al.	2212.10167v1	null
2022-12-15	DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients	Rémi Pautrat et.al.	2212.07766v1	link
2022-12-14	Shared Coupling-bridge for Weakly Supervised Local Feature Learning	Jiayuan Sun et.al.	2212.07047v1	link
2022-12-05	Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter	Suleyman Melih Portakal et.al.	2212.02302v1	null
2022-12-05	ObjectMatch: Robust Registration using Canonical Object Correspondences	Can Gümeli et.al.	2212.01985v1	null
2022-12-07	Universe Points Representation Learning for Partial Multi-Graph Matching	Zhakshylyk Nurlanov et.al.	2212.00780v2	null
2022-11-30	Self-Supervised Feature Learning for Long-Term Metric Visual Localization	Yuxuan Chen et.al.	2212.00122v1	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069v1	link
2022-11-19	Person Text-Image Matching via Text-Feature Interpretability Embedding and External Attack Node Implantation	Fan Li et.al.	2211.08657v2	link
2022-11-20	Detecting Line Segments in Motion-blurred Images with Events	Huai Yu et.al.	2211.07365v2	link
2022-11-15	Fast Key Points Detection and Matching for Tree-Structured Images	Hao Wang et.al.	2211.03242v2	null
2022-10-25	A Comparative Study on Deep-Learning Methods for Dense Image Matching of Multi-angle and Multi-date Remote Sensing Stereo Images	Hessah Albanwan et.al.	2210.14031v1	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517v1	null
2022-10-07	Mars Rover Localization Based on A2G Obstacle Distribution Pattern Matching	Lang Zhou et.al.	2210.03398v1	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586v1	link
2022-09-25	ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement	Dongli Tan et.al.	2209.12213v1	null
2022-09-22	DRKF: Distilled Rotated Kernel Fusion for Efficiently Boosting Rotation Invariance in Image Matching	Chao Li et.al.	2209.10907v1	null
2022-11-15	Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology	Arpan Kusari et.al.	2209.09090v2	null
2022-09-16	SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence	Lei Li et.al.	2209.07806v1	link
2022-08-30	ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer	Hongkai Chen et.al.	2208.14201v1	link
2022-08-25	A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning	Jianli Wei et.al.	2208.12251v1	link
2022-08-25	UAS Navigation in the Real World Using Visual Observation	Yuci Han et.al.	2208.12125v1	null
2022-08-24	Self-Supervised Endoscopic Image Key-Points Matching	Manel Farhat et.al.	2208.11424v1	link
2022-08-22	Equivariant Hypergraph Neural Networks	Jinwoo Kim et.al.	2208.10428v1	link
2022-09-22	Understanding Attention for Vision-and-Language Tasks	Feiqi Cao et.al.	2208.08104v2	link
2022-08-16	Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning	Dongwoo Park et.al.	2208.07039v2	link
2022-08-04	Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification	Xinyu Lin et.al.	2208.02450v1	link
2022-08-04	OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images	Weijia Li et.al.	2208.00928v2	null
2022-07-29	Testing Relational Understanding in Text-Guided Image Generation	Colin Conwell et.al.	2208.00005v1	null
2022-07-21	Pose for Everything: Towards Category-Agnostic Pose Estimation	Lumin Xu et.al.	2207.10387v1	link
2022-07-20	Explaining Deepfake Detection by Analysing Image Matching	Shichao Dong et.al.	2207.09679v1	link
2022-07-18	Adaptive Assignment for Geometry Aware Local Feature Matching	Dihe Huang et.al.	2207.08427v1	link
2022-07-16	Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching	Jiazhen Liu et.al.	2207.07932v1	link
2022-07-06	Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks	Yijie Zhang et.al.	2207.02946v1	null
2022-07-01	TopicFM: Robust and Interpretable Feature Matching with Topic-assisted	Khang Truong Giang et.al.	2207.00328v1	link
2022-06-16	Virtual Correspondence: Humans as a Cue for Extreme-View Geometry	Wei-Chiu Ma et.al.	2206.08365v1	null
2022-06-15	Self-Supervised Learning of Image Scale and Orientation	Jongmin Lee et.al.	2206.07259v1	link
2022-05-27	Image Keypoint Matching using Graph Neural Networks	Nancy Xu et.al.	2205.14275v1	null
2022-05-27	Fine-tuning deep learning models for stereo matching using results from semi-global matching	Hessah Albanwan et.al.	2205.14051v1	null
2022-05-23	TransforMatcher: Match-to-Match Attention for Semantic Correspondence	Seungwook Kim et.al.	2205.11634v1	link
2022-05-16	ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning	Yuxin Deng et.al.	2205.07439v1	null
2022-05-06	BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching	Jingwei Song et.al.	2205.03133v1	link
2022-05-10	AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching	Khanh Nguyen et.al.	2205.02849v2	link
2022-04-27	Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points	Chao Li et.al.	2204.12884v1	null
2022-04-22	SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite	Runzhe Zhu et.al.	2204.10704v1	link
2022-04-20	Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Leila Pishdad et.al.	2204.09268v1	null
2022-04-19	OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching	Ostap Viniavskyi et.al.	2204.08870v1	link
2022-04-19	Self-Supervised Equivariant Learning for Oriented Keypoint Detection	Jongmin Lee et.al.	2204.08613v1	link
2022-04-22	Efficient Linear Attention for Fast and Accurate Keypoint Matching	Suwichaya Suwanwimolkul et.al.	2204.07731v3	null
2022-04-08	Lightweight starshade position sensing with convolutional neural networks and simulation-based inference	Andrew Chen et.al.	2204.03853v1	link
2022-03-30	AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift	Burak Yildiz et.al.	2203.16291v1	link
2022-03-29	Photographic Visualization of Weather Forecasts with Generative Adversarial Networks	Christian Sigg et.al.	2203.15601v1	link
2022-03-29	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots	Pranay Mathur et.al.	2203.15272v1	null
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901v1	link
2022-03-28	S2-Net: Self-supervision Guided Feature Representation Learning for Cross-Modality Images	Shasha Mei et.al.	2203.14581v1	null
2022-03-26	Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching	Yujiao Shi et.al.	2203.14148v1	link
2022-03-24	Keypoints Tracking via Transformer Networks	Oleksii Nasypanyi et.al.	2203.12848v1	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645v2	link
2022-03-14	There's no difference: Convolutional Neural Networks for transient detection without template subtraction	Tatiana Acero-Cuellar et.al.	2203.07390v1	link
2022-03-25	Cross Language Image Matching for Weakly Supervised Semantic Segmentation	Jinheng Xie et.al.	2203.02668v2	link
2022-03-01	CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP	Zihao Wang et.al.	2203.00386v1	null
2022-03-09	Time-resolved Imaging of Stochastic Cascade Reactions over a Submillisecond to Second Time Range at the Angstrom Level	Toshiki Shimizu et.al.	2202.13332v2	null
2022-02-16	Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images	Matheus M. Dos Santos et.al.	2202.07817v1	null
2022-02-14	CATs++: Boosting Cost Aggregation with Convolutions and Transformers	Seokju Cho et.al.	2202.06817v1	link
2022-02-11	Improving Image-recognition Edge Caches with a Generative Adversarial Network	Guilherme B. Souza et.al.	2202.05929v1	null
2022-02-08	Learning Optical Flow with Adaptive Graph Reasoning	Ao Luo et.al.	2202.03857v1	link
2022-02-03	Sim2Real Object-Centric Keypoint Detection and Description	Chengliang Zhong et.al.	2202.00448v2	null
2022-01-27	Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context	Jie Shao et.al.	2201.11296v1	null
2021-12-24	Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation	Zhiwei Liu et.al.	2112.12917v1	null
2021-12-20	Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching	Yujie Fu et.al.	2112.10485v1	null
2021-12-19	GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor	Jean-Baptiste Carluer et.al.	2112.10258v1	link
2021-12-14	More Control for Free! Image Synthesis with Semantic Diffusion Guidance	Xihui Liu et.al.	2112.05744v2	null
2021-12-08	Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning	Bijie Bai et.al.	2112.05240v1	null
2021-12-01	FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery	Boitumelo Ruf et.al.	2112.00821v1	null
2021-12-01	CLIPstyler: Image Style Transfer with a Single Text Condition	Gihyun Kwon et.al.	2112.00374v1	link
2021-11-29	Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution Features	Xiaoteng Zhou et.al.	2111.15514v1	null
2021-11-29	Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic	Yoad Tewel et.al.	2111.14447v1	link
2021-11-29	Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality Discriminator	Usman Cheema et.al.	2111.14339v1	null
2021-11-17	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network	Xiaoming Zhao et.al.	2111.09006v2	null
2021-11-17	Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features	Xiaoteng Zhou et.al.	2111.08994v3	null
2021-10-30	A Deep Search for Faint Chandra X-ray Sources, Radio Sources, and Optical Counterparts in NGC 6752	Haldan N. Cohn et.al.	2111.00357v1	null
2021-10-01	Robustly Removing Deep Sea Lighting Effects for Visual Mapping of Abyssal Plains	Kevin Köser et.al.	2110.00480v1	null
2021-09-29	Visually Grounded Concept Composition	Bowen Zhang et.al.	2109.14115v1	null
2021-09-27	HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines	Fabio Bellavia et.al.	2109.12925v3	null
2021-09-20	Viewpoint Invariant Dense Matching for Visual Geolocalization	Gabriele Berton et.al.	2109.09827v1	link
2021-09-20	Image Subtraction in Fourier Space	Lei Hu et.al.	2109.09334v1	link
2021-09-10	Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization	Sungho Yoon et.al.	2109.04753v1	link
2021-09-08	Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes	Wenzheng Song et.al.	2109.03585v2	null
2021-08-27	A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images	Xiaoteng Zhou et.al.	2108.12151v1	null
2021-08-27	Matching Underwater Sonar Images by the Learned Descriptor Based on Style Transfer Method	Xiaoteng Zhou et.al.	2108.12072v1	null
2021-08-26	Efficient Joint Object Matching via Linear Programming	Antonio De Rosa et.al.	2108.11911v1	null

(back to top)

NeRF

Publish Date	Title	Authors	PDF	Code
2025-03-06	Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering	Idris O. Sunmola et.al.	2503.04079v1	null
2025-03-05	LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation	Qian Feng et.al.	2503.03890v1	null
2025-03-04	Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries	Zeqing Wang et.al.	2503.02558v1	null
2025-03-04	2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting	Qipeng Yan et.al.	2503.02452v1	null
2025-03-04	Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views	Yingji Zhong et.al.	2503.02230v1	null
2025-03-04	Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints	Yan Miao et.al.	2503.02198v1	null
2025-03-03	Data Augmentation for NeRFs in the Low Data Limit	Ayush Gaggar et.al.	2503.02092v1	null
2025-03-03	Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models	Jay Zhangjie Wu et.al.	2503.01774v1	null
2025-03-05	Category-level Meta-learned NeRF Priors for Efficient Object Mapping	Saad Ejaz et.al.	2503.01582v2	null
2025-03-03	LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training	Kaimin Liao et.al.	2503.01199v1	null
2025-03-02	DreamPrinting: Volumetric Printing Primitives for High-Fidelity 3D Printing	Youjia Wang et.al.	2503.00887v1	null
2025-03-01	Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups	Nicholas Pfaff et.al.	2503.00370v1	null
2025-02-27	Identity-preserving Distillation Sampling by Fixed-Point Iterator	SeonHwa Kim et.al.	2502.19930v1	null
2025-02-27	NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission	Weijie Yue et.al.	2502.19873v1	null
2025-02-26	Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions	Muhammad Salman Ali et.al.	2502.19457v1	null
2025-02-26	Does 3D Gaussian Splatting Need Accurate Volumetric Rendering?	Adam Celarek et.al.	2502.19318v1	link
2025-02-26	The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields	Ziyuan Luo et.al.	2502.19125v1	null
2025-02-24	Semantic Neural Radiance Fields for Multi-Date Satellite Data	Valentin Wagner et.al.	2502.16992v1	link
2025-02-22	AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal	Luca Gough et.al.	2502.16351v1	null
2025-02-22	DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation	Yuxuan Xiong et.al.	2502.16302v1	null
2025-02-24	Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis	Ziqian Ni et.al.	2502.15635v2	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931v1	null
2025-02-20	NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis	Xiaoxing Liu et.al.	2502.14178v1	null
2025-02-19	GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian	Bang Du et.al.	2502.14129v1	null
2025-02-18	Geometry-Aware Diffusion Models for Multiview Scene Inpainting	Ahmad Salimi et.al.	2502.13335v1	null
2025-02-18	GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis	Pedro Martin et.al.	2502.13196v1	null
2025-02-18	ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition	Quoc-Anh Bui et.al.	2502.12673v1	null
2025-02-21	HumanGif: Single-View Human Diffusion with Generative Prior	Shoukang Hu et.al.	2502.12080v2	link
2025-02-17	3D Gaussian Inpainting with Depth-Guided Cross-View Consistency	Sheng-Yu Huang et.al.	2502.11801v1	null
2025-02-13	Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures	Francesco Ballerini et.al.	2502.09623v1	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111v1	null
2025-02-12	Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision	Tianle Liu et.al.	2502.08352v1	null
2025-02-10	PrismAvatar: Real-time animated 3D neural head avatars on edge devices	Prashant Raina et.al.	2502.07030v1	null
2025-02-10	Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC	Siwei Meng et.al.	2502.07007v1	null
2025-02-08	GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling	Kang Yang et.al.	2502.05708v1	null
2025-02-05	VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning	Jayram Palamadai et.al.	2502.05222v1	null
2025-02-11	PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression	Feifei Li et.al.	2502.04843v2	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657v1	null
2025-02-04	MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning	Shengbo Gu et.al.	2502.02372v1	null
2025-02-03	FourieRF: Few-Shot NeRFs via Progressive Fourier Frequency Control	Diego Gomez et.al.	2502.01405v1	null
2025-01-31	VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting	Mateusz Nowak et.al.	2501.17978v2	null
2025-01-28	LinPrim: Linear Primitives for Differentiable Volumetric Rendering	Nicolas von Lützow et.al.	2501.16312v2	null
2025-01-24	SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation	Yujian Liu et.al.	2501.14646v1	null
2025-02-05	GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting	Junzhe Jiang et.al.	2501.13971v2	link
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402v1	null
2025-01-22	Neural Radiance Fields for the Real World: A Survey	Wenhui Xiao et.al.	2501.13104v1	null
2025-02-02	DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform	Hung Nguyen et.al.	2501.12637v2	null
2025-01-21	DNRSelect: Active Best View Selection for Deferred Neural Rendering	Dongli Wu et.al.	2501.12150v1	null
2025-01-21	Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging	Shuyi Hu et.al.	2501.11884v1	null
2025-01-16	Poxel: Voxel Reconstruction for 3D Printing	Ruixiang Cao et.al.	2501.10474v1	null
2025-01-17	Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation	Xiaoyun Zheng et.al.	2501.09947v1	link
2025-01-16	Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes	Ji Shi et.al.	2501.09460v1	link
2025-01-15	SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM	Yuhang Ming et.al.	2501.08880v1	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286v1	null
2025-01-13	Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes	Yuhang Zhang et.al.	2501.08072v1	null
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015v2	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927v1	link
2025-01-12	ActiveGAMER: Active GAussian Mapping through Efficient Rendering	Liyan Chen et.al.	2501.06897v1	null
2025-01-17	SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis	Peng Zheng et.al.	2501.06770v2	null
2025-01-11	NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References	Qiang Qu et.al.	2501.06488v1	link
2025-01-10	UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping	Yanjie Li et.al.	2501.05783v1	null
2025-01-13	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226v2	null
2025-01-07	NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives	Leif Van Holland et.al.	2501.04074v1	link
2025-01-07	NeuralSVG: An Implicit Representation for Text-to-Vector Generation	Sagi Polaczek et.al.	2501.03992v1	null
2025-01-14	DehazeGS: Seeing Through Fog with 3D Gaussian Splatting	Jinze Yu et.al.	2501.03659v2	null
2025-01-07	ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting	Yifeng Yang et.al.	2501.03605v1	link
2025-01-07	AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene	Chaoran Feng et.al.	2501.02807v2	null
2024-12-29	Bringing Objects to Life: 4D generation from 3D objects	Ohad Rahamim et.al.	2412.20422v1	null
2024-12-27	Learning Radiance Fields from a Single Snapshot Compressive Image	Yunhao Li et.al.	2412.19483v1	null
2025-01-05	BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream	Gopi Raju Matta et.al.	2412.19370v2	null
2024-12-26	Generating Editable Head Avatars with 3D Gaussian GANs	Guohao Li et.al.	2412.19149v1	link
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130v1	null
2024-12-26	Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos	Changwoon Choi et.al.	2412.19089v1	null
2024-12-23	Editing Implicit and Explicit Representations of Radiance Fields: A Survey	Arthur Hubert et.al.	2412.17628v1	null
2024-12-23	Exploring Dynamic Novel View Synthesis Technologies for Cinematography	Adrian Azzarelli et.al.	2412.17532v1	null
2024-12-21	LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo	Fotios Logothetis et.al.	2412.16737v1	null
2024-12-20	NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems	Laura Weihl et.al.	2412.16141v1	null
2024-12-20	NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images	Yue Guo et.al.	2412.15890v1	null
2024-12-19	LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction	Pou-Chun Kung et.al.	2412.15447v1	null
2024-12-18	DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields	Xingyu Zhu et.al.	2412.15278v1	null
2024-12-19	GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting	Qianpu Sun et.al.	2412.14579v1	null
2024-12-19	Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images	Min Wang et.al.	2412.14547v1	null
2024-12-18	GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians	Xiaobao Wei et.al.	2412.13983v1	**[link](https://github.com/ucwxb/graphav

Name		Name	Last commit message	Last commit date
Latest commit History 2,312 Commits
.github/workflows		.github/workflows
docs		docs
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.03.09

SLAM

SFM

Visual Localization

Keypoint Detection

Image Matching

NeRF

About

Releases

Packages

Languages

agipro/cv-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.03.09

SLAM

SFM

Visual Localization

Keypoint Detection

Image Matching

NeRF

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages