GitHub - Ther-nullptr/circult-eda-mlsys-tinyml-arxiv-daily: 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)

Updated on 2025.05.04

Usage instructions: here

Table of Contents

Quantization
Pruning
Hardware-Software Co-Design
TinyML
Domain Specific Accelerator
Low-Rank Adaptation
Model Compression

Quantization

Publish Date	Title	Authors	PDF	Code
2025-05-01	Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction	Changjun Li et.al.	2505.00259	null
2025-04-24	Precision Neural Network Quantization via Learnable Adaptive Modules	Wenqiang Zhou et.al.	2504.17263	null
2025-04-21	StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models	Yeona Hong et.al.	2504.14915	null
2025-04-14	Enhancing Ultra-Low-Bit Quantization of Large Language Models Through Saliency-Aware Partial Retraining	Deyu Cao et.al.	2504.13932	null
2025-04-13	Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization	Yamato Arai et.al.	2504.09629	null
2025-04-12	DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models	Wenjin Ke et.al.	2504.09223	null
2025-04-10	Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression	Hanqi Xiao et.al.	2504.07389	link
2025-04-09	Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation	Sirine Arfa et.al.	2504.06748	null
2025-04-07	Achieving binary weight and activation for LLMs using Post-Training Quantization	Siqing Song et.al.	2504.05352	null
2025-03-29	RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm	Yongyi Yang et.al.	2504.03717	null
2025-04-04	Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency	Erik Johannes Husom et.al.	2504.03360	null
2025-04-03	APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers	Zhuguanyu Wu et.al.	2504.02508	link
2025-04-02	LLMPi: Optimizing LLMs for High-Throughput on Raspberry Pi	Mahsa Ardakani et.al.	2504.02118	null
2025-04-03	Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models	Hung-Yueh Chiang et.al.	2503.22879	link
2025-03-24	Wireless Hearables With Programmable Speech AI Accelerators	Malek Itani et.al.	2503.18698	null
2025-03-24	GranQ: Granular Zero-Shot Quantization with Unified Layer-Channel Awareness	Inpyo Hong et.al.	2503.18339	null
2025-03-20	QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge	Xuan Shen et.al.	2503.16709	null
2025-03-22	Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation	Yuqing Wang et.al.	2503.16430	null
2025-03-19	PARQ: Piecewise-Affine Regularized Quantization	Lisa Jin et.al.	2503.15748	null
2025-03-19	FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers	Ruichen Chen et.al.	2503.15465	link
2025-03-14	Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix	Junbiao Pang et.al.	2503.11159	null
2025-03-13	OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models	Akshat Ramachandran et.al.	2503.10959	null
2025-03-12	Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks	Idris Zakariyya et.al.	2503.08973	null
2025-03-10	QuantU-Net: Efficient Wearable Medical Imaging Using Bitwidth as a Trainable Parameter	Christiaan Boerkamp et.al.	2503.08719	null
2025-03-10	Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping	Ning Ding et.al.	2503.06930	null
2025-03-09	SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model	Jing Zhang et.al.	2503.06515	null
2025-03-05	AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model	Wenlun Zhang et.al.	2503.03088	null
2025-03-04	Q&C: When Quantization Meets Cache in Efficient Image Generation	Xin Ding et.al.	2503.02508	null
2025-02-28	Identifying Sensitive Weights via Post-quantization Integral	Yuezhou Hu et.al.	2503.01901	null
2025-03-03	KurTail : Kurtosis-based LLM Quantization	Mohammad Sadegh Akhondzadeh et.al.	2503.01483	null
2025-03-05	Regularization-based Framework for Quantization-, Fault- and Variability-Aware Training	Anmol Biswas et.al.	2503.01297	null
2025-02-27	HALO: Hardware-aware quantization with low critical-path-delay weights for LLM acceleration	Rohan Juneja et.al.	2502.19662	null
2025-02-26	Binary Neural Networks for Large Language Model: A Survey	Liangdong Liu et.al.	2502.19008	null
2025-02-23	Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression	Xiaoyi Qu et.al.	2502.16638	link
2025-02-17	Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer	Euntae Choi et.al.	2502.15779	null
2025-02-21	Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection	Jiangyong Yu et.al.	2502.15488	null
2025-02-21	CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution	Kai Liu et.al.	2502.15478	link
2025-02-21	LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design	Renjie Wei et.al.	2502.15260	null
2025-02-20	Hardware-Friendly Static Quantization Method for Video Diffusion Transformers	Sanghyun Yi et.al.	2502.15077	null
2025-02-18	PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models	Jiaqi Zhao et.al.	2502.13179	link
2025-02-18	Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis	Jiaqi Zhao et.al.	2502.13178	null
2025-02-17	Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?	Jacob Nielsen et.al.	2502.11895	null
2025-02-17	On Quantizing Neural Representation for Variable-Rate Video Coding	Junqi Shi et.al.	2502.11729	link
2025-02-14	Can Post-Training Quantization Benefit from an Additional QLoRA Integration?	Xiliang Zhu et.al.	2502.10202	null
2025-02-13	NestQuant: Nested Lattice Quantization for Matrix Products and LLMs	Semyon Savkin et.al.	2502.09720	null
2025-02-13	RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models	Quan Wei et.al.	2502.09003	null
2025-02-12	Compression of Site-Specific Deep Neural Networks for Massive MIMO Precoding	Ghazal Kasalaee et.al.	2502.08758	null
2025-02-06	Exploring Model Invariance with Discrete Search for Ultra-Low-Bit Quantization	Yuqiao Wen et.al.	2502.06844	null
2025-02-07	BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference	Reena Elangovan et.al.	2502.05376	null
2025-02-07	QuEST: Stable Training of LLMs with 1-Bit Weights and Activations	Andrei Panferov et.al.	2502.05003	link
2025-02-07	AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers	Runqing Jiang et.al.	2502.04628	null
2025-02-04	Survey of Quantization Techniques for On-Device Vision-based Crack Detection	Yuxuan Zhang et.al.	2502.02269	null
2025-02-03	Nearly Lossless Adaptive Bit Switching	Haiduo Huang et.al.	2502.01199	link
2025-02-03	On the impact of the parametrization of deep convolutional neural networks on post-training quantization	Samy Houache et.al.	2502.01156	null
2025-02-01	Oscillations Make Neural Networks Robust to Quantization	Jonathan Wenshøj et.al.	2502.00490	null
2025-02-01	MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization	JiangYong Yu et.al.	2502.00425	null
2025-01-30	Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models	Wanlong Liu et.al.	2501.18154	null
2025-01-28	Post-Training Quantization for 3D Medical Image Segmentation: A Practical Study on Real Inference Engines	Chongyu Qu et.al.	2501.17343	null
2025-01-28	Post-Training Quantization for Vision Mamba with k-Scaled Quantization and Reparameterization	Bo-Yun Shi et.al.	2501.16738	null
2025-01-24	End-to-end workflow for machine learning-based qubit readout with QICK and hls4ml	Giuseppe Di Guglielmo et.al.	2501.14663	null
2025-01-24	On Hardening DNNs against Noisy Computations	Xiao Wang et.al.	2501.14531	null
2025-01-23	OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting	Xing Hu et.al.	2501.13987	link
2025-01-23	QMamba: Post-Training Quantization for Vision State Space Models	Yinglong Li et.al.	2501.13624	null
2025-01-23	MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods	Zukang Xu et.al.	2501.13484	link
2025-01-21	UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model	Branislava Jankovic et.al.	2501.12087	null
2025-01-15	Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach	Alireza Ghaffari et.al.	2501.09107	null
2025-01-14	D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models	Qian Zeng et.al.	2501.08180	link
2025-01-10	Mix-QViT: Mixed-Precision Vision Transformer Quantization Driven by Layer Importance and Quantization Sensitivity	Navin Ranjan et.al.	2501.06357	null
2025-01-09	Neural Architecture Codesign for Fast Physics Applications	Jason Weitz et.al.	2501.05515	link
2025-01-09	JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration	Mingzi Wang et.al.	2501.05339	null
2025-01-09	Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning	Dmytro Kuzmenko et.al.	2501.05329	null
2025-01-06	The Power of Negative Zero: Datatype Customization for Quantized Large Language Models	Yuzong Chen et.al.	2501.04052	null
2025-01-05	HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning	Saleh Ashkboos et.al.	2501.02625	link
2024-12-30	PQD: Post-training Quantization for Efficient Diffusion Models	Jiaojiao Ye et.al.	2501.00124	null
2024-12-30	Improving Acoustic Scene Classification in Low-Resource Conditions	Zhi Chen et.al.	2412.20722	null
2024-12-29	PTQ4VM: Post-Training Quantization for Visual Mamba	Younghyun Cho et.al.	2412.20386	link
2024-12-28	IMSSA: Deploying modern state-space models on memristive in-memory compute hardware	Sebastian Siegel et.al.	2412.20215	null
2024-12-27	Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales	Shuokai Pan et.al.	2412.19867	null
2024-12-27	MBQ: Modality-Balanced Quantization for Large Vision-Language Models	Shiyao Li et.al.	2412.19509	link
2024-12-24	Unified Stochastic Framework for Neural Network Quantization and Pruning	Haoyu Zhang et.al.	2412.18184	null
2024-12-21	TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models	Haocheng Huang et.al.	2412.16700	null
2024-12-20	Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart	Chengting Yu et.al.	2412.15846	null
2024-12-19	Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers	Rui Ding et.al.	2412.14633	null
2024-12-19	Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models	Keith G. Mills et.al.	2412.14628	null
2024-12-18	ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals	Utkarsh Saxena et.al.	2412.14363	link
2024-12-15	Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment	Haisheng Lu et.al.	2412.11186	link
2024-12-13	TTAQ: Towards Stable Post-training Quantization in Continuous Domain Adaptation	Junrui Xiao et.al.	2412.09899	null
2024-12-12	CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs	Yuzhuang Xu et.al.	2412.09282	null
2024-12-10	Post-Training Non-Uniform Quantization for Convolutional Neural Networks	Ahmed Luqman et.al.	2412.07391	null
2024-12-09	FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization	Boyang Zhang et.al.	2412.06865	null
2024-12-09	Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion	Shuaiting Li et.al.	2412.06661	null
2024-12-07	GAQAT: gradient-adaptive quantization-aware training for domain generalization	Jiacheng Jiang et.al.	2412.05551	null
2024-12-07	SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization	Runsheng Bai et.al.	2412.04180	null
2024-12-05	Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task	Alireza Maleki et.al.	2412.03915	null
2024-12-03	CPTQuant - A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models	Amitash Nanda et.al.	2412.03599	null
2024-11-26	Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2411.17543	null
2024-12-03	PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution	Libo Zhu et.al.	2411.17106	link
2024-11-23	freePruner: A Training-free Approach for Large Multimodal Model Acceleration	Bingxin Xu et.al.	2411.15446	null
2024-11-22	FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration	Donghyeon Yi et.al.	2411.14733	null
2024-11-17	EfQAT: An Efficient Framework for Quantization-Aware Training	Saleh Ashkboos et.al.	2411.11038	null
2024-11-12	ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization	Weibo Zhao et.al.	2411.07762	null
2024-11-09	Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques	Jahid Hasan et.al.	2411.06084	null
2024-11-08	SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-30	Scaling Laws for Precision	Tanishq Kumar et.al.	2411.04330	null
2024-11-06	Interactions Across Blocks in Post-Training Quantization of Large Language Models	Khasmamad Shabanovi et.al.	2411.03934	null
2024-11-06	An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging	Públio Elon Correa da Silva et.al.	2411.03835	link
2024-11-06	TATAA: Programmable Mixed-Precision Transformer Acceleration with a Transformable Arithmetic Architecture	Jiajun Wu et.al.	2411.03697	null
2024-10-29	Data Generation for Hardware-Friendly Post-Training Quantization	Lior Dikstein et.al.	2410.22110	link
2024-10-30	IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models	Hang Guo et.al.	2410.21759	link
2024-10-26	DQRM: Deep Quantized Recommendation Models	Yang Zhou et.al.	2410.20046	link
2024-10-14	Real-Time Stress Detection via Photoplethysmogram Signals: Implementation of a Combined Continuous Wavelet Transform and Convolutional Neural Network on Resource-Constrained Microcontrollers	Yasin Hasanpoor et.al.	2410.19776	null
2024-10-24	TesseraQ: Ultra Low-Bit LLM Post-Training Quantization with Block Reconstruction	Yuhang Li et.al.	2410.19103	null
2024-10-18	Understanding the difficulty of low-precision post-training quantization of large language models	Zifei Xu et.al.	2410.14570	null
2024-10-17	Quamba: A Post-Training Quantization Recipe for Selective State Space Models	Hung-Yueh Chiang et.al.	2410.13229	link
2024-10-17	Scaling laws for post-training quantized large language models	Zifei Xu et.al.	2410.12119	null
2024-10-15	Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks	Alireza Khodamoradi et.al.	2410.11203	link
2024-10-06	Continuous Approximations for Improving Quantization Aware Training of LLMs	He Li et.al.	2410.10849	null
2024-10-12	SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs	Mohammad Mozaffari et.al.	2410.09615	link
2024-10-12	FlatQuant: Flatness Matters for LLM Quantization	Yuxuan Sun et.al.	2410.09426	link
2024-10-10	Q-VLM: Post-training Quantization for Large Vision-Language Models	Changyuan Wang et.al.	2410.08119	link
2024-10-10	Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential Detection	Hubert Cecotti et.al.	2410.07920	null
2024-10-10	CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression	Wenyuan Liu et.al.	2410.07505	null
2024-10-09	Scaling Laws for Mixed quantization in Large Language Models	Zeyu Cao et.al.	2410.06722	null
2024-10-08	QERA: an Analytical Framework for Quantization Error Reconstruction	Cheng Zhang et.al.	2410.06040	null
2024-10-08	QT-DoG: Quantization-aware Training for Domain Generalization	Saqib Javed et.al.	2410.06020	link
2024-10-10	ARB-LLM: Alternating Refined Binarizations for Large Language Models	Zhiteng Li et.al.	2410.03129	link
2024-10-03	Lightweight Diffusion Models for Resource-Constrained Semantic Communication	Giovanni Pignata et.al.	2410.02491	link
2024-10-01	Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Ismail Erbas et.al.	2410.00948	null
2024-09-30	Constraint Guided Model Quantization of Neural Networks	Quinten Van Baelen et.al.	2409.20138	null
2024-09-26	P4Q: Learning to Prompt for Quantization in Visual-language Models	Huixin Sun et.al.	2409.17634	null
2024-09-25	Accumulator-Aware Post-Training Quantization	Ian Colbert et.al.	2409.17092	null
2024-09-25	VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models	Yifei Liu et.al.	2409.17066	link
2024-09-25	PTQ4RIS: Post-Training Quantization for Referring Image Segmentation	Xiaoyan Jiang et.al.	2409.17020	link
2024-09-26	INT-FlashAttention: Enabling Flash Attention for INT8 Quantization	Shimao Chen et.al.	2409.16997	link
2024-09-20	PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models	Jayneel Vora et.al.	2409.13894	null
2024-09-18	Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview	Yanshu Wang et.al.	2409.11650	null
2024-09-12	LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs	Han Xu et.al.	2409.11424	null
2024-09-12	DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing	Zhenyuan Dong et.al.	2409.07756	link
2024-08-31	Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization	Vage Egiazarian et.al.	2409.00492	null
2024-08-29	A machine learning approach for computing solar flare locations in X-rays on-board Solar Orbiter/STIX	Paolo Massa et.al.	2408.16642	link
2024-08-29	On-device AI: Quantization-aware Training of Transformers in Time-Series	Tianheng Ling et.al.	2408.16495	null
2024-08-27	The Uniqueness of LLaMA3-70B with Per-Channel Quantization: An Empirical Study	Minghai Qin et.al.	2408.15301	null
2024-08-25	MobileQuant: Mobile-friendly Quantization for On-device Language Models	Fuwen Tan et.al.	2408.13933	link
2024-08-25	Infrared Domain Adaptation with Zero-Shot Quantization	Burak Sevsay et.al.	2408.13925	null
2024-08-23	ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models	Chao Zeng et.al.	2408.08554	link
2024-08-14	Analog Spiking Neuron in CMOS 28 nm Towards Large-Scale Neuromorphic Processors	Marwan Besrour et.al.	2408.07734	null
2024-08-13	Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models	Cheng Chen et.al.	2408.06995	null
2024-08-11	RTF-Q: Unsupervised domain adaptation based retraining-free quantization network	Nanyang Du et.al.	2408.05752	null
2024-08-16	DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers	Lianwei Yang et.al.	2408.03291	null
2024-08-05	HQOD: Harmonious Quantization for Object Detection	Long Huang et.al.	2408.02561	link
2024-08-01	Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization	Róisín Luo et.al.	2408.00923	null
2024-08-07	Temporal Feature Matters: A Framework for Diffusion Model Quantization	Yushi Huang et.al.	2407.19547	null
2024-07-25	Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models	Sanae Lotfi et.al.	2407.18158	null
2024-07-27	MetaAug: Meta-Data Augmentation for Post-Training Quantization	Cuong Pham et.al.	2407.14726	link
2024-07-17	AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer	Zhuguanyu Wu et.al.	2407.12951	link
2024-07-17	Mamba-PTQ: Outlier Channels in Recurrent Large Language Models	Alessandro Pierro et.al.	2407.12397	null
2024-07-17	StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators	Ethan G Rogers et.al.	2407.12378	null
2024-07-17	Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models	Ayush Kaushal et.al.	2407.12327	link
2024-07-17	QVD: Post-training Quantization for Video Diffusion Models	Shilong Tian et.al.	2407.11585	null
2024-07-16	LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices	Jung Hyun Lee et.al.	2407.11534	link
2024-07-11	Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients	Zhenyu Zhang et.al.	2407.08296	link
2024-07-10	RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization	Xijie Huang et.al.	2407.08044	link

(back to top)

Pruning

Publish Date	Title	Authors	PDF	Code
2025-05-01	FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation	Chaitali Bhattacharyya et.al.	2505.00624	null
2025-04-30	TinyMA-IEI-PPO: Exploration Incentive-Driven Multi-Agent DRL with Self-Adaptive Pruning for Vehicular Embodied AI Agent Twins Migration	Zhuoqi Zeng et.al.	2505.00055	null
2025-04-29	Efficient LLMs with AMP: Attention Heads and MLP Pruning	Leandro Giusti Mugnaini et.al.	2504.21174	null
2025-04-28	Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs	Muhammad Sabih et.al.	2504.19659	null
2025-04-25	Study on Real-Time Road Surface Reconstruction Using Stereo Vision	Deepak Ghimire et.al.	2504.18112	null
2025-04-20	NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models	Lawrence Liu et.al.	2504.14569	link
2025-04-19	Diffusion-based Dynamic Contract for Federated AI Agent Construction in Mobile Metaverses	Jinbo Wen et.al.	2504.14326	null
2025-04-19	A Real-time and Hardware Efficient Artfecat-free Spike Sorting Using Deep Spike Detection	Xiaoyu Jiang et.al.	2504.14279	null
2025-04-17	Enhanced Pruning Strategy for Multi-Component Neural Architectures Using Component-Aware Graph Analysis	Ganesh Sundaram et.al.	2504.13296	null
2025-04-12	Sparse Hybrid Linear-Morphological Networks	Konstantinos Fotopoulos et.al.	2504.09289	null
2025-04-08	Mosaic: Composite Projection Pruning for Resource-efficient LLMs	Bailey J. Eccles et.al.	2504.06323	null
2025-04-06	Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression	Ivan Ilin et.al.	2504.05346	link
2025-04-05	The Effects of Grouped Structural Global Pruning of Vision Transformers on Domain Generalisation	Hamza Riaz et.al.	2504.04196	null
2025-04-02	MDP: Multidimensional Vision Model Pruning with Latency Constraint	Xinglong Sun et.al.	2504.02168	null
2025-04-01	FedPaI: Achieving Extreme Sparsity in Federated Learning via Pruning at Initialization	Haonan Wang et.al.	2504.00308	null
2025-03-28	Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning	Yupei Li et.al.	2503.21419	null
2025-03-19	Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure	Fatemeh Dehrouyeh et.al.	2503.14799	link
2025-03-14	Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity	Chi Xu et.al.	2503.11164	null
2025-03-18	Týr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization	Guanchen Li et.al.	2503.09657	null
2025-03-08	Sample-aware Adaptive Structured Pruning for Large Language Models	Jun Kong et.al.	2503.06184	null
2025-03-07	IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining	Yixiao Li et.al.	2503.05920	null
2025-03-06	How can representation dimension dominate structurally pruned LLMs?	Mingxue Xu et.al.	2503.04377	null
2025-02-24	Delta Decompression for MoE-based LLMs Compression	Hao Gu et.al.	2502.17298	link
2025-02-23	Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression	Xiaoyi Qu et.al.	2502.16638	link
2025-03-15	Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification	Arshia Kermani et.al.	2502.16627	null
2025-02-21	PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation	Tao Fan et.al.	2502.15857	null
2025-02-21	Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing	Qi Le et.al.	2502.15618	link
2025-02-19	EvoP: Robust LLM Inference via Evolutionary Pruning	Shangyu Wu et.al.	2502.14910	null
2025-02-20	Towards Efficient Automatic Self-Pruning of Large Language Models	Weizhong Huang et.al.	2502.14413	null
2025-02-19	MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures	Jiayu Qin et.al.	2502.14008	null
2025-02-19	Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Jun Zhang et.al.	2502.13533	link
2025-02-17	An Efficient Row-Based Sparse Fine-Tuning	Cen-Jhih Li et.al.	2502.11439	null
2025-02-21	DarwinLM: Evolutionary Structured Pruning of Large Language Models	Shengkun Tang et.al.	2502.07780	link
2025-02-11	Exploring Neural Network Pruning with Screening Methods	Mingyuan Wang et.al.	2502.07189	null
2025-02-11	EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models	Xingrun Xing et.al.	2502.06663	null
2025-02-09	QP-SNN: Quantized and Pruned Spiking Neural Networks	Wenjie Wei et.al.	2502.05905	null
2025-02-09	Synergistic Effects of Knowledge Distillation and Structured Pruning for Self-Supervised Speech Models	Shiva Kumar C et.al.	2502.05837	null
2025-02-06	PGB: One-Shot Pruning for BERT via Weight Grouping and Permutation	Hyemin Lim et.al.	2502.03984	null
2025-02-05	Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training	Boyao Wang et.al.	2502.03460	null
2025-02-08	Progressive Binarization with Semi-Structured Pruning for LLMs	Xianglong Yan et.al.	2502.01705	link
2025-02-02	Structural Latency Perturbation in Large Language Models Through Recursive State Induction	Michael Mangrum et.al.	2502.00758	null
2025-02-02	CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks	Christian Franssen et.al.	2502.00744	null
2025-02-01	ProxSparse: Regularized Learning of Semi-Structured Sparsity Masks for Pretrained LLMs	Hongyi Liu et.al.	2502.00258	null
2025-01-31	Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models	Jialin Zhao et.al.	2501.19090	null
2025-01-29	2SSP: A Two-Stage Framework for Structured Pruning of LLMs	Fabrizio Sandri et.al.	2501.17771	link
2025-01-28	B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning	Nikolaos Kaparinos et.al.	2501.16917	null
2025-01-25	ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning	Shangqian Gao et.al.	2501.15316	null
2025-01-25	PIP: Perturbation-based Iterative Pruning for Large Language Models	Yi Cao et.al.	2501.15278	null
2025-01-25	Lightweight and Post-Training Structured Pruning for On-Device Large Lanaguage Models	Zihuai Xu et.al.	2501.15255	null
2025-01-23	One-cycle Structured Pruning with Stability Driven Structure Search	Deepak Ghimire et.al.	2501.13439	null
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	FASP: Fast and Accurate Structured Pruning of Large Language Models	Hanyu Hu et.al.	2501.09412	null
2025-01-15	SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization	Waqwoya Abebe et.al.	2501.08504	link
2025-01-14	PolyLUT: Ultra-low Latency Polynomial Inference with Hardware-Aware Structured Pruning	Marta Andronic et.al.	2501.08043	null
2025-01-09	Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning	Laura Puccioni et.al.	2501.05248	null
2025-01-09	A 1Mb mixed-precision quantized encoder for image classification and patch-based compression	Van Thien Nguyen et.al.	2501.05097	null
2025-01-05	Efficient Deployment of Large Language Models on Resource-constrained Devices	Zhiwei Yao et.al.	2501.02438	null
2025-01-04	Optimizing Small Language Models for In-Vehicle Function-Calling	Yahya Sowti Khiabani et.al.	2501.02342	null
2025-01-07	Instruction-Following Pruning for Large Language Models	Bairu Hou et.al.	2501.02086	null
2024-12-24	SlimGPT: Layer-wise Structured Pruning for Large Language Models	Gui Ling et.al.	2412.18110	null
2024-12-23	GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference	Chao Zeng et.al.	2412.17560	null
2024-12-28	Lillama: Large Language Models Compression via Low-Rank Feature Distillation	Yaya Sy et.al.	2412.16719	null
2024-12-21	V"Mean"ba: Visual State Space Models only need 1 hidden dimension	Tien-Yu Chi et.al.	2412.16602	null
2024-12-20	Less is More: Towards Green Code Large Language Models via Unified Structural Pruning	Guang Yang et.al.	2412.15921	null
2024-12-20	All-in-One Tuning and Structural Pruning for Domain-Specific LLMs	Lei Lu et.al.	2412.14426	null
2024-12-17	Learning Coarse-to-Fine Pruning of Graph Convolutional Networks for Skeleton-based Recognition	Hichem Sahbi et.al.	2412.12887	null
2024-12-17	A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting	Nicholas Kiefer et.al.	2412.12883	null
2024-12-17	Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation	Dongyue Wu et.al.	2412.12672	link
2024-12-19	RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification	Guangwenjie Zou et.al.	2412.12603	link
2024-12-16	Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition	Hichem Sahbi et.al.	2412.11813	null
2024-12-16	QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models	Changhai Zhou et.al.	2412.11629	null
2024-12-09	LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation	Haihang Wu et.al.	2412.06419	null
2024-12-03	Effortless Efficiency: Low-Cost Pruning of Diffusion Models	Yang Zhang et.al.	2412.02852	null
2024-11-25	Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization	Abdesselam Ferdi et.al.	2411.16901	null
2024-11-21	FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers	Zehua Pei et.al.	2411.14507	null
2024-11-21	Layer Pruning with Consensus: A Triple-Win Solution	Leandro Giusti Mugnaini et.al.	2411.14345	link
2024-11-21	DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization	Hexuan Deng et.al.	2411.14055	link
2024-11-19	FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning	Qingsong Lv et.al.	2411.12781	link
2024-11-17	Electrostatic Force Regularization for Neural Structured Pruning	Abdesselam Ferdi et.al.	2411.11079	null
2024-11-15	Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems	Pedro Palacios et.al.	2411.10285	null
2024-12-16	P $^2$ Law: Scaling Law for Post-Training After Model Pruning	Xiaodong Chen et.al.	2411.10272	null
2024-11-10	RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration	Boyao Wang et.al.	2411.06463	link
2024-11-05	Layer-Adaptive State Pruning for Deep State Space Models	Minseon Gwak et.al.	2411.02824	link
2024-11-04	Automatic Structured Pruning for Efficient Architecture in Federated Learning	Thai Vu Nguyen et.al.	2411.01759	link
2024-10-31	Mutual Information Preserving Neural Network Pruning	Charles Westphal et.al.	2411.00147	null
2024-10-24	Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts	Danyal Aftab et.al.	2410.19185	null
2024-10-18	EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search	Oliver Sieberling et.al.	2410.14649	link
2024-11-04	DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models	Shangqian Gao et.al.	2410.11988	link
2024-11-12	Self-Data Distillation for Recovering Quality in Pruned Large Language Models	Vithursan Thangarasa et.al.	2410.09982	null
2024-10-11	Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients	Yan Li et.al.	2410.08457	null
2024-10-11	Chip-Tuning: Classify Before Language Models Say	Fangwei Zhu et.al.	2410.06541	link
2024-11-04	Large Language Model Compression with Neural Architecture Search	Rhea Sanjay Sukthanker et.al.	2410.06479	null
2024-09-29	Investigating the Effect of Network Pruning on Performance and Interpretability	Jonathan von Rad et.al.	2409.19727	link
2024-10-30	Search for Efficient Large Language Models	Xuan Shen et.al.	2409.17372	link
2024-09-22	SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms	Niraj Pudasaini et.al.	2409.14515	null
2024-09-20	CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information	Yuxin Wang et.al.	2409.13199	link
2024-09-17	KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models	Bo Lv et.al.	2409.11057	null
2024-09-11	HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning	Tianyi Chen et.al.	2409.09085	link
2024-09-12	Structured Pruning for Efficient Visual Place Recognition	Oliver Grainge et.al.	2409.07834	null
2024-09-10	STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning	Jaeseong Lee et.al.	2409.06211	null
2024-09-05	TropNNC: Structured Neural Network Compression Using Tropical Geometry	Konstantinos Fotopoulos et.al.	2409.03945	null
2024-09-02	Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks	Samer Francy et.al.	2409.02134	null
2024-08-27	PAT: Pruning-Aware Tuning for Large Language Models	Yijiang Liu et.al.	2408.14721	link
2024-08-15	PQV-Mobile: A Combined Pruning and Quantization Toolkit to Optimize Vision Transformers for Mobile Applications	Kshitij Bhardwaj et.al.	2408.08437	link
2024-08-13	Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models	Chenqian Yan et.al.	2408.06646	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	link
2024-08-02	Sustainable Diffusion-based Incentive Mechanism for Generative AI-driven Digital Twins in Industrial Cyber-Physical Systems	Jinbo Wen et.al.	2408.01173	null
2024-08-22	Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models	Jiang Hao et.al.	2407.21316	link
2024-07-26	Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining	Jianwei Li et.al.	2407.19126	null
2024-07-17	MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs	Junfeng Gong et.al.	2407.18267	null
2024-07-24	(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork	Tianjin Huang et.al.	2407.17412	null
2024-07-22	Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models	Aayush Saxena et.al.	2407.15904	null
2024-07-19	Shapley Pruning for Neural Network Compression	Kamil Adamczewski et.al.	2407.15875	null
2024-07-22	A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism	Yu Xue et.al.	2407.15600	null
2024-07-19	Straightforward Layer-wise Pruning for More Efficient Visual Adaptation	Ruizi Han et.al.	2407.14330	null
2024-07-18	Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset	Yi Sheng et.al.	2407.13896	null
2024-07-18	Reconstruct the Pruned Model without Any Retraining	Pingjie Wang et.al.	2407.13331	null
2024-07-18	MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets	Peng Liao et.al.	2407.13122	null
2024-07-16	MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models	Hongrong Cheng et.al.	2407.11681	null
2024-07-15	DDFAD: Dataset Distillation Framework for Audio Data	Wenbo Jiang et.al.	2407.10446	null

(back to top)

Hardware-Software Co-Design

Publish Date	Title	Authors	PDF	Code
2025-05-01	Emergent Synaptic Plasticity from Tunable Dynamics of Probabilistic Bits	Sagnik Banerjee et.al.	2505.00252	null
2025-04-30	Low latency FPGA implementation of twisted Edward curve cryptography hardware accelerator over prime field	Md Rownak Hossain et.al.	2504.21342	null
2025-04-28	Systematic Hardware Integration Testing for Smart Video-based Medical Device Prototypes	Oliver Bause et.al.	2504.19533	null
2025-04-28	From Cluster to Desktop: A Cache-Accelerated INR framework for Interactive Visualization of Tera-Scale Data	Daniel Zavorotny et.al.	2504.18001	null
2025-04-25	RapidPIV: Full Flow-Field kHz PIV for Real-Time Display and Control	Scott A. Bollt et.al.	2504.17987	null
2025-04-24	ApproXAI: Energy-Efficient Hardware Acceleration of Explainable AI using Approximate Computing	Ayesha Siddique et.al.	2504.17929	null
2025-04-24	Energy Considerations of Large Language Model Inference and Efficiency Optimizations	Jared Fernandez et.al.	2504.17674	null
2025-04-24	On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration	Maoyang Xiang et.al.	2504.17376	null
2025-04-24	Fine-Grained Fusion: The Missing Piece in Area-Efficient State Space Model Acceleration	Robin Geens et.al.	2504.17333	null
2025-04-21	SCALE-Sim v3: A modular cycle-accurate systolic accelerator simulator for end-to-end system analysis	Ritik Raj et.al.	2504.15377	null
2025-04-21	To Offload or Not To Offload: Model-driven Comparison of Edge-native and On-device Processing	Nathan Ng et.al.	2504.15162	null
2025-04-22	GainSight: Application-Guided Profiling for Composing Heterogeneous On-Chip Memories in AI Hardware Accelerators	Peijing Li et.al.	2504.14866	null
2025-04-26	vApps: Verifiable Applications at Internet Scale	Isaac Zhang et.al.	2504.14809	null
2025-04-19	FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference	Coleman Hooper et.al.	2504.14152	null
2025-04-25	HyDra: SOT-CAM Based Vector Symbolic Macro for Hyperdimensional Computing	Md Mizanur Rahaman Nayan et.al.	2504.14020	null
2025-04-18	MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework	Zhenkai Qin et.al.	2504.13574	null
2025-04-17	CardioFit: A WebGL-Based Tool for Fast and Efficient Parameterization of Cardiac Action Potential Models to Fit User-Provided Data	Darby I. Cairns et.al.	2504.13274	null
2025-04-15	A Unified Hardware Accelerator for Fast Fourier Transform and Number Theoretic Transform	Rishabh Shrivastava et.al.	2504.11124	null
2025-04-14	Adaptive Synaptogenesis Implemented on a Nanomagnetic Platform	Faiyaz Elahi Mullick et.al.	2504.10767	null
2025-04-14	FPGA-Optimized Hardware Accelerator for Fast Fourier Transform and Singular Value Decomposition in AI	Hong Ding et.al.	2504.10411	null
2025-04-14	Carbon-Efficient 3D DNN Acceleration: Optimizing Performance and Sustainability	Aikaterini Maria Panteleaki et.al.	2504.09851	null
2025-04-11	ML For Hardware De#terpretability: Challenges and Opportunities	Raymond Baartmans et.al.	2504.08852	null
2025-04-11	TensorNEAT: A GPU-accelerated Library for NeuroEvolution of Augmenting Topologies	Lishuang Wang et.al.	2504.08339	link
2025-04-14	Improving Multiresource Job Scheduling with Markovian Service Rate Policies	Zhongrui Chen et.al.	2504.08094	link
2025-04-20	Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks	Erin Carson et.al.	2504.07835	link
2025-04-09	Rapid inference and comparison of gravitational-wave population models with neural variational posteriors	Matthew Mould et.al.	2504.07197	null
2025-04-08	Accelerating Hybrid XOR $-$ CNF SAT Problems Natively with In-Memory Computing	Haesol Im et.al.	2504.06476	null
2025-04-08	FETTA: Flexible and Efficient Hardware Accelerator for Tensorized Neural Network Training	Jinming Lu et.al.	2504.06474	null
2025-04-06	Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression	Ivan Ilin et.al.	2504.05346	link
2025-04-07	3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization	Isha Sharma et.al.	2504.04857	null
2025-04-07	A High-Performance Curve25519 and Curve448 Unified Elliptic Curve Cryptography Accelerator	Aniket Banerjee et.al.	2504.04731	null
2025-04-06	pc-COP: An Efficient and Configurable 2048-p-Bit Fully-Connected Probabilistic Computing Accelerator for Combinatorial Optimization	Kiran Magar et.al.	2504.04543	null
2025-04-04	Efficient FPGA-accelerated Convolutional Neural Networks for Cloud Detection on CubeSats	Angela Cratere et.al.	2504.03891	null
2025-04-01	Enhancing Biologically Inspired Hierarchical Temporal Memory with Hardware-Accelerated Reflex Memory	Pavia Bera et.al.	2504.03746	null
2025-03-31	PIM-LLM: A High-Throughput Hybrid PIM Architecture for 1-bit LLMs	Jinendra Malekar et.al.	2504.01994	null
2025-04-01	SCRec: A Scalable Computational Storage System with Statistical Sharding and Tensor-train Decomposition for Recommendation Models	Jinho Yang et.al.	2504.00520	null
2025-03-31	Single-Shot Matrix-Matrix Multiplication Optical Tensor Processor for Deep Learning	Chao Luan et.al.	2503.24356	null
2025-03-30	FlexMem: High-Parallel Near-Memory Architecture for Flexible Dataflow in Fully Homomorphic Encryption	Shangyi Shi et.al.	2503.23496	null
2025-04-03	Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models	Hung-Yueh Chiang et.al.	2503.22879	link
2025-03-31	Residual-based Chebyshev filtered subspace iteration for sparse Hermitian eigenvalue problems tolerant to inexact matrix-vector products	Nikhil Kodali et.al.	2503.22652	null
2025-03-27	An Efficient Training Algorithm for Models with Block-wise Sparsity	Ding Zhu et.al.	2503.21928	null
2025-03-27	Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization	Zhenyu Liang et.al.	2503.20286	link
2025-03-26	VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers	Ching-Yao Chen et.al.	2503.20246	null
2025-03-25	Hardware Efficient Accelerator for Spiking Transformer With Reconfigurable Parallel Time Step Computing	Bo-Yu Chen et.al.	2503.19643	null
2025-03-25	An Efficient Data Reuse with Tile-Based Adaptive Stationary for Transformer Accelerators	Tseng-Jen Li et.al.	2503.19640	null
2025-03-23	Reliable Replication Protocols on SmartNICs	M. R. Siavash Katebzadeh et.al.	2503.18093	null
2025-03-21	Hardware Acceleration for HPS Algorithms in Two and Three Dimensions	Owen Melia et.al.	2503.17535	null
2025-03-20	QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge	Xuan Shen et.al.	2503.16709	null
2025-03-20	Accelerating Transformer Inference and Training with 2:4 Activation Sparsity	Daniel Haziza et.al.	2503.16672	null
2025-03-20	Explainable AI-Guided Efficient Approximate DNN Generation for Multi-Pod Systolic Arrays	Ayesha Siddique et.al.	2503.16583	null
2025-03-19	QEA: An Accelerator for Quantum Circuit Simulation with Resources Efficiency and Flexibility	Van Duy Tran et.al.	2503.14951	link
2025-03-17	Performance Analysis and Industry Deployment of Post-Quantum Cryptography Algorithms	Elif Dicle Demir et.al.	2503.12952	null
2025-03-12	EDEA: Efficient Dual-Engine Accelerator for Depthwise Separable Convolution with Direct Data Transfer	Yi Chen et.al.	2503.11707	null
2025-03-13	Bridging Machine Learning and Cosmological Simulations: Using Neural Operators to emulate Chemical Evolution	Pelle van de Bor et.al.	2503.10736	null
2025-03-12	Hardware.jl - An MLIR-based Julia HLS Flow (Work in Progress)	Benedict Short et.al.	2503.09463	null
2025-03-11	SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting	Shuaiting Li et.al.	2503.08668	null
2025-03-11	V-Max: Making RL practical for Autonomous Driving	Valentin Charraut et.al.	2503.08388	link
2025-03-10	Hardware acceleration for next-to-leading order event generation within MadGraph5_aMC@NLO	Zenny Wettersten et.al.	2503.07439	null
2025-03-09	Hardware-Accelerated Event-Graph Neural Networks for Low-Latency Time-Series Classification on SoC FPGA	Hiroshi Nakano et.al.	2503.06629	null
2025-03-17	Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models	Xubin Wang et.al.	2503.06027	null
2025-03-06	FORTALESA: Fault-Tolerant Reconfigurable Systolic Array for DNN Inference	Natalia Cherezova et.al.	2503.04426	null
2025-03-06	DiRe-JAX: A JAX based Dimensionality Reduction Algorithm for Large-scale Data	Alexander Kolpakov et.al.	2503.03156	link
2025-02-26	Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies	Shaibal Saha et.al.	2503.02891	null
2025-03-04	TFHE-SBC: Software Designs for Fully Homomorphic Encryption over the Torus on Single Board Computers	Marin Matsumoto et.al.	2503.02559	null
2025-03-04	POPGym Arcade: Parallel Pixelated POMDPs	Zekang Wang et.al.	2503.01450	link
2025-02-28	Supporting the development of Machine Learning for fundamental science in a federated Cloud with the AI_INFN platform	Lucio Anderlini et.al.	2502.21266	null
2025-03-07	GreenDFL: a Framework for Assessing the Sustainability of Decentralized Federated Learning Systems	Chao Feng et.al.	2502.20242	null
2025-02-24	Evaluating IOMMU-Based Shared Virtual Addressing for RISC-V Embedded Heterogeneous SoCs	Cyril Koenig et.al.	2502.17398	link
2025-02-24	APINT: A Full-Stack Framework for Acceleration of Privacy-Preserving Inference of Transformers based on Garbled Circuits	Hyunjun Cho et.al.	2502.16877	null
2025-02-22	A Hybrid Neural Network for High-Throughput Attosecond Resolution Single-shot X-ray Pulse Characterization	Jack Hirschman et.al.	2502.16141	null
2025-02-20	Micro Blossom: Accelerated Minimum-Weight Perfect Matching Decoding for Quantum Error Correction	Yue Wu et.al.	2502.14787	null
2025-02-18	RTPD: Penetration Depth calculation using Hardware accelerated Ray-Tracing	YoungWoo Kim et.al.	2502.12463	null
2025-02-20	TherAIssist: Assisting Art Therapy Homework and Client-Practitioner Collaboration through Human-AI Interaction	Di Liu et.al.	2502.12443	null
2025-02-17	Gem5-AcceSys: Enabling System-Level Exploration of Standard Interconnects for Novel Accelerators	Qunyou Liu et.al.	2502.12273	null
2025-02-17	SFTs: a scalable data-analysis framework for long-duration gravitational-wave signals	Rodrigo Tenorio et.al.	2502.11823	null
2025-02-15	Pushing up to the Limit of Memory Bandwidth and Capacity Utilization for Efficient LLM Decoding on Embedded FPGA	Jindong Li et.al.	2502.10659	null
2025-02-13	Recipe: Hardware-Accelerated Replication Protocols	Dimitra Giantsidi et.al.	2502.09251	null
2025-02-12	Scalable Thermodynamic Second-order Optimization	Kaelan Donatella et.al.	2502.08603	null
2025-02-10	Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs	Tousif Rahman et.al.	2502.07823	null
2025-02-10	Accelerating Berends-Giele recursion for gluons in arbitrary dimensions over finite fields	Juan M. Cruz-Martinez et.al.	2502.07060	link
2025-02-07	TNIC: A Trusted NIC Architecture	Dimitra Giantsidi et.al.	2502.05338	null
2025-02-07	Gaussian Models to Non-Gaussian Realms of Quantum Photonic Simulators	Dennis Delali Kwesi Wayo et.al.	2502.05245	null
2025-02-04	SpinGlassPEPS.jl: Tensor-network package for Ising-like optimization on quasi-two-dimensional graphs	Tomasz Śmierzchalski et.al.	2502.02317	null
2025-02-01	Life-Cycle Emissions of AI Hardware: A Cradle-To-Grave Approach and Generational Trends	Ian Schneider et.al.	2502.01671	null
2025-02-01	A Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression	Shupeng Ning et.al.	2502.01670	null
2025-02-02	A Flexible Precision Scaling Deep Neural Network Accelerator with Efficient Weight Combination	Liang Zhao et.al.	2502.00687	null
2025-02-01	Late Breaking Results: Leveraging Approximate Computing for Carbon-Aware DNN Accelerators	Aikaterini Maria Panteleaki et.al.	2502.00286	null
2025-01-31	StruM: Structured Mixed Precision for Efficient Deep Learning Hardware Codesign	Michael Wu et.al.	2501.18953	null
2025-01-30	REDACTOR: eFPGA Redaction for DNN Accelerator Security	Yazan Baddour et.al.	2501.18740	link
2025-01-30	FLASH-FHE: A Heterogeneous Architecture for Fully Homomorphic Encryption Acceleration	Junxue Zhang et.al.	2501.18371	null
2025-01-24	HWPQ: Hessian-free Weight Pruning-Quantization For LLM Compression And Acceleration	Yuhan Kang et.al.	2501.16376	null
2025-01-24	Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel	Zhuoran Liu et.al.	2501.14512	null
2025-02-02	$SpikePack$ : Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility	Guobin Shen et.al.	2501.14484	null
2025-01-22	Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators	Filip Masar et.al.	2501.12818	link
2025-01-22	HEPPO: Hardware-Efficient Proximal Policy Optimization -- A Universal Pipelined Architecture for Generalized Advantage Estimation	Hazem Taha et.al.	2501.12703	null
2025-01-20	Hybrid Photonic-digital Accelerator for Attention Mechanism	Huize Li et.al.	2501.11286	null
2025-01-20	Ditto: Accelerating Diffusion Model via Temporal Value Similarity	Sungbin Kim et.al.	2501.11211	null
2025-01-18	LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator	Guoyu Li et.al.	2501.10658	null
2025-01-17	Optimizing Structured-Sparse Matrix Multiplication in RISC-V Vector Processors	Vasileios Titopoulos et.al.	2501.10189	null
2025-01-17	AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations	Jamin Seo et.al.	2501.09954	link
2025-01-15	RouteNet-Gauss: Hardware-Enhanced Network Modeling with Machine Learning	Carlos Güemes-Palau et.al.	2501.08848	null
2025-01-15	Detecting Wildfire Flame and Smoke through Edge Computing using Transfer Learning Enhanced Deep Learning Models	Giovanny Vazquez et.al.	2501.08639	null
2025-01-14	An Efficient Sparse Hardware Accelerator for Spike-Driven Transformer	Zhengke Li et.al.	2501.07825	null
2025-01-13	fastrerandomize: An R Package for Fast Rerandomization Using Accelerated Computing	Rebecca Goldstein et.al.	2501.07642	link
2025-01-12	Turing-Completeness and Undecidability in Coupled Nonlinear Optical Resonators	Gordon Li et.al.	2501.06966	null
2025-01-10	Axon: A novel systolic array architecture for improved run time and energy efficient GeMM and Conv operation with on-chip im2col	Md Mizanur Rahaman Nayan et.al.	2501.06043	null
2025-01-10	EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration	Zhifan Song et.al.	2501.05885	link
2025-01-16	TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response Scenarios	Daniel Rossi et.al.	2501.05880	link
2025-01-09	JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration	Mingzi Wang et.al.	2501.05339	null
2025-01-08	IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX	Erik Recio-Armengol et.al.	2501.04776	link
2025-01-08	Probabilistic Greedy Algorithm Solver Using Magnetic Tunneling Junctions for Traveling Salesman Problem	Ran Zhang et.al.	2501.04447	null
2025-01-04	Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies	Xubin Wang et.al.	2501.03265	link
2025-01-04	Optimizing Small Language Models for In-Vehicle Function-Calling	Yahya Sowti Khiabani et.al.	2501.02342	null
2025-01-03	DSLR-CNN: Efficient CNN Acceleration using Digit-Serial Left-to-Right Arithmetic	Malik Zohaib Nisar et.al.	2501.01737	null
2025-01-02	Harnessing Hardware Acceleration in High-Energy Physics through High-Level Synthesis Techniques	Pelayo Leguina López et.al.	2501.01338	null
2024-12-30	DeepLL: Considering Linear Logic for the Analysis of Deep Learning Experiments	Nick Papoulias et.al.	2501.00169	null
2024-12-29	A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier	Amit Sarkar et.al.	2412.20393	null
2024-12-29	Open-Source Heterogeneous SoCs for AI: The PULP Platform Experience	Francesco Conti et.al.	2412.20391	null
2024-12-27	HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models	Ze Yang et.al.	2412.19925	null
2024-12-26	Evolution, Challenges, and Optimization in Computer Architecture: The Role of Reconfigurable Systems	Jefferson Ederhion et.al.	2412.19234	null
2024-12-24	GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks	Christodoulos Peltekis et.al.	2412.18534	null
2024-12-23	Advantages of density in tensor network geometries for gradient based training	Sergi Masot-Llima et.al.	2412.17497	null
2024-12-20	Chorba: A novel CRC32 implementation	Sam Russell et.al.	2412.16398	null
2024-12-20	Designing Visual Explanations and Learner Controls to Engage Adolescents in AI-Supported Exercise Selection	Jeroen Ooge et.al.	2412.16034	null
2024-12-20	A survey on FPGA-based accelerator for ML models	Feng Yan et.al.	2412.15666	null
2024-12-19	LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation	Chenxu Zhou et.al.	2412.15199	null
2024-12-18	Pattern Matching in AI Compilers and its Formalization (Extended Version)	Joseph W. Cutler et.al.	2412.13398	null
2024-12-17	if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge Proofs	Shahzad Ahmad Butt et.al.	2412.12481	null
2024-12-13	Strong Structural Bounds for MaxSAT: The Fine Details of Using Neuromorphic and Quantum Hardware Accelerators	Max Bannach et.al.	2412.10289	null
2024-12-16	MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization	Shuaiting Li et.al.	2412.10261	null
2024-12-12	MPAX: Mathematical Programming in JAX	Haihao Lu et.al.	2412.09734	link
2024-12-12	Evaluating the Potential of In-Memory Processing to Accelerate Homomorphic Encryption	Mpoki Mwaisela et.al.	2412.09144	null
2024-12-12	Analyzing Practical Policies for Multiresource Job Scheduling	Zhongrui Chen et.al.	2412.08915	null
2024-12-09	LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation	Haihang Wu et.al.	2412.06419	null
2024-12-03	Demonstrating the Advantages of Analog Wafer-Scale Neuromorphic Hardware	Hartmut Schmidt et.al.	2412.02619	null
2024-12-03	Multi-timescale synaptic plasticity on analog neuromorphic hardware	Amani Atoui et.al.	2412.02515	null
2024-11-27	Deterministic and Probabilistic Rounding Error Analysis for Mixed-Precision Arithmetic on Modern Computing Units	Sahil Bhola et.al.	2411.18747	null
2024-11-26	Scalable iterative pruning of large language and vision models using block coordinate descent	Gili Rosenberg et.al.	2411.17796	null
2024-11-25	Limitations of tensor network approaches for optimization and sampling: A comparison against quantum and classical Ising machines	Anna Maria Dziubyna et.al.	2411.16431	link
2024-11-25	MixPE: Quantization and Hardware Co-design for Efficient LLM Inference	Yu Zhang et.al.	2411.16158	null
2024-11-20	Hardware Accelerators for Artificial Intelligence	S M Mojahidul Ahsan et.al.	2411.13717	null
2024-11-20	Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training	Jared Fernandez et.al.	2411.13055	null
2024-11-19	FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning	Qingsong Lv et.al.	2411.12781	link
2024-11-19	Design of an FPGA-Based Neutral Atom Rearrangement Accelerator for Quantum Computing	Xiaorang Guo et.al.	2411.12401	null
2024-11-18	SILVIA: Automated Superword-Level Parallelism Exploitation via HLS-Specific LLVM Passes for Compute-Intensive FPGA Accelerators	Giovanni Brignone et.al.	2411.11384	link
2024-12-01	InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma	Xiaoxuan Hou et.al.	2411.09856	link
2024-11-21	OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling	Xiaoling Yi et.al.	2411.09543	link
2024-11-15	Communication Compression for Tensor Parallel LLM Inference	Jan Hansen-Palmus et.al.	2411.09510	null
2024-11-18	RPCAcc: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator	Jie Zhang et.al.	2411.07632	null
2024-11-11	Spiking Transformer Hardware Accelerators in 3D Integration	Boxun Xu et.al.	2411.07397	null
2024-11-10	AMAZE: Accelerated MiMC Hardware Architecture for Zero-Knowledge Applications on the Edge	Anees Ahmed et.al.	2411.06350	link
2024-11-03	Stochastic Communication Avoidance for Recommendation Systems	Lutfi Eren Erdogan et.al.	2411.01611	null
2024-11-01	Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks	David A. Danhofer et.al.	2411.00288	null
2024-10-31	LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators	Krishna Teja Chitty-Venkata et.al.	2411.00136	link
2024-10-30	Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks	Michael Matthews et.al.	2410.23208	link
2024-10-24	Watermarking Large Language Models and the Generated Content: Opportunities and Challenges	Ruisi Zhang et.al.	2410.19096	null
2024-10-21	Hacking the Fabric: Targeting Partial Reconfiguration for Fault Injection in FPGA Fabrics	Jayeeta Chaudhuri et.al.	2410.16497	null
2024-10-21	Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure	Juan Cruz-Martinez et.al.	2410.16248	null
2024-10-20	A Remedy to Compute-in-Memory with Dynamic Random Access Memory: 1FeFET-1C Technology for Neuro-Symbolic AI	Xunzhao Yin et.al.	2410.15296	null
2024-10-18	Self-Satisfied: An end-to-end framework for SAT generation and prediction	Christopher R. Serrano et.al.	2410.14888	null
2024-10-17	Quamba: A Post-Training Quantization Recipe for Selective State Space Models	Hung-Yueh Chiang et.al.	2410.13229	link
2024-10-16	Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration	M. Croci et.al.	2410.12614	link
2024-10-15	Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination	Arturo Salmi et.al.	2410.11625	null
2024-10-15	Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet Accuracy	Shuntaro Takahashi et.al.	2410.11553	link
2024-10-14	Differentiable Weightless Neural Networks	Alan T. L. Bacellar et.al.	2410.11112	link
2024-10-14	SLaNC: Static LayerNorm Calibration	Mahsa Salmani et.al.	2410.10553	null
2024-10-11	MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices	Mohamed Amine Hamdi et.al.	2410.08855	link
2024-10-09	Optimized Spatial Architecture Mapping Flow for Transformer Accelerators	Haocheng Xu et.al.	2410.07407	null
2024-10-09	Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing	Ismail Erbas et.al.	2410.07364	null
2024-10-03	CAX: Cellular Automata Accelerated in JAX	Maxence Faldor et.al.	2410.02651	link
2024-10-03	Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition	Hansheng Wang et.al.	2410.02170	null
2024-10-01	Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Ismail Erbas et.al.	2410.00948	null
2024-09-26	Leader Selection and Follower Association for UE-centric Distributed Learning in Future Wireless Networks	Saeedeh Parsaeefard et.al.	2409.18268	null
2024-09-26	A 5T-2MTJ STT-assisted Spin Orbit Torque based Ternary Content Addressable Memory for Hardware Accelerators	Siri Narla et.al.	2409.17863	null
2024-09-24	Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware	Luca Scomparin et.al.	2409.16177	null
2024-09-25	Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA	Lorenzo Borella et.al.	2409.16075	null
2024-09-19	Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention	Rengan Xu et.al.	2409.15373	null
2024-09-23	Efficient Tabular Data Preprocessing of ML Pipelines	Yu Zhu et.al.	2409.14912	null
2024-09-21	FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs	Ehsan Kabir et.al.	2409.14023	null
2024-09-21	ProTEA: Programmable Transformer Encoder Acceleration on FPGA	Ehsan Kabir et.al.	2409.13975	null
2024-09-23	Towards Efficient Neuro-Symbolic AI: From Workload Characterization to Hardware Architecture	Zishen Wan et.al.	2409.13153	null
2024-09-20	Learning to Compare Hardware Designs for High-Level Synthesis	Yunsheng Bai et.al.	2409.13138	null
2024-09-19	Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAML	Chelsea Maria John et.al.	2409.12994	link
2024-09-19	CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications	Vladimir Frolov et.al.	2409.12617	null
2024-09-15	Pack my weights and run! Minimizing overheads for in-memory computing accelerators	Pouya Houshmand et.al.	2409.11437	null
2024-09-11	Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-design	Tathagata Srimani et.al.	2409.11422	null
2024-09-09	Hardware Acceleration of Kolmogorov-Arnold Network (KAN) for Lightweight Edge Inference	Wei-Hsing Huang et.al.	2409.11418	null
2024-09-17	Dynamic Range Reduction via Branch-and-Bound	Thore Gerlach et.al.	2409.10863	null
2024-09-16	Count2Multiply: Reliable In-memory High-Radix Counting	João Paulo Cardoso de Lima et.al.	2409.10136	null
2024-09-16	Hardware-Accelerated Ray Tracing for Discrete and Continuous Collision Detection on GPUs	Sizhe Sui et.al.	2409.09918	null
2024-09-13	Distributed Binary Optimization with In-Memory Computing: An Application for the SAT Problem	Xiangyi Zhang et.al.	2409.09152	null
2024-09-13	Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators	Konstantin Lübeck et.al.	2409.08595	null
2024-09-17	Foragax: An Agent-Based Modelling Framework Based on JAX	Siddharth Chaturvedi et.al.	2409.06345	link
2024-09-10	PIM-MMU: A Memory Management Unit for Accelerating Data Transfers in Commercial PIM Systems	Dongjae Lee et.al.	2409.06204	null
2024-09-06	Towards Narrowing the Generalization Gap in Deep Boolean Networks	Youngsung Kim et.al.	2409.05905	null
2024-09-09	Supervised Learning for Stochastic Optimal Control	Vince Kurtz et.al.	2409.05792	null
2024-09-08	BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration	Yuzong Chen et.al.	2409.05227	link
2024-09-05	Libra: Architectural Support For Principled, Secure And Efficient Balanced Execution On High-End Processors (Extended Version)	Hans Winderix et.al.	2409.03743	null
2024-09-05	Hardware Acceleration of LLMs: A comprehensive survey and comparison	Nikoletta Koilia et.al.	2409.03384	null
2024-09-05	Towards training digitally-tied analog blocks via hybrid gradient computation	Timothy Nest et.al.	2409.03306	null
2024-08-30	The picasso gas model: Painting intracluster gas on gravity-only simulations	F. Kéruzoré et.al.	2408.17445	link
2024-08-29	Serial and Parallel Two-Column Probing for Mixed-Integer Programming	Yongzheng Dai et.al.	2408.16927	link
2024-08-29	On-device AI: Quantization-aware Training of Transformers in Time-Series	Tianheng Ling et.al.	2408.16495	null
2024-08-29	Accelerating Image-based Pest Detection on a Heterogeneous Multi-core Microcontroller	Luca Bompani et.al.	2408.15911	link
2024-08-28	FireFly-S: Exploiting Dual-Side Sparsity for Spiking Neural Networks Acceleration with Reconfigurable Spatial Architecture	Tenglong Li et.al.	2408.15578	null
2024-08-29	CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific Edge Computing	G Abarajithan et.al.	2408.15561	null
2024-08-27	SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search	Hung-Yueh Chiang et.al.	2408.15395	null
2024-08-27	SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration	Runzhen Xue et.al.	2408.15089	null
2024-08-26	On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise	M. Reza Eslami et.al.	2408.14680	null
2024-08-26	HAPM -- Hardware Aware Pruning Method for CNN hardware accelerators in resource constrained devices	Federico Nicolas Peccia et.al.	2408.14055	null
2024-08-22	Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments	Maciej Besta et.al.	2408.12173	null
2024-08-21	Floating-Point Multiply-Add with Approximate Normalization for Low-Cost Matrix Engines	Kosmas Alexandridis et.al.	2408.11997	null
2024-08-21	Cage: Hardware-Accelerated Safe WebAssembly	Martin Fink et.al.	2408.11456	null
2024-08-20	Tapping in a Remote Vehicle's onboard LLM to Complement the Ego Vehicle's Field-of-View	Malsha Ashani Mahawatta Dona et.al.	2408.10794	null
2024-08-16	Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers	Zihang Song et.al.	2408.08794	null
2024-08-16	Cross-Chip Partial Reconfiguration for the Initialisation of Modular and Scalable Heterogeneous Systems	Marvin Fuchs et.al.	2408.08626	null
2024-08-13	HLSPilot: LLM-based High-Level Synthesis	Chenwei Xiong et.al.	2408.06810	link
2024-08-12	Hardware Architecture Design of Model-Based Image Reconstruction Towards Palm-size Photoacoustic Tomography	Yuwei Zheng et.al.	2408.06049	null
2024-08-12	SZKP: A Scalable Accelerator Architecture for Zero-Knowledge Proofs	Alhad Daftardar et.al.	2408.05890	null
2024-08-10	LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale	Jaehong Cho et.al.	2408.05499	link
2024-08-08	Noise-augmented Chaotic Ising Machines for Combinatorial Optimization and Sampling	Kyle Lee et.al.	2408.04744	null
2024-08-07	Hardware-Assisted Virtualization of Neural Processing Units for Cloud Platforms	Yuqi Xue et.al.	2408.04104	null
2024-08-07	Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration	Zhongyao Luo et.al.	2408.03647	link
2024-08-06	LLM-Aided Compilation for Tensor Accelerators	Charles Hong et.al.	2408.03408	null
2024-08-06	HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration	Pratyush Dhingra et.al.	2408.03397	null
2024-08-05	PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy	Rachmad Vidya Wicaksana Putra et.al.	2408.02412	null
2024-08-02	Digitized Phase Change Material Heterostack for Diffractive Optical Neural Network	Ruiyang Chen et.al.	2408.01404	null
2024-08-02	Search-in-Memory (SiM): Reliable, Versatile, and Efficient Data Matching in SSD's NAND Flash Memory Chip for Data Indexing Acceleration	Yun-Chih Chen et.al.	2408.00327	null
2024-08-07	Temporal Feature Matters: A Framework for Diffusion Model Quantization	Yushi Huang et.al.	2407.19547	null
2024-07-16	Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)	Seyed Nima Omidsajedi et.al.	2407.18264	null
2024-07-22	KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer	Aness Al-Qawlaq et.al.	2407.16026	null
2024-07-18	Integrated Hardware Architecture and Device Placement Search	Irene Wang et.al.	2407.13143	link
2024-07-17	ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks	Salma Afifi et.al.	2407.12638	null
2024-07-17	StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators	Ethan G Rogers et.al.	2407.12378	null
2024-07-16	Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment	Yuhao Ji et.al.	2407.12070	null
2024-07-16	Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads	Aritra Dhar et.al.	2407.11888	null
2024-07-15	Hierarchical search method for gravitational waves from stellar-mass binary black holes in noisy space-based detector data	Yao Fu et.al.	2407.10797	null
2024-07-14	Accelerator-as-a-Service in Public Clouds: An Intra-Host Traffic Management View for Performance Isolation in the Wild	Jiechen Zhao et.al.	2407.10098	null
2024-07-12	68-Channel Highly-Integrated Neural Signal Processing PSoC with On-Chip Feature Extraction, Compression, and Hardware Accelerators for Neuroprosthetics in 22nm FDSOI	Liyuan Guo et.al.	2407.09166	null
2024-07-12	Hybrid Temporal Computing for Lower Power Hardware Accelerators	Maliha Tasnim et.al.	2407.08975	null

(back to top)

TinyML

Publish Date	Title	Authors	PDF	Code
2025-05-01	Large Language Models as AI Agents for Digital Atoms and Molecules: Catalyzing a New Era in Computational Biophysics	Yijie Xia et.al.	2505.00270	null
2025-04-30	Smart Environmental Monitoring of Marine Pollution using Edge AI	Mohamed Moursi et.al.	2504.21759	null
2025-04-27	Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality	Majid Behravan et.al.	2504.21033	null
2025-04-29	DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs	Hao Luan et.al.	2504.20754	null
2025-04-29	CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices	Varatheepan Paramanayakam et.al.	2504.20348	null
2025-04-27	Personalized Artificial General Intelligence (AGI) via Neuroscience-Inspired Continuous Learning Systems	Rajeev Gupta et.al.	2504.20109	null
2025-04-28	Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs	Muhammad Sabih et.al.	2504.19659	null
2025-04-22	TinyML for Speech Recognition	Andrew Barovic et.al.	2504.16213	null
2025-04-21	Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT	Stanley Mugisha et.al.	2504.16128	null
2025-04-23	SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems	Abhishek Tyagi et.al.	2504.15305	null
2025-04-21	Time-Series Analysis on Edge-AI Hardware for Healthcare Monitoring	Jinhai Hu et.al.	2504.15178	null
2025-04-20	Explainability for Embedding AI: Aspirations and Actuality	Thomas Weber et.al.	2504.14631	null
2025-04-03	Edge Intelligence for Wildlife Conservation: Real-Time Hornbill Call Classification Using TinyML	Kong Ka Hing et.al.	2504.12272	null
2025-04-19	MultiCore+TPU Accelerated Multi-Modal TinyML for Livestock Behaviour Recognition	Qianxue Zhang et.al.	2504.11467	null
2025-04-14	VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification	Lucas Heublein et.al.	2504.10556	null
2025-04-13	Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?	Christophe El Zeinaty et.al.	2504.09685	null
2025-04-20	MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications	Aashaka Shah et.al.	2504.09014	link
2025-04-11	Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices	Shengyuan Ye et.al.	2504.08242	null
2025-04-09	Neural Signal Compression using RAMAN tinyML Accelerator for BCI Applications	Adithya Krishna et.al.	2504.06996	null
2025-04-08	Enhanced Anomaly Detection for Capsule Endoscopy Using Ensemble Learning Strategies	Julia Werner et.al.	2504.06039	null
2025-04-03	Advancing Air Quality Monitoring: TinyML-Based Real-Time Ozone Prediction with Cost-Effective Edge Devices	Huam Ming Ken et.al.	2504.03776	null
2025-04-02	Efficient Calibration for RRAM-based In-Memory Computing using DoRA	Weirong Dong et.al.	2504.03763	null
2025-04-04	Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency	Erik Johannes Husom et.al.	2504.03360	null
2025-04-02	Satellite Edge Artificial Intelligence with Large Models: Architectures and Technologies	Yuanming Shi et.al.	2504.01676	null
2025-04-02	HH-PIM: Dynamic Optimization of Power and Performance with Heterogeneous-Hybrid PIM for Edge AI Devices	Sangmin Jeon et.al.	2504.01468	null
2025-04-01	Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems	Rachmad Vidya Wicaksana Putra et.al.	2504.00957	null
2025-04-01	IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval	Bangwei Liu et.al.	2504.00954	null
2025-04-01	QSViT: A Methodology for Quantizing Spiking Vision Transformers	Rachmad Vidya Wicaksana Putra et.al.	2504.00948	null
2025-03-19	Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI	Jianyi Zhang et.al.	2503.18958	null
2025-03-12	Intanify AI Platform: Embedded AI for Automated IP Audit and Due Diligence	Viktor Dorfler et.al.	2503.17374	null
2025-03-21	Replay4NCL: An Efficient Memory Replay-based Methodology for Neuromorphic Continual Learning in Embedded AI Systems	Mishal Fatima Minhas et.al.	2503.17061	null
2025-03-21	On-Sensor Convolutional Neural Networks with Early-Exits	Hazem Hesham Yousef Shalby et.al.	2503.16939	null
2025-03-20	Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions	Hadi Amini et.al.	2503.16585	link
2025-03-19	Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure	Fatemeh Dehrouyeh et.al.	2503.14799	link
2025-03-17	Semantic-Relevance Based Sensor Selection for Edge-AI Empowered Sensing Systems	Zhiyan Liu et.al.	2503.12785	null
2025-03-15	End-to-End Edge AI Service Provisioning Framework in 6G ORAN	Yun Tang et.al.	2503.11933	null
2025-03-04	CORDIC Is All You Need	Omkar Kokane et.al.	2503.11685	null
2025-03-12	BioSpark: Beyond Analogical Inspiration to LLM-augmented Transfer	Hyeonsu Kang et.al.	2503.09838	null
2025-03-19	Edge AI for Real-time Fetal Assessment in Rural Guatemala	Nasim Katebi et.al.	2503.09659	null
2025-03-12	Edge AI-Powered Real-Time Decision-Making for Autonomous Vehicles in Adverse Weather Conditions	Milad Rahmati et.al.	2503.09638	null
2025-03-12	Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks	Idris Zakariyya et.al.	2503.08973	null
2025-03-07	SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs	Jaewoo Song et.al.	2503.07657	null
2025-03-07	Compliance of AI Systems	Julius Schöning et.al.	2503.05571	null
2025-03-06	Dynamic # for On-Demand DNN Inference in the Edge-AI Market	Songyuan Li et.al.	2503.04521	null
2025-03-03	Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective	Rakshit Aralimatti et.al.	2503.01933	null
2025-03-03	Dendron: Enhancing Human Activity Recognition with On-Device TinyML Learning	Hazem Hesham Yousef Shalby et.al.	2503.01353	null
2025-03-05	Regularization-based Framework for Quantization-, Fault- and Variability-Aware Training	Anmol Biswas et.al.	2503.01297	null
2025-02-28	Transforming Cyber Defense: Harnessing Agentic and Frontier AI for Proactive, Ethical Threat Intelligence	Krti Tallam et.al.	2503.00164	null
2025-02-26	AI and Semantic Communication for Infrastructure Monitoring in 6G-Driven Drone Swarms	Tasnim Ahmed et.al.	2503.00053	null
2025-02-25	On-device edge learning for IoT data streams: a survey	Afonso Lourenço et.al.	2502.17788	null
2025-02-22	A Hybrid Neural Network for High-Throughput Attosecond Resolution Single-shot X-ray Pulse Characterization	Jack Hirschman et.al.	2502.16141	null
2025-02-19	Qwen2.5-VL Technical Report	Shuai Bai et.al.	2502.13923	null
2025-02-19	AnDB: Breaking Boundaries with an AI-Native Database for Universal Semantic Analysis	Tianqing Wang et.al.	2502.13805	link
2025-02-19	Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency	Jiangrong Shen et.al.	2502.13572	null
2025-02-18	Fast Data Aware Neural Architecture Search via Supernet Accelerated Evaluation	Emil Njor et.al.	2502.12690	null
2025-02-13	nanoML for Human Activity Recognition	Alan T. L. Bacellar et.al.	2502.12173	null
2025-02-17	InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems	Habib Larian et.al.	2502.11644	link
2025-02-17	Biases in Edge Language Models: Detection, Analysis, and Mitigation	Vinamra Sharma et.al.	2502.11349	null
2025-02-14	A Hybrid Edge Classifier: Combining TinyML-Optimised CNN with RRAM-CMOS ACAM for Energy-Efficient Inference	Kieran Woodward et.al.	2502.10089	null
2025-02-13	SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest	Jack Erhardt et.al.	2502.09528	null
2025-02-10	Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs	Tousif Rahman et.al.	2502.07823	null
2025-02-18	XAMBA: Enabling Efficient State Space Models on Resource-Constrained Neural Processing Units	Arghadip Das et.al.	2502.06924	link
2025-02-08	ETHEREAL: Energy-efficient and High-throughput Inference using Compressed Tsetlin Machine	Shengyu Duan et.al.	2502.05640	null
2025-02-07	Demonstrating CavePI: Autonomous Exploration of Underwater Caves by Semantic Guidance	Alankrit Gupta et.al.	2502.05384	null
2025-02-08	Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models	Haoran Ye et.al.	2502.02444	null
2025-02-03	EdgeMark: An Automation and Benchmarking System for Embedded Artificial Intelligence Tools	Mohammad Amin Hasanpour et.al.	2502.01700	null
2025-02-01	Enhancing Field-Oriented Control of Electric Drives with Tiny Neural Network Optimized for Micro-controllers	Martin Joel Mouk Elele et.al.	2502.00532	null
2025-01-31	Infer-EDGE: Dynamic DNN Inference Optimization in 'Just-in-time' Edge-AI Implementations	Motahare Mounesan et.al.	2501.18842	null
2025-01-30	Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization	Kevin Cooper et.al.	2501.18174	null
2025-01-28	On Accelerating Edge AI: Optimizing Resource-Constrained Environments	Jacob Sander et.al.	2501.15014	null
2025-02-06	SplitQuant: Layer Splitting for Low-Bit Neural Network Quantization	Jaewoo Song et.al.	2501.12428	null
2025-01-20	Consolidating TinyML Lifecycle with Large Language Models: Reality, Illusion, or Opportunity?	Guanghan Wu et.al.	2501.12420	null
2025-01-17	Michscan: Black-Box Neural Network Integrity Checking at Runtime Through Power Analysis	Robi Paul et.al.	2501.10174	null
2025-01-13	QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications	Jeongseok Kim et.al.	2501.07161	null
2025-01-12	Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G	Zhiyan Liu et.al.	2501.06726	null
2025-01-09	Towards smart and adaptive agents for active sensing on edge devices	Devendra Vyas et.al.	2501.06262	null
2025-01-21	Distilling Calibration via Conformalized Credal Inference	Jiayi Huang et.al.	2501.06066	null
2025-01-08	Decentralised Resource Sharing in TinyML: Wireless Bilayer Gossip Parallel SGD for Collaborative Learning	Ziyuan Bao et.al.	2501.04817	null
2025-01-07	ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono	Jingquan Wang et.al.	2501.04062	null
2025-01-04	Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies	Xubin Wang et.al.	2501.03265	link
2025-01-01	AI-ANNE: (A) (N)eural (N)et for (E)xploration: Transferring Deep Learning Models onto Microcontrollers and Embedded Systems	Dennis Klinkhammer et.al.	2501.03256	null
2025-01-01	Communication Efficient Cooperative Edge AI via Event-Triggered Computation Offloading	You Zhou et.al.	2501.02001	null
2024-12-25	Tempus Core: Area-Power Efficient Temporal-Unary Convolution Core for Low-Precision Edge DLAs	Prabhu Vellaisamy et.al.	2412.19002	null
2024-12-23	Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings	Harsh Joshi et.al.	2412.18635	null
2024-12-23	tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI	Harideep Nair et.al.	2412.17966	null
2024-12-22	Fatigue Monitoring Using Wearables and AI: Trends, Challenges, and Future Opportunities	Kourosh Kakhi et.al.	2412.16847	null
2024-12-19	ElectraSight: Smart Glasses with Fully Onboard Non-Invasive Eye Tracking Using Hybrid Contact and Contactless EOG	Nicolas Schärer et.al.	2412.14848	null
2025-01-05	Overview of AI and Communication for 6G Network: Fundamentals, Challenges, and Future Research Opportunities	Qimei Cui et.al.	2412.14538	null
2024-12-17	Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart Auscultation	Abraham G. Taye et.al.	2412.14206	null
2024-12-16	Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads	Mukul Lokhande et.al.	2412.11702	link
2024-12-13	Edge AI-based Radio Frequency Fingerprinting for IoT Networks	Ahmed Mohamed Hussain et.al.	2412.10553	null
2024-12-13	EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models	Hanchu Zhou et.al.	2412.09782	null
2024-12-12	Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices	Thanaphon Suwannaphong et.al.	2412.09289	null
2024-12-10	Performance Evaluation of ROS2-DDS middleware implementations facilitating Cooperative Driving in Autonomous Vehicle	Sumit Paul et.al.	2412.07485	null
2024-12-07	Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach	Olamilekan Shobayo et.al.	2412.06837	null
2024-12-09	DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators	Taesik Gong et.al.	2412.06566	link
2024-12-09	Sequential Printed MLP Circuits for Super TinyML Multi-Sensory Applications	Gurol Saglam et.al.	2412.06542	null
2024-12-02	Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping	Marla Grunewald et.al.	2412.01609	null
2024-12-01	Toward Real-Time Edge AI: Model-Agnostic Task-Oriented Communication with Visual Feature Alignment	Songjie Xie et.al.	2412.00862	link
2024-11-28	Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras	Jicheng Yuan et.al.	2411.19143	null
2024-11-28	Towards an Implementation of the Knowledge-Based Control Plane for Intelligent Swarm Networks	Xuanchi Guo et.al.	2411.19068	null
2024-11-24	Space-ground Fluid AI for 6G Edge Intelligence	Qian Chen et.al.	2411.15845	null
2024-11-20	Federated Continual Learning for Edge-AI: A Comprehensive Survey	Zi Wang et.al.	2411.13740	null
2024-11-16	Enhanced FIWARE-Based Architecture for Cyberphysical Systems With Tiny Machine Learning and Machine Learning Operations: A Case Study on Urban Mobility Systems	Javier Conde et.al.	2411.13583	null
2024-11-19	Signformer is all you need: Towards Edge AI for Sign Language	Eta Yang et.al.	2411.12901	link
2024-11-16	DEBUG-HD: Debugging TinyML models on-device using Hyper-Dimensional computing	Nikhil P Ghanathe et.al.	2411.10692	null
2024-11-14	ABCI 3.0: Evolution of the leading AI infrastructure in Japan	Ryousei Takano et.al.	2411.09134	null
2024-11-13	A Cost-effective, Stand-alone, and Real-time TinyML-Based Gait Diagnosis Unit Aimed at Lower-limb Robotic Prostheses and Exoskeletons	Zarin Anjum Madhiha et.al.	2411.08474	null
2024-11-12	Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge	Emmanuel Azuh Mensah et.al.	2411.07834	null
2024-11-16	Enhancing Predictive Maintenance in Mining Mobile Machinery through a TinyML-enabled Hierarchical Inference Network	Raúl de la Fuente et.al.	2411.07168	null
2024-11-11	A Primer on Word Embeddings: AI Techniques for Text Analysis in Social Work	Brian E. Perron et.al.	2411.07156	null
2024-11-11	TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems	Jacob Huckelberry et.al.	2411.07114	null
2024-11-10	Activation Map Compression through Tensor Decomposition for Deep Learning	Le-Trung Nguyen et.al.	2411.06346	link
2024-11-09	TinyML NLP Approach for Semantic Wireless Sentiment Classification	Ahmed Y. Radwan et.al.	2411.06291	link
2024-11-03	Energy-Aware FPGA Implementation of Spiking Neural Network with LIF Neurons	Asmer Hamid Ali et.al.	2411.01628	null
2024-11-01	On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance	Jaskirat Singh et.al.	2411.00907	null
2024-10-30	Profiling AI Models: Towards Efficient Computation Offloading in Heterogeneous Edge AI Systems	Juan Marcelo Parra-Ullauri et.al.	2411.00859	null
2024-11-01	GPT for Games: An Updated Scoping Review (2020-2024)	Daijin Yang et.al.	2411.00308	null
2024-10-31	Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge	Stefano Albini et.al.	2410.24066	link
2024-10-28	FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge	Sifat Ut Taki et.al.	2410.21120	link
2024-10-28	Edge Perception: Intelligent Wireless Sensing at Network Edge	Yuanhao Cui et.al.	2410.21017	null
2024-10-25	Neuromorphic IoT Architecture for Efficient Water Management: A Smart Village Case Study	Mugdim Bublin et.al.	2410.19562	null
2024-10-17	SouLLMate: An Application Enhancing Diverse Mental Health Support with Adaptive LLMs, Prompt Engineering, and RAG Techniques	Qiming Guo et.al.	2410.16322	null
2024-10-21	P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving	Mohamed R. Elshamy et.al.	2410.15602	null
2024-10-15	SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments	Syed Abdul Gaffar Shakhadri et.al.	2410.11331	null
2024-10-14	ABBA-VSM: Time Series Classification using Symbolic Representation on the Edge	Meerzhan Kanatbekova et.al.	2410.10285	null
2024-10-12	Token Pruning using a Lightweight Background Aware Vision Transformer	Sudhakar Sah et.al.	2410.09324	null
2024-10-11	MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices	Mohamed Amine Hamdi et.al.	2410.08855	link
2024-10-11	Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation	Gleb Radchenko et.al.	2410.08651	null
2024-10-10	Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices	Yiwei Zhao et.al.	2410.08326	null
2024-10-10	L-VITeX: Light-weight Visual Intuition for Terrain Exploration	Antar Mazumder et.al.	2410.07872	null
2024-10-10	Towards Robust IoT Defense: Comparative Statistics of Attack Detection in Resource-Constrained Scenarios	Zainab Alwaisi et.al.	2410.07810	null
2024-10-10	vCLIC: Towards Fast Interrupt Handling in Virtualized RISC-V Mixed-criticality Systems	Enrico Zelioli et.al.	2410.07798	null
2024-10-07	SoK: Towards Security and Safety of Edge AI	Tatjana Wingarz et.al.	2410.05349	null
2024-10-10	SONAR: A Synthetic AI-Audio Detection Framework and Benchmark	Xiang Li et.al.	2410.04324	link
2024-09-28	MicroFlow: An Efficient Rust-Based Inference Engine for TinyML	Matteo Carnelos et.al.	2409.19432	link
2024-09-27	Analog fast Fourier transforms for scalable and efficient signal processing	T. Patrick Xiao et.al.	2409.19071	null
2024-09-26	Development of an Edge Resilient ML Ensemble to Tolerate ICS Adversarial Attacks	Likai Yao et.al.	2409.18244	null
2024-09-25	Susceptibility Formulation of Density Matrix Perturbation Theory	Anders M. N. Niklasson et.al.	2409.17033	null
2024-09-25	Ethical and Scalable Automation: A Governance and Compliance Framework for Business Applications	Haocheng Lin et.al.	2409.16872	null
2024-09-25	Accelerating TinyML Inference on Microcontrollers through Approximate Kernels	Giorgos Armeniakos et.al.	2409.16815	link
2024-09-23	Benchmarking Edge AI Platforms for High-Performance ML Inference	Rakshith Jayanth et.al.	2409.14803	null
2024-09-24	CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks	Zhaozhi Qian et.al.	2409.12623	null
2024-09-17	AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances	Dhruv Agarwal et.al.	2409.11360	null
2024-09-17	Optimizing TinyML: The Impact of Reduced Data Acquisition Rates for Time Series Classification on Microcontrollers	Riya Samanta et.al.	2409.10942	null
2024-09-13	Pushing the boundaries of event subsampling in event-based video classification using CNNs	Hesam Araghi et.al.	2409.08953	link
2024-09-12	E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning	Le Zhang et.al.	2409.08369	link
2024-09-12	DiReDi: Distillation and Reverse Distillation for AIoT Applications	Chen Sun et.al.	2409.08308	null
2024-09-11	A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption	Marcus Rüb et.al.	2409.07114	null
2024-09-08	Transformer with Leveraged Masked Autoencoder for video-based Pain Assessment	Minh-Duc Nguyen et.al.	2409.05088	null
2024-09-02	Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks	Samer Francy et.al.	2409.02134	null
2024-09-01	Research on LLM Acceleration Using the High-Performance RISC-V Processor "Xiangshan" (Nanhu Version) Based on the Open-Source Matrix Instruction Set Extension (Vector Dot Product)	Xu-Hao Chen et.al.	2409.00661	null
2024-08-26	Towards Sustainable Personalized On-Device Human Activity Recognition with TinyML and Cloud-Enabled Auto Deployment	Bidyut Saha et.al.	2409.00093	null
2024-08-29	TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification	Bidyut Saha et.al.	2408.16535	link
2024-08-08	An Edge AI System Based on FPGA Platform for Railway Fault Detection	Jiale Li et.al.	2408.15245	null
2024-08-23	S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis	Kamal Basha S et.al.	2408.12833	link
2024-08-20	Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning	Bei Ouyang et.al.	2408.10746	null
2024-08-21	Challenges and Responses in the Practice of Large Language Models	Hongyin Zhu et.al.	2408.09416	null
2024-08-15	Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices	Tess Watt et.al.	2408.08215	null
2024-08-14	Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini Accelerator	Federico Nicolas Peccia et.al.	2408.07404	null
2024-08-13	Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach	Haowei Ni et.al.	2408.06634	null
2024-08-06	Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW	Elia Cereda et.al.	2408.03168	null
2024-08-05	Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow	Philip Wiese et.al.	2408.02473	null
2024-08-05	PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy	Rachmad Vidya Wicaksana Putra et.al.	2408.02412	null
2024-08-02	A Tiny Supervised ODL Core with Auto Data Pruning for Human Activity Recognition	Hiroki Matsutani et.al.	2408.01283	null
2024-07-29	HOAA: Hybrid Overestimating Approximate Adder for Enhanced Performance Processing Engine	Omkar Kokane et.al.	2408.00806	link
2024-07-31	TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors	Zhaolan Huang et.al.	2407.21453	link
2024-07-31	SHA-CNN: Scalable Hierarchical Aware Convolutional Neural Network for Edge AI	Narendra Singh Dhakad et.al.	2407.21370	null
2024-07-30	On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds	Xu Chen et.al.	2407.20710	null
2024-07-29	Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference	Claudio Angione et.al.	2407.19775	null
2024-07-25	A Sensitivity Analysis of Cellular Automata and Heterogeneous Topology Networks: Partially-Local Cellular Automata and Homogeneous Homogeneous Random Boolean Networks	Tom Eivind Glover et.al.	2407.18017	null
2024-07-22	StreamTinyNet: video streaming analysis with spatial-temporal TinyML	Hazem Hesham Yousef Shalby et.al.	2407.17524	null
2024-07-22	KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer	Aness Al-Qawlaq et.al.	2407.16026	null
2024-07-18	Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence	Shubha R. Kharel et.al.	2407.14560	null
2024-07-18	Ultra-Low-Latency Edge Inference for Distributed Sensing	Zhanwei Wang et.al.	2407.13360	null
2024-07-17	Computing: Looking Back and Moving Forward	Muhammed Golec et.al.	2407.12558	null
2024-07-16	XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach	Truong Thanh Hung Nguyen et.al.	2407.11771	link
2024-07-18	Enhancing TinyML Security: Study of Adversarial Attack Transferability	Parin Shah et.al.	2407.11599	null
2024-07-13	Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks	Zhenyu Wang et.al.	2407.10016	null
2024-07-11	Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware	James Seekings et.al.	2407.08704	null

(back to top)

Domain Specific Accelerator

Publish Date	Title	Authors	PDF	Code
2025-04-23	Trends in AI Supercomputers	Konstantin F. Pilz et.al.	2504.16026	null
2025-04-22	GainSight: Application-Guided Profiling for Composing Heterogeneous On-Chip Memories in AI Hardware Accelerators	Peijing Li et.al.	2504.14866	null
2025-04-16	HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks	Stefan Abi-Karam et.al.	2504.12268	null
2025-04-14	Carbon-Efficient 3D DNN Acceleration: Optimizing Performance and Sustainability	Aikaterini Maria Panteleaki et.al.	2504.09851	null
2025-03-21	Fused-Tiled Layers: Minimizing Data Movement on RISC-V SoCs with Software-Managed Caches	Victor J. B. Jung et.al.	2504.03676	null
2025-03-31	DiffuSE: Cross-Layer Design Space Exploration of DNN Accelerator via Diffusion-Driven Optimization	Yi Ren et.al.	2503.23945	null
2025-03-17	LIMCA: LLM for Automating Analog In-Memory Computing Architecture Design Exploration	Deepak Vungarala et.al.	2503.13301	null
2025-03-06	FORTALESA: Fault-Tolerant Reconfigurable Systolic Array for DNN Inference	Natalia Cherezova et.al.	2503.04426	null
2025-02-13	GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units	Arghadip Das et.al.	2502.06921	link
2025-02-09	MetaML-Pro: Cross-Stage Design Flow Automation for Efficient Deep Learning Acceleration	Zhiqiang Que et.al.	2502.05850	null
2025-02-06	Systolic Sparse Tensor Slices: FPGA Building Blocks for Sparse and Dense AI Acceleration	Endri Taka et.al.	2502.03763	null
2025-02-01	Late Breaking Results: Leveraging Approximate Computing for Carbon-Aware DNN Accelerators	Aikaterini Maria Panteleaki et.al.	2502.00286	null
2025-01-31	StruM: Structured Mixed Precision for Efficient Deep Learning Hardware Codesign	Michael Wu et.al.	2501.18953	null
2025-01-30	REDACTOR: eFPGA Redaction for DNN Accelerator Security	Yazan Baddour et.al.	2501.18740	link
2025-01-22	SoMa: Identifying, Exploring, and Understanding the DRAM Communication Scheduling Space for DNN Accelerators	Jingwei Cai et.al.	2501.12634	link
2025-01-17	AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations	Jamin Seo et.al.	2501.09954	link
2025-01-13	Leveraging ASIC AI Chips for Homomorphic Encryption	Jianming Tong et.al.	2501.07047	link
2025-01-12	COMPASS: A Compiler Framework for Resource-Constrained Crossbar-Array Based In-Memory Deep Learning Accelerators	Jihoon Park et.al.	2501.06780	null
2024-12-21	Leveraging Highly Approximated Multipliers in DNN Inference	Georgios Zervakis et.al.	2412.16757	null
2024-12-13	Panacea: Novel DNN Accelerator using Accuracy-Preserving Asymmetric Quantization and Energy-Saving Bit-Slice Sparsity	Dongyun Kam et.al.	2412.10059	null
2024-12-06	HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip Design	Jinwei Tang et.al.	2412.05393	null
2024-12-06	MC3: Memory Contention based Covert Channel Communication on Shared DRAM System-on-Chips	Ismet Dagli et.al.	2412.05228	null
2024-11-28	PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers	Gwangoo Yeo et.al.	2411.19114	null
2024-12-06	FAMES: Fast Approximate Multiplier Substitution for Mixed-Precision Quantized DNNs--Down to 2 Bits!	Yi Ren et.al.	2411.18055	null
2024-11-19	Travel Time Based Task Mapping for NoC-Based DNN Accelerator	Yizhi Chen et.al.	2411.12710	null
2024-10-29	Systolic Array Data Flows for Efficient Matrix Multiplication in Deep Neural Networks	Tejas Raja et.al.	2410.22595	null
2024-10-21	Adventures with Grace Hopper AI Super Chip and the National Research Platform	J. Alex Hurt et.al.	2410.16487	null
2024-10-17	Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and Undervolting	Mikael Rinkinen et.al.	2410.13415	null
2024-10-11	MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices	Mohamed Amine Hamdi et.al.	2410.08855	link
2024-09-23	MESC: Re-thinking Algorithmic Priority and/or Criticality Inversions for Heterogeneous MCSs	Jiapeng Guan et.al.	2409.14837	null
2024-10-14	LoopTree: Exploring the Fused-layer Dataflow Accelerator Design Space	Michael Gilbert et.al.	2409.13625	link
2024-09-13	Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators	Konstantin Lübeck et.al.	2409.08595	null
2024-09-08	BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration	Yuzong Chen et.al.	2409.05227	link
2024-09-08	HYDRA: Hybrid Data Multiplexing and Run-time Layer Configurable DNN Accelerator	Sonu Kumar et.al.	2409.04976	null
2024-08-27	SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration	Runzhen Xue et.al.	2408.15089	null
2024-08-24	SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision Deep Neural Networks	Niharika Thakuria et.al.	2408.13617	null
2024-08-13	Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture	Yu Feng et.al.	2408.06608	null
2024-09-24	Scaling Deep Learning Computation over the Inter-Core Connected Intelligence Processor with T10	Yiqi Liu et.al.	2408.04808	null
2024-07-30	Optical Computing for Deep Neural Network Acceleration: Foundations, Recent Developments, and Emerging Directions	Sudeep Pasricha et.al.	2407.21184	null
2024-07-29	Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices	Hayun Lee et.al.	2407.19644	null
2024-07-24	The Magnificent Seven Challenges and Opportunities in Domain-Specific Accelerator Design for Autonomous Systems	Sabrina M. Neuman et.al.	2407.17311	null
2024-07-17	StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators	Ethan G Rogers et.al.	2407.12378	null
2024-07-11	NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2	Tengfei Xue et.al.	2407.12057	null
2024-07-22	ARCO:Adaptive Multi-Agent Reinforcement Learning-Based Hardware/Software Co-Optimization Compiler for Improved Performance in DNN Accelerator Design	Arya Fayyazi et.al.	2407.08192	null
2024-06-20	SWANN: Shuffling Weights in Crossbar Arrays for Enhanced DNN Accuracy in Deeply Scaled Technologies	Jeffry Victor et.al.	2406.14706	null
2024-06-14	CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories	Man Shi et.al.	2406.14574	null
2024-06-15	Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training	Akul Malhotra et.al.	2406.10528	null
2024-07-17	Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis	Zongyue Qin et.al.	2406.09606	null
2024-06-05	HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator	Zhewen Yu et.al.	2406.03088	link
2024-06-03	A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing	P. J. Zhou et.al.	2406.01151	null

(back to top)

Low-Rank Adaptation

Publish Date	Title	Authors	PDF	Code
2025-05-01	Block Circulant Adapter for Large Language Models	Xinyu Ding et.al.	2505.00582	null
2025-05-01	Communication-Efficient Wireless Federated Fine-Tuning for Large-Scale AI Models	Bumjun Kim et.al.	2505.00333	null
2025-05-01	AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care	Md Asaduzzaman Jabin et.al.	2505.00275	null
2025-04-30	SAM4EM: Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks	Uzair Shah et.al.	2504.21544	null
2025-04-29	TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts	Pradip Kunwar et.al.	2504.21190	null
2025-04-29	X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation	Guy Hadad et.al.	2504.20859	null
2025-04-29	Reinforcement Learning for LLM Reasoning Under Memory Constraints	Alan Lee et.al.	2504.20834	null
2025-04-29	In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer	Zechuan Zhang et.al.	2504.20690	null
2025-04-29	What Causes Knowledge Loss in Multilingual Language Models?	Maria Khelli et.al.	2504.20356	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	null
2025-04-27	Low-Rank Adaptive Structural Priors for Generalizable Diabetic Retinopathy Grading	Yunxuan Wang et.al.	2504.19362	null
2025-04-25	TLoRA: Tri-Matrix Low-Rank Adaptation of Large Language Models	Tanvir Islam et.al.	2504.18735	null
2025-04-25	Pushing the boundary on Natural Language Inference	Pablo Miralles-González et.al.	2504.18376	null
2025-04-25	Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding	Kun Li et.al.	2504.18204	null
2025-04-25	NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation	Rob Romijnders et.al.	2504.18147	null
2025-04-25	Automating Function-Level TARA for Automotive Full-Lifecycle Security	Yuqiao Yang et.al.	2504.18083	null
2025-04-24	Replay to Remember: Retaining Domain Knowledge in Streaming Language Models	Sneh Pillai et.al.	2504.17780	null
2025-04-23	Federated Learning of Low-Rank One-Shot Image Detection Models in Edge Devices with Scalable Accuracy and Compute Complexity	Abdul Hannaan et.al.	2504.16515	null
2025-04-23	EMRModel: A Large Language Model for Extracting Medical Consultation Dialogues into Structured Medical Records	Shuguang Zhao et.al.	2504.16448	null
2025-04-22	PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning	Song Wang et.al.	2504.16023	null
2025-04-22	Low-Rank Adaptation of Neural Fields	Anh Truong et.al.	2504.15933	null
2025-04-22	Tina: Tiny Reasoning Models via LoRA	Shangshang Wang et.al.	2504.15777	null
2025-04-23	A LoRA-Based Approach to Fine-Tuning LLMs for Educational Guidance in Resource-Constrained Settings	Md Millat Hosen et.al.	2504.15610	link
2025-04-21	SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation	Yue Li et.al.	2504.15035	null
2025-04-21	What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale	Xiaoyong Yuan et.al.	2504.14815	null
2025-04-21	When Cloud Removal Meets Diffusion Model in Remote Sensing	Zhenyu Yu et.al.	2504.14785	null
2025-04-20	Efficient Federated Split Learning for Large Language Models over Communication Networks	Kai Zhao et.al.	2504.14667	null
2025-04-20	TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data	Fei Zhu et.al.	2504.14545	null
2025-04-19	Cross-attention for State-based model RWKV-7	Liu Xiao et.al.	2504.14260	link
2025-04-18	6G WavesFM: A Foundation Model for Sensing, Communication, and Localization	Ahmed Aboulfotouh et.al.	2504.14100	null
2025-04-18	ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis	Andrea Rigo et.al.	2504.13745	null
2025-04-18	Efficient Parameter Adaptation for Multi-Modal Medical Image Segmentation and Prognosis	Numan Saeed et.al.	2504.13645	null
2025-04-18	LoRA-Based Continual Learning with Constraints on Critical Parameter Changes	Shimou Ling et.al.	2504.13407	link
2025-04-17	Mirror, Mirror of the Flow: How Does Regularization Shape Implicit Bias?	Tom Jacobs et.al.	2504.12883	null
2025-04-17	Chinese-Vicuna: A Chinese Instruction-following Llama-based Model	Chenghao Fan et.al.	2504.12737	null
2025-04-17	Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models	Zhenyu Yu et.al.	2504.12574	null
2025-04-19	Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex	Azadeh Beiranvand et.al.	2504.12474	null
2025-04-16	You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models	Shiwei Ding et.al.	2504.12471	null
2025-04-16	Activated LoRA: Fine-tuned LLMs for Intrinsics	Kristjan Greenewald et.al.	2504.12397	null
2025-04-16	Super-LoRa: Enhancing LoRa Throughput via Payload Superposition	Salah Abdeljabar et.al.	2504.11927	null
2025-04-16	ACE: Attentional Concept Erasure in Diffusion Models	Finn Carter et.al.	2504.11850	null
2025-04-16	Résumé abstractif à partir d'une transcription audio	Ilia Derkach et.al.	2504.11803	null
2025-04-16	A Library of LLM Intrinsics for Retrieval-Augmented Generation	Marina Danilevsky et.al.	2504.11704	null
2025-04-15	Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models	Nicolas Baumann et.al.	2504.11514	link
2025-04-15	UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer	Xiang Wang et.al.	2504.11289	link
2025-04-15	Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution	Xinning Chai et.al.	2504.11271	link
2025-04-15	FHBench: Towards Efficient and Personalized Federated Learning for Multimodal Healthcare	Penghao Wang et.al.	2504.10817	link
2025-04-14	CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation	Junchen Fu et.al.	2504.10307	link
2025-04-14	UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval	Yating Liu et.al.	2504.10084	link
2025-04-13	AeroLite: Tag-Guided Lightweight Generation of Aerial Image Captions	Xing Zi et.al.	2504.09528	null
2025-04-13	CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models	Pooja Guhan et.al.	2504.09472	null
2025-04-13	Vision Transformers Exhibit Human-Like Biases: Evidence of Orientation and Color Selectivity, Categorical Perception, and Phase Transitions	Nooshin Bahador et.al.	2504.09393	null
2025-04-12	FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment	Sijing Wu et.al.	2504.09255	null
2025-04-12	DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models	Wenjin Ke et.al.	2504.09223	null
2025-04-11	Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models	Jiahuan Long et.al.	2504.08915	null
2025-04-11	Spatial Audio Processing with Large Language Model on Wearable Devices	Ayushi Mishra et.al.	2504.08907	null
2025-04-11	AI-University: An LLM-based platform for instructional alignment to scientific classrooms	Mostafa Faghih Shojaei et.al.	2504.08846	link
2025-04-10	LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution	Danielle Sullivan-Pao et.al.	2504.08149	link
2025-04-08	CDM-QTA: Quantized Training Acceleration for Efficient LoRA Fine-Tuning of Diffusion Model	Jinming Lu et.al.	2504.07998	null
2025-04-10	LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	Juzheng Zhang et.al.	2504.07448	link
2025-04-09	TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling	Liang-Hsuan Tseng et.al.	2504.07053	link
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	null
2025-04-08	Can you Finetune your Binoculars? Embedding Text Watermarks into the Weights of Large Language Models	Fay Elhassan et.al.	2504.06446	null
2025-04-08	S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning	Hanqing Zeng et.al.	2504.06426	null
2025-04-08	Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images	Hicham Talaoubrid et.al.	2504.06330	null
2025-04-11	Optuna vs Code Llama: Are LLMs a New Paradigm for Hyperparameter Tuning?	Roman Kochnev et.al.	2504.06006	null
2025-04-06	AROMA: Autonomous Rank-one Matrix Adaptation	Hao Nan Sheng et.al.	2504.05343	link
2025-04-07	Enhancing Smart Contract Vulnerability Detection in DApps Leveraging Fine-Tuned LLM	Jiuyang Bu et.al.	2504.05006	null
2025-04-07	TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment	Adnan Khan et.al.	2504.04722	null
2025-04-07	LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts	Yimu Wang et.al.	2504.04653	null
2025-04-06	KnowsLM: A framework for evaluation of small language models for knowledge augmentation and humanised conversations	Chitranshu Harbola et.al.	2504.04569	null
2025-04-05	FISH-Tuning: Enhancing PEFT Methods with Fisher Information	Kang Xue et.al.	2504.04050	null
2025-04-03	The Self-Learning Agent with a Progressive Neural Network Integrated Transformer	Ajay Sivakumar et.al.	2504.02489	null
2025-04-03	Cognitive Memory in Large Language Models	Lianlei Shan et.al.	2504.02441	null
2025-04-03	AC-LoRA: Auto Component LoRA for Personalized Artistic Style Image Generation	Zhipu Cui et.al.	2504.02231	null
2025-04-02	CLIP-SLA: Parameter-Efficient CLIP Adaptation for Continuous Sign Language Recognition	Sarah Alyami et.al.	2504.01666	link
2025-04-02	Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning	Yiting Lu et.al.	2504.01655	link
2025-04-01	Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations	Chongjie Si et.al.	2504.00851	null
2025-04-01	DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism	Dengchun Li et.al.	2504.00661	link
2025-04-01	Next Generation LoRaWAN: Integrating Multi-Hop Communications at 2.4 GHz	Riccardo Marini et.al.	2504.00489	null
2025-04-01	Exploring the Collaborative Advantage of Low-level Information on Generalizable AI-Generated Image Detection	Ziyin Zhou et.al.	2504.00463	null
2025-04-01	MetaLoRA: Tensor-Enhanced Adaptive Low-Rank Fine-tuning	Maolin Wang et.al.	2504.00460	null
2025-03-31	ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning	Huandong Chang et.al.	2504.00254	null
2025-03-31	ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion	Rana Muhammad Shahroz Khan et.al.	2503.24354	null
2025-03-31	JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation	Fangda Chen et.al.	2503.23951	null
2025-03-31	Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation	Yongle Li et.al.	2503.23869	null
2025-04-01	Evaluating small vision-language models as AI assistants for radio astronomical source analysis tasks	S. Riggi et.al.	2503.23859	link
2025-03-30	Mixture of Routers	Jia-Chen Zhang et.al.	2503.23362	null
2025-03-30	Not All LoRA Parameters Are Essential: Insights on Inference Necessity	Guanhua Chen et.al.	2503.23360	null
2025-03-29	Efficient Adaptation For Remote Sensing Visual Grounding	Hasan Moughnieh et.al.	2503.23083	null
2025-03-29	InkFM: A Foundational Model for Full-Page Online Handwritten Note Understanding	Anastasiia Fadeeva et.al.	2503.23081	null
2025-03-29	Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning	Xinlei Shao et.al.	2503.23012	link
2025-03-29	Multimodal machine learning with large language embedding model for polymer property prediction	Tianren Zhang et.al.	2503.22962	null
2025-03-28	ActionStudio: A Lightweight Framework for Data and Training of Action Models	Jianguo Zhang et.al.	2503.22673	link
2025-03-28	Shadow and gravitational lensing produced by the nonlinear accretion of a scalar field onto a black hole	J. C. Acevedo-Muñoz et.al.	2503.22624	null
2025-03-28	Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities	Raman Dutt et.al.	2503.22517	null
2025-03-28	Fighting Fire with Fire: Channel-Independent RF Fingerprinting via the Ratio of Linear to Logarithmic Differential Spectrum	Tianshu Chen et.al.	2503.22378	null
2025-03-28	Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization	Barış Batuhan Topal et.al.	2503.22352	null
2025-03-28	Make Some Noise: Towards LLM audio reasoning and generation using sound tokens	Shivam Mehta et.al.	2503.22275	null
2025-03-28	Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation	Minho Park et.al.	2503.22172	null
2025-03-27	RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction	Armin Abdollahi et.al.	2503.21971	null
2025-03-27	VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models	Chi-Pin Huang et.al.	2503.21781	null
2025-03-27	Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation	Reza Qorbani et.al.	2503.21780	link
2025-03-27	Resource-Efficient Federated Fine-Tuning Large Language Models for Heterogeneous Data	Jun Liu et.al.	2503.21213	null
2025-03-27	Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing	Fan Qi et.al.	2503.21069	null
2025-03-26	Vision as LoRA	Han Wang et.al.	2503.20680	link
2025-03-26	TeleLoRA: Teleporting Model-Specific Alignment Across LLMs	Xiao Lin et.al.	2503.20228	null
2025-03-26	ProtoBERT-LoRA: Parameter-Efficient Prototypical Finetuning for Immunotherapy Study Identification	Shijia Zhang et.al.	2503.20179	null
2025-03-25	iNatAg: Multi-Class Classification Models Enabled by a Large-Scale Benchmark Dataset with 4.7M Images of 2,959 Crop and Weed Species	Naitik Jain et.al.	2503.20068	link
2025-03-25	An Overview of Low-Rank Structures in the Training and Adaptation of Large Models	Laura Balzano et.al.	2503.19859	null
2025-03-25	fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models	Saurav Sharma et.al.	2503.19670	null
2025-03-25	Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion	Haim Sawdayee et.al.	2503.19557	null
2025-03-24	A Shared Low-Rank Adaptation Approach to Personalized RLHF	Renpu Liu et.al.	2503.19201	null
2025-03-24	Efficient Self-Supervised Adaptation for Medical Image Analysis	Moein Sorkhei et.al.	2503.18873	link
2025-03-24	Advancing Cross-Organ Domain Generalization with Test-Time Style Transfer and Diversity Enhancement	Biwen Meng et.al.	2503.18567	null
2025-03-24	Hiding Images in Diffusion Models by Editing Learned Score Functions	Haoyu Chen et.al.	2503.18459	null
2025-03-24	Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners	Wen Zheng Terence Ng et.al.	2503.18347	null
2025-03-24	Surgical Action Planning with Large Language Models	Mengya Xu et.al.	2503.18296	null
2025-03-23	Decoupling Angles and Strength in Low-rank Adaptation	Massimo Bini et.al.	2503.18225	link
2025-03-23	The Power of Small LLMs in Geometry Generation for Physical Simulations	Ossama Shafiq et.al.	2503.18178	null
2025-03-23	$D^2LoRA$ : Data-Driven LoRA Initialization for Low Resource Tasks	Javad SeraJ et.al.	2503.18089	null
2025-03-23	Investigating Recent Large Language Models for Vietnamese Machine Reading Comprehension	Anh Duc Nguyen et.al.	2503.18062	null
2025-03-22	Serial Low-rank Adaptation of Vision Transformer	Houqiang Zhong et.al.	2503.17750	null
2025-03-21	Revisiting End To End Sparse Autoencoder Training -- A Short Finetune is All You Need	Adam Karvonen et.al.	2503.17272	link
2025-03-21	TRACE: Time SeRies PArameter EffiCient FinE-tuning	Yuze Li et.al.	2503.16991	null
2025-03-21	HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis	Mengtian Li et.al.	2503.16944	null
2025-03-21	LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models	Jian Liang et.al.	2503.16843	null
2025-03-20	LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates	Ying Shen et.al.	2503.16334	null
2025-03-20	Ultra-Resolution Adaptation with Ease	Ruonan Yu et.al.	2503.16322	link
2025-03-20	SALT: Singular Value Adaptation with Low-Rank Transformation	Abdelrahman Elsayed et.al.	2503.16055	link
2025-03-20	Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras	Beilei Cui et.al.	2503.15917	null
2025-03-19	Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices	Ziyao Wang et.al.	2503.14932	null
2025-03-18	MusicInfuser: Making Video Diffusion Listen and Dance	Susung Hong et.al.	2503.14505	null
2025-03-17	Atyaephyra at SemEval-2025 Task 4: Low-Rank NPO	Jan Bronec et.al.	2503.13690	link
2025-03-17	Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model	Kai Tong et.al.	2503.13575	null
2025-03-17	VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning	Ye Liu et.al.	2503.13444	link
2025-03-17	Edit Transfer: Learning Image Editing via Vision In-Context Relations	Lan Chen et.al.	2503.13327	null
2025-03-17	MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Portrait Few-Step Synthesis	Shitong Shao et.al.	2503.13319	null
2025-03-17	Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation	Henghui Du et.al.	2503.13068	null
2025-03-17	ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM	Wenqiang Wang et.al.	2503.12988	null
2025-03-17	Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction	Zheyuan Liu et.al.	2503.12953	null
2025-03-17	Quantum-Enhanced LLM Efficient Fine Tuning	Xiaofei Kong et.al.	2503.12790	null
2025-03-16	RaSA: Rank-Sharing Low-Rank Adaptation	Zhiwei He et.al.	2503.12576	null
2025-03-16	Towards Suturing World Models: Learning Predictive Models for Robotic Surgical Tasks	Mehmet Kerem Turkcan et.al.	2503.12531	null
2025-03-16	Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation	Byung Hyun Lee et.al.	2503.12356	link
2025-03-14	Multi-Stage Generative Upscaler: Reconstructing Football Broadcast Images via Diffusion Models	Luca Martini et.al.	2503.11181	null
2025-03-13	Phishsense-1B: A Technical Perspective on an AI-Powered Phishing Detection Model	SE Blake et.al.	2503.10944	null
2025-03-14	Distilling Diversity and Control in Diffusion Models	Rohit Gandikota et.al.	2503.10637	null
2025-03-16	Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models	Andy Zhou et.al.	2503.10617	null
2025-03-13	ConsisLoRA: Enhancing Content and Style Consistency for LoRA-based Style Transfer	Bolin Chen et.al.	2503.10614	null
2025-03-13	Piece it Together: Part-Based Concepting with IP-Priors	Elad Richardson et.al.	2503.10365	null
2025-03-13	A Hybrid Architecture with Efficient Fine Tuning for Abstractive Patent Document Summarization	Nevidu Jayatilleke et.al.	2503.10354	null
2025-03-13	Singular Value Fine-tuning for Few-Shot Class-Incremental Learning	Zhiwu Wang et.al.	2503.10214	null
2025-03-13	PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation	Sen Wang et.al.	2503.09938	null
2025-03-12	Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection	Romain Thoreau et.al.	2503.09493	null
2025-03-12	SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery	Jiayuan Huang et.al.	2503.09474	null
2025-03-12	UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer	Haoxuan Wang et.al.	2503.09277	null
2025-03-12	Fine-Tuning Large Language Models for Educational Support: Leveraging Gagne's Nine Events of Instruction for Lesson Planning	Linzhao Jia et.al.	2503.09276	null
2025-03-12	InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images	Jiun Tian Hoe et.al.	2503.09130	null
2025-03-11	OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models	Jialv Zou et.al.	2503.08686	link
2025-03-11	Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation	Mingkang Zhu et.al.	2503.08575	null
2025-03-11	1LoRA: Summation Compression for Very Low-Rank Adaptation	Alessio Quercia et.al.	2503.08333	null
2025-03-11	MGHanD: Multi-modal Guidance for authentic Hand Diffusion	Taehyeon Eum et.al.	2503.08133	null
2025-03-11	Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection	Ying Fu Lim et.al.	2503.08045	null
2025-03-11	MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models	Han Zhao et.al.	2503.08007	null
2025-03-11	A Study to Evaluate the Impact of LoRA Fine-tuning on the Performance of Non-functional Requirements Classification	Xia Li et.al.	2503.07927	null
2025-03-10	AdaptSR: Low-Rank Adaptation for Efficient and Scalable Real-World Super-Resolution	Cansu Korkmaz et.al.	2503.07748	null
2025-03-10	DreamRelation: Relation-Centric Video Customization	Yujie Wei et.al.	2503.07602	null
2025-03-10	Balanced Image Stylization with Style Matching Score	Yuxin Jiang et.al.	2503.07601	null
2025-03-10	TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision	Shaobin Zhuang et.al.	2503.07416	null
2025-03-10	FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates	Sangwoo Park et.al.	2503.07216	null
2025-03-10	EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer	Yuxuan Zhang et.al.	2503.07027	null
2025-03-10	Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization	Ziqing Xu et.al.	2503.06982	null
2025-03-10	Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation	Pengchen Liang et.al.	2503.06976	null
2025-03-10	A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis	Xiang Liu et.al.	2503.06973	link
2025-03-09	Conceptrol: Concept Control of Zero-shot Personalized Image Generation	Qiyuan He et.al.	2503.06568	link
2025-03-09	Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs	Umberto Cappellazzo et.al.	2503.06362	null
2025-03-08	X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation	Jian Ma et.al.	2503.06134	link
2025-03-08	A Novel Trustworthy Video Summarization Algorithm Through a Mixture of LoRA Experts	Wenzhuo Du et.al.	2503.06064	null
2025-03-07	Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints	Parameswaran Kamalaruban et.al.	2503.05684	null
2025-03-07	Nuanced Safety for Generative AI: How Demographics Shape Responsiveness to Severity	Pushkar Mishra et.al.	2503.05609	null
2025-03-07	Quantum-PEFT: Ultra parameter-efficient fine-tuning	Toshiaki Koike-Akino et.al.	2503.05431	null
2025-03-07	LoRACode: LoRA Adapters for Code Embeddings	Saumya Chaturvedi et.al.	2503.05315	null
2025-03-06	Wanda++: Pruning Large Language Models via Regional Gradients	Yifan Yang et.al.	2503.04992	null
2025-03-06	Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach	Soumyadeep Ro et.al.	2503.04918	null
2025-03-05	Enhancing Collective Intelligence in Large Language Models Through Emotional Integration	Likith Kadiyala et.al.	2503.04849	null
2025-03-06	TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models	Xinyi He et.al.	2503.04396	null
2025-03-07	GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI	Cecilia Diana-Albelda et.al.	2503.04325	link
2025-03-06	Continual Optimization with Symmetry Teleportation for Multi-Task Learning	Zhipeng Zhou et.al.	2503.04046	null
2025-03-05	Personalized Federated Fine-tuning for Heterogeneous Data: An Automatic Rank Learning Approach via Two-Level LoRA	Jie Hao et.al.	2503.03920	null
2025-03-05	Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset	Jessica Hoffmann et.al.	2503.03654	null
2025-03-05	WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models	Tao Feng et.al.	2503.03110	null
2025-03-04	LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models	Pengwei Tang et.al.	2503.02659	null
2025-03-04	Efficient Long Sequential Low-rank Adaptive Attention for Click-through rate Prediction	Xin Song et.al.	2503.02542	null
2025-03-04	AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking	Iraklis Premptis et.al.	2503.02443	null
2025-03-04	Measuring Intrinsic Dimension of Token Embeddings	Takuya Kataiwa et.al.	2503.02142	null
2025-03-03	CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom	Yisen Li et.al.	2503.01836	link
2025-03-03	ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition	Nastaran Mansourian et.al.	2503.01750	null
2025-03-03	Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs	Abdelrahman Abouelenin et.al.	2503.01743	null
2025-03-03	CoPL: Collaborative Preference Learning for Personalizing LLMs	Youngbin Choi et.al.	2503.01658	null
2025-03-03	Liger: Linearizing Large Language Models to Gated Recurrent Structures	Disen Lan et.al.	2503.01496	null
2025-03-03	Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace	Jia-Chen Zhang et.al.	2503.01419	null
2025-02-28	Unsupervised Parameter Efficient Source-free Post-pretraining	Abhishek Jha et.al.	2502.21313	null
2025-02-28	RuCCoD: Towards Automated ICD Coding in Russian	Aleksandr Nesterov et.al.	2502.21263	link
2025-02-28	Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs	Weixiang Zhao et.al.	2502.20968	null
2025-02-28	Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content	Hongyuan Shen et.al.	2502.20952	null
2025-02-28	Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA	Ojonugwa Oluwafemi Ejiga Peter et.al.	2502.20667	null
2025-02-27	AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs	Xuyang Wei et.al.	2502.20035	link
2025-02-27	Image Referenced Sketch Colorization Based on Animation Creation Workflow	Dingkun Yan et.al.	2502.19937	link
2025-03-04	HaLoRA: Hardware-aware Low-Rank Adaptation for Large Language Models Based on Hybrid Compute-in-Memory Architecture	Taiqiang Wu et.al.	2502.19747	null
2025-02-26	Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing	Akshat Gupta et.al.	2502.19416	null
2025-02-26	CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning	Ping Zhang et.al.	2502.18910	null
2025-02-25	K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs	Ziheng Ouyang et.al.	2502.18461	null
2025-02-25	VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention	Adnan Iltaf et.al.	2502.18185	link
2025-02-27	SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models	Yuxuan Zhang et.al.	2502.18168	null
2025-02-25	C-LoRA: Continual Low-Rank Adaptation for Pre-trained Models	Xin Zhang et.al.	2502.17920	null
2025-02-24	Function-Space Learning Rates	Edward Milsom et.al.	2502.17405	link
2025-02-24	UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings	Layba Fiaz et.al.	2502.16961	null
2025-02-24	Design of a communication system Images for identification of vehicle plates	Fabrizio Andre Farfán Prado et.al.	2502.16909	null
2025-02-26	Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment	Chenghao Fan et.al.	2502.16894	null
2025-02-23	Efficient 4D Gaussian Stream with Low Rank Adaptation	Zhenhuan Liu et.al.	2502.16575	null
2025-02-22	Orthogonality Analysis in LoRa Uplink Satellite Communications Affected by Doppler Effect	Jikang Deng et.al.	2502.16179	null
2025-02-22	MedForge: Building Medical Foundation Models Like Open Source Software Development	Zheling Tan et.al.	2502.16055	link
2025-02-21	Sparsity May Be All You Need: Sparse Random Parameter Adaptation	Jesus Rios et.al.	2502.15975	null
2025-02-21	Pastiche Novel Generation Creating: Fan Fiction You Love in Your Favorite Author's Style	Xueran Han et.al.	2502.15616	null
2025-02-21	R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning	Jinda Liu et.al.	2502.15455	link
2025-02-21	Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning	Raghav Singhal et.al.	2502.15436	link
2025-02-21	On Performance of LoRa Fluid Antenna Systems	Gaoze Mu et.al.	2502.15258	null
2025-02-21	M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment	Chuan Cui et.al.	2502.15167	null
2025-02-20	Dynamic Concepts Personalization from Single Videos	Rameen Abdal et.al.	2502.14844	null
2025-02-20	Dynamic Low-Rank Sparse Adaptation for Large Language Models	Weizhong Huang et.al.	2502.14816	link
2025-02-20	Beyond Performance Scores: Directed Functional Connectivity as a Brain-Based Biomarker for Motor Skill Learning and Retention	Anil Kamat et.al.	2502.14731	null
2025-02-20	LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization	Yupeng Chang et.al.	2502.14538	link
2025-02-20	How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?	Sergey Pletenev et.al.	2502.14502	link
2025-02-20	NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models	Chenlu Guo et.al.	2502.14482	link
2025-02-19	PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery	Runlong He et.al.	2502.14149	link
2025-02-19	On the Duality between Gradient Transformations and Adapters	Lucas Torroba-Hennigen et.al.	2502.13811	null
2025-02-19	Adapting Large Language Models for Time Series Modeling via a Novel Parameter-efficient Adaptation Method	Juyuan Zhang et.al.	2502.13725	null
2025-02-19	BeamLoRA: Beam-Constraint Low-Rank Adaptation	Naibin Gu et.al.	2502.13604	null
2025-02-19	LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation	Xin Li et.al.	2502.13568	null
2025-02-19	Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Jun Zhang et.al.	2502.13533	link
2025-02-19	Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models	Chenyu Zhu et.al.	2502.13474	null
2025-02-19	Dynamic directed functional connectivity as a neural biomarker for objective motor skill assessment	Anil Kamat et.al.	2502.13362	null
2025-02-18	Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models	Soumi Das et.al.	2502.13313	null
2025-02-18	GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning	Sifan Zhou et.al.	2502.12913	null
2025-02-18	Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation	Kounianhua Du et.al.	2502.12492	null
2025-02-16	Efficient and Effective Prompt Tuning via Prompt Decomposition and Compressed Outer Product	Pengxiang Lan et.al.	2502.12200	null
2025-02-17	Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA	Patryk Marszałek et.al.	2502.12122	link
2025-02-17	Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis	Xu Wang et.al.	2502.11812	null
2025-02-17	DATA: Decomposed Attention-based Task Adaptation for Rehearsal-Free Continual Learning	Huanxuan Liao et.al.	2502.11482	link
2025-02-17	An Efficient Row-Based Sparse Fine-Tuning	Cen-Jhih Li et.al.	2502.11439	null
2025-02-16	Integrating Language Models for Enhanced Network State Monitoring in DRL-Based SFC Provisioning	Parisa Fard Moshiri et.al.	2502.11298	null
2025-02-18	AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks	Ming Xie et.al.	2502.11158	null
2025-02-15	Generalizable speech deepfake detection via meta-learned LoRA	Janne Laakkonen et.al.	2502.10838	null
2025-02-15	Code-Mixed Telugu-English Hate Speech Detection	Santhosh Kakarla et.al.	2502.10632	null
2025-02-14	Hallucinations and Truth: A Comprehensive Accuracy Evaluation of RAG, LoRA and DoRA	Mohammad Baqar et.al.	2502.10497	null
2025-02-14	Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages	Daniil Gurgurov et.al.	2502.10140	null
2025-02-14	Precise Parameter Localization for Textual Generation in Diffusion Models	Łukasz Staniszewski et.al.	2502.09935	null
2025-02-14	Port-LLM: A Port Prediction Method for Fluid Antenna based on Large Language Models	Yali Zhang et.al.	2502.09857	null
2025-02-14	HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation	Tianwei Lin et.al.	2502.09838	link
2025-02-13	Improving Acoustic Side-Channel Attacks on Keyboards Using Transformers and Large Language Models	Jin Hyun Park et.al.	2502.09782	null
2025-02-14	LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won't Fail)	Junsu Kim et.al.	2502.09376	null
2025-02-13	DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation	Tangyu Jiang et.al.	2502.08905	null
2025-02-13	BrainWavLM: Fine-tuning Speech Representations with Brain Responses to Language	Nishitha Vattikonda et.al.	2502.08866	null
2025-02-12	LoRa Fine Synchronization with Two-Pass Time and Frequency Offset Estimation	Joachim Tapparel et.al.	2502.08485	null
2025-02-12	LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits	Zikai Zhou et.al.	2502.08141	null
2025-02-11	Curvature Tuning: Provable Training-free Model Steering From a Single Parameter	Leyang Hu et.al.	2502.07783	link
2025-02-11	HRP: High-Rank Preheating for Superior LoRA Initialization	Yuzhu Chen et.al.	2502.07739	null
2025-02-11	LoRP-TTS: Low-Rank Personalized Text-To-Speech	Łukasz Bondaruk et.al.	2502.07562	null
2025-02-11	LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!	Dacheng Li et.al.	2502.07374	link
2025-02-10	Hyper Compressed Fine-Tuning of Large Foundation Models with Quantum Inspired Adapters	Snehal Raj et.al.	2502.06916	null
2025-02-10	CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers	D. She et.al.	2502.06527	null
2025-02-10	Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis	Sanket Jantre et.al.	2502.06173	null
2025-02-09	DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations	Krishna Sri Ipsit Mantri et.al.	2502.06029	link
2025-02-11	VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer	Xinyu Liu et.al.	2502.05979	null
2025-02-09	Skill Expansion and Composition in Parameter Space	Tenglong Liu et.al.	2502.05932	link
2025-02-08	Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning	Beining Zhang et.al.	2502.05573	null
2025-02-08	SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation	Yixian Shen et.al.	2502.05539	null
2025-02-07	Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs	Thierry Bossy et.al.	2502.05087	link
2025-02-07	SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model	Jiayang Yu et.al.	2502.04958	link
2025-02-07	Cached Multi-Lora Composition for Multi-Concept Image Generation	Xiandong Zou et.al.	2502.04923	link
2025-02-07	SelaFD:Seamless Adaptation of Vision Transformer Fine-tuning for Radar-based Human Activity	Yijun Wang et.al.	2502.04740	link
2025-02-07	EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference	Prakhar Kaushik et.al.	2502.04700	link
2025-02-07	Contrastive Learning-Enhanced Large Language Models for Monolith-to-Microservice Decomposition	Khaled Sellami et.al.	2502.04604	null
2025-02-05	FedP $^2$ EFT: Federated Learning to Personalize Parameter Efficient Fine-Tuning for Multilingual LLMs	Royson Lee et.al.	2502.04387	null
2025-02-09	ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters	Kamer Ali Yuksel et.al.	2502.04315	link
2025-02-07	Efficient Few-Shot Continual Learning in Vision-Language Models	Aristeidis Panos et.al.	2502.04098	null
2025-02-06	Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning	Peizhuang Cong et.al.	2502.03884	null
2025-02-05	Resource-Efficient & Effective Code Summarization	Saima Afrin et.al.	2502.03617	null
2025-02-05	Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach	Abdullahi Isa Ahmed et.al.	2502.03377	null
2025-02-05	RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts	Tuan Truong et.al.	2502.03044	null
2025-02-05	SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs	Dinithi Jayasuriya et.al.	2502.02909	null
2025-02-04	Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation	Atharva Mangeshkumar Agrawal et.al.	2502.02249	null
2025-02-04	LoRA-TTT: Low-Rank Test-Time Training for Vision-Language Models	Yuto Kojima et.al.	2502.02069	null
2025-02-03	Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling	Kang Yang et.al.	2502.01826	null
2025-02-03	Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA	Shuangyi Chen et.al.	2502.01755	null
2025-02-03	Adapter-Based Multi-Agent AVSR Extension for Pre-Trained ASR Models	Christopher Simic et.al.	2502.01709	null
2025-02-03	QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning	Moses Ananta et.al.	2502.01703	link
2025-02-05	MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation	Yiren Song et.al.	2502.01572	null
2025-02-03	CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models	Guanduo Chen et.al.	2502.01378	null
2025-02-03	One-step full gradient suffices for low-rank fine-tuning, provably and efficiently	Yuanhe Zhang et.al.	2502.01235	null
2025-02-03	Joint Localization and Activation Editing for Low-Resource Fine-Tuning	Wen Lai et.al.	2502.01179	link
2025-01-31	Low-Rank Adapting Models for Sparse Autoencoders	Matthew Chen et.al.	2501.19406	link
2025-01-31	Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models	Wenzhi Fang et.al.	2501.19389	link
2025-02-03	SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions	Dominik Wagner et.al.	2501.19377	null
2025-01-31	Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification	Xiangyu Sun et.al.	2501.19086	null
2025-01-31	Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations	Dahye Kim et.al.	2501.19066	link
2025-01-31	Norm-Bounded Low-Rank Adaptation	Ruigang Wang et.al.	2501.19050	null
2025-01-31	Memory-Efficient Fine-Tuning of Transformers via Token Selection	Antoine Simoulin et.al.	2501.18824	null
2025-01-30	High-Accuracy ECG Image Interpretation using Parameter-Efficient LoRA Fine-Tuning with Multimodal LLaMA 3.2	Nandakishor M et.al.	2501.18670	null
2025-01-30	CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization	Yanxia Deng et.al.	2501.18475	null
2025-01-30	Impact of Reactive Jamming Attacks on LoRaWAN: a Theoretical and Experimental Study	Amavi Dossa et.al.	2501.18339	null
2025-01-29	Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning?	Pouya Pezeshkpour et.al.	2501.17840	link
2025-01-29	U2A: Unified Unimodal Adaptation for Robust and Efficient Multimodal Learning	Md Kaykobad Reza et.al.	2501.17823	null
2025-01-30	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-27	A Comprehensive Study on Fine-Tuning Large Language Models for Medical Question Answering Using Classification Models and Comparative Analysis	Aysegul Ucar et.al.	2501.17190	null
2025-01-28	Algorithm for Automatic Legislative Text Consolidation	Matias Etcheverry et.al.	2501.16794	null
2025-01-28	One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning	Chunpeng Zhou et.al.	2501.16720	null
2025-01-28	Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models	Huijie Liu et.al.	2501.16714	null
2025-01-27	LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation	Farzad Farhadzadeh et.al.	2501.16559	null
2025-01-27	Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width	Zheng Liu et.al.	2501.16302	null
2025-01-27	FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments	Zhiyuan Fu et.al.	2501.16029	null
2025-01-26	LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs	Peizhuo Lv et.al.	2501.15478	null
2025-01-26	InfoBFR: Real-World Blind Face Restoration via Information Bottleneck	Nan Gao et.al.	2501.15443	null
2025-01-26	Fine Tuning without Catastrophic Forgetting via Selective Low Rank Adaptation	Reza Akbarian Bafghi et.al.	2501.15377	null
2025-01-26	Decentralized Low-Rank Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2501.15361	null
2025-01-25	Exploring Primitive Visual Measurement Understanding and the Role of Output Format in Learning in Vision-Language Models	Ankit Yadav et.al.	2501.15144	null
2025-01-25	DAGPrompT: Pushing the Limits of Graph Prompting with a Distribution-aware Graph Prompt Tuning Approach	Qin Chen et.al.	2501.15142	link
2025-01-25	ABXI: Invariant Interest Adaptation for Task-Guided Cross-Domain Sequential Recommendation	Qingtian Bian et.al.	2501.15118	link
2025-01-25	Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning	Ziyu Zhao et.al.	2501.15103	null
2025-01-24	FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing	James Seale Smith et.al.	2501.14713	null
2025-01-21	ZKLoRA: Efficient Zero-Knowledge Proofs for LoRA Verification	Bidhan Roy et.al.	2501.13965	null
2025-01-23	Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models	Linh Tran et.al.	2501.13904	null
2025-01-23	Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation	Rong Shan et.al.	2501.13344	link
2025-01-23	SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network	Songge Zhang et.al.	2501.13318	null
2025-01-22	S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning	Yichen Wu et.al.	2501.13198	null
2025-01-22	LLM4WM: Adapting LLM for Wireless Multi-Tasking	Xuanyu Liu et.al.	2501.12983	null
2025-01-22	D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network	Ruiqi Wang et.al.	2501.12589	null
2025-01-21	A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data	Minh Tran et.al.	2501.12501	null
2025-01-21	EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition	Hamid Nasiri et.al.	2501.12067	link
2025-01-21	ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation	Peter Devine et.al.	2501.11929	link
2025-01-20	Recurrent Diffusion for Large-Scale Parameter Generation	Kai Wang et.al.	2501.11587	link
2025-01-17	OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning	Jinyuan Feng et.al.	2501.10062	null
2025-01-16	Practical Continual Forgetting for Pre-trained Vision Models	Hongbo Zhao et.al.	2501.09705	link
2025-01-17	SEAL: Entangled White-box Watermarks on Low-Rank Adaptation	Giyeong Oh et.al.	2501.09284	null
2025-01-15	Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models	Zerui Tao et.al.	2501.08727	null
2025-01-15	LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model	Yuxuan Hu et.al.	2501.08582	null
2025-01-14	DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models	Hyeonwoo Kim et.al.	2501.08333	null
2025-01-14	TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yao Liang et.al.	2501.08008	null
2025-01-14	GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism	Chen Tang et.al.	2501.07890	null
2025-01-14	Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques	Shobhit Ratan et.al.	2501.07853	null
2025-01-13	Implementing LoRa MIMO System for Internet of Things	Atonu Ghosh et.al.	2501.07148	null
2025-01-12	Language Fusion for Parameter-Efficient Cross-lingual Transfer	Philipp Borchert et.al.	2501.06892	link
2025-01-12	Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning	Hanwen Zhong et.al.	2501.06884	link
2025-01-12	Better Prompt Compression Without Multi-Layer Perceptrons	Edouardo Honig et.al.	2501.06730	null
2025-01-10	Aggregating Low Rank Adapters in Federated Fine-tuning	Evelyn Trautmann et.al.	2501.06332	null
2025-01-14	$\text{Transformer}^2$ : Self-adaptive LLMs	Qi Sun et.al.	2501.06252	link
2025-01-10	How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters	Romina Oji et.al.	2501.06025	link
2025-01-09	LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts	Yuri Facanha Bezerra et.al.	2501.05554	link
2025-01-09	JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis	Jun-Hyeok Cha et.al.	2501.04904	null
2025-01-11	RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation	Jun Liu et.al.	2501.04315	null
2025-01-07	Spectral-Aware Low-Rank Adaptation for Speaker Verification	Zhe Li et.al.	2501.03829	link
2025-01-08	MADation: Face Morphing Attack Detection with Foundation Models	Eduarda Caldeira et.al.	2501.03800	link
2025-01-07	Extending Internet Access Over LoRa for Internet of Things and Critical Applications	Atonu Ghosh et.al.	2501.03465	null
2025-01-06	Rate-My-LoRA: Efficient and Adaptive Federated Model Tuning for Cardiac MRI Segmentation	Xiaoxiao He et.al.	2501.03223	null
2025-01-06	The Scaling Law for LoRA Base on Mutual Information Upper Bound	Jing Zhang et.al.	2501.03152	null
2025-01-06	TransPixar: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection	Guray Ozgur et.al.	2501.02892	link
2025-01-05	LoRaConnect: Unlocking HTTP Potential on LoRa Backbones for Remote Areas and Ad-Hoc Networks	Atonu Ghosh et.al.	2501.02469	null
2025-01-05	Efficient Deployment of Large Language Models on Resource-constrained Devices	Zhiwei Yao et.al.	2501.02438	null
2025-01-07	Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers	Markus J. Buehler et.al.	2501.02393	link
2025-01-04	tCURLoRA: Tensor CUR Decomposition Based Low-Rank Parameter Adaptation for Medical Image Segmentation	Guanghua He et.al.	2501.02227	null
2025-01-03	SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation	Mingjie Li et.al.	2501.01765	null
2025-01-03	MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders	Jiajun Cao et.al.	2501.01709	null
2025-01-03	Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption	Zhang Ruoyan et.al.	2501.01672	null
2025-01-02	Towards Interactive Deepfake Analysis	Lixiong Qin et.al.	2501.01164	link
2025-01-01	Alzheimer's disease detection based on large language model prompt engineering	Tian Zheng et.al.	2501.00861	null
2025-01-01	Beyond Words: AuralLLM and SignMST-C for Precise Sign Language Production and Bidirectional Accessibility	Yulong Li et.al.	2501.00765	null
2024-12-31	Low-Rank Adaptation for Foundation Models: A Comprehensive Review	Menglin Yang et.al.	2501.00365	null
2024-12-30	Adversarial Attack and Defense for LoRa Device Identification and Authentication via Deep Learning	Yalin E. Sagduyu et.al.	2412.21164	null
2024-12-30	Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring	Ehsan Latif et.al.	2412.21065	null
2024-12-30	DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models	Xiaolin Hu et.al.	2412.20891	null
2024-12-30	Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation	Shubh Singhal et.al.	2412.20838	null
2024-12-30	VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control	Shaojin Wu et.al.	2412.20800	link
2025-01-02	EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers	Daiheng Gao et.al.	2412.20413	null
2024-12-28	Multi-Modality Driven LoRA for Adverse Condition Depth Estimation	Guanglei Yang et.al.	2412.20162	null
2024-12-28	VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition	Lan Chen et.al.	2412.20064	link
2024-12-28	Adaptive Parameter-Efficient Federated Fine-Tuning on Heterogeneous Devices	Jun Liu et.al.	2412.20004	null
2024-12-27	Gradient Weight-normalized Low-rank Projection for Efficient LLM Training	Jia-Hong Huang et.al.	2412.19616	link
2024-12-27	Performance Evaluation of IoT LoRa Networks on Mars Through ns-3 Simulations	Manuele Favero et.al.	2412.19549	link
2024-12-27	KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing	Shu Zhao et.al.	2412.19417	link
2024-12-25	Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks	Jiacheng Hu et.al.	2412.18729	null
2024-12-24	Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models	Zihan Zhou et.al.	2412.18419	null
2024-12-18	Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting	Vijay Goyal et.al.	2412.17846	link
2024-12-25	DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder	Ente Lin et.al.	2412.17644	null
2024-12-23	Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing	Prakash Aryan et.al.	2412.17548	link
2024-12-21	Label Privacy in Split Learning for Large Models with Parameter-Efficient Training	Philip Zmushko et.al.	2412.16669	link
2024-12-20	Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline	Guancheng Zeng et.al.	2412.15660	null
2024-12-23	CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training	Xiuli Bi et.al.	2412.15646	link
2024-12-20	AutoRank: MCDA Based Rank Personalization for LoRA-Enabled Distributed Learning	Shuaijun Chen et.al.	2412.15553	null
2024-12-19	Knowledge Injection via Prompt Distillation	Kalle Kujanpää et.al.	2412.14964	null
2024-12-20	All-in-One Tuning and Structural Pruning for Domain-Specific LLMs	Lei Lu et.al.	2412.14426	null
2024-12-18	CoRa: A Collision-Resistant LoRa Symbol Detector of Low Complexity	José Álamos et.al.	2412.13930	null
2024-12-18	A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection	Beiqi Zhang et.al.	2412.13801	link
2024-12-18	Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration	Xuhan Zuo et.al.	2412.13551	null
2024-12-18	Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models	Xinxin Liu et.al.	2412.13488	null
2024-12-18	Transducer Tuning: Efficient Model Adaptation for Software Tasks Using Code Property Graphs	Imam Nur Bani Yusuf et.al.	2412.13467	link
2024-12-17	Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models	Elvis Nunez et.al.	2412.13328	null
2024-12-17	FineGates: LLMs Finetuning with Compression using Stochastic Gates	Jonathan Svirsky et.al.	2412.12951	null
2024-12-17	Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion	Syed Zohaib Hassan et.al.	2412.12710	null
2024-12-17	Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT	Jenny Kunz et.al.	2412.12674	link
2024-12-17	NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning	Xin Yi et.al.	2412.12497	link
2024-12-16	Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering	Jinhe Bi et.al.	2412.12359	link
2024-12-16	Can video generation replace cinematographers? Research on the cinematic language of generated video	Xiaozhe Li et.al.	2412.12223	null
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-16	The Open Source Advantage in Large Language Models (LLMs)	Jiya Manchanda et.al.	2412.12004	null
2024-12-17	No More Adam: Learning Rate Scaling at Initialization is All You Need	Minghao Xu et.al.	2412.11768	link
2024-12-16	IDEA-Bench: How Far are Generative Models from Professional Designing?	Chen Liang et.al.	2412.11767	link
2024-12-16	Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing	Anika Tabassum et.al.	2412.11381	link
2024-12-16	FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation	Dannong Wang et.al.	2412.11378	link
2024-12-15	Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models	Di Wu et.al.	2412.11041	null
2024-12-15	SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation	Hang Zhang et.al.	2412.11026	null
2024-12-14	Efficient Adaptation of Multilingual Models for Japanese ASR	Mark Bajo et.al.	2412.10705	link
2024-12-13	SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation	Runtao Liu et.al.	2412.10493	null
2024-12-13	OP-LoRA: The Blessing of Dimensionality	Piotr Teterwak et.al.	2412.10362	null
2024-12-16	ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers	Junyan Hu et.al.	2412.10135	null
2024-12-13	CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models	Dongyu Yao et.al.	2412.09936	link
2024-12-13	Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models	Changqun Li et.al.	2412.09827	null
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	null
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	null
2024-12-12	Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition	Zhisheng Zhong et.al.	2412.09501	link
2024-12-15	GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning	Abdessalam Ed-dib et.al.	2412.09250	null
2024-12-12	RAD: Region-Aware Diffusion Models for Image Inpainting	Sora Kim et.al.	2412.09191	null
2024-12-12	DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization	Geonhui Jang et.al.	2412.09169	null
2024-12-12	MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning	Lulu Zhao et.al.	2412.08946	null
2024-12-11	DMin: Scalable Training Data Influence Estimation for Diffusion Models	Huawei Lin et.al.	2412.08637	link
2024-12-10	Accretion onto WD 2226 $-$ 210, the central star of the Helix Nebula	S. Estrada-Dorado et.al.	2412.07863	null
2024-12-10	PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition	Kartik Narayan et.al.	2412.07771	null
2024-12-10	LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models	Ziqi Lu et.al.	2412.07746	null
2024-12-10	ChocoLlama: Lessons Learned From Teaching Llamas Dutch	Matthieu Meeus et.al.	2412.07633	null
2024-12-10	MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning	Yufei Ma et.al.	2412.07405	null
2024-12-10	Attention Head Purification: A New Perspective to Harness CLIP for Domain Generalization	Yingfan Wang et.al.	2412.07226	null
2024-12-09	Optimal Routing and Link Configuration for Covert Heterogeneous Wireless Networks	Amna Gillani et.al.	2412.07059	null
2024-12-09	Sequential Compression Layers for Efficient Federated Learning in Foundational Models	Navyansh Mahla et.al.	2412.07021	null
2024-12-09	BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation	Qiushi Wang et.al.	2412.06441	null
2024-12-10	S $^{2}$ FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity	Xinyu Yang et.al.	2412.06289	null
2024-12-08	Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling	Kaleel Mahmood et.al.	2412.06106	null
2024-12-08	KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models	Fan Wang et.al.	2412.06071	link
2024-12-07	Training-Free Bayesianization for Low-Rank Adapters of Large Language Models	Haizhou Shi et.al.	2412.05723	link
2024-12-07	Plasmonic Electro-Optic Modulators based on Epsilon-Near-Zero Materials: Comparing the Classical Drift-Diffusion and Schrödinger-Poisson Coupling Models	Masoud Shabaninezhad et.al.	2412.05690	null
2024-12-06	QueEn: A Large Language Model for Quechua-English Translation	Junhao Chen et.al.	2412.05184	null
2024-12-06	LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation	Donald Shenaj et.al.	2412.05148	link
2024-12-05	Performance Evaluation of LoRa Technology for Rural Connectivity: An Experimental Analysis in Nepal	Atit Pokharel et.al.	2412.04563	null
2024-12-04	Prompting Large Language Models for Clinical Temporal Relation Extraction	Jianping He et.al.	2412.04512	null
2024-12-05	UnZipLoRA: Separating Content and Style from a Single Image	Chang Liu et.al.	2412.04465	null
2024-12-08	Discriminative Fine-tuning of LVLMs	Yassine Ouali et.al.	2412.04378	null
2024-12-05	Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts	Chenyang Zhu et.al.	2412.04220	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	link
2024-12-04	Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis	Davide Bucciarelli et.al.	2412.03665	null
2024-12-04	Imagine360: Immersive 360 Video Generation from Perspective Anchor	Jing Tan et.al.	2412.03552	null
2024-12-04	DIVE: Taming DINO for Subject-Driven Video Editing	Yi Huang et.al.	2412.03347	null
2024-12-04	Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach	Lingchen Sun et.al.	2412.03017	link
2024-12-03	EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras	Dmitrii Torbunov et.al.	2412.02890	link
2024-12-03	Explainable CTR Prediction via LLM Reasoning	Xiaohan Yu et.al.	2412.02588	null
2024-12-03	LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization	Ethan Smith et.al.	2412.02352	null
2024-12-03	SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models	Sabina Martyniak et.al.	2412.02332	link
2024-12-03	Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs	Zixuan Hu et.al.	2412.02220	null
2024-12-02	Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping	Marla Grunewald et.al.	2412.01609	null
2024-12-02	CellSeg1: Robust Cell Segmentation with One Training Image	Peilin Zhou et.al.	2412.01410	link
2024-12-02	Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking	Marco Federici et.al.	2412.01380	null
2024-12-02	MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost	Sen Xing et.al.	2412.01271	null
2024-12-02	RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy	Geonho Lee et.al.	2412.01129	link
2024-12-03	Adaptive Rank, Reduced Forgetting: Knowledge Retention in Continual Learning Vision-Language Models with Dynamic Rank-Selective LoRA	Haodong Lu et.al.	2412.01004	null
2024-11-29	SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks	Kim-Celine Kahl et.al.	2411.19688	link
2024-11-29	Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning	Kaustubh Ponkshe et.al.	2411.19557	link
2024-11-28	PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning	Shenghui Li et.al.	2411.19335	null
2024-11-28	Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation	Son Thai Ly et.al.	2411.19297	link
2024-11-28	LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair	Xue Song et.al.	2411.19156	null
2024-11-28	DESIRE: Dynamic Knowledge Consolidation for Rehearsal-Free Continual Learning	Haiyang Guo et.al.	2411.19154	null
2024-11-28	Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures	Yicheng Zhang et.al.	2411.19128	link
2024-11-27	Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning	Omkar Khade et.al.	2411.18571	null
2024-11-27	Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models	Minhyeok Lee et.al.	2411.18530	link
2024-11-27	Adaptive Blind All-in-One Image Restoration	David Serrano-Lozano et.al.	2411.18412	link
2024-11-27	Thai Financial Domain Adaptation of THaLLE -- Technical Report	KBTG Labs et.al.	2411.18242	null
2024-11-27	ROICtrl: Boosting Instance Control for Visual Generation	Yuchao Gu et.al.	2411.17949	null
2024-11-26	Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading	Suyeol Yun et.al.	2411.17900	link
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	null
2024-11-26	PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning	Zhen Sun et.al.	2411.17453	null
2024-11-26	CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy	Fanxu Meng et.al.	2411.17426	link
2024-11-26	Efficient Deployment of Transformer Models in Analog In-Memory Computing Hardware	Chen Li et.al.	2411.17367	link
2024-11-26	ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System	Shuiqiao Yang et.al.	2411.17058	null
2024-11-26	PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation	Hengjia Li et.al.	2411.17048	null
2024-11-25	RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks	Nazia Tasnim et.al.	2411.16870	link
2024-11-25	Parameter Efficient Instruction Tuning: An Empirical Study	Pengfei He et.al.	2411.16775	link
2024-11-23	LoBAM: LoRA-Based Backdoor Attack on Model Merging	Ming Yin et.al.	2411.16746	null
2024-11-24	Modality Alignment Meets Federated Broadcasting	Yuting Ma et.al.	2411.15837	null
2024-11-24	LoRA-Mini : Adaptation Matrices Decomposition and Selective Training	Ayush Singh et.al.	2411.15804	null
2024-11-23	Reassessing Layer Pruning in LLMs: New Insights and Methods	Yao Lu et.al.	2411.15558	link
2024-11-23	Gradient dynamics for low-rank fine-tuning beyond kernels	Arif Kerem Dayi et.al.	2411.15385	null
2024-11-22	On the Impact of Fine-Tuning on Chain-of-Thought Reasoning	Elita Lobo et.al.	2411.15382	null
2024-11-22	ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation	Junzhang Liu et.al.	2411.15281	null
2024-11-21	IterIS: Iterative Inference-Solving Alignment for LoRA Merging	Hongxu Chen et.al.	2411.15231	null
2024-11-22	Exploring Foundation Models Fine-Tuning for Cytology Classification	Manon Dausort et.al.	2411.14975	link
2024-11-22	LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement	Jieming Bian et.al.	2411.14961	null
2024-11-21	Interpreting seasonal and interannual Hadley cell descending edge migrations via the cell-mean Rossby number	Spencer A Hill et.al.	2411.14544	null
2024-11-21	Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model	Ege Kesim et.al.	2411.14064	null
2024-11-21	Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning	Ziqi Wang et.al.	2411.13949	null
2024-11-21	Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel KAN Adapter for Enhanced Feature Adaptation	Gayatri Deshmukh et.al.	2411.13901	null
2024-11-21	AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning	Changhai Zhou et.al.	2411.13814	null
2024-11-20	Unleashing the Power of Large Language Models for Group POI Recommendations	Jing Long et.al.	2411.13415	null
2024-11-20	On the Way to LLM Personalization: Learning to Remember User Conversations	Lucie Charlotte Magister et.al.	2411.13405	null
2024-11-19	Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning	Pengkun Jiao et.al.	2411.12787	null
2024-11-16	LoRA Unlearns More and Retains More (Student Abstract)	Atharv Mittal et.al.	2411.11907	link
2024-11-18	SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions	Shuo Zhang et.al.	2411.11530	null
2024-11-16	Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts	Jinqiang Long et.al.	2411.10669	link
2024-11-15	AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment	Yonggan Fu et.al.	2411.10606	link
2024-11-15	Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning	Yushen Zuo et.al.	2411.10130	null
2024-11-15	LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning	Yahe Yang et.al.	2411.09947	null
2024-11-12	Structured Pattern Expansion with Diffusion Models	Marzia Riso et.al.	2411.08930	null
2024-11-13	Dynamic Subset Tuning: Expanding the Operational Range of Parameter-Efficient Training for Large Language Models	Felix Stahlberg et.al.	2411.08610	null
2024-11-13	Machine Unlearning on Pre-trained Models by Residual Feature Alignment Using LoRA	Laiqiao Qin et.al.	2411.08443	null
2024-11-11	LoRA-BERT: a Natural Language Processing Model for Robust and Accurate Prediction of long non-coding RNAs	Nicholas Jeon et.al.	2411.08073	null
2024-11-12	FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training	Philip Zmushko et.al.	2411.07837	link
2024-11-12	Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices	Kilian Pfeiffer et.al.	2411.07826	null
2024-11-12	Federated Low-Rank Adaptation with Differential Privacy over Wireless Networks	Tianqu Kang et.al.	2411.07806	null
2024-11-12	ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization	Weibo Zhao et.al.	2411.07762	null
2024-11-11	DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning	Zecheng Zhang et.al.	2411.07239	null
2024-11-11	Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation	Ziwei Liu et.al.	2411.07021	null
2024-11-11	MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps	Xue Xia et.al.	2411.06971	link
2024-11-11	LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models	Runming Yang et.al.	2411.06839	null
2024-11-10	Federated LLMs Fine-tuned with Adaptive Importance-Aware LoRA	Yang Su et.al.	2411.06581	null
2024-11-10	Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques	Daniil Sulimov et.al.	2411.06445	null
2024-11-08	Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation	Aayush Shah et.al.	2411.05966	null
2024-11-08	Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation	Xiwen Wei et.al.	2411.05663	link
2024-11-08	SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-07	DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion	Wenqiang Sun et.al.	2411.04928	null
2024-11-07	StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration	Panwen Hu et.al.	2411.04925	null
2024-11-07	LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG	Laifa Tao et.al.	2411.04476	null
2024-11-09	Variational Low-Rank Adaptation Using IVON	Bai Cong et.al.	2411.04421	link
2024-11-08	Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation	Ayan Sengupta et.al.	2411.04358	link
2024-11-06	PyroGuardian: An IoT-Enabled System for Health and Location Monitoring in High-Risk Firefighting Environments	Berkay Kaplan et.al.	2411.03654	null
2024-11-05	LLM-based Framework for Bearing Fault Diagnosis	Laifa Tao et.al.	2411.02718	null
2024-11-04	TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network	Nouf Alabbasi et.al.	2411.02617	link
2024-11-04	Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study	André Storhaug et.al.	2411.02462	null
2024-11-04	Expanding Sparse Tuning for Low Memory Usage	Shufan Shen et.al.	2411.01800	link
2024-11-02	PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment	Dongxu Liu et.al.	2411.01245	null
2024-11-02	One Arrow, Many Targets: Probing LLMs for Multi-Attribute Controllable Text Summarization	Tathagato Roy et.al.	2411.01213	null
2024-11-02	Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models	Wonguk Cho et.al.	2411.01179	null
2024-11-02	LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding	Jian Chen et.al.	2411.01106	null
2024-11-01	V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM	Liang Mi et.al.	2411.00915	null
2024-11-01	Dual Low-Rank Adaptation for Continual Learning with Pre-Trained Models	Huancheng Chen et.al.	2411.00623	null
2024-10-31	DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion	Weicai Ye et.al.	2410.24203	link
2024-11-05	In-Context LoRA for Diffusion Transformers	Lianghua Huang et.al.	2410.23775	link
2024-10-30	Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation	Stefan Stojanovic et.al.	2410.23434	null
2024-10-31	SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation	Yining Hong et.al.	2410.23277	null
2024-10-31	Why Gradient Subspace? Identifying and Mitigating LoRA's Bottlenecks in Federated Fine-Tuning of Large Language Models	Navyansh Mahla et.al.	2410.23111	null
2024-10-30	Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation	Wei Dong et.al.	2410.22952	null
2024-10-30	CopRA: A Progressive LoRA Training Strategy	Zhan Zhuang et.al.	2410.22911	null
2024-10-30	Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients	Jabin Koo et.al.	2410.22815	null
2024-10-30	MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning	Xujia Wang et.al.	2410.22782	null
2024-10-29	Meta-Learning Adaptable Foundation Models	Jacob L. Block et.al.	2410.22264	null
2024-10-30	IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models	Hang Guo et.al.	2410.21759	link
2024-10-28	LoRA vs Full Fine-tuning: An Illusion of Equivalence	Reece Shuttleworth et.al.	2410.21228	null
2024-10-28	Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices	Hiroki Matsutani et.al.	2410.21073	null
2024-10-28	KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation	Rambod Azimi et.al.	2410.20777	link
2024-10-28	Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA	Sangmin Bae et.al.	2410.20672	null
2024-10-28	PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation	Leyao Wang et.al.	2410.20667	null
2024-10-28	Collaborative Knowledge Fusion: A Novel Approach for Multi-task Recommender Systems via LLMs	Chuang Zhao et.al.	2410.20642	null
2024-10-27	LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization	Jui-Nan Yen et.al.	2410.20625	null
2024-10-27	FoldMark: Protecting Protein Generative Models with Watermarking	Zaixi Zhang et.al.	2410.20354	link
2024-10-26	An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation	Dongdong Lin et.al.	2410.20202	null
2024-10-25	Model merging with SVD to tie the Knots	George Stoica et.al.	2410.19735	link
2024-10-25	Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs	Yifei Zhang et.al.	2410.19694	null
2024-10-25	GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing	Hosam Elgendy et.al.	2410.19552	link
2024-10-24	Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts	Danyal Aftab et.al.	2410.19185	null
2024-10-24	On the Crucial Role of Initialization for Matrix Factorization	Bingcong Li et.al.	2410.18965	null
2024-10-24	PSY: Posterior Sampling Based Privacy Enhancer in Large Language Models	Yulian Sun et.al.	2410.18824	null
2024-10-24	GeoLoRA: Geometric integration for parameter efficient fine-tuning	Steffen Schotthöfer et.al.	2410.18720	null
2024-10-24	Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model	Ali Hamza et.al.	2410.18678	null
2024-10-23	CLEAR: Character Unlearning in Textual and Visual Modalities	Alexey Dontsov et.al.	2410.18057	null
2024-10-23	MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Jingfan Zhang et.al.	2410.18035	null
2024-10-23	Closed-form merging of parameter-efficient modules for Federated Continual Learning	Riccardo Salami et.al.	2410.17961	null
2024-10-23	AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning	Yehonathan Refael et.al.	2410.17881	null
2024-10-23	Understanding Layer Significance in LLM Alignment	Guangyuan Shi et.al.	2410.17875	null
2024-10-23	VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning	Yifan Peng et.al.	2410.17485	link
2024-10-22	FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation	Rohan Sukumaran et.al.	2410.17358	null
2024-10-22	Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups	Charvi Rastogi et.al.	2410.17032	null
2024-10-23	GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks	Shuyang Hou et.al.	2410.17031	null
2024-10-22	LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices	Chuntao Ding et.al.	2410.16954	link
2024-10-22	Can Large Language Models Act as Ensembler for Multi-GNNs?	Hanqi Duan et.al.	2410.16822	null
2024-10-22	Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models	Yuheng Lu et.al.	2410.16801	null
2024-10-22	MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report	Samrajya Thapa et.al.	2410.16239	link
2024-10-21	Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs	Kang Zhao et.al.	2410.16135	null
2024-10-21	Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning	Arijit Das et.al.	2410.16029	link
2024-10-21	How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?	Zuojin Tang et.al.	2410.15885	null
2024-10-21	The effect of fine-tuning on language model toxicity	Will Hawkins et.al.	2410.15821	link
2024-10-21	Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization	Hanseon Joo et.al.	2410.15794	link
2024-10-21	Students Rather Than Experts: A New AI For Education Pipeline To Model More Human-Like And Personalised Early Adolescences	Yiping Ma et.al.	2410.15701	null
2024-10-20	MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models	Ahmed Elbakary et.al.	2410.15524	null
2024-10-20	EVA: An Embodied World Model for Future Video Anticipation	Xiaowei Chi et.al.	2410.15461	null
2024-10-20	LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration	Yuang Ai et.al.	2410.15385	link
2024-10-18	Fine-Tuning DeepONets to Enhance Physics-informed Neural Networks for solving Partial Differential Equations	Sidi Wu et.al.	2410.14134	null
2024-10-17	FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model	ZiDong Wang et.al.	2410.13925	link
2024-10-17	Improving Multi-modal Large Language Model through Boosting Vision Capabilities	Yanpeng Sun et.al.	2410.13733	null
2024-10-17	LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning	Yiming Shi et.al.	2410.13618	link
2024-10-18	MoR: Mixture of Ranks for Low-Rank Adaptation Tuning	Chuanyu Tang et.al.	2410.13408	null
2024-10-17	FAMSeC: A Few-shot-sample-based General AI-generated Image Detection Method	Juncong Xu et.al.	2410.13156	null
2024-10-16	LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks	Akshara Prabhakar et.al.	2410.13025	link
2024-10-16	DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving Federated Low-rank Adaptation	Meilu Zhu et.al.	2410.12926	link
2024-10-15	In-context KV-Cache Eviction for LLMs via Attention-Gate	Zihao Zeng et.al.	2410.12876	null
2024-10-16	FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction	Akriti Jain et.al.	2410.12513	null
2024-10-15	LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models	Hossein Abdi et.al.	2410.11551	null
2024-10-15	Transfer Learning with Foundational Models for Time Series Forecasting using Low-Rank Adaptations	M. Germán-Morales et.al.	2410.11539	null
2024-10-15	Energy Efficient Transmission Parameters Selection Method Using Reinforcement Learning in Distributed LoRa Networks	Ryotai Airiyoshi et.al.	2410.11270	null
2024-10-14	Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning	Bokai Hu et.al.	2410.11020	null
2024-10-14	LoLCATs: On Low-Rank Linearizing of Large Language Models	Michael Zhang et.al.	2410.10254	link
2024-10-14	Fed-piLot: Optimizing LoRA Assignment for Efficient Federated Foundation Model Fine-Tuning	Zikai Zhang et.al.	2410.10200	null
2024-10-14	Scalable Multi-Domain Adaptation of Language Models using Modular Experts	Peter Schafhalter et.al.	2410.10181	null
2024-10-14	Is Parameter Collision Hindering Continual Learning in LLMs?	Shuo Yang et.al.	2410.10179	link
2024-10-14	AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality	Peijun Qing et.al.	2410.10054	link
2024-10-13	Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning	Pengfei Jin et.al.	2410.09908	null
2024-10-13	A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning	Chen-Yu Liu et.al.	2410.09846	null
2024-10-13	Understanding Robustness of Parameter-Efficient Tuning for Image Classification	Jiacheng Ruan et.al.	2410.09845	link
2024-10-13	BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation	Peijia Qin et.al.	2410.09758	null
2024-10-13	AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model	Yuchen Li et.al.	2410.09714	null
2024-10-11	Parameter-Efficient Fine-Tuning of State Space Models	Kevin Galim et.al.	2410.09016	link
2024-10-10	Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation	Grigory Malinovsky et.al.	2410.08305	null
2024-10-10	SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture	Jiayi Han et.al.	2410.07739	null
2024-10-10	MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion	Onkar Susladkar et.al.	2410.07659	link
2024-10-09	SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers	Viktoriia Chekalina et.al.	2410.07383	link
2024-10-09	One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation	Fabian Paischer et.al.	2410.07170	link
2024-10-09	Industrial complexity and the evolution of formal employment in developing cities	Neave O'Clery et.al.	2410.06971	null
2024-10-11	Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization	Changli Tang et.al.	2410.06682	null
2024-10-08	Systematic 2.5 D resistive MHD simulations with ambipolar diffusion and Hall effect for fast magnetic reconnection	Gabriela Landinez et.al.	2410.06391	null
2024-10-08	HyperDet: Generalizable Detection of Synthesized Images by Generating and Merging A Mixture of Hyper LoRAs	Huangsen Cao et.al.	2410.06044	null
2024-10-08	QERA: an Analytical Framework for Quantization Error Reconstruction	Cheng Zhang et.al.	2410.06040	null
2024-10-08	Hyper Adversarial Tuning for Boosting Adversarial Robustness of Pretrained Large Vision Models	Kangtao Lv et.al.	2410.05951	null
2024-10-07	GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting	Yukang Cao et.al.	2410.05259	null
2024-10-08	PAMLR: A Passive-Active Multi-Armed Bandit-Based Solution for LoRa Channel Allocation	Jihoon Yun et.al.	2410.05147	null
2024-10-07	HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation	Xinyu Zhou et.al.	2410.05090	link
2024-10-07	Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation	Vince Zhu et.al.	2410.04689	null
2024-10-06	Learning De-Biased Representations for Remote-Sensing Imagery	Zichen Tian et.al.	2410.04546	link
2024-10-05	Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models	Theo et.al.	2410.04207	null
2024-10-05	LoRTA: Low Rank Tensor Adaptation of Large Language Models	Ignacio Hounie et.al.	2410.04060	null
2024-10-05	Hyperbolic Fine-tuning for Large Language Models	Menglin Yang et.al.	2410.04010	link
2024-10-04	AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models	Artur Kasymov et.al.	2410.03941	link
2024-10-04	Collaborative and Efficient Personalization with Mixtures of Adaptors	Abdulla Jasem Almansoori et.al.	2410.03497	null
2024-10-03	Neutral residues: revisiting adapters for model extension	Franck Signe Talla et.al.	2410.02744	null
2024-10-03	Encryption-Friendly LLM Architecture	Donghwan Rho et.al.	2410.02486	null
2024-10-02	NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models	Yibo Zhong et.al.	2410.01870	null
2024-10-02	Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?	Xi Chen et.al.	2410.01623	link
2024-10-02	DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models	Yuxuan Zhang et.al.	2410.01497	link
2024-10-04	Selective Aggregation for Low-Rank Adaptation in Federated Learning	Pengxin Guo et.al.	2410.01463	link
2024-10-02	FlashMask: Efficient and Rich Mask Extension of FlashAttention	Guoxia Wang et.al.	2410.01359	link
2024-10-01	MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards	Sheng Wang et.al.	2410.00938	null
2024-10-02	Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models	Saurav Jha et.al.	2410.00700	null
2024-10-01	PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models	Yang Li et.al.	2410.00433	null
2024-09-30	Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models	Ji Liu et.al.	2410.00131	null
2024-09-30	UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation	Cheng Zhang et.al.	2409.20197	link
2024-09-30	BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain	Kaisi Guan et.al.	2409.20075	null
2024-09-30	HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models	Bingshen Mu et.al.	2409.19878	null
2024-09-29	Learning Attentional Mixture of LoRAs for Language Model Continual Learning	Jialin Liu et.al.	2409.19611	null
2024-09-29	Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers	Prakash Dhakal et.al.	2409.19566	null
2024-09-27	HM3: Heterogeneous Multi-Class Model Merging	Stefan Hackmann et.al.	2409.19173	null
2024-09-26	MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications	Jothi Prasanna Shanmuga Sundaram et.al.	2409.18043	null
2024-09-26	PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification	Tianfang Xie et.al.	2409.17834	null
2024-09-30	Efficient In-Domain Question Answering for Resource-Constrained Environments	Isaac Chung et.al.	2409.17648	null
2024-09-26	On the Implicit Relation Between Low-Rank Adaptation and Differential Privacy	Saber Malekmohammadi et.al.	2409.17538	null
2024-09-26	A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction	Guangyu Wang et.al.	2409.17440	link
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-09-25	Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors	Aiping Zhang et.al.	2409.17058	link
2024-09-25	PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning	Qibin Wang et.al.	2409.16722	null
2024-09-25	GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning	Zhe-Rui Yang et.al.	2409.16670	link
2024-09-25	Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models	Deepak Sridhar et.al.	2409.16535	link
2024-09-24	Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering	Ziyu Zhao et.al.	2409.16167	null
2024-09-24	Evaluation of state-of-the-art ASR Models in Child-Adult Interactions	Aditya Ashvin et.al.	2409.16135	null
2024-09-24	Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs	Yang Yuhang et.al.	2409.16005	null
2024-09-24	Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM	Fengrun Zhang et.al.	2409.15905	null
2024-09-24	Aided design of bridge aesthetics based on Stable Diffusion fine-tuning	Leye Zhang et.al.	2409.15812	link
2024-09-17	Chain-of-Thought Prompting for Speech Translation	Ke Hu et.al.	2409.11538	null
2024-09-17	Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models	Divij Gupta et.al.	2409.11302	null
2024-09-17	LoRa Communication for Agriculture 4.0: Opportunities, Challenges, and Future Directions	Lameya Aldhaheri et.al.	2409.11200	null
2024-09-17	Few-Shot Domain Adaptation for Learned Image Compression	Tianyu Zhang et.al.	2409.11111	null
2024-09-17	KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models	Bo Lv et.al.	2409.11057	null
2024-09-18	Propulsion: Steering LLM with Tiny Fine-Tuning	Md Kowsher et.al.	2409.10927	link
2024-09-16	A Bayesian Interpretation of Adaptive Low-Rank Adaptation	Haolin Chen et.al.	2409.10673	link
2024-09-16	From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs	Navya Jain et.al.	2409.10245	null
2024-09-16	Robust Bird's Eye View Segmentation by Adapting DINOv2	Merve Rabia Barın et.al.	2409.10228	null
2024-09-19	jina-embeddings-v3: Multilingual Embeddings With Task LoRA	Saba Sturua et.al.	2409.10173	null
2024-09-16	Rapid Adaptation of Earth Observation Foundation Models for Segmentation	Karthick Panner Selvam et.al.	2409.09907	null
2024-09-15	AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs	Madhusudan Ghosh et.al.	2409.09704	link
2024-09-14	COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare	Chia-Hao Li et.al.	2409.09549	null
2024-09-14	SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2	Xinrun Chen et.al.	2409.09286	link
2024-09-13	Data Efficient Child-Adult Speaker Diarization with Simulated Conversations	Anfeng Xu et.al.	2409.08881	link
2024-09-13	Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions	Lingwei Meng et.al.	2409.08596	link
2024-09-13	ATFLRec: A Multimodal Recommender System with Audio-Text Fusion and Low-Rank Adaptation via Instruction-Tuned Large Language Model	Zezheng Qin et.al.	2409.08543	null
2024-09-13	Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights	Dixi Yao et.al.	2409.08482	null
2024-09-13	Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design	Lingyao Li et.al.	2409.08459	null
2024-09-12	AudioBERT: Audio Knowledge Augmented Language Model	Hyunjong Ok et.al.	2409.08199	link
2024-09-12	Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy	Bojian Li et.al.	2409.07723	null
2024-09-11	Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region	Muhammad Akhtar Munir et.al.	2409.07585	link
2024-09-11	Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models	Xinhu Zheng et.al.	2409.07016	null
2024-09-10	SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation	Teng Hu et.al.	2409.06633	null
2024-09-09	Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models	Rohit Jena et.al.	2409.06493	null
2024-09-10	HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data	Hossein Hajipour et.al.	2409.06446	link
2024-09-10	VE: Modeling Multivariate Time Series Correlation with Variate Embedding	Shangjiong Wang et.al.	2409.06169	link
2024-09-09	FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations	Ziyao Wang et.al.	2409.05976	link
2024-09-09	SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values	Chengwei Sun et.al.	2409.05926	null
2024-09-09	TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency	Ahmed Imteaj et.al.	2409.05347	null
2024-09-08	Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation	Zhe Cao et.al.	2409.05224	link
2024-09-06	Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning	Xinyue Liu et.al.	2409.04574	null
2024-09-06	Fast Forwarding Low-Rank Training	Adir Rahamim et.al.	2409.04206	null
2024-09-05	Continual Skill and Task Learning via Dialogue	Weiwei Gu et.al.	2409.03166	null
2024-09-04	Non-Orthogonal Multiple-Access Strategies for Direct-to-Satellite IoT Networks	Felipe Augusto Tondo et.al.	2409.02748	null
2024-09-04	Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA	Shuangyi Chen et.al.	2409.02346	null
2024-08-31	CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models	Xiaojun Xiao et.al.	2409.02119	null
2024-09-02	LoGex: Improved tail detection of extremely rare histopathology classes via guided diffusion	Maximilian Mueller et.al.	2409.01317	link
2024-09-02	Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning	Chongjie Si et.al.	2409.01035	link
2024-09-02	Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language	Jeong Hun Yeo et.al.	2409.00986	link
2024-08-30	Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs	Mazal Bethany et.al.	2409.00209	null
2024-08-30	DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model	Mona Sheikh Zeinoddin et.al.	2408.17433	link
2024-08-30	MoRe Fine-Tuning with 10x Fewer Parameters	Wenxuan Tan et.al.	2408.17383	link
2024-08-30	Wireless Integrated Authenticated Communication System (WIA-Comm)	Amith N Bharadwaj et.al.	2408.17112	null
2024-09-02	Instant Adversarial Purification with Adversarial Consistency Distillation	Chun Tong Lei et.al.	2408.17064	null
2024-08-30	Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL	Haiyang Zhao et.al.	2408.17060	null
2024-08-29	LoraMap: Harnessing the Power of LoRA Connections	Hyeryun Park et.al.	2408.16264	null
2024-08-28	LeMON: Learning to Learn Multi-Operator Networks	Jingmin Sun et.al.	2408.16168	link
2024-08-28	Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models	Yuncheng Yang et.al.	2408.15915	link
2024-08-28	StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements	Jillian Fisher et.al.	2408.15666	link
2024-08-28	TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation	Junbao Zhou et.al.	2408.15657	link
2024-08-28	Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models	Yiyang Zhao et.al.	2408.15585	null
2024-08-28	VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech	Heeseung Kim et.al.	2408.14739	null
2024-08-27	PAT: Pruning-Aware Tuning for Large Language Models	Yijiang Liu et.al.	2408.14721	link
2024-08-27	StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-Speech	Haowei Lou et.al.	2408.14713	link
2024-08-26	CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation	Muhammad Fawi et.al.	2408.14572	link
2024-08-27	Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models	Aradhye Agarwal et.al.	2408.14470	link
2024-08-26	Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning	Sakhinana Sagar Srinivas et.al.	2408.14387	null
2024-08-27	SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher	Trung Dao et.al.	2408.14176	link
2024-08-25	TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation	Jack Saunders et.al.	2408.13714	null
2024-08-24	Can Visual Foundation Models Achieve Long-term Point Tracking?	Görkay Aydemir et.al.	2408.13575	null
2024-08-23	The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities	Venkatesh Balavadhani Parthasarathy et.al.	2408.13296	null
2024-08-23	CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition	Yafeng Zhang et.al.	2408.12834	null
2024-08-23	Investigating LLM Applications in E-Commerce	Chester Palen-Michel et.al.	2408.12779	null
2024-08-22	EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts	Nicy Scaria et.al.	2408.12226	link
2024-08-21	Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Omar Erak et.al.	2408.11775	link
2024-08-21	EAGLE: Elevating Geometric Reasoning through LLM-empowered Visual Instruction Tuning	Zhihao Li et.al.	2408.11397	null
2024-08-20	EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech	Xin Qi et.al.	2408.10852	null
2024-08-21	Flexora: Flexible Low Rank Adaptation for Large Language Models	Chenxing Wei et.al.	2408.10774	null
2024-08-20	Large Language Models for Multimodal Deformable Image Registration	Mingrui Ma et.al.	2408.10703	link
2024-08-20	Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper	Tianyi Xu et.al.	2408.10680	null
2024-08-20	CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation	Yuting Liu et.al.	2408.10645	null
2024-08-18	NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models	Cheng Lin et.al.	2408.10280	null
2024-08-19	SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models	Anke Tang et.al.	2408.10174	link
2024-08-19	Customizing Language Models with Instance-wise LoRA for Sequential Recommendation	Xiaoyu Kong et.al.	2408.10159	link
2024-08-19	TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition	Tianwei Lin et.al.	2408.09856	link
2024-08-18	Infinite Scrolling, Finite Satisfaction: Exploring User Behavior and Satisfaction on Social Media in Bangladesh	Sanzana Karim Lora et.al.	2408.09601	null
2024-08-17	ConVerSum: A Contrastive Learning based Approach for Data-Scarce Solution of Cross-Lingual Summarization Beyond Direct Equivalents	Sanzana Karim Lora et.al.	2408.09273	null
2024-08-17	An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation	Junjie Li et.al.	2408.09078	link
2024-08-17	MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality	Zhiyi Shi et.al.	2408.09064	null
2024-08-16	AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation	Yihe Dong et.al.	2408.09015	link
2024-08-16	ML Study of MaliciousTransactions in Ethereum	Natan Katz et.al.	2408.08749	null
2024-08-16	RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS	Shuaijun Chen et.al.	2408.08699	null
2024-08-16	LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression	Yuqi Ye et.al.	2408.08682	null
2024-08-16	Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning	Alessio Devoto et.al.	2408.08670	null
2024-08-16	A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth	Yujia Gu et.al.	2408.08561	null
2024-08-15	Heavy Labels Out! Dataset Distillation with Label Space Lightening	Ruonan Yu et.al.	2408.08201	null
2024-08-15	When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding	Pingping Zhang et.al.	2408.08093	null
2024-08-14	Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification	Yongcheng Li et.al.	2408.07467	link
2024-08-13	SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis	Yuchen Mao et.al.	2408.07196	null
2024-08-13	Imagen 3	Imagen-Team-Google et.al.	2408.07009	null
2024-08-13	New refinements of Narayana polynomials and Motzkin polynomials	Janet J. W. Dong et.al.	2408.06912	null
2024-08-13	LoRA $^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models	Jia-Chen Zhang et.al.	2408.06854	null
2024-08-13	DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion	Yujia Wu et.al.	2408.06740	null
2024-08-13	Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model	Yongcheng Li et.al.	2408.06716	link
2024-08-13	Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach	Haowei Ni et.al.	2408.06634	null
2024-08-13	Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models	Sungmin Cha et.al.	2408.06621	link
2024-08-15	ControlNeXt: Powerful and Efficient Control for Image and Video Generation	Bohao Peng et.al.	2408.06070	link
2024-08-11	Hotfixing Large Language Models for Cod	Zhou Yang et.al.	2408.05727	null
2024-08-09	TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning	Yujie Feng et.al.	2408.05200	link
2024-08-09	LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description	Yizhang Jin et.al.	2408.04957	link
2024-08-09	Energy performance of LR-FHSS: analysis and evaluation	Roger Sanchez-Vital et.al.	2408.04908	null
2024-08-08	Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models	Yupeng Chang et.al.	2408.04556	link
2024-08-08	UNLEARN Efficient Removal of Knowledge in Large Language Models	Tyler Lizzo et.al.	2408.04140	null
2024-08-07	Image-to-LaTeX Converter for Mathematical Formulas and Text	Daniil Gurgurov et.al.	2408.04015	link
2024-08-07	Speaker Adaptation for Quantised End-to-End ASR Models	Qiuming Zhao et.al.	2408.03979	null
2024-08-07	A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case	Sonia Meyer et.al.	2408.03562	null
2024-08-11	Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation	Jiachen Zhu et.al.	2408.03533	null
2024-08-06	FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning	Zhi Chen et.al.	2408.03355	null
2024-08-06	SARA: Singular-Value Based Adaptive Low-Rank Adaption	Jihao Gu et.al.	2408.03290	null
2024-08-06	Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi	Pranita Deshmukh et.al.	2408.03172	null
2024-08-06	L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization	Elvys Linhares Pontes et.al.	2408.03033	null
2024-08-06	Towards Smart Microfarming in an Urban Computing Continuum	Marla Grunewald et.al.	2408.02992	null
2024-08-05	StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion	Zhichao Wang et.al.	2408.02178	null
2024-08-04	SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning	Biqing Qi et.al.	2408.01970	null
2024-08-03	Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design	Joong Ho Choi et.al.	2408.01651	link
2024-08-02	MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts	Lin Ning et.al.	2408.01505	null
2024-08-02	Conditional LoRA Parameter Generation	Xiaolong Jin et.al.	2408.01415	null
2024-08-02	Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer	Yu Yang et.al.	2408.01402	null
2024-08-02	Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration	Donwon Park et.al.	2408.01099	null
2024-08-02	Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs	Afia Anjum et.al.	2408.01008	null
2024-08-02	PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting	Liam Hebert et.al.	2408.00960	null
2024-08-01	Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization	Róisín Luo et.al.	2408.00923	null
2024-07-31	Ge-based Clinopyroxene series: first principles and experimental local probe study	Ricardo P. Moreira et.al.	2407.21749	null
2024-07-31	A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation	Mothilal Asokan et.al.	2407.21739	null
2024-07-31	Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation	Xiang Luo et.al.	2407.21633	link
2024-07-30	CELLM: An Efficient Communication in Large Language Models Training for Federated Learning	Raja Vavekanand et.al.	2407.20557	null
2024-07-29	Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations	Fangyijie Wang et.al.	2407.20072	link
2024-07-28	Memory-efficient Training of LLMs with Larger Mini-batches	Dang Nguyen et.al.	2407.19580	null
2024-07-27	Parameter-Efficient Fine-Tuning via Circular Convolution	Aochuan Chen et.al.	2407.19342	null
2024-07-27	The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations	Thanh-Dung Le et.al.	2407.19299	null
2024-07-26	VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain Diffusion Trained on Uniplex Stains	Shikha Dubey et.al.	2407.19113	null
2024-07-25	Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications	Alon Halfon et.al.	2407.18990	null
2024-07-25	LoRA-Pro: Are Low-Rank Adapters Properly Optimized?	Zhengbo Wang et.al.	2407.18242	link
2024-07-25	DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability	Florent Brondolo et.al.	2407.18100	link
2024-07-24	Channel-Aware Low-Rank Adaptation in Time Series Forecasting	Tong Nie et.al.	2407.17246	link
2024-07-24	Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance	Ao Shen et.al.	2407.17029	link
2024-07-22	Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters	Kartikeya Bhardwaj et.al.	2407.16712	null
2024-07-23	DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models	Zhenyu Xie et.al.	2407.16511	null
2024-07-23	Harmonizing Visual Text Comprehension and Generation	Zhen Zhao et.al.	2407.16364	link
2024-07-23	FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network	Weiying Xie et.al.	2407.16129	link
2024-07-22	Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models	Raza Imam et.al.	2407.15913	link
2024-07-22	Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders	Laura Niss et.al.	2407.15731	null
2024-07-22	LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models	Xi Chen et.al.	2407.15415	link
2024-07-21	Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization	Jiajun Hu et.al.	2407.15085	link
2024-07-21	MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM	Navyansh Mahla et.al.	2407.15042	null

(back to top)

Model Compression

Publish Date	Title	Authors	PDF	Code
2025-05-01	Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading	Shuo Tong et.al.	2505.00592	null
2025-04-30	Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization	Anas Anwarul Haq Khan et.al.	2504.21831	null
2025-04-30	Smart Environmental Monitoring of Marine Pollution using Edge AI	Mohamed Moursi et.al.	2504.21759	null
2025-04-30	CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation	Zherui Zhang et.al.	2504.21478	null
2025-04-30	Enhancing New-item Fairness in Dynamic Recommender Systems	Huizhong Guo et.al.	2504.21362	null
2025-04-30	How to Backdoor the Knowledge Distillation	Chen Wu et.al.	2504.21323	null
2025-04-30	Redundancy Analysis and Mitigation for Machine Learning-Based Process Monitoring of Additive Manufacturing	Jiarui Xie et.al.	2504.21317	null
2025-04-29	Federated One-Shot Learning with Data Privacy and Objective-Hiding	Maximilian Egger et.al.	2504.21182	null
2025-04-29	A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection	Andreas Karathanasis et.al.	2504.21066	null
2025-04-30	DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition	Yanghui Song et.al.	2504.20948	link
2025-04-30	Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition	Tyler McDonald et.al.	2504.20946	null
2025-04-29	Evaluating Effects of Augmented SELFIES for Molecular Understanding Using QK-LSTM	Collin Beaudoin et.al.	2504.20789	null
2025-04-29	SNR-aware Semantic Image Transmission with Deep Learning-based Channel Estimation in Fading Channels	Mahmoud M. Salim et.al.	2504.20557	null
2025-04-29	SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Jia Wang et.al.	2504.20501	null
2025-04-29	Group Relative Knowledge Distillation: Learning from Teacher's Relational Inductive Bias	Chao Li et.al.	2504.20482	null
2025-04-29	The Estimation of Continual Causal Effect for Dataset Shifting Streams	Baining Chen et.al.	2504.20471	null
2025-04-29	Head-Tail-Aware KL Divergence in Knowledge Distillation for Spiking Neural Networks	Tianqing Zhang et.al.	2504.20445	null
2025-04-28	Mitigating Catastrophic Forgetting in the Incremental Learning of Medical Images	Sara Yavari et.al.	2504.20033	null
2025-04-28	Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom	Rishika Sen et.al.	2504.20000	null
2025-04-28	Federated Out-of-Distribution Generalization: A Causal Augmentation View	Runhui Zhang et.al.	2504.19882	null
2025-04-28	Towards Faster and More Compact Foundation Models for Molecular Property Prediction	Yasir Ghunaim et.al.	2504.19538	null
2025-04-27	Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation	Qianren Mao et.al.	2504.19101	null
2025-04-26	KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation	Jiabin Fan et.al.	2504.19024	null
2025-04-26	Revisiting Transformers through the Lens of Low Entropy and Dynamic Sparsity	Ruifeng Ren et.al.	2504.18929	null
2025-04-25	Intelligent Attacks and Defense Methods in Federated Learning-enabled Energy-Efficient Wireless Networks	Han Zhang et.al.	2504.18519	null
2025-04-24	Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction	Farhad Pourkamali-Anaraki et.al.	2504.17655	null
2025-04-24	Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Xin Yi et.al.	2504.17480	null
2025-04-24	Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs	Tiancheng Gu et.al.	2504.17432	null
2025-04-24	On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration	Maoyang Xiang et.al.	2504.17376	null
2025-04-24	Range Image-Based Implicit Neural Compression for LiDAR Point Clouds	Akihiro Kuwabara et.al.	2504.17229	null
2025-04-24	Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?	Kaidong Feng et.al.	2504.17220	null
2025-04-23	Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Alexander Shvets et.al.	2504.16856	null
2025-04-23	Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection	Linhua Kong et.al.	2504.16368	null
2025-04-21	Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT	Stanley Mugisha et.al.	2504.16128	null
2025-04-21	MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation	Xingxing Zuo et.al.	2504.16127	null
2025-04-22	Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability	Daniel Hendriks et.al.	2504.16056	null
2025-04-21	Linear Item-Item Model with Neural Knowledge for Session-based Recommendation	Minjin Choi et.al.	2504.15057	null
2025-04-22	Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification	Shiben Liu et.al.	2504.15041	link
2025-04-20	Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions	Luyang Fang et.al.	2504.14772	null
2025-04-20	Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis	Jingjing Ren et.al.	2504.14470	null
2025-04-19	Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models	Patrick Haller et.al.	2504.14366	null
2025-04-19	Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation	Muhammad Haseeb Aslam et.al.	2504.14307	null
2025-04-19	A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models	Chengming Wang et.al.	2504.14241	null
2025-04-19	Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration	Hongji Li et.al.	2504.14214	link
2025-04-18	Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models	Junjie Yang et.al.	2504.13825	null
2025-04-18	From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs	Jiliang Ni et.al.	2504.13471	null
2025-04-17	ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs	Yan Yang et.al.	2504.13237	null
2025-04-17	Scaling Laws for Data-Efficient Visual Transfer Learning	Wenxuan Yang et.al.	2504.13219	null
2025-04-16	Transferable Deployment of Semantic Edge Inference Systems via Unsupervised Domain Adaption	Weiqiang Jiao et.al.	2504.11873	null
2025-04-15	A Dual-Space Framework for General Knowledge Distillation of Large Language Models	Xue Zhang et.al.	2504.11426	null
2025-04-15	Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Ali Taghibakhshi et.al.	2504.11409	null
2025-04-15	Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution	Xinning Chai et.al.	2504.11271	link
2025-04-15	Efficient Reasoning Models: A Survey	Sicheng Feng et.al.	2504.10903	link
2025-04-14	Optimising Intrusion Detection Systems in Cloud-Edge Continuum with Knowledge Distillation for Privacy-Preserving and Efficient Communication	Soad Almabdy et.al.	2504.10698	null
2025-04-14	Better Estimation of the KL Divergence Between Language Models	Afra Amini et.al.	2504.10637	link
2025-04-14	Digital Staining with Knowledge Distillation: A Unified Framework for Unpaired and Paired-But-Misaligned Data	Ziwang Xu et.al.	2504.09899	link
2025-04-14	DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation	Beomseok Kang et.al.	2504.09814	null
2025-04-14	CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices	Jingxuan Zhou et.al.	2504.09803	null
2025-04-13	Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?	Christophe El Zeinaty et.al.	2504.09685	null
2025-04-12	Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking	You Wu et.al.	2504.09228	null
2025-04-12	Langformers: Unified NLP Pipelines for Language Models	Rabindra Lamsal et.al.	2504.09170	null
2025-04-12	Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization	Gen Li et.al.	2504.09039	null
2025-04-11	Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities	Maria Santos-Villafranca et.al.	2504.08578	null
2025-04-11	Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery	Alireza Fathalizadeh et.al.	2504.08550	link
2025-04-11	Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images	Jinghe Yang et.al.	2504.08253	null
2025-04-10	Towards Unconstrained 2D Pose Estimation of the Human Spine	Muhammad Saif Ullah Khan et.al.	2504.08110	null
2025-04-10	SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement	Xiyao Wang et.al.	2504.07934	link
2025-04-10	Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation	Yanglin Huang et.al.	2504.07691	null
2025-04-10	ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement	Anning Hu et.al.	2504.07418	null
2025-04-10	WK-Pnet: FM-Based Positioning via Wavelet Packet Decomposition and Knowledge Distillation	Shilian Zheng et.al.	2504.07399	null
2025-04-09	Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer	Shi Pan et.al.	2504.07061	null
2025-04-08	Multi-Sense Embeddings for Language Models and Knowledge Distillation	Qitong Wang et.al.	2504.06036	null
2025-04-08	CoA: Towards Real Image Dehazing via Compression-and-Adaptation	Long Ma et.al.	2504.05590	null
2025-04-07	Learning Activity View-invariance Under Extreme Viewpoint Changes via Curriculum Knowledge Distillation	Arjun Somayazulu et.al.	2504.05451	null
2025-04-07	Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization	Zeqin Yu et.al.	2504.05224	null
2025-04-07	Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	Yu Min Park et.al.	2504.05187	null
2025-04-07	GOTHAM: Graph Class Incremental Learning Framework under Weak Supervision	Aditya Hemant Shahane et.al.	2504.04954	link
2025-04-07	Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models	Yoojin Jung et.al.	2504.04747	null
2025-04-07	T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models	Minki Kang et.al.	2504.04718	null
2025-04-06	A Novel Algorithm for Personalized Federated Learning: Knowledge Distillation with Weighted Combination Loss	Hengrui Hu et.al.	2504.04642	null
2025-04-08	Your Image Generator Is Your New Private Dataset	Nicolo Resmini et.al.	2504.04582	null
2025-04-06	Compression Laws for Large Language Models	Ayan Sengupta et.al.	2504.04342	null
2025-04-05	Towards Understanding and Improving Refusal in Compressed Models via Mechanistic Interpretability	Vishnu Kabir Chhabra et.al.	2504.04215	null
2025-04-05	CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation	Kai Fang et.al.	2504.04156	null
2025-04-04	RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation	Hanbo Bi et.al.	2504.03166	null
2025-04-03	Compositionality Unlocks Deep Interpretable Models	Thomas Dooms et.al.	2504.02667	null
2025-04-03	UNDO: Understanding Distillation as Optimization	Kushal Jain et.al.	2504.02521	null
2025-04-03	Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation	Laibin Chang et.al.	2504.02391	null
2025-04-03	Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation	Chengxi Zeng et.al.	2504.02351	null
2025-04-03	Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation	Wupeng Wang et.al.	2504.02302	null
2025-04-03	Beyond Conventional Transformers: The Medical X-ray Attention (MXA) Block for Improved Multi-Label Diagnosis Using Knowledge Distillation	Amit Rand et.al.	2504.02277	link
2025-04-02	MDP: Multidimensional Vision Model Pruning with Latency Constraint	Xinglong Sun et.al.	2504.02168	null
2025-04-02	FlowDistill: Scalable Traffic Flow Prediction via Distillation from LLMs	Chenyang Yu et.al.	2504.02094	link
2025-04-02	A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines	Calvin Kinateder et.al.	2504.01798	null
2025-04-02	KD $^{2}$ M: An unifying framework for feature knowledge distillation	Eduardo Fernandes Montesuma et.al.	2504.01757	null
2025-04-02	Style over Substance: Distilled Language Models Reason Via Stylistic Replication	Philip Lippmann et.al.	2504.01738	null
2025-04-01	Data-free Knowledge Distillation with Diffusion Models	Xiaohua Qi et.al.	2504.00870	null
2025-04-01	Global Intervention and Distillation for Federated Out-of-Distribution Generalization	Zhuang Qi et.al.	2504.00850	null
2025-04-01	Sample-level Adaptive Knowledge Distillation for Action Recognition	Ping Li et.al.	2504.00606	null
2025-04-02	Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks	Yuang Jia et.al.	2504.00540	null
2025-03-31	Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?	Yewei Song et.al.	2503.24102	null
2025-03-31	A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing	Lars Möllenbrok et.al.	2503.24088	null
2025-03-31	Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification	Chenqi Guo et.al.	2503.24017	null
2025-03-31	Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Jiagen Li et.al.	2503.23721	null
2025-03-28	Efficient Verified Machine Unlearning For Distillation	Yijun Quan et.al.	2503.22539	null
2025-03-28	Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces	Wonhyeok Choi et.al.	2503.22209	null
2025-03-28	Multi-modal Knowledge Distillation-based Human Trajectory Forecasting	Jaewoo Jeong et.al.	2503.22201	link
2025-03-28	Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation	Chuan-Wei Kuo et.al.	2503.22074	null
2025-03-28	Multi-Task Semantic Communications via Large Models	Wanli Ni et.al.	2503.22064	null
2025-03-27	Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration	Yujie Chen et.al.	2503.21970	null
2025-03-27	A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices	Ci-Hao Wu et.al.	2503.21335	null
2025-03-27	DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset	Ling Feng et.al.	2503.21323	null
2025-03-27	Delving Deep into Semantic Relation Distillation	Zhaoyi Yan et.al.	2503.21269	null
2025-03-27	MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness	Zihao Zheng et.al.	2503.21135	null
2025-03-27	Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Yedan Shen et.al.	2503.21098	null
2025-03-26	Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications	Mahya Nikouei et.al.	2503.20516	null
2025-03-26	MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation	Rongyu Zhang et.al.	2503.20384	null
2025-03-26	Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning	Yousef Sadegheih et.al.	2503.20326	link
2025-03-25	Scaling Down Text Encoders of Text-to-Image Diffusion Models	Lifu Wang et.al.	2503.19897	link
2025-03-23	FedSKD: Aggregation-free Model-heterogeneous Federated Learning using Multi-dimensional Similarity Knowledge Distillation	Ziqiao Weng et.al.	2503.18981	null
2025-03-24	DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Karim Abou Zeid et.al.	2503.18944	link
2025-03-24	Distilling Stereo Networks for Performant and Efficient Leaner Networks	Rafia Rahim et.al.	2503.18544	link
2025-03-24	Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control	Basim Azam et.al.	2503.18324	null
2025-03-23	CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation	Jungsoo Lee et.al.	2503.18244	null
2025-03-22	OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery	Vignesh Prabhakar et.al.	2503.17604	null
2025-03-21	Efficient Knowledge Distillation via Curriculum Extraction	Shivam Gupta et.al.	2503.17494	null
2025-03-21	Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Reem Gody et.al.	2503.17336	null
2025-03-21	Large Language Model Compression via the Nested Activation-Aware Decomposition	Jun Lu et.al.	2503.17101	null
2025-03-21	Distilling Monocular Foundation Model for Fine-grained Depth Completion	Yingping Liang et.al.	2503.16970	null
2025-03-21	Temporal Action Detection Model Compression by Progressive Block Drop	Xiaoyong Chen et.al.	2503.16916	null
2025-03-21	Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs	Anshumann et.al.	2503.16870	null
2025-03-21	City2Scene: Improving Acoustic Scene Classification with City Features	Yiqiang Cai et.al.	2503.16862	null
2025-03-20	Bezier Distillation	Ling Feng et.al.	2503.16562	null
2025-03-20	Federated Quantum-Train Long Short-Term Memory for Gravitational Wave Signal	Chen-Yu Liu et.al.	2503.16049	null
2025-03-20	InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer	Tony Zhang et.al.	2503.15983	null
2025-03-19	KoGNER: A Novel Framework for Knowledge Graph Distillation on Biomedical Named Entity Recognition	Heming Zhang et.al.	2503.15737	null
2025-03-19	Technical Report for the 5th CLVision Challenge at CVPR: Addressing the Class-Incremental with Repetition using Unlabeled Data -- 4th Place Solution	Panagiota Moraiti et.al.	2503.15697	link
2025-03-19	High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight	Cédric Vincent et.al.	2503.15676	link
2025-03-19	DCA: Dividing and Conquering Amnesia in Incremental Object Detection	Aoting Zhang et.al.	2503.15295	null
2025-03-20	Distilling 3D distinctive local descriptors for 6D pose estimation	Amir Hamza et.al.	2503.15106	null
2025-03-19	Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening	Zihan Cao et.al.	2503.14975	null
2025-03-19	Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials	Sakib Matin et.al.	2503.14293	null
2025-03-18	SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation	Weihong Chen et.al.	2503.14097	null
2025-03-18	Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection	Chunlei Li et.al.	2503.13828	link
2025-03-17	DynSTG-Mamba: Dynamic Spatio-Temporal Graph Mamba with Cross-Graph Knowledge Distillation for Gait Disorders Recognition	Zakariae Zrimek et.al.	2503.13156	null
2025-03-17	ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning	Baohao Liao et.al.	2503.13089	null
2025-03-17	Historic Scripts to Modern Vision: A Novel Dataset and A VLM Framework for Transliteration of Modi Script to Devanagari	Harshal Kausadikar et.al.	2503.13060	null
2025-03-17	Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation	Nassim Ali Ousalah et.al.	2503.13053	null
2025-03-17	Knowledge Distillation: Enhancing Neural Network Compression with Integrated Gradients	David E. Hernandez et.al.	2503.13008	null
2025-03-17	ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing	Aditi Tiwari et.al.	2503.12852	null
2025-03-17	CompMarkGS: Robust Watermarking for Compression 3D Gaussian Splatting	Sumin In et.al.	2503.12836	null
2025-03-17	Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation	Kailin Li et.al.	2503.12820	null
2025-03-16	Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep Learning	Khayrul Islam et.al.	2503.12622	link
2025-03-16	UniBERTs: Adversarial Training for Language-Universal Representations	Andrei-Marius Avram et.al.	2503.12608	null
2025-03-14	Exploring Performance-Complexity Trade-Offs in Sound Event Detection	Tobias Morocutti et.al.	2503.11373	link
2025-03-14	Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification	Tobias Morocutti et.al.	2503.11363	null
2025-03-14	Enabling Weak Client Participation via On-device Knowledge Distillation in Heterogenous Federated Learning	Jihyun Lim et.al.	2503.11151	null
2025-03-12	CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation	Hariprasath Govindarajan et.al.	2503.09878	null
2025-03-12	Vi-LAD: Vision-Language Attention Distillation for Socially-Aware Robot Navigation in Dynamic Environments	Mohamed Elnoor et.al.	2503.09820	null
2025-03-16	xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation	Elio Musacchio et.al.	2503.09313	null
2025-03-12	Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge	Maximilian Abstreiter et.al.	2503.09114	null
2025-03-12	Discovering Influential Neuron Path in Vision Transformers	Yifan Wang et.al.	2503.09046	null
2025-03-12	Adaptive Temperature Based on Logits Correlation in Knowledge Distillation	Kazuhiro Matsuyama et.al.	2503.09030	link
2025-03-12	Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds	Dikai Liu et.al.	2503.08997	null
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-11	Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency	Siqi Fan et.al.	2503.08524	null
2025-03-11	Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation	Deyi Ji et.al.	2503.08043	null
2025-03-11	Generalized Kullback-Leibler Divergence Loss	Jiequan Cui et.al.	2503.08038	null
2025-03-10	Training Domain Draft Models for Speculative Decoding: Best Practices and Insights	Fenglu Hong et.al.	2503.07807	null
2025-03-10	ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning	Soumya Banerjee et.al.	2503.07506	null
2025-03-10	Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification	Thomas Boucher et.al.	2503.07294	null
2025-03-10	CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	Haicheng Liao et.al.	2503.07234	null
2025-03-10	PTMs-TSCIL Pre-Trained Models Based Class-Incremental Learning	Yuanlong Wu et.al.	2503.07153	null
2025-03-10	Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation	Pengchen Liang et.al.	2503.06976	null
2025-03-09	Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence	Zhaowei Chen et.al.	2503.06685	null
2025-03-09	Towards Superior Quantization Accuracy: A Layer-sensitive Approach	Feng Zhang et.al.	2503.06518	null
2025-03-09	HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast	Yiting Zheng et.al.	2503.06511	null
2025-03-09	Causality Enhanced Origin-Destination Flow Prediction in Data-Scarce Cities	Tao Feng et.al.	2503.06398	null
2025-03-08	ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation	Qizhen Lan et.al.	2503.06307	null
2025-03-07	Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning	Run He et.al.	2503.05423	null
2025-03-07	Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification	Dingkun Liu et.al.	2503.05349	link
2025-03-07	Similarity-Based Domain Adaptation with LLMs	Jie He et.al.	2503.05281	null
2025-03-06	LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression	Souvik Kundu et.al.	2503.04982	null
2025-03-06	TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation	Lin Sun et.al.	2503.04872	null
2025-03-05	ZAugNet for Z-Slice Augmentation in Bio-Imaging	Alessandro Pasqui et.al.	2503.04843	link
2025-03-07	No Forgetting Learning: Memory-free Continual Learning	Mohammad Ali Vahedifar et.al.	2503.04638	null
2025-03-06	CrowdHMTware: A Cross-level Co-adaptation Middleware for Context-aware Mobile DL Deployment	Sicong Liu et.al.	2503.04183	null
2025-03-05	Evaluating Compression and Nanoindentation in FCC Nickel: A Methodology for Interatomic Potential Selection	K. Cichocki et.al.	2503.03723	null
2025-03-05	KLiNQ: Knowledge Distillation-Assisted Lightweight Neural Network for Qubit Readout on FPGA	Xiaorang Guo et.al.	2503.03544	null
2025-03-05	Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks	Kairong Yu et.al.	2503.03144	null
2025-03-05	FairSense-AI: Responsible AI Meets Sustainability	Shaina Raza et.al.	2503.02865	null
2025-03-04	10K is Enough: An Ultra-Lightweight Binarized Network for Infrared Small-Target Detection	Biqiao Xin et.al.	2503.02662	null
2025-03-04	It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation	Sohan Patnaik et.al.	2503.02463	null
2025-03-04	Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration	Pengchen Liang et.al.	2503.02321	null
2025-03-03	Mamba base PKD for efficient knowledge compression	José Medina et.al.	2503.01727	null
2025-03-03	DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems	Minoo Hosseinzadeh et.al.	2503.01704	null
2025-03-03	Revisiting Large Language Model Pruning using Neuron Semantic Attribution	Yizhuo Ding et.al.	2503.01542	null
2025-03-01	SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection	Xin Lin et.al.	2503.00414	link
2025-03-01	Energy-Efficient Edge Inference in Integrated Sensing, Communication, and Computation Networks	Jiacheng Yao et.al.	2503.00298	null
2025-02-28	Real-Time Aerial Fire Detection on Resource-Constrained Devices Using Knowledge Distillation	Sabina Jangirova et.al.	2502.20979	null
2025-02-28	VRM: Knowledge Distillation via Virtual Relation Matching	Weijia Zhang et.al.	2502.20760	null
2025-02-27	SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models	Zicheng Cai et.al.	2502.20422	null
2025-02-27	KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model	Kai Zhang et.al.	2502.20350	null
2025-02-27	Granite Embedding Models	Parul Awasthy et.al.	2502.20204	null
2025-02-28	Behind the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models	Sibo Yi et.al.	2502.19883	null
2025-02-28	Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval	Jiaxing Li et.al.	2502.19751	null
2025-02-27	XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs	Linyang He et.al.	2502.19737	null
2025-02-26	Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents	Ashley Lewis et.al.	2502.19545	null
2025-02-26	Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach	Anton Backhaus et.al.	2502.19177	null
2025-02-25	AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Joshua Sakthivel Raju et.al.	2502.18020	null
2025-02-25	Advantage-Guided Distillation for Preference Alignment in Small Language Models	Shiping Gao et.al.	2502.17927	link
2025-02-25	From underwater to aerial: a novel multi-scale knowledge distillation approach for coral reef monitoring	Matteo Contini et.al.	2502.17883	link
2025-02-24	Knowledge Distillation with Training Wheels	Guanlin Liu et.al.	2502.17717	null
2025-02-24	The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?	Zhenheng Tang et.al.	2502.17535	null
2025-02-24	CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation	Vishal Thengane et.al.	2502.17429	link
2025-02-24	Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing	Zhuoran Li et.al.	2502.17308	null
2025-02-24	Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation	Wenyuan Wu et.al.	2502.17003	null
2025-02-24	PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation	Eleftherios Ioannou et.al.	2502.16996	null
2025-02-25	CoT2Align: Cross-Chain of Thought Distillation via Optimal Transport Alignment for Language Models with Different Tokenizers	Anh Duc Le et.al.	2502.16806	null
2025-02-24	A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition	Dewan Tauhid Rahman et.al.	2502.16762	null
2025-02-23	EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Hong Cai Chen et.al.	2502.16541	null
2025-02-21	A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models	Yuchen Jiang et.al.	2502.15959	link
2025-02-21	Scaling Sparse and Dense Retrieval in Decoder-Only LLMs	Hansi Zeng et.al.	2502.15526	link
2025-02-21	When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models	Weilan Wang et.al.	2502.15443	null
2025-02-20	Optimizing Singular Spectrum for Large Language Model Compression	Dengjie Li et.al.	2502.15092	null
2025-02-20	Modifying Final Splits of Classification Tree for Fine-tuning Subpopulation Target in Policy Making	Lei Bill Wang et.al.	2502.15072	null
2025-02-20	TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation	Juntong Ni et.al.	2502.15016	null
2025-02-20	Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery	Minh-Quyet Ha et.al.	2502.14631	null
2025-02-21	Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Pengchen Liang et.al.	2502.14584	null
2025-02-20	Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Wonhyeok Choi et.al.	2502.14573	null
2025-02-20	Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Kayhan Behdin et.al.	2502.14305	null
2025-02-20	Designing Parameter and Compute Efficient Diffusion Transformers using Distillation	Vignesh Sundaresha et.al.	2502.14226	null
2025-02-19	MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation	Romina Aalishah et.al.	2502.14090	null
2025-02-19	Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture	Shijin Duan et.al.	2502.14075	null
2025-02-19	Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles	Orestis Konstantaropoulos et.al.	2502.14023	null
2025-02-19	MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures	Jiayu Qin et.al.	2502.14008	null
2025-02-19	Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning	Caihua Liu et.al.	2502.13754	null
2025-02-19	JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework	Ziyuan Liu et.al.	2502.13407	link
2025-02-18	NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions	Weizhe Yuan et.al.	2502.13124	null
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Gyeongman Kim et.al.	2502.12947	null
2025-02-18	Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models	Neeraj Gangwar et.al.	2502.12855	null
2025-02-18	PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery	Bowei He et.al.	2502.12594	null
2025-02-17	FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control	Yutong Ye et.al.	2502.11937	null
2025-02-17	Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation	Zengkui Sun et.al.	2502.11766	link
2025-02-17	Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?	Leyi Pan et.al.	2502.11598	link
2025-02-17	Leave No One Behind: Enhancing Diversity While Maintaining Accuracy in Social Recommendation	Lei Li et.al.	2502.11374	link
2025-02-16	Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation	Hieu Nguyen et.al.	2502.11306	null
2025-02-16	Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification	Thanushon Sivakaran et.al.	2502.11258	null
2025-02-16	DAViMNet: SSMs-Based Domain Adaptive Object Detection	A. Enes Doruk et.al.	2502.11178	link
2025-02-16	Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping	Yijie Chen et.al.	2502.11104	link
2025-02-15	LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs	Amit Roy et.al.	2502.10914	null
2025-02-15	OPTISHEAR: Towards Efficient and Adaptive Pruning of Large Language Models via Evolutionary Optimization	Shuqi Liu et.al.	2502.10735	null
2025-02-14	Forget the Data and Fine-Tuning! Just Fold the Network to Compress	Dong Wang et.al.	2502.10216	link
2025-02-14	Can Post-Training Quantization Benefit from an Additional QLoRA Integration?	Xiliang Zhu et.al.	2502.10202	null
2025-02-13	Automatic Pruning via Structured Lasso with Class-wise Information	Xiang Liu et.al.	2502.09125	null
2025-02-13	AIDE: Agentically Improve Visual Language Model with Domain Experts	Ming-Chang Chiu et.al.	2502.09051	null
2025-02-12	PLayer-FL: A Principled Approach to Personalized Layer-wise Cross-Silo Federated Learning	Ahmed Elhussein et.al.	2502.08829	link
2025-02-12	LLM Pretraining with Continuous Concepts	Jihoon Tack et.al.	2502.08524	null
2025-02-12	Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning	Barnaby Schmitt et.al.	2502.08323	null
2025-02-11	Vision-Language Models for Edge Networks: A Comprehensive Survey	Ahmed Sharshar et.al.	2502.07855	null
2025-02-11	DarwinLM: Evolutionary Structured Pruning of Large Language Models	Shengkun Tang et.al.	2502.07780	link
2025-02-11	Breaking Down Bias: On The Limits of Generalizable Pruning Strategies	Sibo Ma et.al.	2502.07771	null
2025-02-11	Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers	Zhaodong Bing et.al.	2502.07436	null
2025-02-11	OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Lumen AI et.al.	2502.07312	link
2025-02-11	Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification	Zicheng Liu et.al.	2502.07299	null
2025-02-10	DROP: Poison Dilution via Knowledge Distillation for Federated Learning	Georgios Syros et.al.	2502.07011	link
2025-02-10	A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer	Lirong Wu et.al.	2502.06913	link
2025-02-13	Rationalization Models for Text-to-SQL	Gaetano Rossiello et.al.	2502.06759	null
2025-02-10	Systematic Outliers in Large Language Models	Yongqi An et.al.	2502.06415	link
2025-02-10	Progressive Collaborative and Semantic Knowledge Fusion for Generative Recommendation	Longtao Xiao et.al.	2502.06269	null
2025-02-10	Right Time to Learn:Promoting Generalization via Bio-inspired Spacing Effect in Knowledge Distillation	Guanglong Sun et.al.	2502.06192	null
2025-02-10	Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures	Yaoxin Yang et.al.	2502.06189	null
2025-02-10	A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar	Seung-Hyun Song et.al.	2502.06114	null
2025-02-09	ClinKD: Cross-Modal Clinic Knowledge Distiller For Multi-Task Medical Images	Hongyu Ge et.al.	2502.05928	link
2025-02-09	Learning Accurate, Efficient, and Interpretable MLPs on Multiplex Graphs via Node-wise Multi-View Ensemble Distillation	Yunhui Liu et.al.	2502.05864	null
2025-02-09	Synergistic Effects of Knowledge Distillation and Structured Pruning for Self-Supervised Speech Models	Shiva Kumar C et.al.	2502.05837	null
2025-02-09	Contrastive Representation Distillation via Multi-Scale Feature Decoupling	Cuipeng Wang et.al.	2502.05835	null
2025-02-07	Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement	Xihao Yuan et.al.	2502.04711	null
2025-02-06	Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation	Chenyang Huang et.al.	2502.04537	link
2025-02-06	Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)	Zony Yu et.al.	2502.04499	null
2025-02-06	PGB: One-Shot Pruning for BERT via Weight Grouping and Permutation	Hyemin Lim et.al.	2502.03984	null
2025-02-06	Towards Unified Music Emotion Recognition across Dimensional and Categorical Models	Jaeyong Kang et.al.	2502.03979	link
2025-02-06	BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation	Bo Pang et.al.	2502.03860	null
2025-02-06	Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation	Lin Li et.al.	2502.03856	null
2025-02-05	Knowledge Distillation from Large Language Models for Household Energy Modeling	Mohannad Takrouri et.al.	2502.03034	null
2025-02-05	Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons	Renjun Hu et.al.	2502.02988	null
2025-02-04	Theoretical Guarantees for Low-Rank Compression of Deep Neural Networks	Shihao Zhang et.al.	2502.02766	null
2025-02-04	On Teacher Hacking in Language Model Distillation	Daniil Tiapkin et.al.	2502.02671	null
2025-02-04	Activation-Informed Merging of Large Language Models	Amin Heyrani Nobari et.al.	2502.02421	link
2025-02-03	Memorization Inheritance in Sequence-Level Knowledge Distillation for Neural Machine Translation	Verna Dankers et.al.	2502.01491	null
2025-02-03	Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity	Alessandro Pierro et.al.	2502.01330	null
2025-02-03	CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation	Xiao Lin et.al.	2502.01312	null
2025-02-03	A Framework for Double-Blind Federated Adaptation of Foundation Models	Nurbek Tastan et.al.	2502.01289	null
2025-02-03	MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks	Alejandro Guerra-Manzanares et.al.	2502.01158	null
2025-02-02	Huff-LLM: End-to-End Lossless Compression for Efficient LLM Inference	Patrick Yubeaton et.al.	2502.00922	null
2025-02-02	Attention Sinks and Outlier Features: A 'Catch, Tag, and Release' Mechanism for Embeddings	Stephen Zhang et.al.	2502.00919	null
2025-02-02	FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation	Wenzheng Jiang et.al.	2502.00870	link
2025-02-02	VLM-Assisted Continual learning for Visual Question Answering in Self-Driving	Yuxin Lin et.al.	2502.00843	null
2025-01-31	Imagine with the Teacher: Complete Shape in a Multi-View Distillation Way	Zhanpeng Luo et.al.	2501.19270	null
2025-01-31	Position: Curvature Matrices Should Be Democratized via Linear Operators	Felix Dangel et.al.	2501.19183	null
2025-01-31	Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models	Jialin Zhao et.al.	2501.19090	null
2025-02-04	Efficient Supernet Training with Orthogonal Softmax for Scalable ASR Model Compression	Jingjing Xu et.al.	2501.18895	null
2025-01-30	Rethinking the Upsampling Layer in Hyperspectral Image Super Resolution	Haohan Shi et.al.	2501.18664	null
2025-01-30	SAFL: Structure-Aware Personalized Federated Learning via Client-Specific Clustering and SCSI-Guided Model Pruning	Nan Li et.al.	2501.18659	null
2025-01-30	Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design	Amna Murtada et.al.	2501.18538	null
2025-01-30	SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer	Enze Xie et.al.	2501.18427	null
2025-01-29	RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems	Duy A. Nguyen et.al.	2501.18056	null
2025-01-29	Perforated Backpropagation: A Neuroscience Inspired Extension to Artificial Neural Networks	Rorry Brenner et.al.	2501.18018	link
2025-01-29	Distilling Knowledge for Designing Computational Imaging Systems	Leon Suarez-Rodriguez et.al.	2501.17898	link
2025-01-29	Tapor: 3D Hand Pose Reconstruction with Fully Passive Thermal Sensing for Around-device Interactions	Xie Zhang et.al.	2501.17585	link
2025-01-28	A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts	Hossein Mirzaei et.al.	2501.17289	null
2025-01-28	FedEFM: Federated Endovascular Foundation Model with Unseen Data	Tuong Do et.al.	2501.16992	null
2025-01-28	Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning	Xi Chen et.al.	2501.16966	null
2025-01-29	TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models	Makoto Shing et.al.	2501.16937	null
2025-01-28	Target-driven Self-Distillation for Partial Observed Trajectories Forecasting	Pengfei Zhu et.al.	2501.16767	null
2025-01-28	Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Kunal Dasharath Patil et.al.	2501.16740	null
2025-01-30	Return of the Encoder: Maximizing Parameter Efficiency for SLMs	Mohamed Elfeki et.al.	2501.16273	link
2025-01-27	PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Maxime Louis et.al.	2501.16075	null
2025-01-26	MimicGait: A Model Agnostic approach for Occluded Gait Recognition using Correlational Knowledge Distillation	Ayush Gupta et.al.	2501.15666	link
2025-01-26	Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis	Robinson Umeike et.al.	2501.15370	null
2025-01-25	You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning	Ayan Sengupta et.al.	2501.15296	null
2025-01-25	Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning	Yu Qiao et.al.	2501.15257	null
2025-01-25	Quark: Implementing Convolutional Neural Networks Entirely on Programmable Data Plane	Mai Zhang et.al.	2501.15100	null
2025-01-25	Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval	Bingjun Luo et.al.	2501.15052	null
2025-01-25	On Accelerating Edge AI: Optimizing Resource-Constrained Environments	Jacob Sander et.al.	2501.15014	null
2025-01-24	Remining Hard Negatives for Generative Pseudo Labeled Domain Adaptation	Goksenin Yuksel et.al.	2501.14434	null
2025-01-24	Multimodal Prescriptive Deep Learning	Dimitris Bertsimas et.al.	2501.14152	null
2025-01-23	Unlearning Clients, Features and Samples in Vertical Federated Learning	Ayush K. Varshney et.al.	2501.13683	null
2025-01-24	Multi-aspect Knowledge Distillation with Large Language Model	Taegyeong Lee et.al.	2501.13341	link
2025-01-22	LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jiahao Wang et.al.	2501.12976	null
2025-01-22	Practical quantum federated learning and its experimental demonstration	Zhi-Ping Liu et.al.	2501.12709	null
2025-01-24	EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Yifan Yu et.al.	2501.12689	null
2025-01-22	Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation	Jan Christian Blaise Cruz et.al.	2501.12660	null
2025-01-22	Toward Model-centric Heterogeneous Federated Graph Learning: A Knowledge-driven Approach	Huilin lai et.al.	2501.12624	null
2025-01-21	Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor	Jiaqi Guo et.al.	2501.12524	link
2025-01-19	AI Based Font Pair Suggestion Modelling For Graphic Design	Aryan Singh et.al.	2501.10969	null
2025-01-18	Learning to reconstruct signals with inexact sensing operator via knowledge distillation	Roman Jacome et.al.	2501.10794	null
2025-01-18	DNA 1.0 Technical Report	Jungyup Lee et.al.	2501.10648	null
2025-01-17	MultiPruner: Balanced Structure Removal in Foundation Models	J. Pablo Muñoz et.al.	2501.09949	link
2025-01-16	Enhancing Generalization in Chain of Thought Reasoning for Smaller Models	Maxwell J. Yin et.al.	2501.09804	null
2025-01-16	Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures	Pratyush Dhingra et.al.	2501.09588	null
2025-01-19	Class Incremental Fault Diagnosis under Limited Fault Data via Supervised Contrastive Knowledge Distillation	Hanrong Zhang et.al.	2501.09525	link
2025-01-16	FASP: Fast and Accurate Structured Pruning of Large Language Models	Hanyu Hu et.al.	2501.09412	null
2025-01-16	Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression	Yongheng Zhang et.al.	2501.09321	null
2025-01-16	Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images	Yongheng Zhang et.al.	2501.09268	null
2025-01-15	Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians	Ishan Amin et.al.	2501.09009	link
2025-01-17	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	Feature-based One-For-All: A Universal Framework for Heterogeneous Knowledge Distillation	Jhe-Hao Lin et.al.	2501.08885	null
2025-01-15	SWSC: Shared Weight for Similar Channel in LLM	Binrui Zeng et.al.	2501.08631	null
2025-01-14	Self-Attentive Spatio-Temporal Calibration for Precise Intermediate Layer Matching in ANN-to-SNN Distillation	Di Hong et.al.	2501.08049	link
2025-01-14	Balance Divergence for Knowledge Distillation	Yafei Qi et.al.	2501.07804	null
2025-01-13	A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion	Fabio Montello et.al.	2501.07451	null
2025-01-13	Knowledge Distillation and Enhanced Subdomain Adaptation Using Graph Convolutional Network for Resource-Constrained Bearing Fault Diagnosis	Mohammadreza Kavianpour et.al.	2501.07173	null
2025-01-13	Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection	ZhouRui Zhang et.al.	2501.07101	null
2025-01-13	Research on the Online Update Method for Retrieval-Augmented Generation (RAG) Model with Incremental Learning	Yuxin Fan et.al.	2501.07063	null
2025-01-13	Rethinking Knowledge in Distillation: An In-context Sample Retrieval Perspective	Jinjing Zhu et.al.	2501.07040	null
2025-01-12	Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving	Haoxiang Gao et.al.	2501.06680	null
2025-01-10	Tensorization of neural networks for improved privacy and interpretability	José Ramón Pareja Monturiol et.al.	2501.06300	link
2025-01-10	Merging Feed-Forward Sublayers for Compressed Transformers	Neha Verma et.al.	2501.06126	link
2025-01-10	Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation	Daowan Peng et.al.	2501.05690	null
2025-01-09	LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts	Yuri Facanha Bezerra et.al.	2501.05554	link
2025-01-09	Neural Architecture Codesign for Fast Physics Applications	Jason Weitz et.al.	2501.05515	link
2025-01-09	Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning	Laura Puccioni et.al.	2501.05248	null
2025-01-08	Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models	Miaoyang He et.al.	2501.04582	null
2025-01-08	Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions	Na Yan et.al.	2501.04436	null
2025-01-08	Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images	Yuze Wang et.al.	2501.04283	null
2025-01-08	UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles	Abhishek Balasubramaniam et.al.	2501.04213	null
2025-01-10	CURing Large Models: Compression via CUR Decomposition	Sanghyeon Park et.al.	2501.04211	null
2025-01-08	Generative Dataset Distillation Based on Self-knowledge Distillation	Longzhen Li et.al.	2501.04202	null
2025-01-07	FedKD-hybrid: Federated Hybrid Knowledge Distillation for Lithography Hotspot Detection	Yuqi Li et.al.	2501.04066	link
2025-01-07	A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Yi Zhang et.al.	2501.03670	link
2025-01-07	Effective and Efficient Mixed Precision Quantization of Speech Foundation Models	Haoning Xu et.al.	2501.03643	null
2025-01-07	ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting	Yifeng Yang et.al.	2501.03605	link
2025-01-05	Strategic Fusion Optimizes Transformer Compression	Md Shoaibur Rahman et.al.	2501.03273	null
2025-01-07	LightGNN: Simple Graph Neural Network for Recommendation	Guoxuan Chen et.al.	2501.03228	link
2025-01-06	Comprehensive Pathological Image Segmentation via Teacher Aggregation for Tumor Microenvironment Analysis	Daisuke Komura et.al.	2501.02909	null
2025-01-06	Knowledge Distillation with Adapted Weight	Sirong Wu et.al.	2501.02705	null
2025-01-04	Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Tsz Kin Lam et.al.	2501.02370	null
2025-01-04	V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection	Sichao Wang et.al.	2501.02363	link
2025-01-04	Optimizing Small Language Models for In-Vehicle Function-Calling	Yahya Sowti Khiabani et.al.	2501.02342	null
2025-01-04	KD-MSLRT: Lightweight Sign Language Recognition Model Based on Mediapipe and 3D to 1D Knowledge Distillation	ulong Li et.al.	2501.02321	null
2025-01-04	Distillation-Enhanced Physical Adversarial Attacks	Wei Liu et.al.	2501.02232	null
2025-01-03	Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification	Jarin Ritu et.al.	2501.01921	link
2025-01-03	MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders	Jiajun Cao et.al.	2501.01709	null
2025-01-02	DiagrammaticLearning: A Graphical Language for Compositional Training Regimes	Mason Lary et.al.	2501.01515	null
2024-12-31	Pan-infection Foundation Framework Enables Multiple Pathogen Prediction	Lingrui Zhang et.al.	2501.01462	null
2025-01-01	A Survey of Secure Semantic Communications	Rui Meng et.al.	2501.00842	null
2025-01-01	LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity	Muhammet Anil Yagiz et.al.	2501.00790	null
2024-12-30	Temporal reasoning for timeline summarisation in social media	Jiayu Song et.al.	2501.00152	null
2024-12-30	Improving Acoustic Scene Classification in Low-Resource Conditions	Zhi Chen et.al.	2412.20722	null
2024-12-28	Injecting Explainability and Lightweight De#to Weakly Supervised Video Anomaly Detection Systems	Wen-Dong Jiang et.al.	2412.20201	null
2024-12-28	SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection	Phi Vu Tran et.al.	2412.20047	null
2024-12-28	Invariant debiasing learning for recommendation via biased imputation	Ting Bai et.al.	2412.20036	link
2024-12-28	Learning Adaptive and View-Invariant Vision Transformer with Multi-Teacher Knowledge Distillation for Real-Time UAV Tracking	You Wu et.al.	2412.20002	link
2024-12-27	Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis	Jiaqi Wang et.al.	2412.19654	link
2024-12-27	Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models	Shuo Wang et.al.	2412.19449	null
2024-12-26	SpectralKD: Understanding and Optimizing Vision Transformer Distillation through Spectral Analysis	Huiyuan Tian et.al.	2412.19055	link
2024-12-25	Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models	Haowei Yang et.al.	2412.18715	null
2024-12-23	Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings	Harsh Joshi et.al.	2412.18635	null
2024-12-24	HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation	Mohammed Hamdan et.al.	2412.18524	null
2024-12-24	Understanding Artificial Neural Network's Behavior from Neuron Activation Perspective	Yizhou Zhang et.al.	2412.18073	null
2024-12-23	CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction	Yuanyuan Gao et.al.	2412.17612	null
2024-12-23	GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference	Chao Zeng et.al.	2412.17560	null
2024-12-24	Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement	Hyeonjin Kim et.al.	2412.17387	link
2024-12-23	Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction	Yuying Wang et.al.	2412.17317	null
2024-12-23	LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation	Riku Uemura et.al.	2412.17282	null
2024-12-22	Lightweight Design and Optimization methods for DCNNs: Progress and Futures	Hanhua Long et.al.	2412.16886	null
2024-12-21	Large Language Models Compression via Low-Rank Feature Distillation	Yaya Sy et.al.	2412.16719	null
2024-12-21	CyberSentinel: Efficient Anomaly Detection in Programmable Switch using Knowledge Distillation	Sankalp Mittal et.al.	2412.16693	null
2024-12-21	Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers	Yunshan Zhong et.al.	2412.16553	null
2024-12-21	STKDRec: Spatial-Temporal Knowledge Distillation for Takeaway Recommendation	Shuyuan Zhao et.al.	2412.16502	null
2024-12-20	BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models	Patrick Haller et.al.	2412.15978	null
2024-12-20	A New Method to Capturing Compositional Knowledge in Linguistic Space	Jiahe Wan et.al.	2412.15632	null
2024-12-19	Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation	Meghana Karri et.al.	2412.15380	null
2024-12-19	Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models	Reza Shirkavand et.al.	2412.15341	link
2024-12-19	Self-Evolution Knowledge Distillation for LLM-based Machine Translation	Yuncheng Song et.al.	2412.15303	null
2024-12-19	Adaptive Pruning for Large Language Models with Structural Importance Awareness	Haotian Zheng et.al.	2412.15127	null
2024-12-19	SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection	Ruoyu Xu et.al.	2412.14571	null
2024-12-19	Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models	Xiao Cui et.al.	2412.14528	link
2024-12-19	Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance	Sukrit Leelaluk et.al.	2412.14526	link
2024-12-18	A Survey on Inference Optimization Techniques for Mixture of Experts Models	Jiacheng Liu et.al.	2412.14219	link
2024-12-18	Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective	Zhiyuan Zeng et.al.	2412.14135	null
2024-12-18	On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process	Gereziher Adhane et.al.	2412.13943	null
2024-12-18	Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN	Pengxiang Li et.al.	2412.13795	link
2024-12-18	Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation	Kaiwen Huang et.al.	2412.13742	link
2024-12-18	On the Compression of Language Models for Code: An Empirical Study on CodeBERT	Giordano d'Aloisio et.al.	2412.13737	null
2024-12-18	Hybrid Data-Free Knowledge Distillation	Jialiang Tang et.al.	2412.13525	link
2024-12-18	Deploying Foundation Model Powered Agent Services: A Survey	Wenchao Xu et.al.	2412.13437	null
2024-12-17	In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning	Yifei Duan et.al.	2412.13243	null
2024-12-17	Modality-Inconsistent Continual Learning of Multimodal Large Language Models	Weiguo Pian et.al.	2412.13050	null
2024-12-17	Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation	Jiaqi Wang et.al.	2412.12858	null
2024-12-17	RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification	Guanwenjie Zou et.al.	2412.12603	link
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460	link
2024-12-16	Neural Collapse Inspired Knowledge Distillation	Shuoxi Zhang et.al.	2412.11788	null
2024-12-16	Relation-Guided Adversarial Learning for Data-free Knowledge Transfer	Yingping Liang et.al.	2412.11380	link
2024-12-16	BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions	Wonyong Seo et.al.	2412.11365	null
2024-12-15	Wearable Accelerometer Foundation Models for Health via Knowledge Distillation	Salar Abbaspourazad et.al.	2412.11276	null
2024-12-15	TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs	Lanxiang Hu et.al.	2412.11242	null
2024-12-15	ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes	Pedro Miguel Sánchez Sánchez et.al.	2412.11207	null
2024-12-15	Leveraging Large Language Models for Active Merchant Non-player Characters	Byungjun Kim et.al.	2412.11189	link
2024-12-15	Knowledge Migration Framework for Smart Contract Vulnerability Detection	Luqi Wang et.al.	2412.11175	null
2024-12-15	Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection	Mohammadreza Salehi et.al.	2412.11148	link
2024-12-17	On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning	Pengfei Fang et.al.	2412.11017	null
2024-12-13	Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias	Jianhua Zhang et.al.	2412.09874	null
2024-12-13	ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression	Kai Yao et.al.	2412.09812	null
2024-12-13	LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering	Patrick Sutanto et.al.	2412.09807	null
2024-12-12	SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dongting Hu et.al.	2412.09619	null
2024-12-12	A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks	Saptarshi Mandal et.al.	2412.09579	null
2024-12-12	All You Need in Knowledge Distillation Is a Tailored Coordinate System	Junjie Zhou et.al.	2412.09388	null
2024-12-12	Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices	Thanaphon Suwannaphong et.al.	2412.09289	null
2024-12-15	DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification	Kunlun Xu et.al.	2412.09224	link
2024-12-12	Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation	Xinyue Liu et.al.	2412.08949	link
2024-12-12	Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration	Yunshuai Zhou et.al.	2412.08939	link
2024-12-11	Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach	Xihua Zhu et.al.	2412.08672	null
2024-12-11	Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation	Jiaming Lv et.al.	2412.08139	null
2024-12-11	DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation	Jaeho Moon et.al.	2412.08116	null
2024-12-10	Low-Rank Correction for Quantized LLMs	Meyer Scetbon et.al.	2412.07902	null
2024-12-10	Unlocking the Potential of Reverse Distillation for Anomaly Detection	Xinyue Liu et.al.	2412.07579	link
2024-12-10	TT-MPD: Test Time Model Pruning and Distillation	Haihang Wu et.al.	2412.07114	null
2024-12-09	FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering	Amirhossein Abaskohi et.al.	2412.07030	link
2024-12-09	VQ4ALL: Efficient Neural Network Representation via a Universal Codebook	Juncan Deng et.al.	2412.06875	null
2024-12-09	Compression for Better: A General and Stable Lossless Compression Framework	Boyang Zhang et.al.	2412.06868	null
2024-12-09	Lossless Model Compression via Joint Low-Rank Factorization Optimization	Boyang Zhang et.al.	2412.06867	null
2024-12-08	GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model	Haotong Yang et.al.	2412.06849	null
2024-12-10	Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks	Junhe Zhang et.al.	2412.06414	null
2024-12-09	U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening	Sungpyo Kim et.al.	2412.06243	null
2024-12-08	Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation	Aymen Sekhri et.al.	2412.06003	null
2024-12-07	Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery	Ye Wang et.al.	2412.05573	null
2024-12-07	Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search	Boxun Xu et.al.	2412.05505	null
2024-12-06	BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits	Wazib Ansar et.al.	2412.05225	null
2024-12-06	One-shot Federated Learning via Synthetic Distiller-Distillate Communication	Junyuan Zhang et.al.	2412.05186	link
2024-12-06	CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Qunhang Fu et.al.	2412.04821	null
2024-12-05	Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation	Ali Abbasi et.al.	2412.04668	null
2024-12-05	FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning	Jiayu Liu et.al.	2412.04521	link
2024-12-05	Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer	Gaole Dai et.al.	2412.04060	null
2024-12-04	Designing DNNs for a trade-off between robustness and processing performance in embedded devices	Jon Gutiérrez-Zaballa et.al.	2412.03682	null
2024-12-04	Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective	Jon Gutiérrez-Zaballa et.al.	2412.03630	link
2024-12-03	CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models	Amitash Nanda et.al.	2412.03599	null
2024-12-07	Enhancing CLIP Conceptual Embedding through Knowledge Distillation	Kuei-Chun Kao et.al.	2412.03513	null
2024-12-04	Distillation of Diffusion Features for Semantic Correspondence	Frank Fundel et.al.	2412.03512	null
2024-12-03	Efficient Model Compression Techniques with FishLeg	Jamie McGowan et.al.	2412.02328	null
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model	Qianhan Feng et.al.	2412.01282	link
2024-12-02	Reducing Inference Energy Consumption Using Dual Complementary CNNs	Michail Kinnas et.al.	2412.01039	link
2024-12-01	QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval	T. Y. S. S. Santosh et.al.	2412.00934	null
2024-12-01	Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps	Girmaw Abebe Tadesse et.al.	2412.00777	null
2024-11-30	Continuous Concepts Removal in Text-to-image Diffusion Models	Tingxu Han et.al.	2412.00580	null
2024-11-30	Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling	Peihao Dong et.al.	2412.00562	link
2024-11-30	Toward Fair Graph Neural Networks Via Dual-Teacher Knowledge Distillation	Chengyu Li et.al.	2412.00382	null
2024-11-29	Reverse Thinking Makes LLMs Stronger Reasoners	Justin Chih-Yao Chen et.al.	2411.19865	null
2024-11-28	Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG	Xinxu Wei et.al.	2411.19230	null
2024-12-03	Puzzle: Distillation-Based NAS for Inference-Optimized LLMs	Akhiad Bercovich et.al.	2411.19146	null
2024-11-28	Headache to Overstock? Promoting Long-tail Items through Debiased Product Bundling	Shuo Xu et.al.	2411.19107	null
2024-11-28	Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems	Mansi Rana et.al.	2411.18980	null
2024-11-27	Active Data Curation Effectively Distills Large-Scale Multimodal Models	Vishaal Udandarao et.al.	2411.18674	null
2024-11-27	Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models	Yiming Wu et.al.	2411.18375	null
2024-11-27	Vision Mamba Distillation for Low-resolution Fine-grained Image Classification	Yao Chen et.al.	2411.17980	link
2024-11-27	Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery	Zhenyu Yu et.al.	2411.17973	null
2024-11-26	Attamba: Attending To Multi-Token States	Yash Akhauri et.al.	2411.17685	link
2024-11-26	Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation	Minh-Tuan Tran et.al.	2411.17046	null
2024-11-26	Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation	Shambhavi Mishra et.al.	2411.17002	link
2024-11-25	Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models	Yao Fu et.al.	2411.16991	null
2024-11-25	Leveraging Foundation Models To learn the shape of semi-fluid deformable objects	Omar El Assal et.al.	2411.16802	null
2024-11-25	O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?	Zhen Huang et.al.	2411.16489	link
2024-11-25	When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?	Srikrishna Iyer et.al.	2411.16487	link
2024-11-25	Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Yanan Wang et.al.	2411.16196	link
2024-11-25	Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics	Tian Bowen et.al.	2411.16139	null
2024-11-25	Ensemble Learning via Knowledge Transfer for CTR Prediction	Honghao Li et.al.	2411.16122	link
2024-11-23	Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models	Tianhao Chen et.al.	2411.15525	null
2024-11-23	Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance	Jiayi Chen et.al.	2411.15438	link
2024-11-23	Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning	Xiaoyu Gan et.al.	2411.15403	null
2024-11-22	Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion	Samarth N Ramesh et.al.	2411.15113	null
2024-11-22	RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency	Wentao Huang et.al.	2411.15076	null
2024-11-22	Adaptive Group Robust Ensemble Knowledge Distillation	Patrik Kenfack et.al.	2411.14984	null
2024-11-25	Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation	Aniket Bhattacharyya et.al.	2411.14957	null
2024-11-22	Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers	Hongbo Liu et.al.	2411.14789	null
2024-11-22	Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation	Xunyu Zhu et.al.	2411.14698	null
2024-11-21	TaQ-DiT: Time-aware Quantization for Diffusion Transformers	Xinyan Liu et.al.	2411.14172	null
2024-11-21	DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization	Hexuan Deng et.al.	2411.14055	link
2024-11-21	Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference	Yunhui Liu et.al.	2411.14035	link
2024-11-21	CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition	Md Mahedi Hasan et.al.	2411.13886	null
2024-11-20	RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content	Yuxuan Jiang et.al.	2411.13362	null
2024-11-20	FASTNav: Fine-tuned Adaptive Small-language-models Trained for Multi-point Robot Navigation	Yuxuan Chen et.al.	2411.13262	null
2024-11-20	Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning	Gang Zhao et.al.	2411.13045	null
2024-11-19	Puppet-CNN: Input-Adaptive Convolutional Neural Networks with Model Compression using Ordinary Differential Equation	Yucheng Xing et.al.	2411.12876	null
2024-11-19	Reward Modeling with Ordinal Feedback: Wisdom of the Crowd	Shang Liu et.al.	2411.12843	null
2024-11-19	What Makes a Good Dataset for Knowledge Distillation?	Logan Frank et.al.	2411.12817	null
2024-11-19	FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning	Qingsong Lv et.al.	2411.12781	link
2024-11-19	KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder	Maheswar Bora et.al.	2411.12270	null
2024-11-19	Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes	Rahul Garg et.al.	2411.12174	null
2024-11-18	Federated Incremental Named Entity Recognition	Duzhen Zhang et.al.	2411.11623	link
2024-11-18	Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms	Haizhou Ge et.al.	2411.11406	null
2024-11-17	Map-Free Trajectory Prediction with Map Distillation and Hierarchical Encoding	Xiaodong Liu et.al.	2411.10961	null
2024-11-16	Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting	Ebrahim Farahmand et.al.	2411.10703	link
2024-11-16	Multi-perspective Contrastive Logit Distillation	Qi Wang et.al.	2411.10693	null
2024-11-16	Exploring Feature-based Knowledge Distillation For Recommender System: A Frequency Perspective	Zhangchi Zhu et.al.	2411.10676	link
2024-11-15	Scaling Law for Post-training after Model Pruning	Xiaodong Chen et.al.	2411.10272	null
2024-11-15	Evidential Federated Learning for Skin Lesion Image Classification	Rutger Hendrix et.al.	2411.10071	null
2024-11-14	VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation	Xi Lin et.al.	2411.09567	null
2024-11-14	Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition	Zixing Zhang et.al.	2411.09339	null
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-14	Toward Democratized Generative AI in Next-Generation Mobile Edge Networks	Ruichen Zhang et.al.	2411.09148	null
2024-11-13	Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head	Penghui Yang et.al.	2411.08937	null
2024-11-13	UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Chengyuan Zhang et.al.	2411.08569	null
2024-11-13	Federated Graph Learning with Graphless Clients	Xingbo Fu et.al.	2411.08374	null
2024-11-12	Joint Diffusion models in Continual Learning	Paweł Skierś et.al.	2411.08224	null
2024-11-12	Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data	Juanhui Li et.al.	2411.08028	null
2024-11-13	Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models	Youan Cong et.al.	2411.07820	null
2024-11-12	ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization	Weibo Zhao et.al.	2411.07762	null
2024-11-12	Optimizing Traffic Signal Control using High-Dimensional State Representation and Efficient Deep Reinforcement Learning	Lawrence Francis et.al.	2411.07759	null
2024-11-12	ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation	Liang Zhao et.al.	2411.07752	null
2024-11-12	OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework	Jiaxi Li et.al.	2411.07711	link
2024-11-13	Feature Interaction Fusion Self-Distillation Network For CTR Prediction	Lei Sang et.al.	2411.07508	null
2024-11-12	Quantifying Knowledge Distillation Using Partial Information Decomposition	Pasan Dissanayake et.al.	2411.07483	null
2024-11-11	SAMPart3D: Segment Any Part in 3D Objects	Yunhan Yang et.al.	2411.07184	link
2024-11-11	LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models	Runming Yang et.al.	2411.06839	null
2024-11-11	ScaleKD: Strong Vision Transformers Could Be Excellent Teachers	Jiawei Fan et.al.	2411.06786	link
2024-11-11	An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning	Dong Li et.al.	2411.06659	link
2024-11-10	CULL-MT: Compression Using Language and Layer pruning for Machine Translation	Pedram Rostami et.al.	2411.06506	null
2024-11-10	Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation	Yu-Liang Zhan et.al.	2411.06448	link
2024-11-09	Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification	Hongyu Chen et.al.	2411.06023	null
2024-11-09	Multi-hop RIS-aided Learning Model Sharing for Urban Air Mobility	Kai Xiong et.al.	2411.06015	null
2024-11-08	Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine	Anantha Sharma et.al.	2411.05936	null
2024-11-08	Asterisk: Keep it Simple*	Andrew Semenov et.al.	2411.05691	null
2024-11-08	Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles	Ayobami Adewale et.al.	2411.05618	null
2024-11-08	Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion	Nan Song et.al.	2411.05544	null
2024-11-07	ZipNN: Lossless Compression for AI Models	Moshik Hershcovitch et.al.	2411.05239	link
2024-11-07	Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale	Flavio Di Palo et.al.	2411.05045	null
2024-11-06	From Word Vectors to Multimodal Embeddings: Techniques, Applications, and Future Directions For Large Language Models	Charles Zhang et.al.	2411.05036	null
2024-11-07	Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers	Zhichao Geng et.al.	2411.04403	null
2024-11-07	GazeGen: Gaze-Driven User Interaction for Visual Content Generation	He-Yen Hsieh et.al.	2411.04335	null
2024-11-06	Towards Personalized Federated Learning via Comprehensive Knowledge Distillation	Pengju Wang et.al.	2411.03569	null
2024-11-05	Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy	Razvan-Gabriel Dumitru et.al.	2411.03513	link
2024-11-05	Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation	Francisco Giral et.al.	2411.02975	null
2024-11-05	Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery	Bowei Du et.al.	2411.02861	null
2024-11-05	Brewing Vodka: Distilling Pure Knowledge for Lightweight Threat Detection in Audit Logs	Weiheng Wu et.al.	2411.02775	null
2024-11-05	Multimodal Commonsense Knowledge Distillation for Visual Question Answering	Shuo Yang et.al.	2411.02722	null
2024-11-04	Information plane and compression-gnostic feedback in quantum machine learning	Nathan Haboury et.al.	2411.02313	null
2024-11-04	Training on the Test Model: Contamination in Ranking Distillation	Vishakha Suresh Kalal et.al.	2411.02284	link
2024-11-03	Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment	Chengting Yu et.al.	2411.01547	null
2024-11-01	On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance	Jaskirat Singh et.al.	2411.00907	null
2024-11-01	Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation	Bohan Lyu et.al.	2411.00412	null
2024-11-01	Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation	Arjun Ramesh Kaushik et.al.	2411.00403	null
2024-11-01	Efficient Model Compression for Bayesian Neural Networks	Diptarka Saha et.al.	2411.00273	null
2024-10-31	Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification	Thanh-Dung Le et.al.	2411.00209	link
2024-10-31	Mutual Information Preserving Neural Network Pruning	Charles Westphal et.al.	2411.00147	null
2024-10-30	Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation	Ahmed Akib Jawad Karim et.al.	2411.00052	null
2024-10-30	IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking	Run Luo et.al.	2410.23907	null
2024-10-29	ML Research Benchmark	Matthew Kenney et.al.	2410.22553	link
2024-11-01	Leveraging Recurrent Neural Networks for Predicting Motor Movements from Primate Motor Cortex Neural Recordings	Yuanxi Wang et.al.	2410.22283	null
2024-10-28	Unveiling Context-Aware Criteria in Self-Assessing LLMs	Taneesh Gupta et.al.	2410.21545	null
2024-10-28	Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications	Kemal Altwlkany et.al.	2410.21478	null
2024-10-31	LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment	Ge Yang et.al.	2410.21352	link
2024-10-28	EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Shih-Yang Liu et.al.	2410.21271	null
2024-10-28	Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study	Jiacheng Hu et.al.	2410.20792	null
2024-10-28	KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation	Rambod Azimi et.al.	2410.20777	link
2024-10-28	Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning	Bing Han et.al.	2410.20775	null
2024-10-28	Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA	Sangmin Bae et.al.	2410.20672	null
2024-10-27	Uncovering Capabilities of Model Pruning in Graph Contrastive Learning	Wu Junran et.al.	2410.20356	null
2024-10-25	A Survey of Small Language Models	Chien Van Nguyen et.al.	2410.20011	null
2024-10-25	GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing	Hosam Elgendy et.al.	2410.19552	link
2024-10-25	SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models	Jahyun Koo et.al.	2410.19503	null
2024-10-24	Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts	Danyal Aftab et.al.	2410.19185	null
2024-10-24	AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Ziqi Liang et.al.	2410.19134	null
2024-10-24	High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws	M. Emrullah Ildiz et.al.	2410.18837	null
2024-10-24	Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Anup Shirgaonkar et.al.	2410.18588	null
2024-10-24	SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning	Shivam Adarsh et.al.	2410.18574	link
2024-10-23	ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams	Srija Anand et.al.	2410.17901	null
2024-10-23	Beware of Calibration Data for Pruning Large Language Models	Yixin Ji et.al.	2410.17711	null
2024-10-23	Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need	Jon Irureta et.al.	2410.17648	null
2024-10-23	Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Muquan Li et.al.	2410.17606	link
2024-10-23	Multimodal Information Bottleneck for Deep Reinforcement Learning with Multiple Sensors	Bang You et.al.	2410.17551	null
2024-10-23	Physics-driven AI for Channel Estimation in Cellular Network	Xiaoqian Qi et.al.	2410.17525	null
2024-10-22	MiniPLM: Knowledge Distillation for Pre-Training Language Models	Yuxian Gu et.al.	2410.17215	link
2024-10-22	Self-calibration for Language Model Quantization and Pruning	Miles Williams et.al.	2410.17170	null
2024-10-22	DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization	Haowei Zhu et.al.	2410.16942	null
2024-10-22	Mitigating Vanishing Activations in Deep CapsNets Using Channel Pruning	Siddharth Sahu et.al.	2410.16908	link
2024-10-22	CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Nicholas I-Hsien Kuo et.al.	2410.16872	null
2024-10-22	AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Yongjian Wu et.al.	2410.16820	link
2024-10-22	SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation	Jing-Jing Li et.al.	2410.16665	null
2024-10-21	Pre-training Distillation for Large Language Models: A Design Space Exploration	Hao Peng et.al.	2410.16215	null
2024-10-18	Interpreting Microbiome Relative Abundance Data Using Symbolic Regression	Swagatam Haldar et.al.	2410.16109	link
2024-10-21	Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples	Kirill Lukyanov et.al.	2410.15889	null
2024-10-20	GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Haiwen Diao et.al.	2410.15266	link
2024-10-19	LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound	Xuechen Guo et.al.	2410.15074	null
2024-10-19	Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Tuan Nam Nguyen et.al.	2410.14997	null
2024-10-18	EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search	Oliver Sieberling et.al.	2410.14649	link
2024-10-18	Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Shuai Zhao et.al.	2410.14425	link
2024-10-18	Preview-based Category Contrastive Learning for Knowledge Distillation	Muhe Ding et.al.	2410.14143	null
2024-10-17	Leveraging Fine-Tuned Language Models for Efficient and Accurate Smart Contract Auditing	Zhiyuan Wei et.al.	2410.13918	link
2024-10-17	An Active Learning Framework for Inclusive Generation by Large Language Models	Sabit Hassan et.al.	2410.13641	null
2024-10-18	Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach	Luyao Zou et.al.	2410.13602	null
2024-10-18	Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach	Luyao Zou et.al.	2410.13260	null
2024-10-16	TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant	Guopeng Li et.al.	2410.12342	null
2024-10-16	Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm	Guanming Huang et.al.	2410.12259	null
2024-10-16	TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration	Yiwei Guo et.al.	2410.12183	link
2024-10-17	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-15	MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router	Yanyue Xie et.al.	2410.12013	null
2024-10-15	Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation	Andong Lu et.al.	2410.11586	link
2024-10-15	Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL	Qihuang Zhong et.al.	2410.11371	null
2024-10-15	Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Wenda Xu et.al.	2410.11325	null
2024-10-14	ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection	Martin Aubard et.al.	2410.10554	link
2024-10-14	QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models	Zhumazhan Balapanov et.al.	2410.10318	link
2024-10-14	Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation	Siru Ouyang et.al.	2410.10141	null
2024-10-15	Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices	Xiaoyu Xia et.al.	2410.10128	link
2024-10-14	REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation	Zhiyun Song et.al.	2410.10097	null
2024-10-12	SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs	Mohammad Mozaffari et.al.	2410.09615	link
2024-10-12	Distilling Invariant Representations with Dual Augmentation	Nikolaos Giakoumoglou et.al.	2410.09474	null
2024-10-12	Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets	Thomas Eiter et.al.	2410.09428	link
2024-10-15	Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI	Muhammet Anil Yagiz et.al.	2410.09043	null
2024-10-11	Mentor-KD: Making Small Language Models Better Multi-step Reasoners	Hojae Lee et.al.	2410.09037	link
2024-10-11	Contrastive Knowledge Distillation for Robust Multimodal Sentiment Analysis	Zhongyi Sang et.al.	2410.08692	null
2024-10-11	GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning	Yubo Peng et.al.	2410.08634	null
2024-10-11	Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both	Abhijnan Nath et.al.	2410.08458	null
2024-10-10	What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias	Aida Mohammadshahi et.al.	2410.08407	null
2024-10-10	Non-transferable Pruning	Ruyi Ding et.al.	2410.08015	null
2024-10-10	A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways	Jing Su et.al.	2410.07915	null
2024-10-10	SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks	Haiyang Wang et.al.	2410.07857	link
2024-10-12	Relational Diffusion Distillation for Efficient Image Generation	Weilun Feng et.al.	2410.07679	link
2024-10-10	CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression	Wenyuan Liu et.al.	2410.07505	null
2024-10-09	Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing	Ismail Erbas et.al.	2410.07364	null
2024-10-09	S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning	Weihao Lin et.al.	2410.07046	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Wenqi Niu et.al.	2410.06561	null
2024-10-08	SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching	Tianyi Zhang et.al.	2410.06364	null
2024-10-08	QT-DoG: Quantization-aware Training for Domain Generalization	Saqib Javed et.al.	2410.06020	link
2024-10-10	KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server	Wenhao Wang et.al.	2410.05725	link
2024-10-07	Progressive distillation induces an implicit curriculum	Abhishek Panigrahi et.al.	2410.05464	null
2024-10-07	ESPACE: Dimensionality Reduction of Activations for Model Compression	Charbel Sakr et.al.	2410.05437	null
2024-10-07	ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation	Yuelyu Ji et.al.	2410.05168	null
2024-10-06	CAPEEN: Image Captioning with Early Exits and Knowledge Distillation	Divya Jyoti Bajpai et.al.	2410.04433	link
2024-10-06	DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Divya Jyoti Bajpai et.al.	2410.04424	link
2024-10-05	Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution	Jianze Li et.al.	2410.04224	link
2024-10-05	Accelerating Diffusion Models with One-to-Many Knowledge Distillation	Linfeng Zhang et.al.	2410.04191	null
2024-10-05	DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech	Dominika Woszczyk et.al.	2410.04188	null
2024-10-05	Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Yong Guo et.al.	2410.04140	null
2024-10-04	Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models	Zhuochun Li et.al.	2410.03663	null
2024-10-04	DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models	Sungnyun Kim et.al.	2410.03061	null
2024-10-03	Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models	Mingxue Xu et.al.	2410.03040	null
2024-10-03	Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Siddharth Joshi et.al.	2410.02116	link
2024-10-02	Review Non-convex Optimization Method for Machine Learning	Greg B Fotopoulos et.al.	2410.02017	null
2024-10-02	PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation	Mike Ranzinger et.al.	2410.01680	null
2024-10-04	HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models	Seanie Lee et.al.	2410.01524	link
2024-10-02	Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks	Edan Kinderman et.al.	2410.01483	link
2024-10-02	PairDistill: Pairwise Relevance Distillation for Dense Retrieval	Chao-Wei Huang et.al.	2410.01383	link
2024-10-02	"No Matter What You Do!": Mitigating Backdoor Attacks in Graph Neural Networks	Jiale Zhang et.al.	2410.01272	link
2024-10-01	Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Ismail Erbas et.al.	2410.00948	null
2024-10-01	Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading	Mostafa Hajighasemloua et.al.	2410.00779	null
2024-10-01	Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Jiyoon Myung et.al.	2410.00683	null
2024-10-01	AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation	Ziyang Luo et.al.	2410.00558	link
2024-10-01	Self-Updatable Large Language Models with Parameter Integration	Yu Wang et.al.	2410.00487	null
2024-09-30	Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation	Vlad-Cristian Matei et.al.	2409.20498	null
2024-10-02	Linear Projections of Teacher Embeddings for Few-Class Distillation	Noel Loo et.al.	2409.20449	null
2024-09-30	Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Shalini Sarode et.al.	2409.20237	null
2024-09-30	Aggressive Post-Training Compression on Extremely Large Language Models	Zining Zhang et.al.	2409.20094	null
2024-10-01	HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning	Momin Ahmad Khan et.al.	2409.19912	null
2024-09-29	Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation	Huidong Tang et.al.	2409.19741	null
2024-09-29	InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Mengze Hong et.al.	2409.19689	null
2024-09-28	Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training	Pihe Hu et.al.	2409.19391	null
2024-09-28	Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment	Tianyi Liu et.al.	2409.19366	null
2024-09-27	Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models	Shihua Qin et.al.	2409.19185	null
2024-09-27	MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation	Junyou Zhu et.al.	2409.18800	null
2024-09-27	Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Chaomin Shen et.al.	2409.18785	null
2024-09-27	Harmonizing knowledge Transfer in Neural Network with Unified Distillation	Yaomin Huang et.al.	2409.18565	null
2024-09-27	Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration	Mahdi Morafah et.al.	2409.18461	link
2024-09-26	EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation	Jiaxiang Tang et.al.	2409.18114	null
2024-09-26	Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation	Shuai Zhao et.al.	2409.17946	null
2024-09-26	Kendall's $τ$ Coefficient for Logits Distillation	Yuchen Guan et.al.	2409.17823	null
2024-09-26	General Compression Framework for Efficient Transformer Object Tracking	Lingyi Hong et.al.	2409.17564	null
2024-09-26	Shape-intensity knowledge distillation for robust medical image segmentation	Wenhui Dong et.al.	2409.17503	link
2024-09-25	Search for Efficient Large Language Models	Xuan Shen et.al.	2409.17372	link
2024-09-25	MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events	Xiaoyu Yang et.al.	2409.17010	null
2024-09-25	Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Hanyu Zhou et.al.	2409.17001	null
2024-09-25	SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling	Laurent Dillard et.al.	2409.16581	null
2024-09-24	AIM 2024 Challenge on UHD Blind Photo Quality Assessment	Vlad Hosu et.al.	2409.16271	null
2024-09-25	Privacy Evaluation Benchmarks for NLP Models	Wei Huang et.al.	2409.15868	link
2024-09-24	Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization	Lucas Deckers et.al.	2409.15849	null
2024-09-23	TS-TCD: Triplet-Level Cross-Modal Distillation for Time-Series Forecasting Using Large Language Models	Pengfei Wang et.al.	2409.14978	null
2024-09-23	DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models	Sangyeon Cho et.al.	2409.14904	link
2024-09-23	Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation	Li Li et.al.	2409.14810	null
2024-09-23	An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding	Wei-Bin Kou et.al.	2409.14737	null
2024-09-18	Applications of Knowledge Distillation in Remote Sensing: A Survey	Yassine Himeur et.al.	2409.12111	null
2024-09-18	Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Jin Jie Sean Yeo et.al.	2409.11964	null
2024-09-18	Distillation-free Scaling of Large SSMs for Images and Videos	Hamid Suleman et.al.	2409.11867	null
2024-09-18	EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis	Shaojie Li et.al.	2409.11817	null
2024-09-18	RUIE: Retrieval-based Unified Information Extraction using Large Language Model	Xincheng Liao et.al.	2409.11673	link
2024-09-17	Time-Series Forecasting, Knowledge Distillation, and Refinement within a Multimodal PDE Foundation Model	Derek Jollie et.al.	2409.11609	link
2024-09-17	Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation	Rui Yu et.al.	2409.11018	null
2024-09-17	Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation	Gerard I. Gállego et.al.	2409.11003	null
2024-09-16	Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Amin Karimi Monsefi et.al.	2409.10362	link
2024-09-16	Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference	Huy-Dung Nguyen et.al.	2409.10095	null
2024-09-15	ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration	Ning-Chi Huang et.al.	2409.09708	null
2024-09-14	Effective Pre-Training of Audio Transformers for Sound Event Detection	Florian Schmid et.al.	2409.09546	link
2024-09-14	Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification	Wenhao Yang et.al.	2409.09389	null
2024-09-14	Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility	Xiaoyu Liu et.al.	2409.09357	null
2024-09-13	Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection	Dixi Yao et.al.	2409.08858	null
2024-09-13	An Efficient Privacy-aware Split Learning Framework for Satellite Communications	Jianfei Sun et.al.	2409.08538	null
2024-09-13	AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Zechao Sun et.al.	2409.08516	null
2024-09-12	DiReDi: Distillation and Reverse Distillation for AIoT Applications	Chen Sun et.al.	2409.08308	null
2024-09-12	Ruri: Japanese General Text Embeddings	Hayato Tsukagoshi et.al.	2409.07737	link
2024-09-12	Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios	Xinlei Huang et.al.	2409.07694	null
2024-09-11	DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis	Ke Chen et.al.	2409.07584	null
2024-09-11	EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data	Grégoire Petit et.al.	2409.07566	link
2024-09-11	NVRC: Neural Video Representation Compression	Ho Man Kwan et.al.	2409.07414	null
2024-09-11	Enhancing CTC-Based Visual Speech Recognition	Hendrik Laux et.al.	2409.07210	null
2024-09-11	A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption	Marcus Rüb et.al.	2409.07114	null
2024-09-11	Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator	Kangyang Luo et.al.	2409.06955	null
2024-09-10	Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study	Ilias Siniosoglou et.al.	2409.06904	null
2024-09-10	EasyST: A Simple Framework for Spatio-Temporal Prediction	Jiabin Tang et.al.	2409.06748	link
2024-09-10	SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation	Teng Hu et.al.	2409.06633	null
2024-09-10	Knowledge Distillation via Query Selection for Detection Transformer	Yi Liu et.al.	2409.06443	null
2024-09-10	Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition	Junzheng Zhang et.al.	2409.06371	null
2024-09-10	Enhancing Long Video Understanding via Hierarchical Event-Based Memory	Dingxin Cheng et.al.	2409.06299	null
2024-09-09	Joint Input and Output Coordination for Class-Incremental Learning	Shuai Wang et.al.	2409.05620	null
2024-09-09	LEROjD: Lidar Extended Radar-Only Object Detection	Patrick Palmer et.al.	2409.05564	link
2024-09-09	Federated Transfer Learning Based Cooperative Wideband Spectrum Sensing with Model Pruning	Jibin Jia et.al.	2409.05462	null
2024-09-09	Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition	Shiming Ge et.al.	2409.05384	null
2024-09-09	Application Specific Compression of Deep Learning Models	Rohit Raj Rai et.al.	2409.05368	link
2024-09-09	FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data	Rasoul Jafari Gohari et.al.	2409.05359	link
2024-09-08	Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation	Haichao Zhu et.al.	2409.05151	null
2024-09-07	LoCa: Logit Calibration for Knowledge Distillation	Runming Yang et.al.	2409.04778	null
2024-09-06	SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields	Yuze Wang et.al.	2409.04482	null
2024-09-05	Experimentation in Content Moderation using RWKV	Umut Yildirim et.al.	2409.03939	null
2024-09-05	DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture	Qianlong Xiang et.al.	2409.03550	link
2024-09-05	Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Pei Wang et.al.	2409.03455	null
2024-09-05	Efficient Image Compression Using Advanced State Space Models	Bouzid Arezki et.al.	2409.02743	null
2024-09-04	CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Minhee Cho et.al.	2409.02699	null
2024-09-04	Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation	Kangkai Zhang et.al.	2409.02555	null
2024-09-04	A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing	Davi Rodrigues et.al.	2409.02528	null
2024-09-04	Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation	Yilong Chen et.al.	2409.02438	null
2024-09-03	Low-Resolution Face Recognition via Adaptable Instance-Relation Distillation	Ruixin Shi et.al.	2409.02049	null
2024-09-03	Foundations of Large Language Model Compression -- Part 1: Weight Quantization	Sean I. Young et.al.	2409.02026	link
2024-09-03	Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique	Qiang Zheng et.al.	2409.02020	null
2024-09-03	Contemporary Model Compression on Large Language Models Inference	Dong Liu et.al.	2409.01990	link
2024-09-03	Adaptive Explicit Knowledge Transfer for Knowledge Distillation	Hyungkeun Park et.al.	2409.01679	null
2024-08-30	How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition	Pedro C. Neto et.al.	2408.17399	link
2024-08-30	HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution	Masoomeh Aslahishahri et.al.	2408.16959	link
2024-08-29	VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition	Zaiwei Zhang et.al.	2408.16930	null
2024-08-29	Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling	Hritik Bansal et.al.	2408.16737	null
2024-08-29	MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition	Eduarda Caldeira et.al.	2408.16563	link
2024-08-29	Convolutional Neural Network Compression Based on Low-Rank Decomposition	Yaping He et.al.	2408.16289	null
2024-08-28	LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation	Fangxun Shu et.al.	2408.15881	link
2024-08-28	ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation	Tiantian Feng et.al.	2408.15803	null
2024-08-28	Online pre-training with long-form videos	Itsuki Kato et.al.	2408.15651	null
2024-08-28	Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation	Lujun Gui et.al.	2408.15562	null
2024-08-27	Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification	Yiqiang Cai et.al.	2408.14862	link
2024-08-27	Learning effective pruning at initialization from iterative pruning	Shengkai Liu et.al.	2408.14757	link
2024-08-26	Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems	Nikhil Khani et.al.	2408.14678	null
2024-08-25	Variational autoencoder-based neural network model compression	Liang Cheng et.al.	2408.14513	null
2024-08-26	TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines	Hymalai Bello et.al.	2408.14146	null
2024-08-27	GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets	Sven Oehri et.al.	2408.14131	link
2024-08-26	Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection	Yuncheng Jiang et.al.	2408.14051	null
2024-08-25	Condensed Sample-Guided Model Inversion for Knowledge Distillation	Kuluhan Binici et.al.	2408.13850	null
2024-08-25	Bring the Power of Diffusion Model to Defect Detection	Xuyi Yu et.al.	2408.13845	null
2024-08-24	Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic	Yifei He et.al.	2408.13656	link
2024-08-24	MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning	Seungbeom Hu et.al.	2408.13482	null
2024-08-23	Growing Deep Neural Network Considering with Similarity between Neurons	Taigo Sakai et.al.	2408.13291	null
2024-08-23	Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption	Sakhinana Sagar Srinivas et.al.	2408.13248	null
2024-08-23	A Web-Based Solution for Federated Learning with LLM-Based Automation	Chamith Mawela et.al.	2408.13010	null
2024-08-23	A Survey on Drowsiness Detection -- Modern Applications and Methods	Biying Fu et.al.	2408.12990	null
2024-08-22	Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers	Sayed Mohammad Vakilzadeh Hatefi et.al.	2408.12568	link
2024-08-22	Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models	Meiyun Wang et.al.	2408.12326	link
2024-08-22	Rebalancing Multi-Label Class-Incremental Learning	Kaile Du et.al.	2408.12161	null
2024-08-22	Vision-Based Detection of Uncooperative Targets and Components on Small Satellites	Hannah Grauer et.al.	2408.12084	null
2024-08-22	Aligning (Medical) LLMs for (Counterfactual) Fairness	Raphael Poulain et.al.	2408.12055	link
2024-08-22	LAKD-Activation Mapping Distillation Based on Local Learning	Yaoze Zhang et.al.	2408.11478	null
2024-08-21	A Practical Trigger-Free Backdoor Attack on Neural Networks	Jiahao Wang et.al.	2408.11444	null
2024-08-21	Pano2Room: Novel View Synthesis from a Single Indoor Panorama	Guo Pu et.al.	2408.11413	link
2024-08-21	Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection	Liang Yao et.al.	2408.11407	null
2024-08-21	A Unified Framework for Continual Learning and Machine Unlearning	Romit Chatterjee et.al.	2408.11374	null
2024-08-20	SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection	Huafeng Chen et.al.	2408.10760	null
2024-08-20	Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation	Md Fahim Sikder et.al.	2408.10755	null
2024-08-20	Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches	Yanjie Dong et.al.	2408.10691	null
2024-08-20	LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models	Yupeng Su et.al.	2408.10631	link
2024-08-20	Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers	Thanh Thi Nguyen et.al.	2408.10503	null
2024-08-19	Transferring Backdoors between Large Language Models by Knowledge Distillation	Pengzhou Cheng et.al.	2408.09878	link
2024-08-20	MoDeGPT: Modular Decomposition for Large Language Model Compression	Chi-Heng Lin et.al.	2408.09632	null
2024-08-18	MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment	Tianyi Liu et.al.	2408.09465	null
2024-08-18	CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination	Kaicheng Yang et.al.	2408.09441	null
2024-08-18	OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Muhammad Rameez Ur Rahman et.al.	2408.09424	link
2024-08-17	RepControlNet: ControlNet Reparameterization	Zhaoli Deng et.al.	2408.09240	null
2024-08-16	Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition	Muhammad Haseeb Aslam et.al.	2408.09035	link
2024-08-16	Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase	Yicong Li et.al.	2408.08684	null
2024-08-16	ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models	Chao Zeng et.al.	2408.08554	link
2024-08-15	Computer Vision Model Compression Techniques for Embedded Systems: A Survey	Alexandre Lopes et.al.	2408.08250	link
2024-08-15	MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU	Yan Li et.al.	2408.08144	null
2024-08-19	Knowledge Distillation with Refined Logits	Wujie Sun et.al.	2408.07703	link
2024-08-14	FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher	Alessio Mora et.al.	2408.07587	null
2024-08-14	Towards Real-time Video Compressive Sensing on Mobile Devices	Miao Cao et.al.	2408.07530	link
2024-08-14	One Step Diffusion-based Super-Resolution with Time-Aware Distillation	Xiao He et.al.	2408.07476	link
2024-08-14	Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection	Zhonglin Chen et.al.	2408.07455	null
2024-08-13	Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach	Tong Wang et.al.	2408.07238	null
2024-08-15	An Event Structure-aware Generative Model for Biomedical Event Extraction	Haohan Yuan et.al.	2408.06583	null
2024-08-12	Optimizing Vision Transformers with Data-Free Knowledge Transfer	Gousia Habib et.al.	2408.05952	null
2024-08-11	Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation	Xiaoxiong Zhang et.al.	2408.05748	null
2024-08-11	Efficient Federated Learning Using Dynamic Update and Adaptive Pruning with Momentum on Shared Server Data	Ji Liu et.al.	2408.05678	null
2024-08-08	LaDiMo: Layer-wise Distillation Inspired MoEfier	Sungyoon Kim et.al.	2408.04278	null
2024-08-08	Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement	Runduo Han et.al.	2408.04267	null
2024-08-14	ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model	Yifan Chen et.al.	2408.04145	null
2024-08-07	AdapMTL: Adaptive Pruning Framework for Multitask Learning Model	Mingcan Xiang et.al.	2408.03913	null
2024-08-07	Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection	Xinyue Liu et.al.	2408.03888	null
2024-08-07	Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields	Joo Chan Lee et.al.	2408.03822	null
2024-08-07	Iterative Knowledge Distillation through Feedback-Driven Learning Cycles	Yujia Chen et.al.	2408.03680	null
2024-08-07	Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration	Zhongyao Luo et.al.	2408.03647	link
2024-08-07	Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation	Feng Zhou et.al.	2408.03616	link
2024-08-06	EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures	Teng Liang et.al.	2408.03449	link
2024-08-06	DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers	Lianwei Yang et.al.	2408.03291	null
2024-08-06	Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments	Angie Boggust et.al.	2408.03274	null
2024-08-06	Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization	Yanghai Zhang et.al.	2408.03149	link
2024-08-06	Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations	Leo Donisch et.al.	2408.03130	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	link
2024-08-06	VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation	Ju-Hyeon Nam et.al.	2408.02888	null
2024-08-05	An approach to optimize inference of the DIART speaker diarization pipeline	Roman Aperdannier et.al.	2408.02341	null
2024-08-05	Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution	Hojung Lee et.al.	2408.02307	link
2024-08-05	Unsupervised Domain Adaption Harnessing Vision-Language Pre-training	Wenlve Zhou et.al.	2408.02192	link
2024-08-03	Joint Model Pruning and Resource Allocation for Wireless Time-triggered Federated Learning	Xinlu Zhang et.al.	2408.01765	null
2024-08-02	An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression	Shiyi Luo et.al.	2408.01534	null
2024-08-02	Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning	Lu Yu et.al.	2408.01076	link
2024-08-02	Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs	Afia Anjum et.al.	2408.01008	null
2024-08-01	DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects	Yiheng Huang et.al.	2408.00337	null
2024-08-01	Clover-2: Accurate Inference for Regressive Lightweight Speculative Decoding	Bin Xiao et.al.	2408.00264	null
2024-08-01	Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation	Kohei Matsuura et.al.	2408.00205	null
2024-07-31	StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization	Kaiyuan Tang et.al.	2408.00150	null
2024-08-02	Gemma 2: Improving Open Language Models at a Practical Size	Gemma Team et.al.	2408.00118	null
2024-07-31	Dynamic Object Queries for Transformer-based Incremental Object Detection	Jichuan Zhang et.al.	2407.21687	null
2024-07-31	Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins	Lukas Gienapp et.al.	2407.21515	null
2024-07-31	VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning	Yuhang Ming et.al.	2407.21416	null
2024-07-31	Lifelong Person Search	Jae-Won Yang et.al.	2407.21252	null
2024-07-29	SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation	Chakkrit Termritthikun et.al.	2407.20062	link
2024-07-29	ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality	Guoliang Xu et.al.	2407.19820	null
2024-07-29	Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices	Hayun Lee et.al.	2407.19644	null
2024-07-28	Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Mohammed Al-Maamari et.al.	2407.19610	link
2024-07-28	Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Knowledge Distillation and Random Data Erasing	Heejoon Koo et.al.	2407.19540	link
2024-07-28	LLAVADI: What Matters For Multimodal Large Language Models Distillation	Shilin Xu et.al.	2407.19409	null
2024-07-28	Logic Distillation: Learning from Code Function by Function for Planning and Decision-making	Dong Chen et.al.	2407.19405	null
2024-07-27	Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Gang Pan et.al.	2407.19271	null
2024-07-26	Automatic Detection of Moral Values in Music Lyrics	Vjosa Preniqi et.al.	2407.18787	link
2024-07-26	Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers	Longkun Zou et.al.	2407.18534	link
2024-07-26	FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction	Wentao Ouyang et.al.	2407.18472	null
2024-07-26	Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation	Jiabo Ma et.al.	2407.18449	null
2024-07-25	Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT	Niels G. Faber et.al.	2407.18288	link
2024-07-25	Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning	Tianduo Wang et.al.	2407.18248	link
2024-07-25	How to Train the Teacher Model for Effective Knowledge Distillation	Shayan Mohajer Hamidi et.al.	2407.18041	link
2024-07-25	Peak-Controlled Logits Poisoning Attack in Federated Distillation	Yuhan Tang et.al.	2407.18039	null
2024-07-25	Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach	Kangil Lee et.al.	2407.17909	null
2024-07-25	NC-NCD: Novel Class Discovery for Node Classification	Yue Hou et.al.	2407.17816	link
2024-07-24	CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis	Muhammad Alberb et.al.	2407.17620	link
2024-07-24	(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork	Tianjin Huang et.al.	2407.17412	null
2024-07-23	Strike a Balance in Continual Panoptic Segmentation	Jinpeng Chen et.al.	2407.16354	link
2024-07-23	OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection	Fan Cui et.al.	2407.16237	link
2024-07-23	DDK: Distilling Domain Knowledge for Efficient Large Language Models	Jiaheng Liu et.al.	2407.16154	null

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 2,466 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.05.04

Quantization

Pruning

Hardware-Software Co-Design

TinyML

Domain Specific Accelerator

Low-Rank Adaptation

Model Compression

About

Releases

Packages

Languages

License

Ther-nullptr/circult-eda-mlsys-tinyml-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.05.04

Quantization

Pruning

Hardware-Software Co-Design

TinyML

Domain Specific Accelerator

Low-Rank Adaptation

Model Compression

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages