Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction | Changjun Li et.al. | 2505.00259 | null |
2025-04-24 | Precision Neural Network Quantization via Learnable Adaptive Modules | Wenqiang Zhou et.al. | 2504.17263 | null |
2025-04-21 | StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models | Yeona Hong et.al. | 2504.14915 | null |
2025-04-14 | Enhancing Ultra-Low-Bit Quantization of Large Language Models Through Saliency-Aware Partial Retraining | Deyu Cao et.al. | 2504.13932 | null |
2025-04-13 | Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization | Yamato Arai et.al. | 2504.09629 | null |
2025-04-12 | DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models | Wenjin Ke et.al. | 2504.09223 | null |
2025-04-10 | Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression | Hanqi Xiao et.al. | 2504.07389 | link |
2025-04-09 | Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation | Sirine Arfa et.al. | 2504.06748 | null |
2025-04-07 | Achieving binary weight and activation for LLMs using Post-Training Quantization | Siqing Song et.al. | 2504.05352 | null |
2025-03-29 | RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm | Yongyi Yang et.al. | 2504.03717 | null |
2025-04-04 | Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency | Erik Johannes Husom et.al. | 2504.03360 | null |
2025-04-03 | APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers | Zhuguanyu Wu et.al. | 2504.02508 | link |
2025-04-02 | LLMPi: Optimizing LLMs for High-Throughput on Raspberry Pi | Mahsa Ardakani et.al. | 2504.02118 | null |
2025-04-03 | Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models | Hung-Yueh Chiang et.al. | 2503.22879 | link |
2025-03-24 | Wireless Hearables With Programmable Speech AI Accelerators | Malek Itani et.al. | 2503.18698 | null |
2025-03-24 | GranQ: Granular Zero-Shot Quantization with Unified Layer-Channel Awareness | Inpyo Hong et.al. | 2503.18339 | null |
2025-03-20 | QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge | Xuan Shen et.al. | 2503.16709 | null |
2025-03-22 | Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation | Yuqing Wang et.al. | 2503.16430 | null |
2025-03-19 | PARQ: Piecewise-Affine Regularized Quantization | Lisa Jin et.al. | 2503.15748 | null |
2025-03-19 | FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers | Ruichen Chen et.al. | 2503.15465 | link |
2025-03-14 | Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix | Junbiao Pang et.al. | 2503.11159 | null |
2025-03-13 | OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models | Akshat Ramachandran et.al. | 2503.10959 | null |
2025-03-12 | Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks | Idris Zakariyya et.al. | 2503.08973 | null |
2025-03-10 | QuantU-Net: Efficient Wearable Medical Imaging Using Bitwidth as a Trainable Parameter | Christiaan Boerkamp et.al. | 2503.08719 | null |
2025-03-10 | Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping | Ning Ding et.al. | 2503.06930 | null |
2025-03-09 | SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model | Jing Zhang et.al. | 2503.06515 | null |
2025-03-05 | AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model | Wenlun Zhang et.al. | 2503.03088 | null |
2025-03-04 | Q&C: When Quantization Meets Cache in Efficient Image Generation | Xin Ding et.al. | 2503.02508 | null |
2025-02-28 | Identifying Sensitive Weights via Post-quantization Integral | Yuezhou Hu et.al. | 2503.01901 | null |
2025-03-03 | KurTail : Kurtosis-based LLM Quantization | Mohammad Sadegh Akhondzadeh et.al. | 2503.01483 | null |
2025-03-05 | Regularization-based Framework for Quantization-, Fault- and Variability-Aware Training | Anmol Biswas et.al. | 2503.01297 | null |
2025-02-27 | HALO: Hardware-aware quantization with low critical-path-delay weights for LLM acceleration | Rohan Juneja et.al. | 2502.19662 | null |
2025-02-26 | Binary Neural Networks for Large Language Model: A Survey | Liangdong Liu et.al. | 2502.19008 | null |
2025-02-23 | Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression | Xiaoyi Qu et.al. | 2502.16638 | link |
2025-02-17 | Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer | Euntae Choi et.al. | 2502.15779 | null |
2025-02-21 | Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2502.15488 | null |
2025-02-21 | CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution | Kai Liu et.al. | 2502.15478 | link |
2025-02-21 | LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design | Renjie Wei et.al. | 2502.15260 | null |
2025-02-20 | Hardware-Friendly Static Quantization Method for Video Diffusion Transformers | Sanghyun Yi et.al. | 2502.15077 | null |
2025-02-18 | PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models | Jiaqi Zhao et.al. | 2502.13179 | link |
2025-02-18 | Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis | Jiaqi Zhao et.al. | 2502.13178 | null |
2025-02-17 | Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models? | Jacob Nielsen et.al. | 2502.11895 | null |
2025-02-17 | On Quantizing Neural Representation for Variable-Rate Video Coding | Junqi Shi et.al. | 2502.11729 | link |
2025-02-14 | Can Post-Training Quantization Benefit from an Additional QLoRA Integration? | Xiliang Zhu et.al. | 2502.10202 | null |
2025-02-13 | NestQuant: Nested Lattice Quantization for Matrix Products and LLMs | Semyon Savkin et.al. | 2502.09720 | null |
2025-02-13 | RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models | Quan Wei et.al. | 2502.09003 | null |
2025-02-12 | Compression of Site-Specific Deep Neural Networks for Massive MIMO Precoding | Ghazal Kasalaee et.al. | 2502.08758 | null |
2025-02-06 | Exploring Model Invariance with Discrete Search for Ultra-Low-Bit Quantization | Yuqiao Wen et.al. | 2502.06844 | null |
2025-02-07 | BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference | Reena Elangovan et.al. | 2502.05376 | null |
2025-02-07 | QuEST: Stable Training of LLMs with 1-Bit Weights and Activations | Andrei Panferov et.al. | 2502.05003 | link |
2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
2025-02-04 | Survey of Quantization Techniques for On-Device Vision-based Crack Detection | Yuxuan Zhang et.al. | 2502.02269 | null |
2025-02-03 | Nearly Lossless Adaptive Bit Switching | Haiduo Huang et.al. | 2502.01199 | link |
2025-02-03 | On the impact of the parametrization of deep convolutional neural networks on post-training quantization | Samy Houache et.al. | 2502.01156 | null |
2025-02-01 | Oscillations Make Neural Networks Robust to Quantization | Jonathan Wenshøj et.al. | 2502.00490 | null |
2025-02-01 | MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization | JiangYong Yu et.al. | 2502.00425 | null |
2025-01-30 | Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models | Wanlong Liu et.al. | 2501.18154 | null |
2025-01-28 | Post-Training Quantization for 3D Medical Image Segmentation: A Practical Study on Real Inference Engines | Chongyu Qu et.al. | 2501.17343 | null |
2025-01-28 | Post-Training Quantization for Vision Mamba with k-Scaled Quantization and Reparameterization | Bo-Yun Shi et.al. | 2501.16738 | null |
2025-01-24 | End-to-end workflow for machine learning-based qubit readout with QICK and hls4ml | Giuseppe Di Guglielmo et.al. | 2501.14663 | null |
2025-01-24 | On Hardening DNNs against Noisy Computations | Xiao Wang et.al. | 2501.14531 | null |
2025-01-23 | OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting | Xing Hu et.al. | 2501.13987 | link |
2025-01-23 | QMamba: Post-Training Quantization for Vision State Space Models | Yinglong Li et.al. | 2501.13624 | null |
2025-01-23 | MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods | Zukang Xu et.al. | 2501.13484 | link |
2025-01-21 | UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model | Branislava Jankovic et.al. | 2501.12087 | null |
2025-01-15 | Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach | Alireza Ghaffari et.al. | 2501.09107 | null |
2025-01-14 | D |
Qian Zeng et.al. | 2501.08180 | link |
2025-01-10 | Mix-QViT: Mixed-Precision Vision Transformer Quantization Driven by Layer Importance and Quantization Sensitivity | Navin Ranjan et.al. | 2501.06357 | null |
2025-01-09 | Neural Architecture Codesign for Fast Physics Applications | Jason Weitz et.al. | 2501.05515 | link |
2025-01-09 | JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration | Mingzi Wang et.al. | 2501.05339 | null |
2025-01-09 | Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning | Dmytro Kuzmenko et.al. | 2501.05329 | null |
2025-01-06 | The Power of Negative Zero: Datatype Customization for Quantized Large Language Models | Yuzong Chen et.al. | 2501.04052 | null |
2025-01-05 | HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning | Saleh Ashkboos et.al. | 2501.02625 | link |
2024-12-30 | PQD: Post-training Quantization for Efficient Diffusion Models | Jiaojiao Ye et.al. | 2501.00124 | null |
2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
2024-12-29 | PTQ4VM: Post-Training Quantization for Visual Mamba | Younghyun Cho et.al. | 2412.20386 | link |
2024-12-28 | IMSSA: Deploying modern state-space models on memristive in-memory compute hardware | Sebastian Siegel et.al. | 2412.20215 | null |
2024-12-27 | Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales | Shuokai Pan et.al. | 2412.19867 | null |
2024-12-27 | MBQ: Modality-Balanced Quantization for Large Vision-Language Models | Shiyao Li et.al. | 2412.19509 | link |
2024-12-24 | Unified Stochastic Framework for Neural Network Quantization and Pruning | Haoyu Zhang et.al. | 2412.18184 | null |
2024-12-21 | TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models | Haocheng Huang et.al. | 2412.16700 | null |
2024-12-20 | Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart | Chengting Yu et.al. | 2412.15846 | null |
2024-12-19 | Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers | Rui Ding et.al. | 2412.14633 | null |
2024-12-19 | Qua |
Keith G. Mills et.al. | 2412.14628 | null |
2024-12-18 | ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals | Utkarsh Saxena et.al. | 2412.14363 | link |
2024-12-15 | Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment | Haisheng Lu et.al. | 2412.11186 | link |
2024-12-13 | TTAQ: Towards Stable Post-training Quantization in Continuous Domain Adaptation | Junrui Xiao et.al. | 2412.09899 | null |
2024-12-12 | CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs | Yuzhuang Xu et.al. | 2412.09282 | null |
2024-12-10 | Post-Training Non-Uniform Quantization for Convolutional Neural Networks | Ahmed Luqman et.al. | 2412.07391 | null |
2024-12-09 | FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization | Boyang Zhang et.al. | 2412.06865 | null |
2024-12-09 | Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion | Shuaiting Li et.al. | 2412.06661 | null |
2024-12-07 | GAQAT: gradient-adaptive quantization-aware training for domain generalization | Jiacheng Jiang et.al. | 2412.05551 | null |
2024-12-07 | SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization | Runsheng Bai et.al. | 2412.04180 | null |
2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
2024-12-03 | CPTQuant - A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models | Amitash Nanda et.al. | 2412.03599 | null |
2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
2024-12-03 | PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution | Libo Zhu et.al. | 2411.17106 | link |
2024-11-23 | freePruner: A Training-free Approach for Large Multimodal Model Acceleration | Bingxin Xu et.al. | 2411.15446 | null |
2024-11-22 | FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration | Donghyeon Yi et.al. | 2411.14733 | null |
2024-11-17 | EfQAT: An Efficient Framework for Quantization-Aware Training | Saleh Ashkboos et.al. | 2411.11038 | null |
2024-11-12 | ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Weibo Zhao et.al. | 2411.07762 | null |
2024-11-09 | Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques | Jahid Hasan et.al. | 2411.06084 | null |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-30 | Scaling Laws for Precision | Tanishq Kumar et.al. | 2411.04330 | null |
2024-11-06 | Interactions Across Blocks in Post-Training Quantization of Large Language Models | Khasmamad Shabanovi et.al. | 2411.03934 | null |
2024-11-06 | An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging | Públio Elon Correa da Silva et.al. | 2411.03835 | link |
2024-11-06 | TATAA: Programmable Mixed-Precision Transformer Acceleration with a Transformable Arithmetic Architecture | Jiajun Wu et.al. | 2411.03697 | null |
2024-10-29 | Data Generation for Hardware-Friendly Post-Training Quantization | Lior Dikstein et.al. | 2410.22110 | link |
2024-10-30 | IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models | Hang Guo et.al. | 2410.21759 | link |
2024-10-26 | DQRM: Deep Quantized Recommendation Models | Yang Zhou et.al. | 2410.20046 | link |
2024-10-14 | Real-Time Stress Detection via Photoplethysmogram Signals: Implementation of a Combined Continuous Wavelet Transform and Convolutional Neural Network on Resource-Constrained Microcontrollers | Yasin Hasanpoor et.al. | 2410.19776 | null |
2024-10-24 | TesseraQ: Ultra Low-Bit LLM Post-Training Quantization with Block Reconstruction | Yuhang Li et.al. | 2410.19103 | null |
2024-10-18 | Understanding the difficulty of low-precision post-training quantization of large language models | Zifei Xu et.al. | 2410.14570 | null |
2024-10-17 | Quamba: A Post-Training Quantization Recipe for Selective State Space Models | Hung-Yueh Chiang et.al. | 2410.13229 | link |
2024-10-17 | Scaling laws for post-training quantized large language models | Zifei Xu et.al. | 2410.12119 | null |
2024-10-15 | Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks | Alireza Khodamoradi et.al. | 2410.11203 | link |
2024-10-06 | Continuous Approximations for Improving Quantization Aware Training of LLMs | He Li et.al. | 2410.10849 | null |
2024-10-12 | SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs | Mohammad Mozaffari et.al. | 2410.09615 | link |
2024-10-12 | FlatQuant: Flatness Matters for LLM Quantization | Yuxuan Sun et.al. | 2410.09426 | link |
2024-10-10 | Q-VLM: Post-training Quantization for Large Vision-Language Models | Changyuan Wang et.al. | 2410.08119 | link |
2024-10-10 | Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential Detection | Hubert Cecotti et.al. | 2410.07920 | null |
2024-10-10 | CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression | Wenyuan Liu et.al. | 2410.07505 | null |
2024-10-09 | Scaling Laws for Mixed quantization in Large Language Models | Zeyu Cao et.al. | 2410.06722 | null |
2024-10-08 | QERA: an Analytical Framework for Quantization Error Reconstruction | Cheng Zhang et.al. | 2410.06040 | null |
2024-10-08 | QT-DoG: Quantization-aware Training for Domain Generalization | Saqib Javed et.al. | 2410.06020 | link |
2024-10-10 | ARB-LLM: Alternating Refined Binarizations for Large Language Models | Zhiteng Li et.al. | 2410.03129 | link |
2024-10-03 | Lightweight Diffusion Models for Resource-Constrained Semantic Communication | Giovanni Pignata et.al. | 2410.02491 | link |
2024-10-01 | Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging | Ismail Erbas et.al. | 2410.00948 | null |
2024-09-30 | Constraint Guided Model Quantization of Neural Networks | Quinten Van Baelen et.al. | 2409.20138 | null |
2024-09-26 | P4Q: Learning to Prompt for Quantization in Visual-language Models | Huixin Sun et.al. | 2409.17634 | null |
2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
2024-09-25 | VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu et.al. | 2409.17066 | link |
2024-09-25 | PTQ4RIS: Post-Training Quantization for Referring Image Segmentation | Xiaoyan Jiang et.al. | 2409.17020 | link |
2024-09-26 | INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Shimao Chen et.al. | 2409.16997 | link |
2024-09-20 | PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models | Jayneel Vora et.al. | 2409.13894 | null |
2024-09-18 | Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview | Yanshu Wang et.al. | 2409.11650 | null |
2024-09-12 | LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs | Han Xu et.al. | 2409.11424 | null |
2024-09-12 | DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing | Zhenyuan Dong et.al. | 2409.07756 | link |
2024-08-31 | Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization | Vage Egiazarian et.al. | 2409.00492 | null |
2024-08-29 | A machine learning approach for computing solar flare locations in X-rays on-board Solar Orbiter/STIX | Paolo Massa et.al. | 2408.16642 | link |
2024-08-29 | On-device AI: Quantization-aware Training of Transformers in Time-Series | Tianheng Ling et.al. | 2408.16495 | null |
2024-08-27 | The Uniqueness of LLaMA3-70B with Per-Channel Quantization: An Empirical Study | Minghai Qin et.al. | 2408.15301 | null |
2024-08-25 | MobileQuant: Mobile-friendly Quantization for On-device Language Models | Fuwen Tan et.al. | 2408.13933 | link |
2024-08-25 | Infrared Domain Adaptation with Zero-Shot Quantization | Burak Sevsay et.al. | 2408.13925 | null |
2024-08-23 | ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Chao Zeng et.al. | 2408.08554 | link |
2024-08-14 | Analog Spiking Neuron in CMOS 28 nm Towards Large-Scale Neuromorphic Processors | Marwan Besrour et.al. | 2408.07734 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-11 | RTF-Q: Unsupervised domain adaptation based retraining-free quantization network | Nanyang Du et.al. | 2408.05752 | null |
2024-08-16 | DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers | Lianwei Yang et.al. | 2408.03291 | null |
2024-08-05 | HQOD: Harmonious Quantization for Object Detection | Long Huang et.al. | 2408.02561 | link |
2024-08-01 | Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization | Róisín Luo et.al. | 2408.00923 | null |
2024-08-07 | Temporal Feature Matters: A Framework for Diffusion Model Quantization | Yushi Huang et.al. | 2407.19547 | null |
2024-07-25 | Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models | Sanae Lotfi et.al. | 2407.18158 | null |
2024-07-27 | MetaAug: Meta-Data Augmentation for Post-Training Quantization | Cuong Pham et.al. | 2407.14726 | link |
2024-07-17 | AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer | Zhuguanyu Wu et.al. | 2407.12951 | link |
2024-07-17 | Mamba-PTQ: Outlier Channels in Recurrent Large Language Models | Alessandro Pierro et.al. | 2407.12397 | null |
2024-07-17 | StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators | Ethan G Rogers et.al. | 2407.12378 | null |
2024-07-17 | Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models | Ayush Kaushal et.al. | 2407.12327 | link |
2024-07-17 | QVD: Post-training Quantization for Video Diffusion Models | Shilong Tian et.al. | 2407.11585 | null |
2024-07-16 | LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices | Jung Hyun Lee et.al. | 2407.11534 | link |
2024-07-11 | Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients | Zhenyu Zhang et.al. | 2407.08296 | link |
2024-07-10 | RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization | Xijie Huang et.al. | 2407.08044 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation | Chaitali Bhattacharyya et.al. | 2505.00624 | null |
2025-04-30 | TinyMA-IEI-PPO: Exploration Incentive-Driven Multi-Agent DRL with Self-Adaptive Pruning for Vehicular Embodied AI Agent Twins Migration | Zhuoqi Zeng et.al. | 2505.00055 | null |
2025-04-29 | Efficient LLMs with AMP: Attention Heads and MLP Pruning | Leandro Giusti Mugnaini et.al. | 2504.21174 | null |
2025-04-28 | Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs | Muhammad Sabih et.al. | 2504.19659 | null |
2025-04-25 | Study on Real-Time Road Surface Reconstruction Using Stereo Vision | Deepak Ghimire et.al. | 2504.18112 | null |
2025-04-20 | NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models | Lawrence Liu et.al. | 2504.14569 | link |
2025-04-19 | Diffusion-based Dynamic Contract for Federated AI Agent Construction in Mobile Metaverses | Jinbo Wen et.al. | 2504.14326 | null |
2025-04-19 | A Real-time and Hardware Efficient Artfecat-free Spike Sorting Using Deep Spike Detection | Xiaoyu Jiang et.al. | 2504.14279 | null |
2025-04-17 | Enhanced Pruning Strategy for Multi-Component Neural Architectures Using Component-Aware Graph Analysis | Ganesh Sundaram et.al. | 2504.13296 | null |
2025-04-12 | Sparse Hybrid Linear-Morphological Networks | Konstantinos Fotopoulos et.al. | 2504.09289 | null |
2025-04-08 | Mosaic: Composite Projection Pruning for Resource-efficient LLMs | Bailey J. Eccles et.al. | 2504.06323 | null |
2025-04-06 | Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression | Ivan Ilin et.al. | 2504.05346 | link |
2025-04-05 | The Effects of Grouped Structural Global Pruning of Vision Transformers on Domain Generalisation | Hamza Riaz et.al. | 2504.04196 | null |
2025-04-02 | MDP: Multidimensional Vision Model Pruning with Latency Constraint | Xinglong Sun et.al. | 2504.02168 | null |
2025-04-01 | FedPaI: Achieving Extreme Sparsity in Federated Learning via Pruning at Initialization | Haonan Wang et.al. | 2504.00308 | null |
2025-03-28 | Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning | Yupei Li et.al. | 2503.21419 | null |
2025-03-19 | Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure | Fatemeh Dehrouyeh et.al. | 2503.14799 | link |
2025-03-14 | Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity | Chi Xu et.al. | 2503.11164 | null |
2025-03-18 | Týr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization | Guanchen Li et.al. | 2503.09657 | null |
2025-03-08 | Sample-aware Adaptive Structured Pruning for Large Language Models | Jun Kong et.al. | 2503.06184 | null |
2025-03-07 | IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining | Yixiao Li et.al. | 2503.05920 | null |
2025-03-06 | How can representation dimension dominate structurally pruned LLMs? | Mingxue Xu et.al. | 2503.04377 | null |
2025-02-24 | Delta Decompression for MoE-based LLMs Compression | Hao Gu et.al. | 2502.17298 | link |
2025-02-23 | Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression | Xiaoyi Qu et.al. | 2502.16638 | link |
2025-03-15 | Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification | Arshia Kermani et.al. | 2502.16627 | null |
2025-02-21 | PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation | Tao Fan et.al. | 2502.15857 | null |
2025-02-21 | Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing | Qi Le et.al. | 2502.15618 | link |
2025-02-19 | EvoP: Robust LLM Inference via Evolutionary Pruning | Shangyu Wu et.al. | 2502.14910 | null |
2025-02-20 | Towards Efficient Automatic Self-Pruning of Large Language Models | Weizhong Huang et.al. | 2502.14413 | null |
2025-02-19 | MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures | Jiayu Qin et.al. | 2502.14008 | null |
2025-02-19 | Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models | Jun Zhang et.al. | 2502.13533 | link |
2025-02-17 | An Efficient Row-Based Sparse Fine-Tuning | Cen-Jhih Li et.al. | 2502.11439 | null |
2025-02-21 | DarwinLM: Evolutionary Structured Pruning of Large Language Models | Shengkun Tang et.al. | 2502.07780 | link |
2025-02-11 | Exploring Neural Network Pruning with Screening Methods | Mingyuan Wang et.al. | 2502.07189 | null |
2025-02-11 | EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models | Xingrun Xing et.al. | 2502.06663 | null |
2025-02-09 | QP-SNN: Quantized and Pruned Spiking Neural Networks | Wenjie Wei et.al. | 2502.05905 | null |
2025-02-09 | Synergistic Effects of Knowledge Distillation and Structured Pruning for Self-Supervised Speech Models | Shiva Kumar C et.al. | 2502.05837 | null |
2025-02-06 | PGB: One-Shot Pruning for BERT via Weight Grouping and Permutation | Hyemin Lim et.al. | 2502.03984 | null |
2025-02-05 | Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Boyao Wang et.al. | 2502.03460 | null |
2025-02-08 | Progressive Binarization with Semi-Structured Pruning for LLMs | Xianglong Yan et.al. | 2502.01705 | link |
2025-02-02 | Structural Latency Perturbation in Large Language Models Through Recursive State Induction | Michael Mangrum et.al. | 2502.00758 | null |
2025-02-02 | CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks | Christian Franssen et.al. | 2502.00744 | null |
2025-02-01 | ProxSparse: Regularized Learning of Semi-Structured Sparsity Masks for Pretrained LLMs | Hongyi Liu et.al. | 2502.00258 | null |
2025-01-31 | Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models | Jialin Zhao et.al. | 2501.19090 | null |
2025-01-29 | 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Fabrizio Sandri et.al. | 2501.17771 | link |
2025-01-28 | B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning | Nikolaos Kaparinos et.al. | 2501.16917 | null |
2025-01-25 | ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning | Shangqian Gao et.al. | 2501.15316 | null |
2025-01-25 | PIP: Perturbation-based Iterative Pruning for Large Language Models | Yi Cao et.al. | 2501.15278 | null |
2025-01-25 | Lightweight and Post-Training Structured Pruning for On-Device Large Lanaguage Models | Zihuai Xu et.al. | 2501.15255 | null |
2025-01-23 | One-cycle Structured Pruning with Stability Driven Structure Search | Deepak Ghimire et.al. | 2501.13439 | null |
2025-01-16 | Pruning for Sparse Diffusion Models based on Gradient Flow | Ben Wan et.al. | 2501.09464 | null |
2025-01-16 | FASP: Fast and Accurate Structured Pruning of Large Language Models | Hanyu Hu et.al. | 2501.09412 | null |
2025-01-15 | SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization | Waqwoya Abebe et.al. | 2501.08504 | link |
2025-01-14 | PolyLUT: Ultra-low Latency Polynomial Inference with Hardware-Aware Structured Pruning | Marta Andronic et.al. | 2501.08043 | null |
2025-01-09 | Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning | Laura Puccioni et.al. | 2501.05248 | null |
2025-01-09 | A 1Mb mixed-precision quantized encoder for image classification and patch-based compression | Van Thien Nguyen et.al. | 2501.05097 | null |
2025-01-05 | Efficient Deployment of Large Language Models on Resource-constrained Devices | Zhiwei Yao et.al. | 2501.02438 | null |
2025-01-04 | Optimizing Small Language Models for In-Vehicle Function-Calling | Yahya Sowti Khiabani et.al. | 2501.02342 | null |
2025-01-07 | Instruction-Following Pruning for Large Language Models | Bairu Hou et.al. | 2501.02086 | null |
2024-12-24 | SlimGPT: Layer-wise Structured Pruning for Large Language Models | Gui Ling et.al. | 2412.18110 | null |
2024-12-23 | GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Chao Zeng et.al. | 2412.17560 | null |
2024-12-28 | Lillama: Large Language Models Compression via Low-Rank Feature Distillation | Yaya Sy et.al. | 2412.16719 | null |
2024-12-21 | V"Mean"ba: Visual State Space Models only need 1 hidden dimension | Tien-Yu Chi et.al. | 2412.16602 | null |
2024-12-20 | Less is More: Towards Green Code Large Language Models via Unified Structural Pruning | Guang Yang et.al. | 2412.15921 | null |
2024-12-20 | All-in-One Tuning and Structural Pruning for Domain-Specific LLMs | Lei Lu et.al. | 2412.14426 | null |
2024-12-17 | Learning Coarse-to-Fine Pruning of Graph Convolutional Networks for Skeleton-based Recognition | Hichem Sahbi et.al. | 2412.12887 | null |
2024-12-17 | A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting | Nicholas Kiefer et.al. | 2412.12883 | null |
2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | link |
2024-12-19 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guangwenjie Zou et.al. | 2412.12603 | link |
2024-12-16 | Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition | Hichem Sahbi et.al. | 2412.11813 | null |
2024-12-16 | QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models | Changhai Zhou et.al. | 2412.11629 | null |
2024-12-09 | LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation | Haihang Wu et.al. | 2412.06419 | null |
2024-12-03 | Effortless Efficiency: Low-Cost Pruning of Diffusion Models | Yang Zhang et.al. | 2412.02852 | null |
2024-11-25 | Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization | Abdesselam Ferdi et.al. | 2411.16901 | null |
2024-11-21 | FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers | Zehua Pei et.al. | 2411.14507 | null |
2024-11-21 | Layer Pruning with Consensus: A Triple-Win Solution | Leandro Giusti Mugnaini et.al. | 2411.14345 | link |
2024-11-21 | DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization | Hexuan Deng et.al. | 2411.14055 | link |
2024-11-19 | FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning | Qingsong Lv et.al. | 2411.12781 | link |
2024-11-17 | Electrostatic Force Regularization for Neural Structured Pruning | Abdesselam Ferdi et.al. | 2411.11079 | null |
2024-11-15 | Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems | Pedro Palacios et.al. | 2411.10285 | null |
2024-12-16 | P |
Xiaodong Chen et.al. | 2411.10272 | null |
2024-11-10 | RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration | Boyao Wang et.al. | 2411.06463 | link |
2024-11-05 | Layer-Adaptive State Pruning for Deep State Space Models | Minseon Gwak et.al. | 2411.02824 | link |
2024-11-04 | Automatic Structured Pruning for Efficient Architecture in Federated Learning | Thai Vu Nguyen et.al. | 2411.01759 | link |
2024-10-31 | Mutual Information Preserving Neural Network Pruning | Charles Westphal et.al. | 2411.00147 | null |
2024-10-24 | Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts | Danyal Aftab et.al. | 2410.19185 | null |
2024-10-18 | EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search | Oliver Sieberling et.al. | 2410.14649 | link |
2024-11-04 | DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models | Shangqian Gao et.al. | 2410.11988 | link |
2024-11-12 | Self-Data Distillation for Recovering Quality in Pruned Large Language Models | Vithursan Thangarasa et.al. | 2410.09982 | null |
2024-10-11 | Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients | Yan Li et.al. | 2410.08457 | null |
2024-10-11 | Chip-Tuning: Classify Before Language Models Say | Fangwei Zhu et.al. | 2410.06541 | link |
2024-11-04 | Large Language Model Compression with Neural Architecture Search | Rhea Sanjay Sukthanker et.al. | 2410.06479 | null |
2024-09-29 | Investigating the Effect of Network Pruning on Performance and Interpretability | Jonathan von Rad et.al. | 2409.19727 | link |
2024-10-30 | Search for Efficient Large Language Models | Xuan Shen et.al. | 2409.17372 | link |
2024-09-22 | SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms | Niraj Pudasaini et.al. | 2409.14515 | null |
2024-09-20 | CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information | Yuxin Wang et.al. | 2409.13199 | link |
2024-09-17 | KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models | Bo Lv et.al. | 2409.11057 | null |
2024-09-11 | HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning | Tianyi Chen et.al. | 2409.09085 | link |
2024-09-12 | Structured Pruning for Efficient Visual Place Recognition | Oliver Grainge et.al. | 2409.07834 | null |
2024-09-10 | STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning | Jaeseong Lee et.al. | 2409.06211 | null |
2024-09-05 | TropNNC: Structured Neural Network Compression Using Tropical Geometry | Konstantinos Fotopoulos et.al. | 2409.03945 | null |
2024-09-02 | Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks | Samer Francy et.al. | 2409.02134 | null |
2024-08-27 | PAT: Pruning-Aware Tuning for Large Language Models | Yijiang Liu et.al. | 2408.14721 | link |
2024-08-15 | PQV-Mobile: A Combined Pruning and Quantization Toolkit to Optimize Vision Transformers for Mobile Applications | Kshitij Bhardwaj et.al. | 2408.08437 | link |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | link |
2024-08-02 | Sustainable Diffusion-based Incentive Mechanism for Generative AI-driven Digital Twins in Industrial Cyber-Physical Systems | Jinbo Wen et.al. | 2408.01173 | null |
2024-08-22 | Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models | Jiang Hao et.al. | 2407.21316 | link |
2024-07-26 | Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining | Jianwei Li et.al. | 2407.19126 | null |
2024-07-17 | MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs | Junfeng Gong et.al. | 2407.18267 | null |
2024-07-24 | (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork | Tianjin Huang et.al. | 2407.17412 | null |
2024-07-22 | Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models | Aayush Saxena et.al. | 2407.15904 | null |
2024-07-19 | Shapley Pruning for Neural Network Compression | Kamil Adamczewski et.al. | 2407.15875 | null |
2024-07-22 | A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism | Yu Xue et.al. | 2407.15600 | null |
2024-07-19 | Straightforward Layer-wise Pruning for More Efficient Visual Adaptation | Ruizi Han et.al. | 2407.14330 | null |
2024-07-18 | Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset | Yi Sheng et.al. | 2407.13896 | null |
2024-07-18 | Reconstruct the Pruned Model without Any Retraining | Pingjie Wang et.al. | 2407.13331 | null |
2024-07-18 | MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets | Peng Liao et.al. | 2407.13122 | null |
2024-07-16 | MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models | Hongrong Cheng et.al. | 2407.11681 | null |
2024-07-15 | DDFAD: Dataset Distillation Framework for Audio Data | Wenbo Jiang et.al. | 2407.10446 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Emergent Synaptic Plasticity from Tunable Dynamics of Probabilistic Bits | Sagnik Banerjee et.al. | 2505.00252 | null |
2025-04-30 | Low latency FPGA implementation of twisted Edward curve cryptography hardware accelerator over prime field | Md Rownak Hossain et.al. | 2504.21342 | null |
2025-04-28 | Systematic Hardware Integration Testing for Smart Video-based Medical Device Prototypes | Oliver Bause et.al. | 2504.19533 | null |
2025-04-28 | From Cluster to Desktop: A Cache-Accelerated INR framework for Interactive Visualization of Tera-Scale Data | Daniel Zavorotny et.al. | 2504.18001 | null |
2025-04-25 | RapidPIV: Full Flow-Field kHz PIV for Real-Time Display and Control | Scott A. Bollt et.al. | 2504.17987 | null |
2025-04-24 | ApproXAI: Energy-Efficient Hardware Acceleration of Explainable AI using Approximate Computing | Ayesha Siddique et.al. | 2504.17929 | null |
2025-04-24 | Energy Considerations of Large Language Model Inference and Efficiency Optimizations | Jared Fernandez et.al. | 2504.17674 | null |
2025-04-24 | On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration | Maoyang Xiang et.al. | 2504.17376 | null |
2025-04-24 | Fine-Grained Fusion: The Missing Piece in Area-Efficient State Space Model Acceleration | Robin Geens et.al. | 2504.17333 | null |
2025-04-21 | SCALE-Sim v3: A modular cycle-accurate systolic accelerator simulator for end-to-end system analysis | Ritik Raj et.al. | 2504.15377 | null |
2025-04-21 | To Offload or Not To Offload: Model-driven Comparison of Edge-native and On-device Processing | Nathan Ng et.al. | 2504.15162 | null |
2025-04-22 | GainSight: Application-Guided Profiling for Composing Heterogeneous On-Chip Memories in AI Hardware Accelerators | Peijing Li et.al. | 2504.14866 | null |
2025-04-26 | vApps: Verifiable Applications at Internet Scale | Isaac Zhang et.al. | 2504.14809 | null |
2025-04-19 | FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference | Coleman Hooper et.al. | 2504.14152 | null |
2025-04-25 | HyDra: SOT-CAM Based Vector Symbolic Macro for Hyperdimensional Computing | Md Mizanur Rahaman Nayan et.al. | 2504.14020 | null |
2025-04-18 | MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework | Zhenkai Qin et.al. | 2504.13574 | null |
2025-04-17 | CardioFit: A WebGL-Based Tool for Fast and Efficient Parameterization of Cardiac Action Potential Models to Fit User-Provided Data | Darby I. Cairns et.al. | 2504.13274 | null |
2025-04-15 | A Unified Hardware Accelerator for Fast Fourier Transform and Number Theoretic Transform | Rishabh Shrivastava et.al. | 2504.11124 | null |
2025-04-14 | Adaptive Synaptogenesis Implemented on a Nanomagnetic Platform | Faiyaz Elahi Mullick et.al. | 2504.10767 | null |
2025-04-14 | FPGA-Optimized Hardware Accelerator for Fast Fourier Transform and Singular Value Decomposition in AI | Hong Ding et.al. | 2504.10411 | null |
2025-04-14 | Carbon-Efficient 3D DNN Acceleration: Optimizing Performance and Sustainability | Aikaterini Maria Panteleaki et.al. | 2504.09851 | null |
2025-04-11 | ML For Hardware De#terpretability: Challenges and Opportunities | Raymond Baartmans et.al. | 2504.08852 | null |
2025-04-11 | TensorNEAT: A GPU-accelerated Library for NeuroEvolution of Augmenting Topologies | Lishuang Wang et.al. | 2504.08339 | link |
2025-04-14 | Improving Multiresource Job Scheduling with Markovian Service Rate Policies | Zhongrui Chen et.al. | 2504.08094 | link |
2025-04-20 | Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Erin Carson et.al. | 2504.07835 | link |
2025-04-09 | Rapid inference and comparison of gravitational-wave population models with neural variational posteriors | Matthew Mould et.al. | 2504.07197 | null |
2025-04-08 | Accelerating Hybrid XOR |
Haesol Im et.al. | 2504.06476 | null |
2025-04-08 | FETTA: Flexible and Efficient Hardware Accelerator for Tensorized Neural Network Training | Jinming Lu et.al. | 2504.06474 | null |
2025-04-06 | Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression | Ivan Ilin et.al. | 2504.05346 | link |
2025-04-07 | 3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization | Isha Sharma et.al. | 2504.04857 | null |
2025-04-07 | A High-Performance Curve25519 and Curve448 Unified Elliptic Curve Cryptography Accelerator | Aniket Banerjee et.al. | 2504.04731 | null |
2025-04-06 | pc-COP: An Efficient and Configurable 2048-p-Bit Fully-Connected Probabilistic Computing Accelerator for Combinatorial Optimization | Kiran Magar et.al. | 2504.04543 | null |
2025-04-04 | Efficient FPGA-accelerated Convolutional Neural Networks for Cloud Detection on CubeSats | Angela Cratere et.al. | 2504.03891 | null |
2025-04-01 | Enhancing Biologically Inspired Hierarchical Temporal Memory with Hardware-Accelerated Reflex Memory | Pavia Bera et.al. | 2504.03746 | null |
2025-03-31 | PIM-LLM: A High-Throughput Hybrid PIM Architecture for 1-bit LLMs | Jinendra Malekar et.al. | 2504.01994 | null |
2025-04-01 | SCRec: A Scalable Computational Storage System with Statistical Sharding and Tensor-train Decomposition for Recommendation Models | Jinho Yang et.al. | 2504.00520 | null |
2025-03-31 | Single-Shot Matrix-Matrix Multiplication Optical Tensor Processor for Deep Learning | Chao Luan et.al. | 2503.24356 | null |
2025-03-30 | FlexMem: High-Parallel Near-Memory Architecture for Flexible Dataflow in Fully Homomorphic Encryption | Shangyi Shi et.al. | 2503.23496 | null |
2025-04-03 | Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models | Hung-Yueh Chiang et.al. | 2503.22879 | link |
2025-03-31 | Residual-based Chebyshev filtered subspace iteration for sparse Hermitian eigenvalue problems tolerant to inexact matrix-vector products | Nikhil Kodali et.al. | 2503.22652 | null |
2025-03-27 | An Efficient Training Algorithm for Models with Block-wise Sparsity | Ding Zhu et.al. | 2503.21928 | null |
2025-03-27 | Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization | Zhenyu Liang et.al. | 2503.20286 | link |
2025-03-26 | VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers | Ching-Yao Chen et.al. | 2503.20246 | null |
2025-03-25 | Hardware Efficient Accelerator for Spiking Transformer With Reconfigurable Parallel Time Step Computing | Bo-Yu Chen et.al. | 2503.19643 | null |
2025-03-25 | An Efficient Data Reuse with Tile-Based Adaptive Stationary for Transformer Accelerators | Tseng-Jen Li et.al. | 2503.19640 | null |
2025-03-23 | Reliable Replication Protocols on SmartNICs | M. R. Siavash Katebzadeh et.al. | 2503.18093 | null |
2025-03-21 | Hardware Acceleration for HPS Algorithms in Two and Three Dimensions | Owen Melia et.al. | 2503.17535 | null |
2025-03-20 | QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge | Xuan Shen et.al. | 2503.16709 | null |
2025-03-20 | Accelerating Transformer Inference and Training with 2:4 Activation Sparsity | Daniel Haziza et.al. | 2503.16672 | null |
2025-03-20 | Explainable AI-Guided Efficient Approximate DNN Generation for Multi-Pod Systolic Arrays | Ayesha Siddique et.al. | 2503.16583 | null |
2025-03-19 | QEA: An Accelerator for Quantum Circuit Simulation with Resources Efficiency and Flexibility | Van Duy Tran et.al. | 2503.14951 | link |
2025-03-17 | Performance Analysis and Industry Deployment of Post-Quantum Cryptography Algorithms | Elif Dicle Demir et.al. | 2503.12952 | null |
2025-03-12 | EDEA: Efficient Dual-Engine Accelerator for Depthwise Separable Convolution with Direct Data Transfer | Yi Chen et.al. | 2503.11707 | null |
2025-03-13 | Bridging Machine Learning and Cosmological Simulations: Using Neural Operators to emulate Chemical Evolution | Pelle van de Bor et.al. | 2503.10736 | null |
2025-03-12 | Hardware.jl - An MLIR-based Julia HLS Flow (Work in Progress) | Benedict Short et.al. | 2503.09463 | null |
2025-03-11 | SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting | Shuaiting Li et.al. | 2503.08668 | null |
2025-03-11 | V-Max: Making RL practical for Autonomous Driving | Valentin Charraut et.al. | 2503.08388 | link |
2025-03-10 | Hardware acceleration for next-to-leading order event generation within MadGraph5_aMC@NLO | Zenny Wettersten et.al. | 2503.07439 | null |
2025-03-09 | Hardware-Accelerated Event-Graph Neural Networks for Low-Latency Time-Series Classification on SoC FPGA | Hiroshi Nakano et.al. | 2503.06629 | null |
2025-03-17 | Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models | Xubin Wang et.al. | 2503.06027 | null |
2025-03-06 | FORTALESA: Fault-Tolerant Reconfigurable Systolic Array for DNN Inference | Natalia Cherezova et.al. | 2503.04426 | null |
2025-03-06 | DiRe-JAX: A JAX based Dimensionality Reduction Algorithm for Large-scale Data | Alexander Kolpakov et.al. | 2503.03156 | link |
2025-02-26 | Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies | Shaibal Saha et.al. | 2503.02891 | null |
2025-03-04 | TFHE-SBC: Software Designs for Fully Homomorphic Encryption over the Torus on Single Board Computers | Marin Matsumoto et.al. | 2503.02559 | null |
2025-03-04 | POPGym Arcade: Parallel Pixelated POMDPs | Zekang Wang et.al. | 2503.01450 | link |
2025-02-28 | Supporting the development of Machine Learning for fundamental science in a federated Cloud with the AI_INFN platform | Lucio Anderlini et.al. | 2502.21266 | null |
2025-03-07 | GreenDFL: a Framework for Assessing the Sustainability of Decentralized Federated Learning Systems | Chao Feng et.al. | 2502.20242 | null |
2025-02-24 | Evaluating IOMMU-Based Shared Virtual Addressing for RISC-V Embedded Heterogeneous SoCs | Cyril Koenig et.al. | 2502.17398 | link |
2025-02-24 | APINT: A Full-Stack Framework for Acceleration of Privacy-Preserving Inference of Transformers based on Garbled Circuits | Hyunjun Cho et.al. | 2502.16877 | null |
2025-02-22 | A Hybrid Neural Network for High-Throughput Attosecond Resolution Single-shot X-ray Pulse Characterization | Jack Hirschman et.al. | 2502.16141 | null |
2025-02-20 | Micro Blossom: Accelerated Minimum-Weight Perfect Matching Decoding for Quantum Error Correction | Yue Wu et.al. | 2502.14787 | null |
2025-02-18 | RTPD: Penetration Depth calculation using Hardware accelerated Ray-Tracing | YoungWoo Kim et.al. | 2502.12463 | null |
2025-02-20 | TherAIssist: Assisting Art Therapy Homework and Client-Practitioner Collaboration through Human-AI Interaction | Di Liu et.al. | 2502.12443 | null |
2025-02-17 | Gem5-AcceSys: Enabling System-Level Exploration of Standard Interconnects for Novel Accelerators | Qunyou Liu et.al. | 2502.12273 | null |
2025-02-17 | SFTs: a scalable data-analysis framework for long-duration gravitational-wave signals | Rodrigo Tenorio et.al. | 2502.11823 | null |
2025-02-15 | Pushing up to the Limit of Memory Bandwidth and Capacity Utilization for Efficient LLM Decoding on Embedded FPGA | Jindong Li et.al. | 2502.10659 | null |
2025-02-13 | Recipe: Hardware-Accelerated Replication Protocols | Dimitra Giantsidi et.al. | 2502.09251 | null |
2025-02-12 | Scalable Thermodynamic Second-order Optimization | Kaelan Donatella et.al. | 2502.08603 | null |
2025-02-10 | Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs | Tousif Rahman et.al. | 2502.07823 | null |
2025-02-10 | Accelerating Berends-Giele recursion for gluons in arbitrary dimensions over finite fields | Juan M. Cruz-Martinez et.al. | 2502.07060 | link |
2025-02-07 | TNIC: A Trusted NIC Architecture | Dimitra Giantsidi et.al. | 2502.05338 | null |
2025-02-07 | Gaussian Models to Non-Gaussian Realms of Quantum Photonic Simulators | Dennis Delali Kwesi Wayo et.al. | 2502.05245 | null |
2025-02-04 | SpinGlassPEPS.jl: Tensor-network package for Ising-like optimization on quasi-two-dimensional graphs | Tomasz Śmierzchalski et.al. | 2502.02317 | null |
2025-02-01 | Life-Cycle Emissions of AI Hardware: A Cradle-To-Grave Approach and Generational Trends | Ian Schneider et.al. | 2502.01671 | null |
2025-02-01 | A Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression | Shupeng Ning et.al. | 2502.01670 | null |
2025-02-02 | A Flexible Precision Scaling Deep Neural Network Accelerator with Efficient Weight Combination | Liang Zhao et.al. | 2502.00687 | null |
2025-02-01 | Late Breaking Results: Leveraging Approximate Computing for Carbon-Aware DNN Accelerators | Aikaterini Maria Panteleaki et.al. | 2502.00286 | null |
2025-01-31 | StruM: Structured Mixed Precision for Efficient Deep Learning Hardware Codesign | Michael Wu et.al. | 2501.18953 | null |
2025-01-30 | REDACTOR: eFPGA Redaction for DNN Accelerator Security | Yazan Baddour et.al. | 2501.18740 | link |
2025-01-30 | FLASH-FHE: A Heterogeneous Architecture for Fully Homomorphic Encryption Acceleration | Junxue Zhang et.al. | 2501.18371 | null |
2025-01-24 | HWPQ: Hessian-free Weight Pruning-Quantization For LLM Compression And Acceleration | Yuhan Kang et.al. | 2501.16376 | null |
2025-01-24 | Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel | Zhuoran Liu et.al. | 2501.14512 | null |
2025-02-02 | Guobin Shen et.al. | 2501.14484 | null | |
2025-01-22 | Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators | Filip Masar et.al. | 2501.12818 | link |
2025-01-22 | HEPPO: Hardware-Efficient Proximal Policy Optimization -- A Universal Pipelined Architecture for Generalized Advantage Estimation | Hazem Taha et.al. | 2501.12703 | null |
2025-01-20 | Hybrid Photonic-digital Accelerator for Attention Mechanism | Huize Li et.al. | 2501.11286 | null |
2025-01-20 | Ditto: Accelerating Diffusion Model via Temporal Value Similarity | Sungbin Kim et.al. | 2501.11211 | null |
2025-01-18 | LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator | Guoyu Li et.al. | 2501.10658 | null |
2025-01-17 | Optimizing Structured-Sparse Matrix Multiplication in RISC-V Vector Processors | Vasileios Titopoulos et.al. | 2501.10189 | null |
2025-01-17 | AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations | Jamin Seo et.al. | 2501.09954 | link |
2025-01-15 | RouteNet-Gauss: Hardware-Enhanced Network Modeling with Machine Learning | Carlos Güemes-Palau et.al. | 2501.08848 | null |
2025-01-15 | Detecting Wildfire Flame and Smoke through Edge Computing using Transfer Learning Enhanced Deep Learning Models | Giovanny Vazquez et.al. | 2501.08639 | null |
2025-01-14 | An Efficient Sparse Hardware Accelerator for Spike-Driven Transformer | Zhengke Li et.al. | 2501.07825 | null |
2025-01-13 | fastrerandomize: An R Package for Fast Rerandomization Using Accelerated Computing | Rebecca Goldstein et.al. | 2501.07642 | link |
2025-01-12 | Turing-Completeness and Undecidability in Coupled Nonlinear Optical Resonators | Gordon Li et.al. | 2501.06966 | null |
2025-01-10 | Axon: A novel systolic array architecture for improved run time and energy efficient GeMM and Conv operation with on-chip im2col | Md Mizanur Rahaman Nayan et.al. | 2501.06043 | null |
2025-01-10 | EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration | Zhifan Song et.al. | 2501.05885 | link |
2025-01-16 | TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response Scenarios | Daniel Rossi et.al. | 2501.05880 | link |
2025-01-09 | JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration | Mingzi Wang et.al. | 2501.05339 | null |
2025-01-08 | IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX | Erik Recio-Armengol et.al. | 2501.04776 | link |
2025-01-08 | Probabilistic Greedy Algorithm Solver Using Magnetic Tunneling Junctions for Traveling Salesman Problem | Ran Zhang et.al. | 2501.04447 | null |
2025-01-04 | Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies | Xubin Wang et.al. | 2501.03265 | link |
2025-01-04 | Optimizing Small Language Models for In-Vehicle Function-Calling | Yahya Sowti Khiabani et.al. | 2501.02342 | null |
2025-01-03 | DSLR-CNN: Efficient CNN Acceleration using Digit-Serial Left-to-Right Arithmetic | Malik Zohaib Nisar et.al. | 2501.01737 | null |
2025-01-02 | Harnessing Hardware Acceleration in High-Energy Physics through High-Level Synthesis Techniques | Pelayo Leguina López et.al. | 2501.01338 | null |
2024-12-30 | DeepLL: Considering Linear Logic for the Analysis of Deep Learning Experiments | Nick Papoulias et.al. | 2501.00169 | null |
2024-12-29 | A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier | Amit Sarkar et.al. | 2412.20393 | null |
2024-12-29 | Open-Source Heterogeneous SoCs for AI: The PULP Platform Experience | Francesco Conti et.al. | 2412.20391 | null |
2024-12-27 | HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models | Ze Yang et.al. | 2412.19925 | null |
2024-12-26 | Evolution, Challenges, and Optimization in Computer Architecture: The Role of Reconfigurable Systems | Jefferson Ederhion et.al. | 2412.19234 | null |
2024-12-24 | GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks | Christodoulos Peltekis et.al. | 2412.18534 | null |
2024-12-23 | Advantages of density in tensor network geometries for gradient based training | Sergi Masot-Llima et.al. | 2412.17497 | null |
2024-12-20 | Chorba: A novel CRC32 implementation | Sam Russell et.al. | 2412.16398 | null |
2024-12-20 | Designing Visual Explanations and Learner Controls to Engage Adolescents in AI-Supported Exercise Selection | Jeroen Ooge et.al. | 2412.16034 | null |
2024-12-20 | A survey on FPGA-based accelerator for ML models | Feng Yan et.al. | 2412.15666 | null |
2024-12-19 | LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation | Chenxu Zhou et.al. | 2412.15199 | null |
2024-12-18 | Pattern Matching in AI Compilers and its Formalization (Extended Version) | Joseph W. Cutler et.al. | 2412.13398 | null |
2024-12-17 | if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge Proofs | Shahzad Ahmad Butt et.al. | 2412.12481 | null |
2024-12-13 | Strong Structural Bounds for MaxSAT: The Fine Details of Using Neuromorphic and Quantum Hardware Accelerators | Max Bannach et.al. | 2412.10289 | null |
2024-12-16 | MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization | Shuaiting Li et.al. | 2412.10261 | null |
2024-12-12 | MPAX: Mathematical Programming in JAX | Haihao Lu et.al. | 2412.09734 | link |
2024-12-12 | Evaluating the Potential of In-Memory Processing to Accelerate Homomorphic Encryption | Mpoki Mwaisela et.al. | 2412.09144 | null |
2024-12-12 | Analyzing Practical Policies for Multiresource Job Scheduling | Zhongrui Chen et.al. | 2412.08915 | null |
2024-12-09 | LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation | Haihang Wu et.al. | 2412.06419 | null |
2024-12-03 | Demonstrating the Advantages of Analog Wafer-Scale Neuromorphic Hardware | Hartmut Schmidt et.al. | 2412.02619 | null |
2024-12-03 | Multi-timescale synaptic plasticity on analog neuromorphic hardware | Amani Atoui et.al. | 2412.02515 | null |
2024-11-27 | Deterministic and Probabilistic Rounding Error Analysis for Mixed-Precision Arithmetic on Modern Computing Units | Sahil Bhola et.al. | 2411.18747 | null |
2024-11-26 | Scalable iterative pruning of large language and vision models using block coordinate descent | Gili Rosenberg et.al. | 2411.17796 | null |
2024-11-25 | Limitations of tensor network approaches for optimization and sampling: A comparison against quantum and classical Ising machines | Anna Maria Dziubyna et.al. | 2411.16431 | link |
2024-11-25 | MixPE: Quantization and Hardware Co-design for Efficient LLM Inference | Yu Zhang et.al. | 2411.16158 | null |
2024-11-20 | Hardware Accelerators for Artificial Intelligence | S M Mojahidul Ahsan et.al. | 2411.13717 | null |
2024-11-20 | Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training | Jared Fernandez et.al. | 2411.13055 | null |
2024-11-19 | FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning | Qingsong Lv et.al. | 2411.12781 | link |
2024-11-19 | Design of an FPGA-Based Neutral Atom Rearrangement Accelerator for Quantum Computing | Xiaorang Guo et.al. | 2411.12401 | null |
2024-11-18 | SILVIA: Automated Superword-Level Parallelism Exploitation via HLS-Specific LLVM Passes for Compute-Intensive FPGA Accelerators | Giovanni Brignone et.al. | 2411.11384 | link |
2024-12-01 | InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma | Xiaoxuan Hou et.al. | 2411.09856 | link |
2024-11-21 | OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling | Xiaoling Yi et.al. | 2411.09543 | link |
2024-11-15 | Communication Compression for Tensor Parallel LLM Inference | Jan Hansen-Palmus et.al. | 2411.09510 | null |
2024-11-18 | RPCAcc: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator | Jie Zhang et.al. | 2411.07632 | null |
2024-11-11 | Spiking Transformer Hardware Accelerators in 3D Integration | Boxun Xu et.al. | 2411.07397 | null |
2024-11-10 | AMAZE: Accelerated MiMC Hardware Architecture for Zero-Knowledge Applications on the Edge | Anees Ahmed et.al. | 2411.06350 | link |
2024-11-03 | Stochastic Communication Avoidance for Recommendation Systems | Lutfi Eren Erdogan et.al. | 2411.01611 | null |
2024-11-01 | Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks | David A. Danhofer et.al. | 2411.00288 | null |
2024-10-31 | LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators | Krishna Teja Chitty-Venkata et.al. | 2411.00136 | link |
2024-10-30 | Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks | Michael Matthews et.al. | 2410.23208 | link |
2024-10-24 | Watermarking Large Language Models and the Generated Content: Opportunities and Challenges | Ruisi Zhang et.al. | 2410.19096 | null |
2024-10-21 | Hacking the Fabric: Targeting Partial Reconfiguration for Fault Injection in FPGA Fabrics | Jayeeta Chaudhuri et.al. | 2410.16497 | null |
2024-10-21 | Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure | Juan Cruz-Martinez et.al. | 2410.16248 | null |
2024-10-20 | A Remedy to Compute-in-Memory with Dynamic Random Access Memory: 1FeFET-1C Technology for Neuro-Symbolic AI | Xunzhao Yin et.al. | 2410.15296 | null |
2024-10-18 | Self-Satisfied: An end-to-end framework for SAT generation and prediction | Christopher R. Serrano et.al. | 2410.14888 | null |
2024-10-17 | Quamba: A Post-Training Quantization Recipe for Selective State Space Models | Hung-Yueh Chiang et.al. | 2410.13229 | link |
2024-10-16 | Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration | M. Croci et.al. | 2410.12614 | link |
2024-10-15 | Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination | Arturo Salmi et.al. | 2410.11625 | null |
2024-10-15 | Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet Accuracy | Shuntaro Takahashi et.al. | 2410.11553 | link |
2024-10-14 | Differentiable Weightless Neural Networks | Alan T. L. Bacellar et.al. | 2410.11112 | link |
2024-10-14 | SLaNC: Static LayerNorm Calibration | Mahsa Salmani et.al. | 2410.10553 | null |
2024-10-11 | MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Mohamed Amine Hamdi et.al. | 2410.08855 | link |
2024-10-09 | Optimized Spatial Architecture Mapping Flow for Transformer Accelerators | Haocheng Xu et.al. | 2410.07407 | null |
2024-10-09 | Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing | Ismail Erbas et.al. | 2410.07364 | null |
2024-10-03 | CAX: Cellular Automata Accelerated in JAX | Maxence Faldor et.al. | 2410.02651 | link |
2024-10-03 | Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition | Hansheng Wang et.al. | 2410.02170 | null |
2024-10-01 | Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging | Ismail Erbas et.al. | 2410.00948 | null |
2024-09-26 | Leader Selection and Follower Association for UE-centric Distributed Learning in Future Wireless Networks | Saeedeh Parsaeefard et.al. | 2409.18268 | null |
2024-09-26 | A 5T-2MTJ STT-assisted Spin Orbit Torque based Ternary Content Addressable Memory for Hardware Accelerators | Siri Narla et.al. | 2409.17863 | null |
2024-09-24 | Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware | Luca Scomparin et.al. | 2409.16177 | null |
2024-09-25 | Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA | Lorenzo Borella et.al. | 2409.16075 | null |
2024-09-19 | Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention | Rengan Xu et.al. | 2409.15373 | null |
2024-09-23 | Efficient Tabular Data Preprocessing of ML Pipelines | Yu Zhu et.al. | 2409.14912 | null |
2024-09-21 | FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs | Ehsan Kabir et.al. | 2409.14023 | null |
2024-09-21 | ProTEA: Programmable Transformer Encoder Acceleration on FPGA | Ehsan Kabir et.al. | 2409.13975 | null |
2024-09-23 | Towards Efficient Neuro-Symbolic AI: From Workload Characterization to Hardware Architecture | Zishen Wan et.al. | 2409.13153 | null |
2024-09-20 | Learning to Compare Hardware Designs for High-Level Synthesis | Yunsheng Bai et.al. | 2409.13138 | null |
2024-09-19 | Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAML | Chelsea Maria John et.al. | 2409.12994 | link |
2024-09-19 | CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications | Vladimir Frolov et.al. | 2409.12617 | null |
2024-09-15 | Pack my weights and run! Minimizing overheads for in-memory computing accelerators | Pouya Houshmand et.al. | 2409.11437 | null |
2024-09-11 | Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-design | Tathagata Srimani et.al. | 2409.11422 | null |
2024-09-09 | Hardware Acceleration of Kolmogorov-Arnold Network (KAN) for Lightweight Edge Inference | Wei-Hsing Huang et.al. | 2409.11418 | null |
2024-09-17 | Dynamic Range Reduction via Branch-and-Bound | Thore Gerlach et.al. | 2409.10863 | null |
2024-09-16 | Count2Multiply: Reliable In-memory High-Radix Counting | João Paulo Cardoso de Lima et.al. | 2409.10136 | null |
2024-09-16 | Hardware-Accelerated Ray Tracing for Discrete and Continuous Collision Detection on GPUs | Sizhe Sui et.al. | 2409.09918 | null |
2024-09-13 | Distributed Binary Optimization with In-Memory Computing: An Application for the SAT Problem | Xiangyi Zhang et.al. | 2409.09152 | null |
2024-09-13 | Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators | Konstantin Lübeck et.al. | 2409.08595 | null |
2024-09-17 | Foragax: An Agent-Based Modelling Framework Based on JAX | Siddharth Chaturvedi et.al. | 2409.06345 | link |
2024-09-10 | PIM-MMU: A Memory Management Unit for Accelerating Data Transfers in Commercial PIM Systems | Dongjae Lee et.al. | 2409.06204 | null |
2024-09-06 | Towards Narrowing the Generalization Gap in Deep Boolean Networks | Youngsung Kim et.al. | 2409.05905 | null |
2024-09-09 | Supervised Learning for Stochastic Optimal Control | Vince Kurtz et.al. | 2409.05792 | null |
2024-09-08 | BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration | Yuzong Chen et.al. | 2409.05227 | link |
2024-09-05 | Libra: Architectural Support For Principled, Secure And Efficient Balanced Execution On High-End Processors (Extended Version) | Hans Winderix et.al. | 2409.03743 | null |
2024-09-05 | Hardware Acceleration of LLMs: A comprehensive survey and comparison | Nikoletta Koilia et.al. | 2409.03384 | null |
2024-09-05 | Towards training digitally-tied analog blocks via hybrid gradient computation | Timothy Nest et.al. | 2409.03306 | null |
2024-08-30 | The picasso gas model: Painting intracluster gas on gravity-only simulations | F. Kéruzoré et.al. | 2408.17445 | link |
2024-08-29 | Serial and Parallel Two-Column Probing for Mixed-Integer Programming | Yongzheng Dai et.al. | 2408.16927 | link |
2024-08-29 | On-device AI: Quantization-aware Training of Transformers in Time-Series | Tianheng Ling et.al. | 2408.16495 | null |
2024-08-29 | Accelerating Image-based Pest Detection on a Heterogeneous Multi-core Microcontroller | Luca Bompani et.al. | 2408.15911 | link |
2024-08-28 | FireFly-S: Exploiting Dual-Side Sparsity for Spiking Neural Networks Acceleration with Reconfigurable Spatial Architecture | Tenglong Li et.al. | 2408.15578 | null |
2024-08-29 | CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific Edge Computing | G Abarajithan et.al. | 2408.15561 | null |
2024-08-27 | SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search | Hung-Yueh Chiang et.al. | 2408.15395 | null |
2024-08-27 | SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration | Runzhen Xue et.al. | 2408.15089 | null |
2024-08-26 | On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise | M. Reza Eslami et.al. | 2408.14680 | null |
2024-08-26 | HAPM -- Hardware Aware Pruning Method for CNN hardware accelerators in resource constrained devices | Federico Nicolas Peccia et.al. | 2408.14055 | null |
2024-08-22 | Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments | Maciej Besta et.al. | 2408.12173 | null |
2024-08-21 | Floating-Point Multiply-Add with Approximate Normalization for Low-Cost Matrix Engines | Kosmas Alexandridis et.al. | 2408.11997 | null |
2024-08-21 | Cage: Hardware-Accelerated Safe WebAssembly | Martin Fink et.al. | 2408.11456 | null |
2024-08-20 | Tapping in a Remote Vehicle's onboard LLM to Complement the Ego Vehicle's Field-of-View | Malsha Ashani Mahawatta Dona et.al. | 2408.10794 | null |
2024-08-16 | Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers | Zihang Song et.al. | 2408.08794 | null |
2024-08-16 | Cross-Chip Partial Reconfiguration for the Initialisation of Modular and Scalable Heterogeneous Systems | Marvin Fuchs et.al. | 2408.08626 | null |
2024-08-13 | HLSPilot: LLM-based High-Level Synthesis | Chenwei Xiong et.al. | 2408.06810 | link |
2024-08-12 | Hardware Architecture Design of Model-Based Image Reconstruction Towards Palm-size Photoacoustic Tomography | Yuwei Zheng et.al. | 2408.06049 | null |
2024-08-12 | SZKP: A Scalable Accelerator Architecture for Zero-Knowledge Proofs | Alhad Daftardar et.al. | 2408.05890 | null |
2024-08-10 | LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Jaehong Cho et.al. | 2408.05499 | link |
2024-08-08 | Noise-augmented Chaotic Ising Machines for Combinatorial Optimization and Sampling | Kyle Lee et.al. | 2408.04744 | null |
2024-08-07 | Hardware-Assisted Virtualization of Neural Processing Units for Cloud Platforms | Yuqi Xue et.al. | 2408.04104 | null |
2024-08-07 | Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Zhongyao Luo et.al. | 2408.03647 | link |
2024-08-06 | LLM-Aided Compilation for Tensor Accelerators | Charles Hong et.al. | 2408.03408 | null |
2024-08-06 | HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration | Pratyush Dhingra et.al. | 2408.03397 | null |
2024-08-05 | PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy | Rachmad Vidya Wicaksana Putra et.al. | 2408.02412 | null |
2024-08-02 | Digitized Phase Change Material Heterostack for Diffractive Optical Neural Network | Ruiyang Chen et.al. | 2408.01404 | null |
2024-08-02 | Search-in-Memory (SiM): Reliable, Versatile, and Efficient Data Matching in SSD's NAND Flash Memory Chip for Data Indexing Acceleration | Yun-Chih Chen et.al. | 2408.00327 | null |
2024-08-07 | Temporal Feature Matters: A Framework for Diffusion Model Quantization | Yushi Huang et.al. | 2407.19547 | null |
2024-07-16 | Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC) | Seyed Nima Omidsajedi et.al. | 2407.18264 | null |
2024-07-22 | KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer | Aness Al-Qawlaq et.al. | 2407.16026 | null |
2024-07-18 | Integrated Hardware Architecture and Device Placement Search | Irene Wang et.al. | 2407.13143 | link |
2024-07-17 | ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks | Salma Afifi et.al. | 2407.12638 | null |
2024-07-17 | StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators | Ethan G Rogers et.al. | 2407.12378 | null |
2024-07-16 | Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment | Yuhao Ji et.al. | 2407.12070 | null |
2024-07-16 | Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads | Aritra Dhar et.al. | 2407.11888 | null |
2024-07-15 | Hierarchical search method for gravitational waves from stellar-mass binary black holes in noisy space-based detector data | Yao Fu et.al. | 2407.10797 | null |
2024-07-14 | Accelerator-as-a-Service in Public Clouds: An Intra-Host Traffic Management View for Performance Isolation in the Wild | Jiechen Zhao et.al. | 2407.10098 | null |
2024-07-12 | 68-Channel Highly-Integrated Neural Signal Processing PSoC with On-Chip Feature Extraction, Compression, and Hardware Accelerators for Neuroprosthetics in 22nm FDSOI | Liyuan Guo et.al. | 2407.09166 | null |
2024-07-12 | Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim et.al. | 2407.08975 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Large Language Models as AI Agents for Digital Atoms and Molecules: Catalyzing a New Era in Computational Biophysics | Yijie Xia et.al. | 2505.00270 | null |
2025-04-30 | Smart Environmental Monitoring of Marine Pollution using Edge AI | Mohamed Moursi et.al. | 2504.21759 | null |
2025-04-27 | Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality | Majid Behravan et.al. | 2504.21033 | null |
2025-04-29 | DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs | Hao Luan et.al. | 2504.20754 | null |
2025-04-29 | CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices | Varatheepan Paramanayakam et.al. | 2504.20348 | null |
2025-04-27 | Personalized Artificial General Intelligence (AGI) via Neuroscience-Inspired Continuous Learning Systems | Rajeev Gupta et.al. | 2504.20109 | null |
2025-04-28 | Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs | Muhammad Sabih et.al. | 2504.19659 | null |
2025-04-22 | TinyML for Speech Recognition | Andrew Barovic et.al. | 2504.16213 | null |
2025-04-21 | Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT | Stanley Mugisha et.al. | 2504.16128 | null |
2025-04-23 | SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems | Abhishek Tyagi et.al. | 2504.15305 | null |
2025-04-21 | Time-Series Analysis on Edge-AI Hardware for Healthcare Monitoring | Jinhai Hu et.al. | 2504.15178 | null |
2025-04-20 | Explainability for Embedding AI: Aspirations and Actuality | Thomas Weber et.al. | 2504.14631 | null |
2025-04-03 | Edge Intelligence for Wildlife Conservation: Real-Time Hornbill Call Classification Using TinyML | Kong Ka Hing et.al. | 2504.12272 | null |
2025-04-19 | MultiCore+TPU Accelerated Multi-Modal TinyML for Livestock Behaviour Recognition | Qianxue Zhang et.al. | 2504.11467 | null |
2025-04-14 | VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification | Lucas Heublein et.al. | 2504.10556 | null |
2025-04-13 | Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models? | Christophe El Zeinaty et.al. | 2504.09685 | null |
2025-04-20 | MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications | Aashaka Shah et.al. | 2504.09014 | link |
2025-04-11 | Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices | Shengyuan Ye et.al. | 2504.08242 | null |
2025-04-09 | Neural Signal Compression using RAMAN tinyML Accelerator for BCI Applications | Adithya Krishna et.al. | 2504.06996 | null |
2025-04-08 | Enhanced Anomaly Detection for Capsule Endoscopy Using Ensemble Learning Strategies | Julia Werner et.al. | 2504.06039 | null |
2025-04-03 | Advancing Air Quality Monitoring: TinyML-Based Real-Time Ozone Prediction with Cost-Effective Edge Devices | Huam Ming Ken et.al. | 2504.03776 | null |
2025-04-02 | Efficient Calibration for RRAM-based In-Memory Computing using DoRA | Weirong Dong et.al. | 2504.03763 | null |
2025-04-04 | Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency | Erik Johannes Husom et.al. | 2504.03360 | null |
2025-04-02 | Satellite Edge Artificial Intelligence with Large Models: Architectures and Technologies | Yuanming Shi et.al. | 2504.01676 | null |
2025-04-02 | HH-PIM: Dynamic Optimization of Power and Performance with Heterogeneous-Hybrid PIM for Edge AI Devices | Sangmin Jeon et.al. | 2504.01468 | null |
2025-04-01 | Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems | Rachmad Vidya Wicaksana Putra et.al. | 2504.00957 | null |
2025-04-01 | IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval | Bangwei Liu et.al. | 2504.00954 | null |
2025-04-01 | QSViT: A Methodology for Quantizing Spiking Vision Transformers | Rachmad Vidya Wicaksana Putra et.al. | 2504.00948 | null |
2025-03-19 | Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI | Jianyi Zhang et.al. | 2503.18958 | null |
2025-03-12 | Intanify AI Platform: Embedded AI for Automated IP Audit and Due Diligence | Viktor Dorfler et.al. | 2503.17374 | null |
2025-03-21 | Replay4NCL: An Efficient Memory Replay-based Methodology for Neuromorphic Continual Learning in Embedded AI Systems | Mishal Fatima Minhas et.al. | 2503.17061 | null |
2025-03-21 | On-Sensor Convolutional Neural Networks with Early-Exits | Hazem Hesham Yousef Shalby et.al. | 2503.16939 | null |
2025-03-20 | Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Hadi Amini et.al. | 2503.16585 | link |
2025-03-19 | Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure | Fatemeh Dehrouyeh et.al. | 2503.14799 | link |
2025-03-17 | Semantic-Relevance Based Sensor Selection for Edge-AI Empowered Sensing Systems | Zhiyan Liu et.al. | 2503.12785 | null |
2025-03-15 | End-to-End Edge AI Service Provisioning Framework in 6G ORAN | Yun Tang et.al. | 2503.11933 | null |
2025-03-04 | CORDIC Is All You Need | Omkar Kokane et.al. | 2503.11685 | null |
2025-03-12 | BioSpark: Beyond Analogical Inspiration to LLM-augmented Transfer | Hyeonsu Kang et.al. | 2503.09838 | null |
2025-03-19 | Edge AI for Real-time Fetal Assessment in Rural Guatemala | Nasim Katebi et.al. | 2503.09659 | null |
2025-03-12 | Edge AI-Powered Real-Time Decision-Making for Autonomous Vehicles in Adverse Weather Conditions | Milad Rahmati et.al. | 2503.09638 | null |
2025-03-12 | Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks | Idris Zakariyya et.al. | 2503.08973 | null |
2025-03-07 | SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs | Jaewoo Song et.al. | 2503.07657 | null |
2025-03-07 | Compliance of AI Systems | Julius Schöning et.al. | 2503.05571 | null |
2025-03-06 | Dynamic # for On-Demand DNN Inference in the Edge-AI Market | Songyuan Li et.al. | 2503.04521 | null |
2025-03-03 | Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective | Rakshit Aralimatti et.al. | 2503.01933 | null |
2025-03-03 | Dendron: Enhancing Human Activity Recognition with On-Device TinyML Learning | Hazem Hesham Yousef Shalby et.al. | 2503.01353 | null |
2025-03-05 | Regularization-based Framework for Quantization-, Fault- and Variability-Aware Training | Anmol Biswas et.al. | 2503.01297 | null |
2025-02-28 | Transforming Cyber Defense: Harnessing Agentic and Frontier AI for Proactive, Ethical Threat Intelligence | Krti Tallam et.al. | 2503.00164 | null |
2025-02-26 | AI and Semantic Communication for Infrastructure Monitoring in 6G-Driven Drone Swarms | Tasnim Ahmed et.al. | 2503.00053 | null |
2025-02-25 | On-device edge learning for IoT data streams: a survey | Afonso Lourenço et.al. | 2502.17788 | null |
2025-02-22 | A Hybrid Neural Network for High-Throughput Attosecond Resolution Single-shot X-ray Pulse Characterization | Jack Hirschman et.al. | 2502.16141 | null |
2025-02-19 | Qwen2.5-VL Technical Report | Shuai Bai et.al. | 2502.13923 | null |
2025-02-19 | AnDB: Breaking Boundaries with an AI-Native Database for Universal Semantic Analysis | Tianqing Wang et.al. | 2502.13805 | link |
2025-02-19 | Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency | Jiangrong Shen et.al. | 2502.13572 | null |
2025-02-18 | Fast Data Aware Neural Architecture Search via Supernet Accelerated Evaluation | Emil Njor et.al. | 2502.12690 | null |
2025-02-13 | nanoML for Human Activity Recognition | Alan T. L. Bacellar et.al. | 2502.12173 | null |
2025-02-17 | InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems | Habib Larian et.al. | 2502.11644 | link |
2025-02-17 | Biases in Edge Language Models: Detection, Analysis, and Mitigation | Vinamra Sharma et.al. | 2502.11349 | null |
2025-02-14 | A Hybrid Edge Classifier: Combining TinyML-Optimised CNN with RRAM-CMOS ACAM for Energy-Efficient Inference | Kieran Woodward et.al. | 2502.10089 | null |
2025-02-13 | SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest | Jack Erhardt et.al. | 2502.09528 | null |
2025-02-10 | Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs | Tousif Rahman et.al. | 2502.07823 | null |
2025-02-18 | XAMBA: Enabling Efficient State Space Models on Resource-Constrained Neural Processing Units | Arghadip Das et.al. | 2502.06924 | link |
2025-02-08 | ETHEREAL: Energy-efficient and High-throughput Inference using Compressed Tsetlin Machine | Shengyu Duan et.al. | 2502.05640 | null |
2025-02-07 | Demonstrating CavePI: Autonomous Exploration of Underwater Caves by Semantic Guidance | Alankrit Gupta et.al. | 2502.05384 | null |
2025-02-08 | Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models | Haoran Ye et.al. | 2502.02444 | null |
2025-02-03 | EdgeMark: An Automation and Benchmarking System for Embedded Artificial Intelligence Tools | Mohammad Amin Hasanpour et.al. | 2502.01700 | null |
2025-02-01 | Enhancing Field-Oriented Control of Electric Drives with Tiny Neural Network Optimized for Micro-controllers | Martin Joel Mouk Elele et.al. | 2502.00532 | null |
2025-01-31 | Infer-EDGE: Dynamic DNN Inference Optimization in 'Just-in-time' Edge-AI Implementations | Motahare Mounesan et.al. | 2501.18842 | null |
2025-01-30 | Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization | Kevin Cooper et.al. | 2501.18174 | null |
2025-01-28 | On Accelerating Edge AI: Optimizing Resource-Constrained Environments | Jacob Sander et.al. | 2501.15014 | null |
2025-02-06 | SplitQuant: Layer Splitting for Low-Bit Neural Network Quantization | Jaewoo Song et.al. | 2501.12428 | null |
2025-01-20 | Consolidating TinyML Lifecycle with Large Language Models: Reality, Illusion, or Opportunity? | Guanghan Wu et.al. | 2501.12420 | null |
2025-01-17 | Michscan: Black-Box Neural Network Integrity Checking at Runtime Through Power Analysis | Robi Paul et.al. | 2501.10174 | null |
2025-01-13 | QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications | Jeongseok Kim et.al. | 2501.07161 | null |
2025-01-12 | Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G | Zhiyan Liu et.al. | 2501.06726 | null |
2025-01-09 | Towards smart and adaptive agents for active sensing on edge devices | Devendra Vyas et.al. | 2501.06262 | null |
2025-01-21 | Distilling Calibration via Conformalized Credal Inference | Jiayi Huang et.al. | 2501.06066 | null |
2025-01-08 | Decentralised Resource Sharing in TinyML: Wireless Bilayer Gossip Parallel SGD for Collaborative Learning | Ziyuan Bao et.al. | 2501.04817 | null |
2025-01-07 | ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono | Jingquan Wang et.al. | 2501.04062 | null |
2025-01-04 | Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies | Xubin Wang et.al. | 2501.03265 | link |
2025-01-01 | AI-ANNE: (A) (N)eural (N)et for (E)xploration: Transferring Deep Learning Models onto Microcontrollers and Embedded Systems | Dennis Klinkhammer et.al. | 2501.03256 | null |
2025-01-01 | Communication Efficient Cooperative Edge AI via Event-Triggered Computation Offloading | You Zhou et.al. | 2501.02001 | null |
2024-12-25 | Tempus Core: Area-Power Efficient Temporal-Unary Convolution Core for Low-Precision Edge DLAs | Prabhu Vellaisamy et.al. | 2412.19002 | null |
2024-12-23 | Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings | Harsh Joshi et.al. | 2412.18635 | null |
2024-12-23 | tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI | Harideep Nair et.al. | 2412.17966 | null |
2024-12-22 | Fatigue Monitoring Using Wearables and AI: Trends, Challenges, and Future Opportunities | Kourosh Kakhi et.al. | 2412.16847 | null |
2024-12-19 | ElectraSight: Smart Glasses with Fully Onboard Non-Invasive Eye Tracking Using Hybrid Contact and Contactless EOG | Nicolas Schärer et.al. | 2412.14848 | null |
2025-01-05 | Overview of AI and Communication for 6G Network: Fundamentals, Challenges, and Future Research Opportunities | Qimei Cui et.al. | 2412.14538 | null |
2024-12-17 | Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart Auscultation | Abraham G. Taye et.al. | 2412.14206 | null |
2024-12-16 | Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads | Mukul Lokhande et.al. | 2412.11702 | link |
2024-12-13 | Edge AI-based Radio Frequency Fingerprinting for IoT Networks | Ahmed Mohamed Hussain et.al. | 2412.10553 | null |
2024-12-13 | EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models | Hanchu Zhou et.al. | 2412.09782 | null |
2024-12-12 | Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices | Thanaphon Suwannaphong et.al. | 2412.09289 | null |
2024-12-10 | Performance Evaluation of ROS2-DDS middleware implementations facilitating Cooperative Driving in Autonomous Vehicle | Sumit Paul et.al. | 2412.07485 | null |
2024-12-07 | Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach | Olamilekan Shobayo et.al. | 2412.06837 | null |
2024-12-09 | DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators | Taesik Gong et.al. | 2412.06566 | link |
2024-12-09 | Sequential Printed MLP Circuits for Super TinyML Multi-Sensory Applications | Gurol Saglam et.al. | 2412.06542 | null |
2024-12-02 | Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping | Marla Grunewald et.al. | 2412.01609 | null |
2024-12-01 | Toward Real-Time Edge AI: Model-Agnostic Task-Oriented Communication with Visual Feature Alignment | Songjie Xie et.al. | 2412.00862 | link |
2024-11-28 | Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras | Jicheng Yuan et.al. | 2411.19143 | null |
2024-11-28 | Towards an Implementation of the Knowledge-Based Control Plane for Intelligent Swarm Networks | Xuanchi Guo et.al. | 2411.19068 | null |
2024-11-24 | Space-ground Fluid AI for 6G Edge Intelligence | Qian Chen et.al. | 2411.15845 | null |
2024-11-20 | Federated Continual Learning for Edge-AI: A Comprehensive Survey | Zi Wang et.al. | 2411.13740 | null |
2024-11-16 | Enhanced FIWARE-Based Architecture for Cyberphysical Systems With Tiny Machine Learning and Machine Learning Operations: A Case Study on Urban Mobility Systems | Javier Conde et.al. | 2411.13583 | null |
2024-11-19 | Signformer is all you need: Towards Edge AI for Sign Language | Eta Yang et.al. | 2411.12901 | link |
2024-11-16 | DEBUG-HD: Debugging TinyML models on-device using Hyper-Dimensional computing | Nikhil P Ghanathe et.al. | 2411.10692 | null |
2024-11-14 | ABCI 3.0: Evolution of the leading AI infrastructure in Japan | Ryousei Takano et.al. | 2411.09134 | null |
2024-11-13 | A Cost-effective, Stand-alone, and Real-time TinyML-Based Gait Diagnosis Unit Aimed at Lower-limb Robotic Prostheses and Exoskeletons | Zarin Anjum Madhiha et.al. | 2411.08474 | null |
2024-11-12 | Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge | Emmanuel Azuh Mensah et.al. | 2411.07834 | null |
2024-11-16 | Enhancing Predictive Maintenance in Mining Mobile Machinery through a TinyML-enabled Hierarchical Inference Network | Raúl de la Fuente et.al. | 2411.07168 | null |
2024-11-11 | A Primer on Word Embeddings: AI Techniques for Text Analysis in Social Work | Brian E. Perron et.al. | 2411.07156 | null |
2024-11-11 | TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems | Jacob Huckelberry et.al. | 2411.07114 | null |
2024-11-10 | Activation Map Compression through Tensor Decomposition for Deep Learning | Le-Trung Nguyen et.al. | 2411.06346 | link |
2024-11-09 | TinyML NLP Approach for Semantic Wireless Sentiment Classification | Ahmed Y. Radwan et.al. | 2411.06291 | link |
2024-11-03 | Energy-Aware FPGA Implementation of Spiking Neural Network with LIF Neurons | Asmer Hamid Ali et.al. | 2411.01628 | null |
2024-11-01 | On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance | Jaskirat Singh et.al. | 2411.00907 | null |
2024-10-30 | Profiling AI Models: Towards Efficient Computation Offloading in Heterogeneous Edge AI Systems | Juan Marcelo Parra-Ullauri et.al. | 2411.00859 | null |
2024-11-01 | GPT for Games: An Updated Scoping Review (2020-2024) | Daijin Yang et.al. | 2411.00308 | null |
2024-10-31 | Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge | Stefano Albini et.al. | 2410.24066 | link |
2024-10-28 | FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge | Sifat Ut Taki et.al. | 2410.21120 | link |
2024-10-28 | Edge Perception: Intelligent Wireless Sensing at Network Edge | Yuanhao Cui et.al. | 2410.21017 | null |
2024-10-25 | Neuromorphic IoT Architecture for Efficient Water Management: A Smart Village Case Study | Mugdim Bublin et.al. | 2410.19562 | null |
2024-10-17 | SouLLMate: An Application Enhancing Diverse Mental Health Support with Adaptive LLMs, Prompt Engineering, and RAG Techniques | Qiming Guo et.al. | 2410.16322 | null |
2024-10-21 | P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving | Mohamed R. Elshamy et.al. | 2410.15602 | null |
2024-10-15 | SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments | Syed Abdul Gaffar Shakhadri et.al. | 2410.11331 | null |
2024-10-14 | ABBA-VSM: Time Series Classification using Symbolic Representation on the Edge | Meerzhan Kanatbekova et.al. | 2410.10285 | null |
2024-10-12 | Token Pruning using a Lightweight Background Aware Vision Transformer | Sudhakar Sah et.al. | 2410.09324 | null |
2024-10-11 | MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Mohamed Amine Hamdi et.al. | 2410.08855 | link |
2024-10-11 | Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation | Gleb Radchenko et.al. | 2410.08651 | null |
2024-10-10 | Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices | Yiwei Zhao et.al. | 2410.08326 | null |
2024-10-10 | L-VITeX: Light-weight Visual Intuition for Terrain Exploration | Antar Mazumder et.al. | 2410.07872 | null |
2024-10-10 | Towards Robust IoT Defense: Comparative Statistics of Attack Detection in Resource-Constrained Scenarios | Zainab Alwaisi et.al. | 2410.07810 | null |
2024-10-10 | vCLIC: Towards Fast Interrupt Handling in Virtualized RISC-V Mixed-criticality Systems | Enrico Zelioli et.al. | 2410.07798 | null |
2024-10-07 | SoK: Towards Security and Safety of Edge AI | Tatjana Wingarz et.al. | 2410.05349 | null |
2024-10-10 | SONAR: A Synthetic AI-Audio Detection Framework and Benchmark | Xiang Li et.al. | 2410.04324 | link |
2024-09-28 | MicroFlow: An Efficient Rust-Based Inference Engine for TinyML | Matteo Carnelos et.al. | 2409.19432 | link |
2024-09-27 | Analog fast Fourier transforms for scalable and efficient signal processing | T. Patrick Xiao et.al. | 2409.19071 | null |
2024-09-26 | Development of an Edge Resilient ML Ensemble to Tolerate ICS Adversarial Attacks | Likai Yao et.al. | 2409.18244 | null |
2024-09-25 | Susceptibility Formulation of Density Matrix Perturbation Theory | Anders M. N. Niklasson et.al. | 2409.17033 | null |
2024-09-25 | Ethical and Scalable Automation: A Governance and Compliance Framework for Business Applications | Haocheng Lin et.al. | 2409.16872 | null |
2024-09-25 | Accelerating TinyML Inference on Microcontrollers through Approximate Kernels | Giorgos Armeniakos et.al. | 2409.16815 | link |
2024-09-23 | Benchmarking Edge AI Platforms for High-Performance ML Inference | Rakshith Jayanth et.al. | 2409.14803 | null |
2024-09-24 | CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks | Zhaozhi Qian et.al. | 2409.12623 | null |
2024-09-17 | AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances | Dhruv Agarwal et.al. | 2409.11360 | null |
2024-09-17 | Optimizing TinyML: The Impact of Reduced Data Acquisition Rates for Time Series Classification on Microcontrollers | Riya Samanta et.al. | 2409.10942 | null |
2024-09-13 | Pushing the boundaries of event subsampling in event-based video classification using CNNs | Hesam Araghi et.al. | 2409.08953 | link |
2024-09-12 | E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning | Le Zhang et.al. | 2409.08369 | link |
2024-09-12 | DiReDi: Distillation and Reverse Distillation for AIoT Applications | Chen Sun et.al. | 2409.08308 | null |
2024-09-11 | A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption | Marcus Rüb et.al. | 2409.07114 | null |
2024-09-08 | Transformer with Leveraged Masked Autoencoder for video-based Pain Assessment | Minh-Duc Nguyen et.al. | 2409.05088 | null |
2024-09-02 | Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks | Samer Francy et.al. | 2409.02134 | null |
2024-09-01 | Research on LLM Acceleration Using the High-Performance RISC-V Processor "Xiangshan" (Nanhu Version) Based on the Open-Source Matrix Instruction Set Extension (Vector Dot Product) | Xu-Hao Chen et.al. | 2409.00661 | null |
2024-08-26 | Towards Sustainable Personalized On-Device Human Activity Recognition with TinyML and Cloud-Enabled Auto Deployment | Bidyut Saha et.al. | 2409.00093 | null |
2024-08-29 | TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification | Bidyut Saha et.al. | 2408.16535 | link |
2024-08-08 | An Edge AI System Based on FPGA Platform for Railway Fault Detection | Jiale Li et.al. | 2408.15245 | null |
2024-08-23 | S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis | Kamal Basha S et.al. | 2408.12833 | link |
2024-08-20 | Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning | Bei Ouyang et.al. | 2408.10746 | null |
2024-08-21 | Challenges and Responses in the Practice of Large Language Models | Hongyin Zhu et.al. | 2408.09416 | null |
2024-08-15 | Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices | Tess Watt et.al. | 2408.08215 | null |
2024-08-14 | Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini Accelerator | Federico Nicolas Peccia et.al. | 2408.07404 | null |
2024-08-13 | Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach | Haowei Ni et.al. | 2408.06634 | null |
2024-08-06 | Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW | Elia Cereda et.al. | 2408.03168 | null |
2024-08-05 | Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow | Philip Wiese et.al. | 2408.02473 | null |
2024-08-05 | PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy | Rachmad Vidya Wicaksana Putra et.al. | 2408.02412 | null |
2024-08-02 | A Tiny Supervised ODL Core with Auto Data Pruning for Human Activity Recognition | Hiroki Matsutani et.al. | 2408.01283 | null |
2024-07-29 | HOAA: Hybrid Overestimating Approximate Adder for Enhanced Performance Processing Engine | Omkar Kokane et.al. | 2408.00806 | link |
2024-07-31 | TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors | Zhaolan Huang et.al. | 2407.21453 | link |
2024-07-31 | SHA-CNN: Scalable Hierarchical Aware Convolutional Neural Network for Edge AI | Narendra Singh Dhakad et.al. | 2407.21370 | null |
2024-07-30 | On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds | Xu Chen et.al. | 2407.20710 | null |
2024-07-29 | Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference | Claudio Angione et.al. | 2407.19775 | null |
2024-07-25 | A Sensitivity Analysis of Cellular Automata and Heterogeneous Topology Networks: Partially-Local Cellular Automata and Homogeneous Homogeneous Random Boolean Networks | Tom Eivind Glover et.al. | 2407.18017 | null |
2024-07-22 | StreamTinyNet: video streaming analysis with spatial-temporal TinyML | Hazem Hesham Yousef Shalby et.al. | 2407.17524 | null |
2024-07-22 | KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer | Aness Al-Qawlaq et.al. | 2407.16026 | null |
2024-07-18 | Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence | Shubha R. Kharel et.al. | 2407.14560 | null |
2024-07-18 | Ultra-Low-Latency Edge Inference for Distributed Sensing | Zhanwei Wang et.al. | 2407.13360 | null |
2024-07-17 | Computing: Looking Back and Moving Forward | Muhammed Golec et.al. | 2407.12558 | null |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-18 | Enhancing TinyML Security: Study of Adversarial Attack Transferability | Parin Shah et.al. | 2407.11599 | null |
2024-07-13 | Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks | Zhenyu Wang et.al. | 2407.10016 | null |
2024-07-11 | Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware | James Seekings et.al. | 2407.08704 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-23 | Trends in AI Supercomputers | Konstantin F. Pilz et.al. | 2504.16026 | null |
2025-04-22 | GainSight: Application-Guided Profiling for Composing Heterogeneous On-Chip Memories in AI Hardware Accelerators | Peijing Li et.al. | 2504.14866 | null |
2025-04-16 | HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks | Stefan Abi-Karam et.al. | 2504.12268 | null |
2025-04-14 | Carbon-Efficient 3D DNN Acceleration: Optimizing Performance and Sustainability | Aikaterini Maria Panteleaki et.al. | 2504.09851 | null |
2025-03-21 | Fused-Tiled Layers: Minimizing Data Movement on RISC-V SoCs with Software-Managed Caches | Victor J. B. Jung et.al. | 2504.03676 | null |
2025-03-31 | DiffuSE: Cross-Layer Design Space Exploration of DNN Accelerator via Diffusion-Driven Optimization | Yi Ren et.al. | 2503.23945 | null |
2025-03-17 | LIMCA: LLM for Automating Analog In-Memory Computing Architecture Design Exploration | Deepak Vungarala et.al. | 2503.13301 | null |
2025-03-06 | FORTALESA: Fault-Tolerant Reconfigurable Systolic Array for DNN Inference | Natalia Cherezova et.al. | 2503.04426 | null |
2025-02-13 | GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units | Arghadip Das et.al. | 2502.06921 | link |
2025-02-09 | MetaML-Pro: Cross-Stage Design Flow Automation for Efficient Deep Learning Acceleration | Zhiqiang Que et.al. | 2502.05850 | null |
2025-02-06 | Systolic Sparse Tensor Slices: FPGA Building Blocks for Sparse and Dense AI Acceleration | Endri Taka et.al. | 2502.03763 | null |
2025-02-01 | Late Breaking Results: Leveraging Approximate Computing for Carbon-Aware DNN Accelerators | Aikaterini Maria Panteleaki et.al. | 2502.00286 | null |
2025-01-31 | StruM: Structured Mixed Precision for Efficient Deep Learning Hardware Codesign | Michael Wu et.al. | 2501.18953 | null |
2025-01-30 | REDACTOR: eFPGA Redaction for DNN Accelerator Security | Yazan Baddour et.al. | 2501.18740 | link |
2025-01-22 | SoMa: Identifying, Exploring, and Understanding the DRAM Communication Scheduling Space for DNN Accelerators | Jingwei Cai et.al. | 2501.12634 | link |
2025-01-17 | AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations | Jamin Seo et.al. | 2501.09954 | link |
2025-01-13 | Leveraging ASIC AI Chips for Homomorphic Encryption | Jianming Tong et.al. | 2501.07047 | link |
2025-01-12 | COMPASS: A Compiler Framework for Resource-Constrained Crossbar-Array Based In-Memory Deep Learning Accelerators | Jihoon Park et.al. | 2501.06780 | null |
2024-12-21 | Leveraging Highly Approximated Multipliers in DNN Inference | Georgios Zervakis et.al. | 2412.16757 | null |
2024-12-13 | Panacea: Novel DNN Accelerator using Accuracy-Preserving Asymmetric Quantization and Energy-Saving Bit-Slice Sparsity | Dongyun Kam et.al. | 2412.10059 | null |
2024-12-06 | HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip Design | Jinwei Tang et.al. | 2412.05393 | null |
2024-12-06 | MC3: Memory Contention based Covert Channel Communication on Shared DRAM System-on-Chips | Ismet Dagli et.al. | 2412.05228 | null |
2024-11-28 | PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers | Gwangoo Yeo et.al. | 2411.19114 | null |
2024-12-06 | FAMES: Fast Approximate Multiplier Substitution for Mixed-Precision Quantized DNNs--Down to 2 Bits! | Yi Ren et.al. | 2411.18055 | null |
2024-11-19 | Travel Time Based Task Mapping for NoC-Based DNN Accelerator | Yizhi Chen et.al. | 2411.12710 | null |
2024-10-29 | Systolic Array Data Flows for Efficient Matrix Multiplication in Deep Neural Networks | Tejas Raja et.al. | 2410.22595 | null |
2024-10-21 | Adventures with Grace Hopper AI Super Chip and the National Research Platform | J. Alex Hurt et.al. | 2410.16487 | null |
2024-10-17 | Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and Undervolting | Mikael Rinkinen et.al. | 2410.13415 | null |
2024-10-11 | MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Mohamed Amine Hamdi et.al. | 2410.08855 | link |
2024-09-23 | MESC: Re-thinking Algorithmic Priority and/or Criticality Inversions for Heterogeneous MCSs | Jiapeng Guan et.al. | 2409.14837 | null |
2024-10-14 | LoopTree: Exploring the Fused-layer Dataflow Accelerator Design Space | Michael Gilbert et.al. | 2409.13625 | link |
2024-09-13 | Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators | Konstantin Lübeck et.al. | 2409.08595 | null |
2024-09-08 | BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration | Yuzong Chen et.al. | 2409.05227 | link |
2024-09-08 | HYDRA: Hybrid Data Multiplexing and Run-time Layer Configurable DNN Accelerator | Sonu Kumar et.al. | 2409.04976 | null |
2024-08-27 | SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration | Runzhen Xue et.al. | 2408.15089 | null |
2024-08-24 | SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision Deep Neural Networks | Niharika Thakuria et.al. | 2408.13617 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-09-24 | Scaling Deep Learning Computation over the Inter-Core Connected Intelligence Processor with T10 | Yiqi Liu et.al. | 2408.04808 | null |
2024-07-30 | Optical Computing for Deep Neural Network Acceleration: Foundations, Recent Developments, and Emerging Directions | Sudeep Pasricha et.al. | 2407.21184 | null |
2024-07-29 | Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices | Hayun Lee et.al. | 2407.19644 | null |
2024-07-24 | The Magnificent Seven Challenges and Opportunities in Domain-Specific Accelerator Design for Autonomous Systems | Sabrina M. Neuman et.al. | 2407.17311 | null |
2024-07-17 | StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators | Ethan G Rogers et.al. | 2407.12378 | null |
2024-07-11 | NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2 | Tengfei Xue et.al. | 2407.12057 | null |
2024-07-22 | ARCO:Adaptive Multi-Agent Reinforcement Learning-Based Hardware/Software Co-Optimization Compiler for Improved Performance in DNN Accelerator Design | Arya Fayyazi et.al. | 2407.08192 | null |
2024-06-20 | SWANN: Shuffling Weights in Crossbar Arrays for Enhanced DNN Accuracy in Deeply Scaled Technologies | Jeffry Victor et.al. | 2406.14706 | null |
2024-06-14 | CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories | Man Shi et.al. | 2406.14574 | null |
2024-06-15 | Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training | Akul Malhotra et.al. | 2406.10528 | null |
2024-07-17 | Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis | Zongyue Qin et.al. | 2406.09606 | null |
2024-06-05 | HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator | Zhewen Yu et.al. | 2406.03088 | link |
2024-06-03 | A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing | P. J. Zhou et.al. | 2406.01151 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Block Circulant Adapter for Large Language Models | Xinyu Ding et.al. | 2505.00582 | null |
2025-05-01 | Communication-Efficient Wireless Federated Fine-Tuning for Large-Scale AI Models | Bumjun Kim et.al. | 2505.00333 | null |
2025-05-01 | AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care | Md Asaduzzaman Jabin et.al. | 2505.00275 | null |
2025-04-30 | SAM4EM: Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks | Uzair Shah et.al. | 2504.21544 | null |
2025-04-29 | TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts | Pradip Kunwar et.al. | 2504.21190 | null |
2025-04-29 | X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation | Guy Hadad et.al. | 2504.20859 | null |
2025-04-29 | Reinforcement Learning for LLM Reasoning Under Memory Constraints | Alan Lee et.al. | 2504.20834 | null |
2025-04-29 | In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer | Zechuan Zhang et.al. | 2504.20690 | null |
2025-04-29 | What Causes Knowledge Loss in Multilingual Language Models? | Maria Khelli et.al. | 2504.20356 | null |
2025-04-28 | DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Mamadou Keita et.al. | 2504.19876 | null |
2025-04-27 | Low-Rank Adaptive Structural Priors for Generalizable Diabetic Retinopathy Grading | Yunxuan Wang et.al. | 2504.19362 | null |
2025-04-25 | TLoRA: Tri-Matrix Low-Rank Adaptation of Large Language Models | Tanvir Islam et.al. | 2504.18735 | null |
2025-04-25 | Pushing the boundary on Natural Language Inference | Pablo Miralles-González et.al. | 2504.18376 | null |
2025-04-25 | Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding | Kun Li et.al. | 2504.18204 | null |
2025-04-25 | NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation | Rob Romijnders et.al. | 2504.18147 | null |
2025-04-25 | Automating Function-Level TARA for Automotive Full-Lifecycle Security | Yuqiao Yang et.al. | 2504.18083 | null |
2025-04-24 | Replay to Remember: Retaining Domain Knowledge in Streaming Language Models | Sneh Pillai et.al. | 2504.17780 | null |
2025-04-23 | Federated Learning of Low-Rank One-Shot Image Detection Models in Edge Devices with Scalable Accuracy and Compute Complexity | Abdul Hannaan et.al. | 2504.16515 | null |
2025-04-23 | EMRModel: A Large Language Model for Extracting Medical Consultation Dialogues into Structured Medical Records | Shuguang Zhao et.al. | 2504.16448 | null |
2025-04-22 | PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning | Song Wang et.al. | 2504.16023 | null |
2025-04-22 | Low-Rank Adaptation of Neural Fields | Anh Truong et.al. | 2504.15933 | null |
2025-04-22 | Tina: Tiny Reasoning Models via LoRA | Shangshang Wang et.al. | 2504.15777 | null |
2025-04-23 | A LoRA-Based Approach to Fine-Tuning LLMs for Educational Guidance in Resource-Constrained Settings | Md Millat Hosen et.al. | 2504.15610 | link |
2025-04-21 | SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation | Yue Li et.al. | 2504.15035 | null |
2025-04-21 | What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale | Xiaoyong Yuan et.al. | 2504.14815 | null |
2025-04-21 | When Cloud Removal Meets Diffusion Model in Remote Sensing | Zhenyu Yu et.al. | 2504.14785 | null |
2025-04-20 | Efficient Federated Split Learning for Large Language Models over Communication Networks | Kai Zhao et.al. | 2504.14667 | null |
2025-04-20 | TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data | Fei Zhu et.al. | 2504.14545 | null |
2025-04-19 | Cross-attention for State-based model RWKV-7 | Liu Xiao et.al. | 2504.14260 | link |
2025-04-18 | 6G WavesFM: A Foundation Model for Sensing, Communication, and Localization | Ahmed Aboulfotouh et.al. | 2504.14100 | null |
2025-04-18 | ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis | Andrea Rigo et.al. | 2504.13745 | null |
2025-04-18 | Efficient Parameter Adaptation for Multi-Modal Medical Image Segmentation and Prognosis | Numan Saeed et.al. | 2504.13645 | null |
2025-04-18 | LoRA-Based Continual Learning with Constraints on Critical Parameter Changes | Shimou Ling et.al. | 2504.13407 | link |
2025-04-17 | Mirror, Mirror of the Flow: How Does Regularization Shape Implicit Bias? | Tom Jacobs et.al. | 2504.12883 | null |
2025-04-17 | Chinese-Vicuna: A Chinese Instruction-following Llama-based Model | Chenghao Fan et.al. | 2504.12737 | null |
2025-04-17 | Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models | Zhenyu Yu et.al. | 2504.12574 | null |
2025-04-19 | Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex | Azadeh Beiranvand et.al. | 2504.12474 | null |
2025-04-16 | You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models | Shiwei Ding et.al. | 2504.12471 | null |
2025-04-16 | Activated LoRA: Fine-tuned LLMs for Intrinsics | Kristjan Greenewald et.al. | 2504.12397 | null |
2025-04-16 | Super-LoRa: Enhancing LoRa Throughput via Payload Superposition | Salah Abdeljabar et.al. | 2504.11927 | null |
2025-04-16 | ACE: Attentional Concept Erasure in Diffusion Models | Finn Carter et.al. | 2504.11850 | null |
2025-04-16 | Résumé abstractif à partir d'une transcription audio | Ilia Derkach et.al. | 2504.11803 | null |
2025-04-16 | A Library of LLM Intrinsics for Retrieval-Augmented Generation | Marina Danilevsky et.al. | 2504.11704 | null |
2025-04-15 | Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Nicolas Baumann et.al. | 2504.11514 | link |
2025-04-15 | UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer | Xiang Wang et.al. | 2504.11289 | link |
2025-04-15 | Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution | Xinning Chai et.al. | 2504.11271 | link |
2025-04-15 | FHBench: Towards Efficient and Personalized Federated Learning for Multimodal Healthcare | Penghao Wang et.al. | 2504.10817 | link |
2025-04-14 | CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation | Junchen Fu et.al. | 2504.10307 | link |
2025-04-14 | UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval | Yating Liu et.al. | 2504.10084 | link |
2025-04-13 | AeroLite: Tag-Guided Lightweight Generation of Aerial Image Captions | Xing Zi et.al. | 2504.09528 | null |
2025-04-13 | CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models | Pooja Guhan et.al. | 2504.09472 | null |
2025-04-13 | Vision Transformers Exhibit Human-Like Biases: Evidence of Orientation and Color Selectivity, Categorical Perception, and Phase Transitions | Nooshin Bahador et.al. | 2504.09393 | null |
2025-04-12 | FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment | Sijing Wu et.al. | 2504.09255 | null |
2025-04-12 | DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models | Wenjin Ke et.al. | 2504.09223 | null |
2025-04-11 | Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models | Jiahuan Long et.al. | 2504.08915 | null |
2025-04-11 | Spatial Audio Processing with Large Language Model on Wearable Devices | Ayushi Mishra et.al. | 2504.08907 | null |
2025-04-11 | AI-University: An LLM-based platform for instructional alignment to scientific classrooms | Mostafa Faghih Shojaei et.al. | 2504.08846 | link |
2025-04-10 | LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution | Danielle Sullivan-Pao et.al. | 2504.08149 | link |
2025-04-08 | CDM-QTA: Quantized Training Acceleration for Efficient LoRA Fine-Tuning of Diffusion Model | Jinming Lu et.al. | 2504.07998 | null |
2025-04-10 | LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation | Juzheng Zhang et.al. | 2504.07448 | link |
2025-04-09 | TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling | Liang-Hsuan Tseng et.al. | 2504.07053 | link |
2025-04-09 | DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation | Wangbo Zhao et.al. | 2504.06803 | null |
2025-04-08 | Can you Finetune your Binoculars? Embedding Text Watermarks into the Weights of Large Language Models | Fay Elhassan et.al. | 2504.06446 | null |
2025-04-08 | S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning | Hanqing Zeng et.al. | 2504.06426 | null |
2025-04-08 | Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images | Hicham Talaoubrid et.al. | 2504.06330 | null |
2025-04-11 | Optuna vs Code Llama: Are LLMs a New Paradigm for Hyperparameter Tuning? | Roman Kochnev et.al. | 2504.06006 | null |
2025-04-06 | AROMA: Autonomous Rank-one Matrix Adaptation | Hao Nan Sheng et.al. | 2504.05343 | link |
2025-04-07 | Enhancing Smart Contract Vulnerability Detection in DApps Leveraging Fine-Tuned LLM | Jiuyang Bu et.al. | 2504.05006 | null |
2025-04-07 | TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment | Adnan Khan et.al. | 2504.04722 | null |
2025-04-07 | LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts | Yimu Wang et.al. | 2504.04653 | null |
2025-04-06 | KnowsLM: A framework for evaluation of small language models for knowledge augmentation and humanised conversations | Chitranshu Harbola et.al. | 2504.04569 | null |
2025-04-05 | FISH-Tuning: Enhancing PEFT Methods with Fisher Information | Kang Xue et.al. | 2504.04050 | null |
2025-04-03 | The Self-Learning Agent with a Progressive Neural Network Integrated Transformer | Ajay Sivakumar et.al. | 2504.02489 | null |
2025-04-03 | Cognitive Memory in Large Language Models | Lianlei Shan et.al. | 2504.02441 | null |
2025-04-03 | AC-LoRA: Auto Component LoRA for Personalized Artistic Style Image Generation | Zhipu Cui et.al. | 2504.02231 | null |
2025-04-02 | CLIP-SLA: Parameter-Efficient CLIP Adaptation for Continuous Sign Language Recognition | Sarah Alyami et.al. | 2504.01666 | link |
2025-04-02 | Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning | Yiting Lu et.al. | 2504.01655 | link |
2025-04-01 | Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations | Chongjie Si et.al. | 2504.00851 | null |
2025-04-01 | DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism | Dengchun Li et.al. | 2504.00661 | link |
2025-04-01 | Next Generation LoRaWAN: Integrating Multi-Hop Communications at 2.4 GHz | Riccardo Marini et.al. | 2504.00489 | null |
2025-04-01 | Exploring the Collaborative Advantage of Low-level Information on Generalizable AI-Generated Image Detection | Ziyin Zhou et.al. | 2504.00463 | null |
2025-04-01 | MetaLoRA: Tensor-Enhanced Adaptive Low-Rank Fine-tuning | Maolin Wang et.al. | 2504.00460 | null |
2025-03-31 | ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning | Huandong Chang et.al. | 2504.00254 | null |
2025-03-31 | ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion | Rana Muhammad Shahroz Khan et.al. | 2503.24354 | null |
2025-03-31 | JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation | Fangda Chen et.al. | 2503.23951 | null |
2025-03-31 | Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation | Yongle Li et.al. | 2503.23869 | null |
2025-04-01 | Evaluating small vision-language models as AI assistants for radio astronomical source analysis tasks | S. Riggi et.al. | 2503.23859 | link |
2025-03-30 | Mixture of Routers | Jia-Chen Zhang et.al. | 2503.23362 | null |
2025-03-30 | Not All LoRA Parameters Are Essential: Insights on Inference Necessity | Guanhua Chen et.al. | 2503.23360 | null |
2025-03-29 | Efficient Adaptation For Remote Sensing Visual Grounding | Hasan Moughnieh et.al. | 2503.23083 | null |
2025-03-29 | InkFM: A Foundational Model for Full-Page Online Handwritten Note Understanding | Anastasiia Fadeeva et.al. | 2503.23081 | null |
2025-03-29 | Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning | Xinlei Shao et.al. | 2503.23012 | link |
2025-03-29 | Multimodal machine learning with large language embedding model for polymer property prediction | Tianren Zhang et.al. | 2503.22962 | null |
2025-03-28 | ActionStudio: A Lightweight Framework for Data and Training of Action Models | Jianguo Zhang et.al. | 2503.22673 | link |
2025-03-28 | Shadow and gravitational lensing produced by the nonlinear accretion of a scalar field onto a black hole | J. C. Acevedo-Muñoz et.al. | 2503.22624 | null |
2025-03-28 | Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities | Raman Dutt et.al. | 2503.22517 | null |
2025-03-28 | Fighting Fire with Fire: Channel-Independent RF Fingerprinting via the Ratio of Linear to Logarithmic Differential Spectrum | Tianshu Chen et.al. | 2503.22378 | null |
2025-03-28 | Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization | Barış Batuhan Topal et.al. | 2503.22352 | null |
2025-03-28 | Make Some Noise: Towards LLM audio reasoning and generation using sound tokens | Shivam Mehta et.al. | 2503.22275 | null |
2025-03-28 | Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation | Minho Park et.al. | 2503.22172 | null |
2025-03-27 | RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction | Armin Abdollahi et.al. | 2503.21971 | null |
2025-03-27 | VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models | Chi-Pin Huang et.al. | 2503.21781 | null |
2025-03-27 | Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation | Reza Qorbani et.al. | 2503.21780 | link |
2025-03-27 | Resource-Efficient Federated Fine-Tuning Large Language Models for Heterogeneous Data | Jun Liu et.al. | 2503.21213 | null |
2025-03-27 | Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing | Fan Qi et.al. | 2503.21069 | null |
2025-03-26 | Vision as LoRA | Han Wang et.al. | 2503.20680 | link |
2025-03-26 | TeleLoRA: Teleporting Model-Specific Alignment Across LLMs | Xiao Lin et.al. | 2503.20228 | null |
2025-03-26 | ProtoBERT-LoRA: Parameter-Efficient Prototypical Finetuning for Immunotherapy Study Identification | Shijia Zhang et.al. | 2503.20179 | null |
2025-03-25 | iNatAg: Multi-Class Classification Models Enabled by a Large-Scale Benchmark Dataset with 4.7M Images of 2,959 Crop and Weed Species | Naitik Jain et.al. | 2503.20068 | link |
2025-03-25 | An Overview of Low-Rank Structures in the Training and Adaptation of Large Models | Laura Balzano et.al. | 2503.19859 | null |
2025-03-25 | fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models | Saurav Sharma et.al. | 2503.19670 | null |
2025-03-25 | Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion | Haim Sawdayee et.al. | 2503.19557 | null |
2025-03-24 | A Shared Low-Rank Adaptation Approach to Personalized RLHF | Renpu Liu et.al. | 2503.19201 | null |
2025-03-24 | Efficient Self-Supervised Adaptation for Medical Image Analysis | Moein Sorkhei et.al. | 2503.18873 | link |
2025-03-24 | Advancing Cross-Organ Domain Generalization with Test-Time Style Transfer and Diversity Enhancement | Biwen Meng et.al. | 2503.18567 | null |
2025-03-24 | Hiding Images in Diffusion Models by Editing Learned Score Functions | Haoyu Chen et.al. | 2503.18459 | null |
2025-03-24 | Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners | Wen Zheng Terence Ng et.al. | 2503.18347 | null |
2025-03-24 | Surgical Action Planning with Large Language Models | Mengya Xu et.al. | 2503.18296 | null |
2025-03-23 | Decoupling Angles and Strength in Low-rank Adaptation | Massimo Bini et.al. | 2503.18225 | link |
2025-03-23 | The Power of Small LLMs in Geometry Generation for Physical Simulations | Ossama Shafiq et.al. | 2503.18178 | null |
2025-03-23 | Javad SeraJ et.al. | 2503.18089 | null | |
2025-03-23 | Investigating Recent Large Language Models for Vietnamese Machine Reading Comprehension | Anh Duc Nguyen et.al. | 2503.18062 | null |
2025-03-22 | Serial Low-rank Adaptation of Vision Transformer | Houqiang Zhong et.al. | 2503.17750 | null |
2025-03-21 | Revisiting End To End Sparse Autoencoder Training -- A Short Finetune is All You Need | Adam Karvonen et.al. | 2503.17272 | link |
2025-03-21 | TRACE: Time SeRies PArameter EffiCient FinE-tuning | Yuze Li et.al. | 2503.16991 | null |
2025-03-21 | HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis | Mengtian Li et.al. | 2503.16944 | null |
2025-03-21 | LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models | Jian Liang et.al. | 2503.16843 | null |
2025-03-20 | LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Ying Shen et.al. | 2503.16334 | null |
2025-03-20 | Ultra-Resolution Adaptation with Ease | Ruonan Yu et.al. | 2503.16322 | link |
2025-03-20 | SALT: Singular Value Adaptation with Low-Rank Transformation | Abdelrahman Elsayed et.al. | 2503.16055 | link |
2025-03-20 | Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras | Beilei Cui et.al. | 2503.15917 | null |
2025-03-19 | Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices | Ziyao Wang et.al. | 2503.14932 | null |
2025-03-18 | MusicInfuser: Making Video Diffusion Listen and Dance | Susung Hong et.al. | 2503.14505 | null |
2025-03-17 | Atyaephyra at SemEval-2025 Task 4: Low-Rank NPO | Jan Bronec et.al. | 2503.13690 | link |
2025-03-17 | Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model | Kai Tong et.al. | 2503.13575 | null |
2025-03-17 | VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning | Ye Liu et.al. | 2503.13444 | link |
2025-03-17 | Edit Transfer: Learning Image Editing via Vision In-Context Relations | Lan Chen et.al. | 2503.13327 | null |
2025-03-17 | MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Portrait Few-Step Synthesis | Shitong Shao et.al. | 2503.13319 | null |
2025-03-17 | Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation | Henghui Du et.al. | 2503.13068 | null |
2025-03-17 | ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM | Wenqiang Wang et.al. | 2503.12988 | null |
2025-03-17 | Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction | Zheyuan Liu et.al. | 2503.12953 | null |
2025-03-17 | Quantum-Enhanced LLM Efficient Fine Tuning | Xiaofei Kong et.al. | 2503.12790 | null |
2025-03-16 | RaSA: Rank-Sharing Low-Rank Adaptation | Zhiwei He et.al. | 2503.12576 | null |
2025-03-16 | Towards Suturing World Models: Learning Predictive Models for Robotic Surgical Tasks | Mehmet Kerem Turkcan et.al. | 2503.12531 | null |
2025-03-16 | Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation | Byung Hyun Lee et.al. | 2503.12356 | link |
2025-03-14 | Multi-Stage Generative Upscaler: Reconstructing Football Broadcast Images via Diffusion Models | Luca Martini et.al. | 2503.11181 | null |
2025-03-13 | Phishsense-1B: A Technical Perspective on an AI-Powered Phishing Detection Model | SE Blake et.al. | 2503.10944 | null |
2025-03-14 | Distilling Diversity and Control in Diffusion Models | Rohit Gandikota et.al. | 2503.10637 | null |
2025-03-16 | Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models | Andy Zhou et.al. | 2503.10617 | null |
2025-03-13 | ConsisLoRA: Enhancing Content and Style Consistency for LoRA-based Style Transfer | Bolin Chen et.al. | 2503.10614 | null |
2025-03-13 | Piece it Together: Part-Based Concepting with IP-Priors | Elad Richardson et.al. | 2503.10365 | null |
2025-03-13 | A Hybrid Architecture with Efficient Fine Tuning for Abstractive Patent Document Summarization | Nevidu Jayatilleke et.al. | 2503.10354 | null |
2025-03-13 | Singular Value Fine-tuning for Few-Shot Class-Incremental Learning | Zhiwu Wang et.al. | 2503.10214 | null |
2025-03-13 | PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation | Sen Wang et.al. | 2503.09938 | null |
2025-03-12 | Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection | Romain Thoreau et.al. | 2503.09493 | null |
2025-03-12 | SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery | Jiayuan Huang et.al. | 2503.09474 | null |
2025-03-12 | UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer | Haoxuan Wang et.al. | 2503.09277 | null |
2025-03-12 | Fine-Tuning Large Language Models for Educational Support: Leveraging Gagne's Nine Events of Instruction for Lesson Planning | Linzhao Jia et.al. | 2503.09276 | null |
2025-03-12 | InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images | Jiun Tian Hoe et.al. | 2503.09130 | null |
2025-03-11 | OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models | Jialv Zou et.al. | 2503.08686 | link |
2025-03-11 | Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation | Mingkang Zhu et.al. | 2503.08575 | null |
2025-03-11 | 1LoRA: Summation Compression for Very Low-Rank Adaptation | Alessio Quercia et.al. | 2503.08333 | null |
2025-03-11 | MGHanD: Multi-modal Guidance for authentic Hand Diffusion | Taehyeon Eum et.al. | 2503.08133 | null |
2025-03-11 | Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection | Ying Fu Lim et.al. | 2503.08045 | null |
2025-03-11 | MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models | Han Zhao et.al. | 2503.08007 | null |
2025-03-11 | A Study to Evaluate the Impact of LoRA Fine-tuning on the Performance of Non-functional Requirements Classification | Xia Li et.al. | 2503.07927 | null |
2025-03-10 | AdaptSR: Low-Rank Adaptation for Efficient and Scalable Real-World Super-Resolution | Cansu Korkmaz et.al. | 2503.07748 | null |
2025-03-10 | DreamRelation: Relation-Centric Video Customization | Yujie Wei et.al. | 2503.07602 | null |
2025-03-10 | Balanced Image Stylization with Style Matching Score | Yuxin Jiang et.al. | 2503.07601 | null |
2025-03-10 | TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision | Shaobin Zhuang et.al. | 2503.07416 | null |
2025-03-10 | FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates | Sangwoo Park et.al. | 2503.07216 | null |
2025-03-10 | EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer | Yuxuan Zhang et.al. | 2503.07027 | null |
2025-03-10 | Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization | Ziqing Xu et.al. | 2503.06982 | null |
2025-03-10 | Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation | Pengchen Liang et.al. | 2503.06976 | null |
2025-03-10 | A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis | Xiang Liu et.al. | 2503.06973 | link |
2025-03-09 | Conceptrol: Concept Control of Zero-shot Personalized Image Generation | Qiyuan He et.al. | 2503.06568 | link |
2025-03-09 | Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs | Umberto Cappellazzo et.al. | 2503.06362 | null |
2025-03-08 | X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation | Jian Ma et.al. | 2503.06134 | link |
2025-03-08 | A Novel Trustworthy Video Summarization Algorithm Through a Mixture of LoRA Experts | Wenzhuo Du et.al. | 2503.06064 | null |
2025-03-07 | Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints | Parameswaran Kamalaruban et.al. | 2503.05684 | null |
2025-03-07 | Nuanced Safety for Generative AI: How Demographics Shape Responsiveness to Severity | Pushkar Mishra et.al. | 2503.05609 | null |
2025-03-07 | Quantum-PEFT: Ultra parameter-efficient fine-tuning | Toshiaki Koike-Akino et.al. | 2503.05431 | null |
2025-03-07 | LoRACode: LoRA Adapters for Code Embeddings | Saumya Chaturvedi et.al. | 2503.05315 | null |
2025-03-06 | Wanda++: Pruning Large Language Models via Regional Gradients | Yifan Yang et.al. | 2503.04992 | null |
2025-03-06 | Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach | Soumyadeep Ro et.al. | 2503.04918 | null |
2025-03-05 | Enhancing Collective Intelligence in Large Language Models Through Emotional Integration | Likith Kadiyala et.al. | 2503.04849 | null |
2025-03-06 | TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models | Xinyi He et.al. | 2503.04396 | null |
2025-03-07 | GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI | Cecilia Diana-Albelda et.al. | 2503.04325 | link |
2025-03-06 | Continual Optimization with Symmetry Teleportation for Multi-Task Learning | Zhipeng Zhou et.al. | 2503.04046 | null |
2025-03-05 | Personalized Federated Fine-tuning for Heterogeneous Data: An Automatic Rank Learning Approach via Two-Level LoRA | Jie Hao et.al. | 2503.03920 | null |
2025-03-05 | Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset | Jessica Hoffmann et.al. | 2503.03654 | null |
2025-03-05 | WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models | Tao Feng et.al. | 2503.03110 | null |
2025-03-04 | LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models | Pengwei Tang et.al. | 2503.02659 | null |
2025-03-04 | Efficient Long Sequential Low-rank Adaptive Attention for Click-through rate Prediction | Xin Song et.al. | 2503.02542 | null |
2025-03-04 | AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking | Iraklis Premptis et.al. | 2503.02443 | null |
2025-03-04 | Measuring Intrinsic Dimension of Token Embeddings | Takuya Kataiwa et.al. | 2503.02142 | null |
2025-03-03 | CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom | Yisen Li et.al. | 2503.01836 | link |
2025-03-03 | ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition | Nastaran Mansourian et.al. | 2503.01750 | null |
2025-03-03 | Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs | Abdelrahman Abouelenin et.al. | 2503.01743 | null |
2025-03-03 | CoPL: Collaborative Preference Learning for Personalizing LLMs | Youngbin Choi et.al. | 2503.01658 | null |
2025-03-03 | Liger: Linearizing Large Language Models to Gated Recurrent Structures | Disen Lan et.al. | 2503.01496 | null |
2025-03-03 | Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace | Jia-Chen Zhang et.al. | 2503.01419 | null |
2025-02-28 | Unsupervised Parameter Efficient Source-free Post-pretraining | Abhishek Jha et.al. | 2502.21313 | null |
2025-02-28 | RuCCoD: Towards Automated ICD Coding in Russian | Aleksandr Nesterov et.al. | 2502.21263 | link |
2025-02-28 | Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs | Weixiang Zhao et.al. | 2502.20968 | null |
2025-02-28 | Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content | Hongyuan Shen et.al. | 2502.20952 | null |
2025-02-28 | Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA | Ojonugwa Oluwafemi Ejiga Peter et.al. | 2502.20667 | null |
2025-02-27 | AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs | Xuyang Wei et.al. | 2502.20035 | link |
2025-02-27 | Image Referenced Sketch Colorization Based on Animation Creation Workflow | Dingkun Yan et.al. | 2502.19937 | link |
2025-03-04 | HaLoRA: Hardware-aware Low-Rank Adaptation for Large Language Models Based on Hybrid Compute-in-Memory Architecture | Taiqiang Wu et.al. | 2502.19747 | null |
2025-02-26 | Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing | Akshat Gupta et.al. | 2502.19416 | null |
2025-02-26 | CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning | Ping Zhang et.al. | 2502.18910 | null |
2025-02-25 | K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs | Ziheng Ouyang et.al. | 2502.18461 | null |
2025-02-25 | VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention | Adnan Iltaf et.al. | 2502.18185 | link |
2025-02-27 | SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models | Yuxuan Zhang et.al. | 2502.18168 | null |
2025-02-25 | C-LoRA: Continual Low-Rank Adaptation for Pre-trained Models | Xin Zhang et.al. | 2502.17920 | null |
2025-02-24 | Function-Space Learning Rates | Edward Milsom et.al. | 2502.17405 | link |
2025-02-24 | UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings | Layba Fiaz et.al. | 2502.16961 | null |
2025-02-24 | Design of a communication system Images for identification of vehicle plates | Fabrizio Andre Farfán Prado et.al. | 2502.16909 | null |
2025-02-26 | Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment | Chenghao Fan et.al. | 2502.16894 | null |
2025-02-23 | Efficient 4D Gaussian Stream with Low Rank Adaptation | Zhenhuan Liu et.al. | 2502.16575 | null |
2025-02-22 | Orthogonality Analysis in LoRa Uplink Satellite Communications Affected by Doppler Effect | Jikang Deng et.al. | 2502.16179 | null |
2025-02-22 | MedForge: Building Medical Foundation Models Like Open Source Software Development | Zheling Tan et.al. | 2502.16055 | link |
2025-02-21 | Sparsity May Be All You Need: Sparse Random Parameter Adaptation | Jesus Rios et.al. | 2502.15975 | null |
2025-02-21 | Pastiche Novel Generation Creating: Fan Fiction You Love in Your Favorite Author's Style | Xueran Han et.al. | 2502.15616 | null |
2025-02-21 | R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning | Jinda Liu et.al. | 2502.15455 | link |
2025-02-21 | Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning | Raghav Singhal et.al. | 2502.15436 | link |
2025-02-21 | On Performance of LoRa Fluid Antenna Systems | Gaoze Mu et.al. | 2502.15258 | null |
2025-02-21 | M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment | Chuan Cui et.al. | 2502.15167 | null |
2025-02-20 | Dynamic Concepts Personalization from Single Videos | Rameen Abdal et.al. | 2502.14844 | null |
2025-02-20 | Dynamic Low-Rank Sparse Adaptation for Large Language Models | Weizhong Huang et.al. | 2502.14816 | link |
2025-02-20 | Beyond Performance Scores: Directed Functional Connectivity as a Brain-Based Biomarker for Motor Skill Learning and Retention | Anil Kamat et.al. | 2502.14731 | null |
2025-02-20 | LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization | Yupeng Chang et.al. | 2502.14538 | link |
2025-02-20 | How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? | Sergey Pletenev et.al. | 2502.14502 | link |
2025-02-20 | NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models | Chenlu Guo et.al. | 2502.14482 | link |
2025-02-19 | PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery | Runlong He et.al. | 2502.14149 | link |
2025-02-19 | On the Duality between Gradient Transformations and Adapters | Lucas Torroba-Hennigen et.al. | 2502.13811 | null |
2025-02-19 | Adapting Large Language Models for Time Series Modeling via a Novel Parameter-efficient Adaptation Method | Juyuan Zhang et.al. | 2502.13725 | null |
2025-02-19 | BeamLoRA: Beam-Constraint Low-Rank Adaptation | Naibin Gu et.al. | 2502.13604 | null |
2025-02-19 | LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation | Xin Li et.al. | 2502.13568 | null |
2025-02-19 | Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models | Jun Zhang et.al. | 2502.13533 | link |
2025-02-19 | Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models | Chenyu Zhu et.al. | 2502.13474 | null |
2025-02-19 | Dynamic directed functional connectivity as a neural biomarker for objective motor skill assessment | Anil Kamat et.al. | 2502.13362 | null |
2025-02-18 | Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models | Soumi Das et.al. | 2502.13313 | null |
2025-02-18 | GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning | Sifan Zhou et.al. | 2502.12913 | null |
2025-02-18 | Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation | Kounianhua Du et.al. | 2502.12492 | null |
2025-02-16 | Efficient and Effective Prompt Tuning via Prompt Decomposition and Compressed Outer Product | Pengxiang Lan et.al. | 2502.12200 | null |
2025-02-17 | Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA | Patryk Marszałek et.al. | 2502.12122 | link |
2025-02-17 | Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis | Xu Wang et.al. | 2502.11812 | null |
2025-02-17 | DATA: Decomposed Attention-based Task Adaptation for Rehearsal-Free Continual Learning | Huanxuan Liao et.al. | 2502.11482 | link |
2025-02-17 | An Efficient Row-Based Sparse Fine-Tuning | Cen-Jhih Li et.al. | 2502.11439 | null |
2025-02-16 | Integrating Language Models for Enhanced Network State Monitoring in DRL-Based SFC Provisioning | Parisa Fard Moshiri et.al. | 2502.11298 | null |
2025-02-18 | AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks | Ming Xie et.al. | 2502.11158 | null |
2025-02-15 | Generalizable speech deepfake detection via meta-learned LoRA | Janne Laakkonen et.al. | 2502.10838 | null |
2025-02-15 | Code-Mixed Telugu-English Hate Speech Detection | Santhosh Kakarla et.al. | 2502.10632 | null |
2025-02-14 | Hallucinations and Truth: A Comprehensive Accuracy Evaluation of RAG, LoRA and DoRA | Mohammad Baqar et.al. | 2502.10497 | null |
2025-02-14 | Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages | Daniil Gurgurov et.al. | 2502.10140 | null |
2025-02-14 | Precise Parameter Localization for Textual Generation in Diffusion Models | Łukasz Staniszewski et.al. | 2502.09935 | null |
2025-02-14 | Port-LLM: A Port Prediction Method for Fluid Antenna based on Large Language Models | Yali Zhang et.al. | 2502.09857 | null |
2025-02-14 | HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation | Tianwei Lin et.al. | 2502.09838 | link |
2025-02-13 | Improving Acoustic Side-Channel Attacks on Keyboards Using Transformers and Large Language Models | Jin Hyun Park et.al. | 2502.09782 | null |
2025-02-14 | LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won't Fail) | Junsu Kim et.al. | 2502.09376 | null |
2025-02-13 | DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation | Tangyu Jiang et.al. | 2502.08905 | null |
2025-02-13 | BrainWavLM: Fine-tuning Speech Representations with Brain Responses to Language | Nishitha Vattikonda et.al. | 2502.08866 | null |
2025-02-12 | LoRa Fine Synchronization with Two-Pass Time and Frequency Offset Estimation | Joachim Tapparel et.al. | 2502.08485 | null |
2025-02-12 | LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits | Zikai Zhou et.al. | 2502.08141 | null |
2025-02-11 | Curvature Tuning: Provable Training-free Model Steering From a Single Parameter | Leyang Hu et.al. | 2502.07783 | link |
2025-02-11 | HRP: High-Rank Preheating for Superior LoRA Initialization | Yuzhu Chen et.al. | 2502.07739 | null |
2025-02-11 | LoRP-TTS: Low-Rank Personalized Text-To-Speech | Łukasz Bondaruk et.al. | 2502.07562 | null |
2025-02-11 | LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! | Dacheng Li et.al. | 2502.07374 | link |
2025-02-10 | Hyper Compressed Fine-Tuning of Large Foundation Models with Quantum Inspired Adapters | Snehal Raj et.al. | 2502.06916 | null |
2025-02-10 | CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | D. She et.al. | 2502.06527 | null |
2025-02-10 | Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis | Sanket Jantre et.al. | 2502.06173 | null |
2025-02-09 | DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations | Krishna Sri Ipsit Mantri et.al. | 2502.06029 | link |
2025-02-11 | VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer | Xinyu Liu et.al. | 2502.05979 | null |
2025-02-09 | Skill Expansion and Composition in Parameter Space | Tenglong Liu et.al. | 2502.05932 | link |
2025-02-08 | Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning | Beining Zhang et.al. | 2502.05573 | null |
2025-02-08 | SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation | Yixian Shen et.al. | 2502.05539 | null |
2025-02-07 | Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs | Thierry Bossy et.al. | 2502.05087 | link |
2025-02-07 | SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model | Jiayang Yu et.al. | 2502.04958 | link |
2025-02-07 | Cached Multi-Lora Composition for Multi-Concept Image Generation | Xiandong Zou et.al. | 2502.04923 | link |
2025-02-07 | SelaFD:Seamless Adaptation of Vision Transformer Fine-tuning for Radar-based Human Activity | Yijun Wang et.al. | 2502.04740 | link |
2025-02-07 | EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference | Prakhar Kaushik et.al. | 2502.04700 | link |
2025-02-07 | Contrastive Learning-Enhanced Large Language Models for Monolith-to-Microservice Decomposition | Khaled Sellami et.al. | 2502.04604 | null |
2025-02-05 | FedP |
Royson Lee et.al. | 2502.04387 | null |
2025-02-09 | ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters | Kamer Ali Yuksel et.al. | 2502.04315 | link |
2025-02-07 | Efficient Few-Shot Continual Learning in Vision-Language Models | Aristeidis Panos et.al. | 2502.04098 | null |
2025-02-06 | Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning | Peizhuang Cong et.al. | 2502.03884 | null |
2025-02-05 | Resource-Efficient & Effective Code Summarization | Saima Afrin et.al. | 2502.03617 | null |
2025-02-05 | Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach | Abdullahi Isa Ahmed et.al. | 2502.03377 | null |
2025-02-05 | RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts | Tuan Truong et.al. | 2502.03044 | null |
2025-02-05 | SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs | Dinithi Jayasuriya et.al. | 2502.02909 | null |
2025-02-04 | Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation | Atharva Mangeshkumar Agrawal et.al. | 2502.02249 | null |
2025-02-04 | LoRA-TTT: Low-Rank Test-Time Training for Vision-Language Models | Yuto Kojima et.al. | 2502.02069 | null |
2025-02-03 | Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling | Kang Yang et.al. | 2502.01826 | null |
2025-02-03 | Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA | Shuangyi Chen et.al. | 2502.01755 | null |
2025-02-03 | Adapter-Based Multi-Agent AVSR Extension for Pre-Trained ASR Models | Christopher Simic et.al. | 2502.01709 | null |
2025-02-03 | QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning | Moses Ananta et.al. | 2502.01703 | link |
2025-02-05 | MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation | Yiren Song et.al. | 2502.01572 | null |
2025-02-03 | CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models | Guanduo Chen et.al. | 2502.01378 | null |
2025-02-03 | One-step full gradient suffices for low-rank fine-tuning, provably and efficiently | Yuanhe Zhang et.al. | 2502.01235 | null |
2025-02-03 | Joint Localization and Activation Editing for Low-Resource Fine-Tuning | Wen Lai et.al. | 2502.01179 | link |
2025-01-31 | Low-Rank Adapting Models for Sparse Autoencoders | Matthew Chen et.al. | 2501.19406 | link |
2025-01-31 | Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models | Wenzhi Fang et.al. | 2501.19389 | link |
2025-02-03 | SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions | Dominik Wagner et.al. | 2501.19377 | null |
2025-01-31 | Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification | Xiangyu Sun et.al. | 2501.19086 | null |
2025-01-31 | Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations | Dahye Kim et.al. | 2501.19066 | link |
2025-01-31 | Norm-Bounded Low-Rank Adaptation | Ruigang Wang et.al. | 2501.19050 | null |
2025-01-31 | Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin et.al. | 2501.18824 | null |
2025-01-30 | High-Accuracy ECG Image Interpretation using Parameter-Efficient LoRA Fine-Tuning with Multimodal LLaMA 3.2 | Nandakishor M et.al. | 2501.18670 | null |
2025-01-30 | CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization | Yanxia Deng et.al. | 2501.18475 | null |
2025-01-30 | Impact of Reactive Jamming Attacks on LoRaWAN: a Theoretical and Experimental Study | Amavi Dossa et.al. | 2501.18339 | null |
2025-01-29 | Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning? | Pouya Pezeshkpour et.al. | 2501.17840 | link |
2025-01-29 | U2A: Unified Unimodal Adaptation for Robust and Efficient Multimodal Learning | Md Kaykobad Reza et.al. | 2501.17823 | null |
2025-01-30 | In-Context Meta LoRA Generation | Yihua Shao et.al. | 2501.17635 | null |
2025-01-27 | A Comprehensive Study on Fine-Tuning Large Language Models for Medical Question Answering Using Classification Models and Comparative Analysis | Aysegul Ucar et.al. | 2501.17190 | null |
2025-01-28 | Algorithm for Automatic Legislative Text Consolidation | Matias Etcheverry et.al. | 2501.16794 | null |
2025-01-28 | One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning | Chunpeng Zhou et.al. | 2501.16720 | null |
2025-01-28 | Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models | Huijie Liu et.al. | 2501.16714 | null |
2025-01-27 | LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation | Farzad Farhadzadeh et.al. | 2501.16559 | null |
2025-01-27 | Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width | Zheng Liu et.al. | 2501.16302 | null |
2025-01-27 | FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments | Zhiyuan Fu et.al. | 2501.16029 | null |
2025-01-26 | LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs | Peizhuo Lv et.al. | 2501.15478 | null |
2025-01-26 | InfoBFR: Real-World Blind Face Restoration via Information Bottleneck | Nan Gao et.al. | 2501.15443 | null |
2025-01-26 | Fine Tuning without Catastrophic Forgetting via Selective Low Rank Adaptation | Reza Akbarian Bafghi et.al. | 2501.15377 | null |
2025-01-26 | Decentralized Low-Rank Fine-Tuning of Large Language Models | Sajjad Ghiasvand et.al. | 2501.15361 | null |
2025-01-25 | Exploring Primitive Visual Measurement Understanding and the Role of Output Format in Learning in Vision-Language Models | Ankit Yadav et.al. | 2501.15144 | null |
2025-01-25 | DAGPrompT: Pushing the Limits of Graph Prompting with a Distribution-aware Graph Prompt Tuning Approach | Qin Chen et.al. | 2501.15142 | link |
2025-01-25 | ABXI: Invariant Interest Adaptation for Task-Guided Cross-Domain Sequential Recommendation | Qingtian Bian et.al. | 2501.15118 | link |
2025-01-25 | Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning | Ziyu Zhao et.al. | 2501.15103 | null |
2025-01-24 | FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing | James Seale Smith et.al. | 2501.14713 | null |
2025-01-21 | ZKLoRA: Efficient Zero-Knowledge Proofs for LoRA Verification | Bidhan Roy et.al. | 2501.13965 | null |
2025-01-23 | Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models | Linh Tran et.al. | 2501.13904 | null |
2025-01-23 | Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation | Rong Shan et.al. | 2501.13344 | link |
2025-01-23 | SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network | Songge Zhang et.al. | 2501.13318 | null |
2025-01-22 | S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning | Yichen Wu et.al. | 2501.13198 | null |
2025-01-22 | LLM4WM: Adapting LLM for Wireless Multi-Tasking | Xuanyu Liu et.al. | 2501.12983 | null |
2025-01-22 | D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network | Ruiqi Wang et.al. | 2501.12589 | null |
2025-01-21 | A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data | Minh Tran et.al. | 2501.12501 | null |
2025-01-21 | EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition | Hamid Nasiri et.al. | 2501.12067 | link |
2025-01-21 | ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation | Peter Devine et.al. | 2501.11929 | link |
2025-01-20 | Recurrent Diffusion for Large-Scale Parameter Generation | Kai Wang et.al. | 2501.11587 | link |
2025-01-17 | OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning | Jinyuan Feng et.al. | 2501.10062 | null |
2025-01-16 | Practical Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2501.09705 | link |
2025-01-17 | SEAL: Entangled White-box Watermarks on Low-Rank Adaptation | Giyeong Oh et.al. | 2501.09284 | null |
2025-01-15 | Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models | Zerui Tao et.al. | 2501.08727 | null |
2025-01-15 | LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model | Yuxuan Hu et.al. | 2501.08582 | null |
2025-01-14 | DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models | Hyeonwoo Kim et.al. | 2501.08333 | null |
2025-01-14 | TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning | Yao Liang et.al. | 2501.08008 | null |
2025-01-14 | GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism | Chen Tang et.al. | 2501.07890 | null |
2025-01-14 | Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques | Shobhit Ratan et.al. | 2501.07853 | null |
2025-01-13 | Implementing LoRa MIMO System for Internet of Things | Atonu Ghosh et.al. | 2501.07148 | null |
2025-01-12 | Language Fusion for Parameter-Efficient Cross-lingual Transfer | Philipp Borchert et.al. | 2501.06892 | link |
2025-01-12 | Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning | Hanwen Zhong et.al. | 2501.06884 | link |
2025-01-12 | Better Prompt Compression Without Multi-Layer Perceptrons | Edouardo Honig et.al. | 2501.06730 | null |
2025-01-10 | Aggregating Low Rank Adapters in Federated Fine-tuning | Evelyn Trautmann et.al. | 2501.06332 | null |
2025-01-14 | Qi Sun et.al. | 2501.06252 | link | |
2025-01-10 | How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters | Romina Oji et.al. | 2501.06025 | link |
2025-01-09 | LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts | Yuri Facanha Bezerra et.al. | 2501.05554 | link |
2025-01-09 | JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis | Jun-Hyeok Cha et.al. | 2501.04904 | null |
2025-01-11 | RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation | Jun Liu et.al. | 2501.04315 | null |
2025-01-07 | Spectral-Aware Low-Rank Adaptation for Speaker Verification | Zhe Li et.al. | 2501.03829 | link |
2025-01-08 | MADation: Face Morphing Attack Detection with Foundation Models | Eduarda Caldeira et.al. | 2501.03800 | link |
2025-01-07 | Extending Internet Access Over LoRa for Internet of Things and Critical Applications | Atonu Ghosh et.al. | 2501.03465 | null |
2025-01-06 | Rate-My-LoRA: Efficient and Adaptive Federated Model Tuning for Cardiac MRI Segmentation | Xiaoxiao He et.al. | 2501.03223 | null |
2025-01-06 | The Scaling Law for LoRA Base on Mutual Information Upper Bound | Jing Zhang et.al. | 2501.03152 | null |
2025-01-06 | TransPixar: Advancing Text-to-Video Generation with Transparency | Luozhou Wang et.al. | 2501.03006 | link |
2025-01-06 | FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection | Guray Ozgur et.al. | 2501.02892 | link |
2025-01-05 | LoRaConnect: Unlocking HTTP Potential on LoRa Backbones for Remote Areas and Ad-Hoc Networks | Atonu Ghosh et.al. | 2501.02469 | null |
2025-01-05 | Efficient Deployment of Large Language Models on Resource-constrained Devices | Zhiwei Yao et.al. | 2501.02438 | null |
2025-01-07 | Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers | Markus J. Buehler et.al. | 2501.02393 | link |
2025-01-04 | tCURLoRA: Tensor CUR Decomposition Based Low-Rank Parameter Adaptation for Medical Image Segmentation | Guanghua He et.al. | 2501.02227 | null |
2025-01-03 | SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation | Mingjie Li et.al. | 2501.01765 | null |
2025-01-03 | MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders | Jiajun Cao et.al. | 2501.01709 | null |
2025-01-03 | Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption | Zhang Ruoyan et.al. | 2501.01672 | null |
2025-01-02 | Towards Interactive Deepfake Analysis | Lixiong Qin et.al. | 2501.01164 | link |
2025-01-01 | Alzheimer's disease detection based on large language model prompt engineering | Tian Zheng et.al. | 2501.00861 | null |
2025-01-01 | Beyond Words: AuralLLM and SignMST-C for Precise Sign Language Production and Bidirectional Accessibility | Yulong Li et.al. | 2501.00765 | null |
2024-12-31 | Low-Rank Adaptation for Foundation Models: A Comprehensive Review | Menglin Yang et.al. | 2501.00365 | null |
2024-12-30 | Adversarial Attack and Defense for LoRa Device Identification and Authentication via Deep Learning | Yalin E. Sagduyu et.al. | 2412.21164 | null |
2024-12-30 | Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring | Ehsan Latif et.al. | 2412.21065 | null |
2024-12-30 | DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models | Xiaolin Hu et.al. | 2412.20891 | null |
2024-12-30 | Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation | Shubh Singhal et.al. | 2412.20838 | null |
2024-12-30 | VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control | Shaojin Wu et.al. | 2412.20800 | link |
2025-01-02 | EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers | Daiheng Gao et.al. | 2412.20413 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2024-12-28 | VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition | Lan Chen et.al. | 2412.20064 | link |
2024-12-28 | Adaptive Parameter-Efficient Federated Fine-Tuning on Heterogeneous Devices | Jun Liu et.al. | 2412.20004 | null |
2024-12-27 | Gradient Weight-normalized Low-rank Projection for Efficient LLM Training | Jia-Hong Huang et.al. | 2412.19616 | link |
2024-12-27 | Performance Evaluation of IoT LoRa Networks on Mars Through ns-3 Simulations | Manuele Favero et.al. | 2412.19549 | link |
2024-12-27 | KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing | Shu Zhao et.al. | 2412.19417 | link |
2024-12-25 | Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks | Jiacheng Hu et.al. | 2412.18729 | null |
2024-12-24 | Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models | Zihan Zhou et.al. | 2412.18419 | null |
2024-12-18 | Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting | Vijay Goyal et.al. | 2412.17846 | link |
2024-12-25 | DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder | Ente Lin et.al. | 2412.17644 | null |
2024-12-23 | Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Prakash Aryan et.al. | 2412.17548 | link |
2024-12-21 | Label Privacy in Split Learning for Large Models with Parameter-Efficient Training | Philip Zmushko et.al. | 2412.16669 | link |
2024-12-20 | Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline | Guancheng Zeng et.al. | 2412.15660 | null |
2024-12-23 | CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training | Xiuli Bi et.al. | 2412.15646 | link |
2024-12-20 | AutoRank: MCDA Based Rank Personalization for LoRA-Enabled Distributed Learning | Shuaijun Chen et.al. | 2412.15553 | null |
2024-12-19 | Knowledge Injection via Prompt Distillation | Kalle Kujanpää et.al. | 2412.14964 | null |
2024-12-20 | All-in-One Tuning and Structural Pruning for Domain-Specific LLMs | Lei Lu et.al. | 2412.14426 | null |
2024-12-18 | CoRa: A Collision-Resistant LoRa Symbol Detector of Low Complexity | José Álamos et.al. | 2412.13930 | null |
2024-12-18 | A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection | Beiqi Zhang et.al. | 2412.13801 | link |
2024-12-18 | Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration | Xuhan Zuo et.al. | 2412.13551 | null |
2024-12-18 | Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models | Xinxin Liu et.al. | 2412.13488 | null |
2024-12-18 | Transducer Tuning: Efficient Model Adaptation for Software Tasks Using Code Property Graphs | Imam Nur Bani Yusuf et.al. | 2412.13467 | link |
2024-12-17 | Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models | Elvis Nunez et.al. | 2412.13328 | null |
2024-12-17 | FineGates: LLMs Finetuning with Compression using Stochastic Gates | Jonathan Svirsky et.al. | 2412.12951 | null |
2024-12-17 | Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion | Syed Zohaib Hassan et.al. | 2412.12710 | null |
2024-12-17 | Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT | Jenny Kunz et.al. | 2412.12674 | link |
2024-12-17 | NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning | Xin Yi et.al. | 2412.12497 | link |
2024-12-16 | Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering | Jinhe Bi et.al. | 2412.12359 | link |
2024-12-16 | Can video generation replace cinematographers? Research on the cinematic language of generated video | Xiaozhe Li et.al. | 2412.12223 | null |
2024-12-16 | A LoRA is Worth a Thousand Pictures | Chenxi Liu et.al. | 2412.12048 | null |
2024-12-16 | The Open Source Advantage in Large Language Models (LLMs) | Jiya Manchanda et.al. | 2412.12004 | null |
2024-12-17 | No More Adam: Learning Rate Scaling at Initialization is All You Need | Minghao Xu et.al. | 2412.11768 | link |
2024-12-16 | IDEA-Bench: How Far are Generative Models from Professional Designing? | Chen Liang et.al. | 2412.11767 | link |
2024-12-16 | Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing | Anika Tabassum et.al. | 2412.11381 | link |
2024-12-16 | FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation | Dannong Wang et.al. | 2412.11378 | link |
2024-12-15 | Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models | Di Wu et.al. | 2412.11041 | null |
2024-12-15 | SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation | Hang Zhang et.al. | 2412.11026 | null |
2024-12-14 | Efficient Adaptation of Multilingual Models for Japanese ASR | Mark Bajo et.al. | 2412.10705 | link |
2024-12-13 | SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation | Runtao Liu et.al. | 2412.10493 | null |
2024-12-13 | OP-LoRA: The Blessing of Dimensionality | Piotr Teterwak et.al. | 2412.10362 | null |
2024-12-16 | ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers | Junyan Hu et.al. | 2412.10135 | null |
2024-12-13 | CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models | Dongyu Yao et.al. | 2412.09936 | link |
2024-12-13 | Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models | Changqun Li et.al. | 2412.09827 | null |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition | Zhisheng Zhong et.al. | 2412.09501 | link |
2024-12-15 | GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning | Abdessalam Ed-dib et.al. | 2412.09250 | null |
2024-12-12 | RAD: Region-Aware Diffusion Models for Image Inpainting | Sora Kim et.al. | 2412.09191 | null |
2024-12-12 | DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization | Geonhui Jang et.al. | 2412.09169 | null |
2024-12-12 | MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning | Lulu Zhao et.al. | 2412.08946 | null |
2024-12-11 | DMin: Scalable Training Data Influence Estimation for Diffusion Models | Huawei Lin et.al. | 2412.08637 | link |
2024-12-10 | Accretion onto WD 2226 |
S. Estrada-Dorado et.al. | 2412.07863 | null |
2024-12-10 | PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition | Kartik Narayan et.al. | 2412.07771 | null |
2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
2024-12-10 | ChocoLlama: Lessons Learned From Teaching Llamas Dutch | Matthieu Meeus et.al. | 2412.07633 | null |
2024-12-10 | MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning | Yufei Ma et.al. | 2412.07405 | null |
2024-12-10 | Attention Head Purification: A New Perspective to Harness CLIP for Domain Generalization | Yingfan Wang et.al. | 2412.07226 | null |
2024-12-09 | Optimal Routing and Link Configuration for Covert Heterogeneous Wireless Networks | Amna Gillani et.al. | 2412.07059 | null |
2024-12-09 | Sequential Compression Layers for Efficient Federated Learning in Foundational Models | Navyansh Mahla et.al. | 2412.07021 | null |
2024-12-09 | BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation | Qiushi Wang et.al. | 2412.06441 | null |
2024-12-10 | S |
Xinyu Yang et.al. | 2412.06289 | null |
2024-12-08 | Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling | Kaleel Mahmood et.al. | 2412.06106 | null |
2024-12-08 | KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models | Fan Wang et.al. | 2412.06071 | link |
2024-12-07 | Training-Free Bayesianization for Low-Rank Adapters of Large Language Models | Haizhou Shi et.al. | 2412.05723 | link |
2024-12-07 | Plasmonic Electro-Optic Modulators based on Epsilon-Near-Zero Materials: Comparing the Classical Drift-Diffusion and Schrödinger-Poisson Coupling Models | Masoud Shabaninezhad et.al. | 2412.05690 | null |
2024-12-06 | QueEn: A Large Language Model for Quechua-English Translation | Junhao Chen et.al. | 2412.05184 | null |
2024-12-06 | LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Donald Shenaj et.al. | 2412.05148 | link |
2024-12-05 | Performance Evaluation of LoRa Technology for Rural Connectivity: An Experimental Analysis in Nepal | Atit Pokharel et.al. | 2412.04563 | null |
2024-12-04 | Prompting Large Language Models for Clinical Temporal Relation Extraction | Jianping He et.al. | 2412.04512 | null |
2024-12-05 | UnZipLoRA: Separating Content and Style from a Single Image | Chang Liu et.al. | 2412.04465 | null |
2024-12-08 | Discriminative Fine-tuning of LVLMs | Yassine Ouali et.al. | 2412.04378 | null |
2024-12-05 | Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts | Chenyang Zhu et.al. | 2412.04220 | null |
2024-12-05 | SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning | Seokju Yun et.al. | 2412.04077 | link |
2024-12-04 | Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis | Davide Bucciarelli et.al. | 2412.03665 | null |
2024-12-04 | Imagine360: Immersive 360 Video Generation from Perspective Anchor | Jing Tan et.al. | 2412.03552 | null |
2024-12-04 | DIVE: Taming DINO for Subject-Driven Video Editing | Yi Huang et.al. | 2412.03347 | null |
2024-12-04 | Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach | Lingchen Sun et.al. | 2412.03017 | link |
2024-12-03 | EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras | Dmitrii Torbunov et.al. | 2412.02890 | link |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-03 | LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization | Ethan Smith et.al. | 2412.02352 | null |
2024-12-03 | SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models | Sabina Martyniak et.al. | 2412.02332 | link |
2024-12-03 | Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs | Zixuan Hu et.al. | 2412.02220 | null |
2024-12-02 | Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping | Marla Grunewald et.al. | 2412.01609 | null |
2024-12-02 | CellSeg1: Robust Cell Segmentation with One Training Image | Peilin Zhou et.al. | 2412.01410 | link |
2024-12-02 | Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking | Marco Federici et.al. | 2412.01380 | null |
2024-12-02 | MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost | Sen Xing et.al. | 2412.01271 | null |
2024-12-02 | RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy | Geonho Lee et.al. | 2412.01129 | link |
2024-12-03 | Adaptive Rank, Reduced Forgetting: Knowledge Retention in Continual Learning Vision-Language Models with Dynamic Rank-Selective LoRA | Haodong Lu et.al. | 2412.01004 | null |
2024-11-29 | SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks | Kim-Celine Kahl et.al. | 2411.19688 | link |
2024-11-29 | Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning | Kaustubh Ponkshe et.al. | 2411.19557 | link |
2024-11-28 | PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning | Shenghui Li et.al. | 2411.19335 | null |
2024-11-28 | Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation | Son Thai Ly et.al. | 2411.19297 | link |
2024-11-28 | LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair | Xue Song et.al. | 2411.19156 | null |
2024-11-28 | DESIRE: Dynamic Knowledge Consolidation for Rehearsal-Free Continual Learning | Haiyang Guo et.al. | 2411.19154 | null |
2024-11-28 | Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures | Yicheng Zhang et.al. | 2411.19128 | link |
2024-11-27 | Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning | Omkar Khade et.al. | 2411.18571 | null |
2024-11-27 | Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models | Minhyeok Lee et.al. | 2411.18530 | link |
2024-11-27 | Adaptive Blind All-in-One Image Restoration | David Serrano-Lozano et.al. | 2411.18412 | link |
2024-11-27 | Thai Financial Domain Adaptation of THaLLE -- Technical Report | KBTG Labs et.al. | 2411.18242 | null |
2024-11-27 | ROICtrl: Boosting Instance Control for Visual Generation | Yuchao Gu et.al. | 2411.17949 | null |
2024-11-26 | Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading | Suyeol Yun et.al. | 2411.17900 | link |
2024-11-26 | Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Sudarshan Rajagopalan et.al. | 2411.17814 | null |
2024-11-26 | PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning | Zhen Sun et.al. | 2411.17453 | null |
2024-11-26 | CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy | Fanxu Meng et.al. | 2411.17426 | link |
2024-11-26 | Efficient Deployment of Transformer Models in Analog In-Memory Computing Hardware | Chen Li et.al. | 2411.17367 | link |
2024-11-26 | ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System | Shuiqiao Yang et.al. | 2411.17058 | null |
2024-11-26 | PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation | Hengjia Li et.al. | 2411.17048 | null |
2024-11-25 | RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks | Nazia Tasnim et.al. | 2411.16870 | link |
2024-11-25 | Parameter Efficient Instruction Tuning: An Empirical Study | Pengfei He et.al. | 2411.16775 | link |
2024-11-23 | LoBAM: LoRA-Based Backdoor Attack on Model Merging | Ming Yin et.al. | 2411.16746 | null |
2024-11-24 | Modality Alignment Meets Federated Broadcasting | Yuting Ma et.al. | 2411.15837 | null |
2024-11-24 | LoRA-Mini : Adaptation Matrices Decomposition and Selective Training | Ayush Singh et.al. | 2411.15804 | null |
2024-11-23 | Reassessing Layer Pruning in LLMs: New Insights and Methods | Yao Lu et.al. | 2411.15558 | link |
2024-11-23 | Gradient dynamics for low-rank fine-tuning beyond kernels | Arif Kerem Dayi et.al. | 2411.15385 | null |
2024-11-22 | On the Impact of Fine-Tuning on Chain-of-Thought Reasoning | Elita Lobo et.al. | 2411.15382 | null |
2024-11-22 | ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation | Junzhang Liu et.al. | 2411.15281 | null |
2024-11-21 | IterIS: Iterative Inference-Solving Alignment for LoRA Merging | Hongxu Chen et.al. | 2411.15231 | null |
2024-11-22 | Exploring Foundation Models Fine-Tuning for Cytology Classification | Manon Dausort et.al. | 2411.14975 | link |
2024-11-22 | LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement | Jieming Bian et.al. | 2411.14961 | null |
2024-11-21 | Interpreting seasonal and interannual Hadley cell descending edge migrations via the cell-mean Rossby number | Spencer A Hill et.al. | 2411.14544 | null |
2024-11-21 | Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model | Ege Kesim et.al. | 2411.14064 | null |
2024-11-21 | Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning | Ziqi Wang et.al. | 2411.13949 | null |
2024-11-21 | Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel KAN Adapter for Enhanced Feature Adaptation | Gayatri Deshmukh et.al. | 2411.13901 | null |
2024-11-21 | AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning | Changhai Zhou et.al. | 2411.13814 | null |
2024-11-20 | Unleashing the Power of Large Language Models for Group POI Recommendations | Jing Long et.al. | 2411.13415 | null |
2024-11-20 | On the Way to LLM Personalization: Learning to Remember User Conversations | Lucie Charlotte Magister et.al. | 2411.13405 | null |
2024-11-19 | Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning | Pengkun Jiao et.al. | 2411.12787 | null |
2024-11-16 | LoRA Unlearns More and Retains More (Student Abstract) | Atharv Mittal et.al. | 2411.11907 | link |
2024-11-18 | SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions | Shuo Zhang et.al. | 2411.11530 | null |
2024-11-16 | Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts | Jinqiang Long et.al. | 2411.10669 | link |
2024-11-15 | AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment | Yonggan Fu et.al. | 2411.10606 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning | Yahe Yang et.al. | 2411.09947 | null |
2024-11-12 | Structured Pattern Expansion with Diffusion Models | Marzia Riso et.al. | 2411.08930 | null |
2024-11-13 | Dynamic Subset Tuning: Expanding the Operational Range of Parameter-Efficient Training for Large Language Models | Felix Stahlberg et.al. | 2411.08610 | null |
2024-11-13 | Machine Unlearning on Pre-trained Models by Residual Feature Alignment Using LoRA | Laiqiao Qin et.al. | 2411.08443 | null |
2024-11-11 | LoRA-BERT: a Natural Language Processing Model for Robust and Accurate Prediction of long non-coding RNAs | Nicholas Jeon et.al. | 2411.08073 | null |
2024-11-12 | FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training | Philip Zmushko et.al. | 2411.07837 | link |
2024-11-12 | Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices | Kilian Pfeiffer et.al. | 2411.07826 | null |
2024-11-12 | Federated Low-Rank Adaptation with Differential Privacy over Wireless Networks | Tianqu Kang et.al. | 2411.07806 | null |
2024-11-12 | ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Weibo Zhao et.al. | 2411.07762 | null |
2024-11-11 | DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning | Zecheng Zhang et.al. | 2411.07239 | null |
2024-11-11 | Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation | Ziwei Liu et.al. | 2411.07021 | null |
2024-11-11 | MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps | Xue Xia et.al. | 2411.06971 | link |
2024-11-11 | LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Runming Yang et.al. | 2411.06839 | null |
2024-11-10 | Federated LLMs Fine-tuned with Adaptive Importance-Aware LoRA | Yang Su et.al. | 2411.06581 | null |
2024-11-10 | Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques | Daniil Sulimov et.al. | 2411.06445 | null |
2024-11-08 | Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation | Aayush Shah et.al. | 2411.05966 | null |
2024-11-08 | Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation | Xiwen Wei et.al. | 2411.05663 | link |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Wenqiang Sun et.al. | 2411.04928 | null |
2024-11-07 | StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration | Panwen Hu et.al. | 2411.04925 | null |
2024-11-07 | LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG | Laifa Tao et.al. | 2411.04476 | null |
2024-11-09 | Variational Low-Rank Adaptation Using IVON | Bai Cong et.al. | 2411.04421 | link |
2024-11-08 | Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation | Ayan Sengupta et.al. | 2411.04358 | link |
2024-11-06 | PyroGuardian: An IoT-Enabled System for Health and Location Monitoring in High-Risk Firefighting Environments | Berkay Kaplan et.al. | 2411.03654 | null |
2024-11-05 | LLM-based Framework for Bearing Fault Diagnosis | Laifa Tao et.al. | 2411.02718 | null |
2024-11-04 | TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nouf Alabbasi et.al. | 2411.02617 | link |
2024-11-04 | Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study | André Storhaug et.al. | 2411.02462 | null |
2024-11-04 | Expanding Sparse Tuning for Low Memory Usage | Shufan Shen et.al. | 2411.01800 | link |
2024-11-02 | PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment | Dongxu Liu et.al. | 2411.01245 | null |
2024-11-02 | One Arrow, Many Targets: Probing LLMs for Multi-Attribute Controllable Text Summarization | Tathagato Roy et.al. | 2411.01213 | null |
2024-11-02 | Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models | Wonguk Cho et.al. | 2411.01179 | null |
2024-11-02 | LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding | Jian Chen et.al. | 2411.01106 | null |
2024-11-01 | V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM | Liang Mi et.al. | 2411.00915 | null |
2024-11-01 | Dual Low-Rank Adaptation for Continual Learning with Pre-Trained Models | Huancheng Chen et.al. | 2411.00623 | null |
2024-10-31 | DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Weicai Ye et.al. | 2410.24203 | link |
2024-11-05 | In-Context LoRA for Diffusion Transformers | Lianghua Huang et.al. | 2410.23775 | link |
2024-10-30 | Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation | Stefan Stojanovic et.al. | 2410.23434 | null |
2024-10-31 | SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation | Yining Hong et.al. | 2410.23277 | null |
2024-10-31 | Why Gradient Subspace? Identifying and Mitigating LoRA's Bottlenecks in Federated Fine-Tuning of Large Language Models | Navyansh Mahla et.al. | 2410.23111 | null |
2024-10-30 | Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation | Wei Dong et.al. | 2410.22952 | null |
2024-10-30 | CopRA: A Progressive LoRA Training Strategy | Zhan Zhuang et.al. | 2410.22911 | null |
2024-10-30 | Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients | Jabin Koo et.al. | 2410.22815 | null |
2024-10-30 | MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning | Xujia Wang et.al. | 2410.22782 | null |
2024-10-29 | Meta-Learning Adaptable Foundation Models | Jacob L. Block et.al. | 2410.22264 | null |
2024-10-30 | IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models | Hang Guo et.al. | 2410.21759 | link |
2024-10-28 | LoRA vs Full Fine-tuning: An Illusion of Equivalence | Reece Shuttleworth et.al. | 2410.21228 | null |
2024-10-28 | Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices | Hiroki Matsutani et.al. | 2410.21073 | null |
2024-10-28 | KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Rambod Azimi et.al. | 2410.20777 | link |
2024-10-28 | Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA | Sangmin Bae et.al. | 2410.20672 | null |
2024-10-28 | PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation | Leyao Wang et.al. | 2410.20667 | null |
2024-10-28 | Collaborative Knowledge Fusion: A Novel Approach for Multi-task Recommender Systems via LLMs | Chuang Zhao et.al. | 2410.20642 | null |
2024-10-27 | LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization | Jui-Nan Yen et.al. | 2410.20625 | null |
2024-10-27 | FoldMark: Protecting Protein Generative Models with Watermarking | Zaixi Zhang et.al. | 2410.20354 | link |
2024-10-26 | An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation | Dongdong Lin et.al. | 2410.20202 | null |
2024-10-25 | Model merging with SVD to tie the Knots | George Stoica et.al. | 2410.19735 | link |
2024-10-25 | Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs | Yifei Zhang et.al. | 2410.19694 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-24 | Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts | Danyal Aftab et.al. | 2410.19185 | null |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | PSY: Posterior Sampling Based Privacy Enhancer in Large Language Models | Yulian Sun et.al. | 2410.18824 | null |
2024-10-24 | GeoLoRA: Geometric integration for parameter efficient fine-tuning | Steffen Schotthöfer et.al. | 2410.18720 | null |
2024-10-24 | Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model | Ali Hamza et.al. | 2410.18678 | null |
2024-10-23 | CLEAR: Character Unlearning in Textual and Visual Modalities | Alexey Dontsov et.al. | 2410.18057 | null |
2024-10-23 | MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning | Jingfan Zhang et.al. | 2410.18035 | null |
2024-10-23 | Closed-form merging of parameter-efficient modules for Federated Continual Learning | Riccardo Salami et.al. | 2410.17961 | null |
2024-10-23 | AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning | Yehonathan Refael et.al. | 2410.17881 | null |
2024-10-23 | Understanding Layer Significance in LLM Alignment | Guangyuan Shi et.al. | 2410.17875 | null |
2024-10-23 | VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning | Yifan Peng et.al. | 2410.17485 | link |
2024-10-22 | FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation | Rohan Sukumaran et.al. | 2410.17358 | null |
2024-10-22 | Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups | Charvi Rastogi et.al. | 2410.17032 | null |
2024-10-23 | GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks | Shuyang Hou et.al. | 2410.17031 | null |
2024-10-22 | LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices | Chuntao Ding et.al. | 2410.16954 | link |
2024-10-22 | Can Large Language Models Act as Ensembler for Multi-GNNs? | Hanqi Duan et.al. | 2410.16822 | null |
2024-10-22 | Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models | Yuheng Lu et.al. | 2410.16801 | null |
2024-10-22 | MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report | Samrajya Thapa et.al. | 2410.16239 | link |
2024-10-21 | Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs | Kang Zhao et.al. | 2410.16135 | null |
2024-10-21 | Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning | Arijit Das et.al. | 2410.16029 | link |
2024-10-21 | How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? | Zuojin Tang et.al. | 2410.15885 | null |
2024-10-21 | The effect of fine-tuning on language model toxicity | Will Hawkins et.al. | 2410.15821 | link |
2024-10-21 | Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization | Hanseon Joo et.al. | 2410.15794 | link |
2024-10-21 | Students Rather Than Experts: A New AI For Education Pipeline To Model More Human-Like And Personalised Early Adolescences | Yiping Ma et.al. | 2410.15701 | null |
2024-10-20 | MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models | Ahmed Elbakary et.al. | 2410.15524 | null |
2024-10-20 | EVA: An Embodied World Model for Future Video Anticipation | Xiaowei Chi et.al. | 2410.15461 | null |
2024-10-20 | LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration | Yuang Ai et.al. | 2410.15385 | link |
2024-10-18 | Fine-Tuning DeepONets to Enhance Physics-informed Neural Networks for solving Partial Differential Equations | Sidi Wu et.al. | 2410.14134 | null |
2024-10-17 | FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model | ZiDong Wang et.al. | 2410.13925 | link |
2024-10-17 | Improving Multi-modal Large Language Model through Boosting Vision Capabilities | Yanpeng Sun et.al. | 2410.13733 | null |
2024-10-17 | LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning | Yiming Shi et.al. | 2410.13618 | link |
2024-10-18 | MoR: Mixture of Ranks for Low-Rank Adaptation Tuning | Chuanyu Tang et.al. | 2410.13408 | null |
2024-10-17 | FAMSeC: A Few-shot-sample-based General AI-generated Image Detection Method | Juncong Xu et.al. | 2410.13156 | null |
2024-10-16 | LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks | Akshara Prabhakar et.al. | 2410.13025 | link |
2024-10-16 | DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving Federated Low-rank Adaptation | Meilu Zhu et.al. | 2410.12926 | link |
2024-10-15 | In-context KV-Cache Eviction for LLMs via Attention-Gate | Zihao Zeng et.al. | 2410.12876 | null |
2024-10-16 | FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction | Akriti Jain et.al. | 2410.12513 | null |
2024-10-15 | LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models | Hossein Abdi et.al. | 2410.11551 | null |
2024-10-15 | Transfer Learning with Foundational Models for Time Series Forecasting using Low-Rank Adaptations | M. Germán-Morales et.al. | 2410.11539 | null |
2024-10-15 | Energy Efficient Transmission Parameters Selection Method Using Reinforcement Learning in Distributed LoRa Networks | Ryotai Airiyoshi et.al. | 2410.11270 | null |
2024-10-14 | Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning | Bokai Hu et.al. | 2410.11020 | null |
2024-10-14 | LoLCATs: On Low-Rank Linearizing of Large Language Models | Michael Zhang et.al. | 2410.10254 | link |
2024-10-14 | Fed-piLot: Optimizing LoRA Assignment for Efficient Federated Foundation Model Fine-Tuning | Zikai Zhang et.al. | 2410.10200 | null |
2024-10-14 | Scalable Multi-Domain Adaptation of Language Models using Modular Experts | Peter Schafhalter et.al. | 2410.10181 | null |
2024-10-14 | Is Parameter Collision Hindering Continual Learning in LLMs? | Shuo Yang et.al. | 2410.10179 | link |
2024-10-14 | AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality | Peijun Qing et.al. | 2410.10054 | link |
2024-10-13 | Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning | Pengfei Jin et.al. | 2410.09908 | null |
2024-10-13 | A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning | Chen-Yu Liu et.al. | 2410.09846 | null |
2024-10-13 | Understanding Robustness of Parameter-Efficient Tuning for Image Classification | Jiacheng Ruan et.al. | 2410.09845 | link |
2024-10-13 | BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation | Peijia Qin et.al. | 2410.09758 | null |
2024-10-13 | AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model | Yuchen Li et.al. | 2410.09714 | null |
2024-10-11 | Parameter-Efficient Fine-Tuning of State Space Models | Kevin Galim et.al. | 2410.09016 | link |
2024-10-10 | Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation | Grigory Malinovsky et.al. | 2410.08305 | null |
2024-10-10 | SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture | Jiayi Han et.al. | 2410.07739 | null |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | link |
2024-10-09 | SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers | Viktoriia Chekalina et.al. | 2410.07383 | link |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
2024-10-09 | Industrial complexity and the evolution of formal employment in developing cities | Neave O'Clery et.al. | 2410.06971 | null |
2024-10-11 | Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization | Changli Tang et.al. | 2410.06682 | null |
2024-10-08 | Systematic 2.5 D resistive MHD simulations with ambipolar diffusion and Hall effect for fast magnetic reconnection | Gabriela Landinez et.al. | 2410.06391 | null |
2024-10-08 | HyperDet: Generalizable Detection of Synthesized Images by Generating and Merging A Mixture of Hyper LoRAs | Huangsen Cao et.al. | 2410.06044 | null |
2024-10-08 | QERA: an Analytical Framework for Quantization Error Reconstruction | Cheng Zhang et.al. | 2410.06040 | null |
2024-10-08 | Hyper Adversarial Tuning for Boosting Adversarial Robustness of Pretrained Large Vision Models | Kangtao Lv et.al. | 2410.05951 | null |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-08 | PAMLR: A Passive-Active Multi-Armed Bandit-Based Solution for LoRa Channel Allocation | Jihoon Yun et.al. | 2410.05147 | null |
2024-10-07 | HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation | Xinyu Zhou et.al. | 2410.05090 | link |
2024-10-07 | Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation | Vince Zhu et.al. | 2410.04689 | null |
2024-10-06 | Learning De-Biased Representations for Remote-Sensing Imagery | Zichen Tian et.al. | 2410.04546 | link |
2024-10-05 | Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models | Theo et.al. | 2410.04207 | null |
2024-10-05 | LoRTA: Low Rank Tensor Adaptation of Large Language Models | Ignacio Hounie et.al. | 2410.04060 | null |
2024-10-05 | Hyperbolic Fine-tuning for Large Language Models | Menglin Yang et.al. | 2410.04010 | link |
2024-10-04 | AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models | Artur Kasymov et.al. | 2410.03941 | link |
2024-10-04 | Collaborative and Efficient Personalization with Mixtures of Adaptors | Abdulla Jasem Almansoori et.al. | 2410.03497 | null |
2024-10-03 | Neutral residues: revisiting adapters for model extension | Franck Signe Talla et.al. | 2410.02744 | null |
2024-10-03 | Encryption-Friendly LLM Architecture | Donghwan Rho et.al. | 2410.02486 | null |
2024-10-02 | NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models | Yibo Zhong et.al. | 2410.01870 | null |
2024-10-02 | Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | Xi Chen et.al. | 2410.01623 | link |
2024-10-02 | DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models | Yuxuan Zhang et.al. | 2410.01497 | link |
2024-10-04 | Selective Aggregation for Low-Rank Adaptation in Federated Learning | Pengxin Guo et.al. | 2410.01463 | link |
2024-10-02 | FlashMask: Efficient and Rich Mask Extension of FlashAttention | Guoxia Wang et.al. | 2410.01359 | link |
2024-10-01 | MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards | Sheng Wang et.al. | 2410.00938 | null |
2024-10-02 | Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models | Saurav Jha et.al. | 2410.00700 | null |
2024-10-01 | PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models | Yang Li et.al. | 2410.00433 | null |
2024-09-30 | Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models | Ji Liu et.al. | 2410.00131 | null |
2024-09-30 | UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation | Cheng Zhang et.al. | 2409.20197 | link |
2024-09-30 | BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain | Kaisi Guan et.al. | 2409.20075 | null |
2024-09-30 | HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models | Bingshen Mu et.al. | 2409.19878 | null |
2024-09-29 | Learning Attentional Mixture of LoRAs for Language Model Continual Learning | Jialin Liu et.al. | 2409.19611 | null |
2024-09-29 | Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers | Prakash Dhakal et.al. | 2409.19566 | null |
2024-09-27 | HM3: Heterogeneous Multi-Class Model Merging | Stefan Hackmann et.al. | 2409.19173 | null |
2024-09-26 | MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications | Jothi Prasanna Shanmuga Sundaram et.al. | 2409.18043 | null |
2024-09-26 | PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification | Tianfang Xie et.al. | 2409.17834 | null |
2024-09-30 | Efficient In-Domain Question Answering for Resource-Constrained Environments | Isaac Chung et.al. | 2409.17648 | null |
2024-09-26 | On the Implicit Relation Between Low-Rank Adaptation and Differential Privacy | Saber Malekmohammadi et.al. | 2409.17538 | null |
2024-09-26 | A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction | Guangyu Wang et.al. | 2409.17440 | link |
2024-09-25 | Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation | Richard D. Paul et.al. | 2409.17085 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning | Qibin Wang et.al. | 2409.16722 | null |
2024-09-25 | GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning | Zhe-Rui Yang et.al. | 2409.16670 | link |
2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | link |
2024-09-24 | Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering | Ziyu Zhao et.al. | 2409.16167 | null |
2024-09-24 | Evaluation of state-of-the-art ASR Models in Child-Adult Interactions | Aditya Ashvin et.al. | 2409.16135 | null |
2024-09-24 | Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs | Yang Yuhang et.al. | 2409.16005 | null |
2024-09-24 | Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM | Fengrun Zhang et.al. | 2409.15905 | null |
2024-09-24 | Aided design of bridge aesthetics based on Stable Diffusion fine-tuning | Leye Zhang et.al. | 2409.15812 | link |
2024-09-17 | Chain-of-Thought Prompting for Speech Translation | Ke Hu et.al. | 2409.11538 | null |
2024-09-17 | Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models | Divij Gupta et.al. | 2409.11302 | null |
2024-09-17 | LoRa Communication for Agriculture 4.0: Opportunities, Challenges, and Future Directions | Lameya Aldhaheri et.al. | 2409.11200 | null |
2024-09-17 | Few-Shot Domain Adaptation for Learned Image Compression | Tianyu Zhang et.al. | 2409.11111 | null |
2024-09-17 | KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models | Bo Lv et.al. | 2409.11057 | null |
2024-09-18 | Propulsion: Steering LLM with Tiny Fine-Tuning | Md Kowsher et.al. | 2409.10927 | link |
2024-09-16 | A Bayesian Interpretation of Adaptive Low-Rank Adaptation | Haolin Chen et.al. | 2409.10673 | link |
2024-09-16 | From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs | Navya Jain et.al. | 2409.10245 | null |
2024-09-16 | Robust Bird's Eye View Segmentation by Adapting DINOv2 | Merve Rabia Barın et.al. | 2409.10228 | null |
2024-09-19 | jina-embeddings-v3: Multilingual Embeddings With Task LoRA | Saba Sturua et.al. | 2409.10173 | null |
2024-09-16 | Rapid Adaptation of Earth Observation Foundation Models for Segmentation | Karthick Panner Selvam et.al. | 2409.09907 | null |
2024-09-15 | AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs | Madhusudan Ghosh et.al. | 2409.09704 | link |
2024-09-14 | COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare | Chia-Hao Li et.al. | 2409.09549 | null |
2024-09-14 | SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2 | Xinrun Chen et.al. | 2409.09286 | link |
2024-09-13 | Data Efficient Child-Adult Speaker Diarization with Simulated Conversations | Anfeng Xu et.al. | 2409.08881 | link |
2024-09-13 | Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions | Lingwei Meng et.al. | 2409.08596 | link |
2024-09-13 | ATFLRec: A Multimodal Recommender System with Audio-Text Fusion and Low-Rank Adaptation via Instruction-Tuned Large Language Model | Zezheng Qin et.al. | 2409.08543 | null |
2024-09-13 | Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights | Dixi Yao et.al. | 2409.08482 | null |
2024-09-13 | Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design | Lingyao Li et.al. | 2409.08459 | null |
2024-09-12 | AudioBERT: Audio Knowledge Augmented Language Model | Hyunjong Ok et.al. | 2409.08199 | link |
2024-09-12 | Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy | Bojian Li et.al. | 2409.07723 | null |
2024-09-11 | Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region | Muhammad Akhtar Munir et.al. | 2409.07585 | link |
2024-09-11 | Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models | Xinhu Zheng et.al. | 2409.07016 | null |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-09 | Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models | Rohit Jena et.al. | 2409.06493 | null |
2024-09-10 | HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data | Hossein Hajipour et.al. | 2409.06446 | link |
2024-09-10 | VE: Modeling Multivariate Time Series Correlation with Variate Embedding | Shangjiong Wang et.al. | 2409.06169 | link |
2024-09-09 | FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations | Ziyao Wang et.al. | 2409.05976 | link |
2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | null |
2024-09-09 | TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency | Ahmed Imteaj et.al. | 2409.05347 | null |
2024-09-08 | Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation | Zhe Cao et.al. | 2409.05224 | link |
2024-09-06 | Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning | Xinyue Liu et.al. | 2409.04574 | null |
2024-09-06 | Fast Forwarding Low-Rank Training | Adir Rahamim et.al. | 2409.04206 | null |
2024-09-05 | Continual Skill and Task Learning via Dialogue | Weiwei Gu et.al. | 2409.03166 | null |
2024-09-04 | Non-Orthogonal Multiple-Access Strategies for Direct-to-Satellite IoT Networks | Felipe Augusto Tondo et.al. | 2409.02748 | null |
2024-09-04 | Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA | Shuangyi Chen et.al. | 2409.02346 | null |
2024-08-31 | CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models | Xiaojun Xiao et.al. | 2409.02119 | null |
2024-09-02 | LoGex: Improved tail detection of extremely rare histopathology classes via guided diffusion | Maximilian Mueller et.al. | 2409.01317 | link |
2024-09-02 | Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning | Chongjie Si et.al. | 2409.01035 | link |
2024-09-02 | Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language | Jeong Hun Yeo et.al. | 2409.00986 | link |
2024-08-30 | Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs | Mazal Bethany et.al. | 2409.00209 | null |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433 | link |
2024-08-30 | MoRe Fine-Tuning with 10x Fewer Parameters | Wenxuan Tan et.al. | 2408.17383 | link |
2024-08-30 | Wireless Integrated Authenticated Communication System (WIA-Comm) | Amith N Bharadwaj et.al. | 2408.17112 | null |
2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
2024-08-30 | Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL | Haiyang Zhao et.al. | 2408.17060 | null |
2024-08-29 | LoraMap: Harnessing the Power of LoRA Connections | Hyeryun Park et.al. | 2408.16264 | null |
2024-08-28 | LeMON: Learning to Learn Multi-Operator Networks | Jingmin Sun et.al. | 2408.16168 | link |
2024-08-28 | Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models | Yuncheng Yang et.al. | 2408.15915 | link |
2024-08-28 | StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements | Jillian Fisher et.al. | 2408.15666 | link |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-08-28 | Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models | Yiyang Zhao et.al. | 2408.15585 | null |
2024-08-28 | VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech | Heeseung Kim et.al. | 2408.14739 | null |
2024-08-27 | PAT: Pruning-Aware Tuning for Large Language Models | Yijiang Liu et.al. | 2408.14721 | link |
2024-08-27 | StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-Speech | Haowei Lou et.al. | 2408.14713 | link |
2024-08-26 | CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation | Muhammad Fawi et.al. | 2408.14572 | link |
2024-08-27 | Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models | Aradhye Agarwal et.al. | 2408.14470 | link |
2024-08-26 | Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning | Sakhinana Sagar Srinivas et.al. | 2408.14387 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-25 | TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation | Jack Saunders et.al. | 2408.13714 | null |
2024-08-24 | Can Visual Foundation Models Achieve Long-term Point Tracking? | Görkay Aydemir et.al. | 2408.13575 | null |
2024-08-23 | The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Venkatesh Balavadhani Parthasarathy et.al. | 2408.13296 | null |
2024-08-23 | CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition | Yafeng Zhang et.al. | 2408.12834 | null |
2024-08-23 | Investigating LLM Applications in E-Commerce | Chester Palen-Michel et.al. | 2408.12779 | null |
2024-08-22 | EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts | Nicy Scaria et.al. | 2408.12226 | link |
2024-08-21 | Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Omar Erak et.al. | 2408.11775 | link |
2024-08-21 | EAGLE: Elevating Geometric Reasoning through LLM-empowered Visual Instruction Tuning | Zhihao Li et.al. | 2408.11397 | null |
2024-08-20 | EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech | Xin Qi et.al. | 2408.10852 | null |
2024-08-21 | Flexora: Flexible Low Rank Adaptation for Large Language Models | Chenxing Wei et.al. | 2408.10774 | null |
2024-08-20 | Large Language Models for Multimodal Deformable Image Registration | Mingrui Ma et.al. | 2408.10703 | link |
2024-08-20 | Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper | Tianyi Xu et.al. | 2408.10680 | null |
2024-08-20 | CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation | Yuting Liu et.al. | 2408.10645 | null |
2024-08-18 | NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models | Cheng Lin et.al. | 2408.10280 | null |
2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
2024-08-19 | Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Xiaoyu Kong et.al. | 2408.10159 | link |
2024-08-19 | TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition | Tianwei Lin et.al. | 2408.09856 | link |
2024-08-18 | Infinite Scrolling, Finite Satisfaction: Exploring User Behavior and Satisfaction on Social Media in Bangladesh | Sanzana Karim Lora et.al. | 2408.09601 | null |
2024-08-17 | ConVerSum: A Contrastive Learning based Approach for Data-Scarce Solution of Cross-Lingual Summarization Beyond Direct Equivalents | Sanzana Karim Lora et.al. | 2408.09273 | null |
2024-08-17 | An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation | Junjie Li et.al. | 2408.09078 | link |
2024-08-17 | MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality | Zhiyi Shi et.al. | 2408.09064 | null |
2024-08-16 | AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation | Yihe Dong et.al. | 2408.09015 | link |
2024-08-16 | ML Study of MaliciousTransactions in Ethereum | Natan Katz et.al. | 2408.08749 | null |
2024-08-16 | RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS | Shuaijun Chen et.al. | 2408.08699 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning | Alessio Devoto et.al. | 2408.08670 | null |
2024-08-16 | A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth | Yujia Gu et.al. | 2408.08561 | null |
2024-08-15 | Heavy Labels Out! Dataset Distillation with Label Space Lightening | Ruonan Yu et.al. | 2408.08201 | null |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-14 | Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification | Yongcheng Li et.al. | 2408.07467 | link |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | Imagen 3 | Imagen-Team-Google et.al. | 2408.07009 | null |
2024-08-13 | New refinements of Narayana polynomials and Motzkin polynomials | Janet J. W. Dong et.al. | 2408.06912 | null |
2024-08-13 | LoRA |
Jia-Chen Zhang et.al. | 2408.06854 | null |
2024-08-13 | DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion | Yujia Wu et.al. | 2408.06740 | null |
2024-08-13 | Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model | Yongcheng Li et.al. | 2408.06716 | link |
2024-08-13 | Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach | Haowei Ni et.al. | 2408.06634 | null |
2024-08-13 | Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models | Sungmin Cha et.al. | 2408.06621 | link |
2024-08-15 | ControlNeXt: Powerful and Efficient Control for Image and Video Generation | Bohao Peng et.al. | 2408.06070 | link |
2024-08-11 | Hotfixing Large Language Models for Cod | Zhou Yang et.al. | 2408.05727 | null |
2024-08-09 | TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning | Yujie Feng et.al. | 2408.05200 | link |
2024-08-09 | LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Yizhang Jin et.al. | 2408.04957 | link |
2024-08-09 | Energy performance of LR-FHSS: analysis and evaluation | Roger Sanchez-Vital et.al. | 2408.04908 | null |
2024-08-08 | Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models | Yupeng Chang et.al. | 2408.04556 | link |
2024-08-08 | UNLEARN Efficient Removal of Knowledge in Large Language Models | Tyler Lizzo et.al. | 2408.04140 | null |
2024-08-07 | Image-to-LaTeX Converter for Mathematical Formulas and Text | Daniil Gurgurov et.al. | 2408.04015 | link |
2024-08-07 | Speaker Adaptation for Quantised End-to-End ASR Models | Qiuming Zhao et.al. | 2408.03979 | null |
2024-08-07 | A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case | Sonia Meyer et.al. | 2408.03562 | null |
2024-08-11 | Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation | Jiachen Zhu et.al. | 2408.03533 | null |
2024-08-06 | FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning | Zhi Chen et.al. | 2408.03355 | null |
2024-08-06 | SARA: Singular-Value Based Adaptive Low-Rank Adaption | Jihao Gu et.al. | 2408.03290 | null |
2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
2024-08-06 | L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Elvys Linhares Pontes et.al. | 2408.03033 | null |
2024-08-06 | Towards Smart Microfarming in an Urban Computing Continuum | Marla Grunewald et.al. | 2408.02992 | null |
2024-08-05 | StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion | Zhichao Wang et.al. | 2408.02178 | null |
2024-08-04 | SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning | Biqing Qi et.al. | 2408.01970 | null |
2024-08-03 | Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design | Joong Ho Choi et.al. | 2408.01651 | link |
2024-08-02 | MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts | Lin Ning et.al. | 2408.01505 | null |
2024-08-02 | Conditional LoRA Parameter Generation | Xiaolong Jin et.al. | 2408.01415 | null |
2024-08-02 | Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer | Yu Yang et.al. | 2408.01402 | null |
2024-08-02 | Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration | Donwon Park et.al. | 2408.01099 | null |
2024-08-02 | Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs | Afia Anjum et.al. | 2408.01008 | null |
2024-08-02 | PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting | Liam Hebert et.al. | 2408.00960 | null |
2024-08-01 | Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization | Róisín Luo et.al. | 2408.00923 | null |
2024-07-31 | Ge-based Clinopyroxene series: first principles and experimental local probe study | Ricardo P. Moreira et.al. | 2407.21749 | null |
2024-07-31 | A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation | Mothilal Asokan et.al. | 2407.21739 | null |
2024-07-31 | Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation | Xiang Luo et.al. | 2407.21633 | link |
2024-07-30 | CELLM: An Efficient Communication in Large Language Models Training for Federated Learning | Raja Vavekanand et.al. | 2407.20557 | null |
2024-07-29 | Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations | Fangyijie Wang et.al. | 2407.20072 | link |
2024-07-28 | Memory-efficient Training of LLMs with Larger Mini-batches | Dang Nguyen et.al. | 2407.19580 | null |
2024-07-27 | Parameter-Efficient Fine-Tuning via Circular Convolution | Aochuan Chen et.al. | 2407.19342 | null |
2024-07-27 | The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations | Thanh-Dung Le et.al. | 2407.19299 | null |
2024-07-26 | VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain Diffusion Trained on Uniplex Stains | Shikha Dubey et.al. | 2407.19113 | null |
2024-07-25 | Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications | Alon Halfon et.al. | 2407.18990 | null |
2024-07-25 | LoRA-Pro: Are Low-Rank Adapters Properly Optimized? | Zhengbo Wang et.al. | 2407.18242 | link |
2024-07-25 | DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability | Florent Brondolo et.al. | 2407.18100 | link |
2024-07-24 | Channel-Aware Low-Rank Adaptation in Time Series Forecasting | Tong Nie et.al. | 2407.17246 | link |
2024-07-24 | Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance | Ao Shen et.al. | 2407.17029 | link |
2024-07-22 | Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters | Kartikeya Bhardwaj et.al. | 2407.16712 | null |
2024-07-23 | DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models | Zhenyu Xie et.al. | 2407.16511 | null |
2024-07-23 | Harmonizing Visual Text Comprehension and Generation | Zhen Zhao et.al. | 2407.16364 | link |
2024-07-23 | FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network | Weiying Xie et.al. | 2407.16129 | link |
2024-07-22 | Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models | Raza Imam et.al. | 2407.15913 | link |
2024-07-22 | Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders | Laura Niss et.al. | 2407.15731 | null |
2024-07-22 | LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Xi Chen et.al. | 2407.15415 | link |
2024-07-21 | Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization | Jiajun Hu et.al. | 2407.15085 | link |
2024-07-21 | MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM | Navyansh Mahla et.al. | 2407.15042 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading | Shuo Tong et.al. | 2505.00592 | null |
2025-04-30 | Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization | Anas Anwarul Haq Khan et.al. | 2504.21831 | null |
2025-04-30 | Smart Environmental Monitoring of Marine Pollution using Edge AI | Mohamed Moursi et.al. | 2504.21759 | null |
2025-04-30 | CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation | Zherui Zhang et.al. | 2504.21478 | null |
2025-04-30 | Enhancing New-item Fairness in Dynamic Recommender Systems | Huizhong Guo et.al. | 2504.21362 | null |
2025-04-30 | How to Backdoor the Knowledge Distillation | Chen Wu et.al. | 2504.21323 | null |
2025-04-30 | Redundancy Analysis and Mitigation for Machine Learning-Based Process Monitoring of Additive Manufacturing | Jiarui Xie et.al. | 2504.21317 | null |
2025-04-29 | Federated One-Shot Learning with Data Privacy and Objective-Hiding | Maximilian Egger et.al. | 2504.21182 | null |
2025-04-29 | A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection | Andreas Karathanasis et.al. | 2504.21066 | null |
2025-04-30 | DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition | Yanghui Song et.al. | 2504.20948 | link |
2025-04-30 | Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition | Tyler McDonald et.al. | 2504.20946 | null |
2025-04-29 | Evaluating Effects of Augmented SELFIES for Molecular Understanding Using QK-LSTM | Collin Beaudoin et.al. | 2504.20789 | null |
2025-04-29 | SNR-aware Semantic Image Transmission with Deep Learning-based Channel Estimation in Fading Channels | Mahmoud M. Salim et.al. | 2504.20557 | null |
2025-04-29 | SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation | Jia Wang et.al. | 2504.20501 | null |
2025-04-29 | Group Relative Knowledge Distillation: Learning from Teacher's Relational Inductive Bias | Chao Li et.al. | 2504.20482 | null |
2025-04-29 | The Estimation of Continual Causal Effect for Dataset Shifting Streams | Baining Chen et.al. | 2504.20471 | null |
2025-04-29 | Head-Tail-Aware KL Divergence in Knowledge Distillation for Spiking Neural Networks | Tianqing Zhang et.al. | 2504.20445 | null |
2025-04-28 | Mitigating Catastrophic Forgetting in the Incremental Learning of Medical Images | Sara Yavari et.al. | 2504.20033 | null |
2025-04-28 | Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom | Rishika Sen et.al. | 2504.20000 | null |
2025-04-28 | Federated Out-of-Distribution Generalization: A Causal Augmentation View | Runhui Zhang et.al. | 2504.19882 | null |
2025-04-28 | Towards Faster and More Compact Foundation Models for Molecular Property Prediction | Yasir Ghunaim et.al. | 2504.19538 | null |
2025-04-27 | Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation | Qianren Mao et.al. | 2504.19101 | null |
2025-04-26 | KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation | Jiabin Fan et.al. | 2504.19024 | null |
2025-04-26 | Revisiting Transformers through the Lens of Low Entropy and Dynamic Sparsity | Ruifeng Ren et.al. | 2504.18929 | null |
2025-04-25 | Intelligent Attacks and Defense Methods in Federated Learning-enabled Energy-Efficient Wireless Networks | Han Zhang et.al. | 2504.18519 | null |
2025-04-24 | Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction | Farhad Pourkamali-Anaraki et.al. | 2504.17655 | null |
2025-04-24 | Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation | Xin Yi et.al. | 2504.17480 | null |
2025-04-24 | Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs | Tiancheng Gu et.al. | 2504.17432 | null |
2025-04-24 | On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration | Maoyang Xiang et.al. | 2504.17376 | null |
2025-04-24 | Range Image-Based Implicit Neural Compression for LiDAR Point Clouds | Akihiro Kuwabara et.al. | 2504.17229 | null |
2025-04-24 | Does Knowledge Distillation Matter for Large Language Model based Bundle Generation? | Kaidong Feng et.al. | 2504.17220 | null |
2025-04-23 | Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification | Alexander Shvets et.al. | 2504.16856 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-21 | Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT | Stanley Mugisha et.al. | 2504.16128 | null |
2025-04-21 | MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation | Xingxing Zuo et.al. | 2504.16127 | null |
2025-04-22 | Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability | Daniel Hendriks et.al. | 2504.16056 | null |
2025-04-21 | Linear Item-Item Model with Neural Knowledge for Session-based Recommendation | Minjin Choi et.al. | 2504.15057 | null |
2025-04-22 | Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification | Shiben Liu et.al. | 2504.15041 | link |
2025-04-20 | Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions | Luyang Fang et.al. | 2504.14772 | null |
2025-04-20 | Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis | Jingjing Ren et.al. | 2504.14470 | null |
2025-04-19 | Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models | Patrick Haller et.al. | 2504.14366 | null |
2025-04-19 | Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation | Muhammad Haseeb Aslam et.al. | 2504.14307 | null |
2025-04-19 | A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models | Chengming Wang et.al. | 2504.14241 | null |
2025-04-19 | Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration | Hongji Li et.al. | 2504.14214 | link |
2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
2025-04-18 | From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs | Jiliang Ni et.al. | 2504.13471 | null |
2025-04-17 | ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs | Yan Yang et.al. | 2504.13237 | null |
2025-04-17 | Scaling Laws for Data-Efficient Visual Transfer Learning | Wenxuan Yang et.al. | 2504.13219 | null |
2025-04-16 | Transferable Deployment of Semantic Edge Inference Systems via Unsupervised Domain Adaption | Weiqiang Jiao et.al. | 2504.11873 | null |
2025-04-15 | A Dual-Space Framework for General Knowledge Distillation of Large Language Models | Xue Zhang et.al. | 2504.11426 | null |
2025-04-15 | Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning | Ali Taghibakhshi et.al. | 2504.11409 | null |
2025-04-15 | Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution | Xinning Chai et.al. | 2504.11271 | link |
2025-04-15 | Efficient Reasoning Models: A Survey | Sicheng Feng et.al. | 2504.10903 | link |
2025-04-14 | Optimising Intrusion Detection Systems in Cloud-Edge Continuum with Knowledge Distillation for Privacy-Preserving and Efficient Communication | Soad Almabdy et.al. | 2504.10698 | null |
2025-04-14 | Better Estimation of the KL Divergence Between Language Models | Afra Amini et.al. | 2504.10637 | link |
2025-04-14 | Digital Staining with Knowledge Distillation: A Unified Framework for Unpaired and Paired-But-Misaligned Data | Ziwang Xu et.al. | 2504.09899 | link |
2025-04-14 | DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation | Beomseok Kang et.al. | 2504.09814 | null |
2025-04-14 | CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices | Jingxuan Zhou et.al. | 2504.09803 | null |
2025-04-13 | Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models? | Christophe El Zeinaty et.al. | 2504.09685 | null |
2025-04-12 | Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking | You Wu et.al. | 2504.09228 | null |
2025-04-12 | Langformers: Unified NLP Pipelines for Language Models | Rabindra Lamsal et.al. | 2504.09170 | null |
2025-04-12 | Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization | Gen Li et.al. | 2504.09039 | null |
2025-04-11 | Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities | Maria Santos-Villafranca et.al. | 2504.08578 | null |
2025-04-11 | Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery | Alireza Fathalizadeh et.al. | 2504.08550 | link |
2025-04-11 | Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images | Jinghe Yang et.al. | 2504.08253 | null |
2025-04-10 | Towards Unconstrained 2D Pose Estimation of the Human Spine | Muhammad Saif Ullah Khan et.al. | 2504.08110 | null |
2025-04-10 | SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement | Xiyao Wang et.al. | 2504.07934 | link |
2025-04-10 | Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation | Yanglin Huang et.al. | 2504.07691 | null |
2025-04-10 | ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement | Anning Hu et.al. | 2504.07418 | null |
2025-04-10 | WK-Pnet: FM-Based Positioning via Wavelet Packet Decomposition and Knowledge Distillation | Shilian Zheng et.al. | 2504.07399 | null |
2025-04-09 | Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer | Shi Pan et.al. | 2504.07061 | null |
2025-04-08 | Multi-Sense Embeddings for Language Models and Knowledge Distillation | Qitong Wang et.al. | 2504.06036 | null |
2025-04-08 | CoA: Towards Real Image Dehazing via Compression-and-Adaptation | Long Ma et.al. | 2504.05590 | null |
2025-04-07 | Learning Activity View-invariance Under Extreme Viewpoint Changes via Curriculum Knowledge Distillation | Arjun Somayazulu et.al. | 2504.05451 | null |
2025-04-07 | Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization | Zeqin Yu et.al. | 2504.05224 | null |
2025-04-07 | Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework | Yu Min Park et.al. | 2504.05187 | null |
2025-04-07 | GOTHAM: Graph Class Incremental Learning Framework under Weak Supervision | Aditya Hemant Shahane et.al. | 2504.04954 | link |
2025-04-07 | Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models | Yoojin Jung et.al. | 2504.04747 | null |
2025-04-07 | T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models | Minki Kang et.al. | 2504.04718 | null |
2025-04-06 | A Novel Algorithm for Personalized Federated Learning: Knowledge Distillation with Weighted Combination Loss | Hengrui Hu et.al. | 2504.04642 | null |
2025-04-08 | Your Image Generator Is Your New Private Dataset | Nicolo Resmini et.al. | 2504.04582 | null |
2025-04-06 | Compression Laws for Large Language Models | Ayan Sengupta et.al. | 2504.04342 | null |
2025-04-05 | Towards Understanding and Improving Refusal in Compressed Models via Mechanistic Interpretability | Vishnu Kabir Chhabra et.al. | 2504.04215 | null |
2025-04-05 | CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation | Kai Fang et.al. | 2504.04156 | null |
2025-04-04 | RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation | Hanbo Bi et.al. | 2504.03166 | null |
2025-04-03 | Compositionality Unlocks Deep Interpretable Models | Thomas Dooms et.al. | 2504.02667 | null |
2025-04-03 | UNDO: Understanding Distillation as Optimization | Kushal Jain et.al. | 2504.02521 | null |
2025-04-03 | Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation | Laibin Chang et.al. | 2504.02391 | null |
2025-04-03 | Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation | Chengxi Zeng et.al. | 2504.02351 | null |
2025-04-03 | Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation | Wupeng Wang et.al. | 2504.02302 | null |
2025-04-03 | Beyond Conventional Transformers: The Medical X-ray Attention (MXA) Block for Improved Multi-Label Diagnosis Using Knowledge Distillation | Amit Rand et.al. | 2504.02277 | link |
2025-04-02 | MDP: Multidimensional Vision Model Pruning with Latency Constraint | Xinglong Sun et.al. | 2504.02168 | null |
2025-04-02 | FlowDistill: Scalable Traffic Flow Prediction via Distillation from LLMs | Chenyang Yu et.al. | 2504.02094 | link |
2025-04-02 | A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines | Calvin Kinateder et.al. | 2504.01798 | null |
2025-04-02 | KD |
Eduardo Fernandes Montesuma et.al. | 2504.01757 | null |
2025-04-02 | Style over Substance: Distilled Language Models Reason Via Stylistic Replication | Philip Lippmann et.al. | 2504.01738 | null |
2025-04-01 | Data-free Knowledge Distillation with Diffusion Models | Xiaohua Qi et.al. | 2504.00870 | null |
2025-04-01 | Global Intervention and Distillation for Federated Out-of-Distribution Generalization | Zhuang Qi et.al. | 2504.00850 | null |
2025-04-01 | Sample-level Adaptive Knowledge Distillation for Action Recognition | Ping Li et.al. | 2504.00606 | null |
2025-04-02 | Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks | Yuang Jia et.al. | 2504.00540 | null |
2025-03-31 | Is LLM the Silver Bullet to Low-Resource Languages Machine Translation? | Yewei Song et.al. | 2503.24102 | null |
2025-03-31 | A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing | Lars Möllenbrok et.al. | 2503.24088 | null |
2025-03-31 | Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification | Chenqi Guo et.al. | 2503.24017 | null |
2025-03-31 | Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion | Jiagen Li et.al. | 2503.23721 | null |
2025-03-28 | Efficient Verified Machine Unlearning For Distillation | Yijun Quan et.al. | 2503.22539 | null |
2025-03-28 | Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces | Wonhyeok Choi et.al. | 2503.22209 | null |
2025-03-28 | Multi-modal Knowledge Distillation-based Human Trajectory Forecasting | Jaewoo Jeong et.al. | 2503.22201 | link |
2025-03-28 | Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation | Chuan-Wei Kuo et.al. | 2503.22074 | null |
2025-03-28 | Multi-Task Semantic Communications via Large Models | Wanli Ni et.al. | 2503.22064 | null |
2025-03-27 | Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration | Yujie Chen et.al. | 2503.21970 | null |
2025-03-27 | A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices | Ci-Hao Wu et.al. | 2503.21335 | null |
2025-03-27 | DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset | Ling Feng et.al. | 2503.21323 | null |
2025-03-27 | Delving Deep into Semantic Relation Distillation | Zhaoyi Yan et.al. | 2503.21269 | null |
2025-03-27 | MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness | Zihao Zheng et.al. | 2503.21135 | null |
2025-03-27 | Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search | Yedan Shen et.al. | 2503.21098 | null |
2025-03-26 | Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications | Mahya Nikouei et.al. | 2503.20516 | null |
2025-03-26 | MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation | Rongyu Zhang et.al. | 2503.20384 | null |
2025-03-26 | Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning | Yousef Sadegheih et.al. | 2503.20326 | link |
2025-03-25 | Scaling Down Text Encoders of Text-to-Image Diffusion Models | Lifu Wang et.al. | 2503.19897 | link |
2025-03-23 | FedSKD: Aggregation-free Model-heterogeneous Federated Learning using Multi-dimensional Similarity Knowledge Distillation | Ziqiao Weng et.al. | 2503.18981 | null |
2025-03-24 | DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Karim Abou Zeid et.al. | 2503.18944 | link |
2025-03-24 | Distilling Stereo Networks for Performant and Efficient Leaner Networks | Rafia Rahim et.al. | 2503.18544 | link |
2025-03-24 | Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control | Basim Azam et.al. | 2503.18324 | null |
2025-03-23 | CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation | Jungsoo Lee et.al. | 2503.18244 | null |
2025-03-22 | OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery | Vignesh Prabhakar et.al. | 2503.17604 | null |
2025-03-21 | Efficient Knowledge Distillation via Curriculum Extraction | Shivam Gupta et.al. | 2503.17494 | null |
2025-03-21 | Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs | Reem Gody et.al. | 2503.17336 | null |
2025-03-21 | Large Language Model Compression via the Nested Activation-Aware Decomposition | Jun Lu et.al. | 2503.17101 | null |
2025-03-21 | Distilling Monocular Foundation Model for Fine-grained Depth Completion | Yingping Liang et.al. | 2503.16970 | null |
2025-03-21 | Temporal Action Detection Model Compression by Progressive Block Drop | Xiaoyong Chen et.al. | 2503.16916 | null |
2025-03-21 | Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs | Anshumann et.al. | 2503.16870 | null |
2025-03-21 | City2Scene: Improving Acoustic Scene Classification with City Features | Yiqiang Cai et.al. | 2503.16862 | null |
2025-03-20 | Bezier Distillation | Ling Feng et.al. | 2503.16562 | null |
2025-03-20 | Federated Quantum-Train Long Short-Term Memory for Gravitational Wave Signal | Chen-Yu Liu et.al. | 2503.16049 | null |
2025-03-20 | InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer | Tony Zhang et.al. | 2503.15983 | null |
2025-03-19 | KoGNER: A Novel Framework for Knowledge Graph Distillation on Biomedical Named Entity Recognition | Heming Zhang et.al. | 2503.15737 | null |
2025-03-19 | Technical Report for the 5th CLVision Challenge at CVPR: Addressing the Class-Incremental with Repetition using Unlabeled Data -- 4th Place Solution | Panagiota Moraiti et.al. | 2503.15697 | link |
2025-03-19 | High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight | Cédric Vincent et.al. | 2503.15676 | link |
2025-03-19 | DCA: Dividing and Conquering Amnesia in Incremental Object Detection | Aoting Zhang et.al. | 2503.15295 | null |
2025-03-20 | Distilling 3D distinctive local descriptors for 6D pose estimation | Amir Hamza et.al. | 2503.15106 | null |
2025-03-19 | Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening | Zihan Cao et.al. | 2503.14975 | null |
2025-03-19 | Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials | Sakib Matin et.al. | 2503.14293 | null |
2025-03-18 | SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation | Weihong Chen et.al. | 2503.14097 | null |
2025-03-18 | Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection | Chunlei Li et.al. | 2503.13828 | link |
2025-03-17 | DynSTG-Mamba: Dynamic Spatio-Temporal Graph Mamba with Cross-Graph Knowledge Distillation for Gait Disorders Recognition | Zakariae Zrimek et.al. | 2503.13156 | null |
2025-03-17 | ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning | Baohao Liao et.al. | 2503.13089 | null |
2025-03-17 | Historic Scripts to Modern Vision: A Novel Dataset and A VLM Framework for Transliteration of Modi Script to Devanagari | Harshal Kausadikar et.al. | 2503.13060 | null |
2025-03-17 | Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation | Nassim Ali Ousalah et.al. | 2503.13053 | null |
2025-03-17 | Knowledge Distillation: Enhancing Neural Network Compression with Integrated Gradients | David E. Hernandez et.al. | 2503.13008 | null |
2025-03-17 | ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing | Aditi Tiwari et.al. | 2503.12852 | null |
2025-03-17 | CompMarkGS: Robust Watermarking for Compression 3D Gaussian Splatting | Sumin In et.al. | 2503.12836 | null |
2025-03-17 | Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation | Kailin Li et.al. | 2503.12820 | null |
2025-03-16 | Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep Learning | Khayrul Islam et.al. | 2503.12622 | link |
2025-03-16 | UniBERTs: Adversarial Training for Language-Universal Representations | Andrei-Marius Avram et.al. | 2503.12608 | null |
2025-03-14 | Exploring Performance-Complexity Trade-Offs in Sound Event Detection | Tobias Morocutti et.al. | 2503.11373 | link |
2025-03-14 | Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification | Tobias Morocutti et.al. | 2503.11363 | null |
2025-03-14 | Enabling Weak Client Participation via On-device Knowledge Distillation in Heterogenous Federated Learning | Jihyun Lim et.al. | 2503.11151 | null |
2025-03-12 | CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Hariprasath Govindarajan et.al. | 2503.09878 | null |
2025-03-12 | Vi-LAD: Vision-Language Attention Distillation for Socially-Aware Robot Navigation in Dynamic Environments | Mohamed Elnoor et.al. | 2503.09820 | null |
2025-03-16 | xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation | Elio Musacchio et.al. | 2503.09313 | null |
2025-03-12 | Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge | Maximilian Abstreiter et.al. | 2503.09114 | null |
2025-03-12 | Discovering Influential Neuron Path in Vision Transformers | Yifan Wang et.al. | 2503.09046 | null |
2025-03-12 | Adaptive Temperature Based on Logits Correlation in Knowledge Distillation | Kazuhiro Matsuyama et.al. | 2503.09030 | link |
2025-03-12 | Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds | Dikai Liu et.al. | 2503.08997 | null |
2025-03-11 | LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization | Xianfeng Wu et.al. | 2503.08619 | link |
2025-03-11 | Position-Aware Depth Decay Decoding ( |
Siqi Fan et.al. | 2503.08524 | null |
2025-03-11 | Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation | Deyi Ji et.al. | 2503.08043 | null |
2025-03-11 | Generalized Kullback-Leibler Divergence Loss | Jiequan Cui et.al. | 2503.08038 | null |
2025-03-10 | Training Domain Draft Models for Speculative Decoding: Best Practices and Insights | Fenglu Hong et.al. | 2503.07807 | null |
2025-03-10 | ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning | Soumya Banerjee et.al. | 2503.07506 | null |
2025-03-10 | Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification | Thomas Boucher et.al. | 2503.07294 | null |
2025-03-10 | CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting | Haicheng Liao et.al. | 2503.07234 | null |
2025-03-10 | PTMs-TSCIL Pre-Trained Models Based Class-Incremental Learning | Yuanlong Wu et.al. | 2503.07153 | null |
2025-03-10 | Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation | Pengchen Liang et.al. | 2503.06976 | null |
2025-03-09 | Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence | Zhaowei Chen et.al. | 2503.06685 | null |
2025-03-09 | Towards Superior Quantization Accuracy: A Layer-sensitive Approach | Feng Zhang et.al. | 2503.06518 | null |
2025-03-09 | HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast | Yiting Zheng et.al. | 2503.06511 | null |
2025-03-09 | Causality Enhanced Origin-Destination Flow Prediction in Data-Scarce Cities | Tao Feng et.al. | 2503.06398 | null |
2025-03-08 | ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation | Qizhen Lan et.al. | 2503.06307 | null |
2025-03-07 | Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning | Run He et.al. | 2503.05423 | null |
2025-03-07 | Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification | Dingkun Liu et.al. | 2503.05349 | link |
2025-03-07 | Similarity-Based Domain Adaptation with LLMs | Jie He et.al. | 2503.05281 | null |
2025-03-06 | LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression | Souvik Kundu et.al. | 2503.04982 | null |
2025-03-06 | TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation | Lin Sun et.al. | 2503.04872 | null |
2025-03-05 | ZAugNet for Z-Slice Augmentation in Bio-Imaging | Alessandro Pasqui et.al. | 2503.04843 | link |
2025-03-07 | No Forgetting Learning: Memory-free Continual Learning | Mohammad Ali Vahedifar et.al. | 2503.04638 | null |
2025-03-06 | CrowdHMTware: A Cross-level Co-adaptation Middleware for Context-aware Mobile DL Deployment | Sicong Liu et.al. | 2503.04183 | null |
2025-03-05 | Evaluating Compression and Nanoindentation in FCC Nickel: A Methodology for Interatomic Potential Selection | K. Cichocki et.al. | 2503.03723 | null |
2025-03-05 | KLiNQ: Knowledge Distillation-Assisted Lightweight Neural Network for Qubit Readout on FPGA | Xiaorang Guo et.al. | 2503.03544 | null |
2025-03-05 | Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks | Kairong Yu et.al. | 2503.03144 | null |
2025-03-05 | FairSense-AI: Responsible AI Meets Sustainability | Shaina Raza et.al. | 2503.02865 | null |
2025-03-04 | 10K is Enough: An Ultra-Lightweight Binarized Network for Infrared Small-Target Detection | Biqiao Xin et.al. | 2503.02662 | null |
2025-03-04 | It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation | Sohan Patnaik et.al. | 2503.02463 | null |
2025-03-04 | Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration | Pengchen Liang et.al. | 2503.02321 | null |
2025-03-03 | Mamba base PKD for efficient knowledge compression | José Medina et.al. | 2503.01727 | null |
2025-03-03 | DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems | Minoo Hosseinzadeh et.al. | 2503.01704 | null |
2025-03-03 | Revisiting Large Language Model Pruning using Neuron Semantic Attribution | Yizhuo Ding et.al. | 2503.01542 | null |
2025-03-01 | SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection | Xin Lin et.al. | 2503.00414 | link |
2025-03-01 | Energy-Efficient Edge Inference in Integrated Sensing, Communication, and Computation Networks | Jiacheng Yao et.al. | 2503.00298 | null |
2025-02-28 | Real-Time Aerial Fire Detection on Resource-Constrained Devices Using Knowledge Distillation | Sabina Jangirova et.al. | 2502.20979 | null |
2025-02-28 | VRM: Knowledge Distillation via Virtual Relation Matching | Weijia Zhang et.al. | 2502.20760 | null |
2025-02-27 | SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models | Zicheng Cai et.al. | 2502.20422 | null |
2025-02-27 | KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model | Kai Zhang et.al. | 2502.20350 | null |
2025-02-27 | Granite Embedding Models | Parul Awasthy et.al. | 2502.20204 | null |
2025-02-28 | Behind the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models | Sibo Yi et.al. | 2502.19883 | null |
2025-02-28 | Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval | Jiaxing Li et.al. | 2502.19751 | null |
2025-02-27 | XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs | Linyang He et.al. | 2502.19737 | null |
2025-02-26 | Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents | Ashley Lewis et.al. | 2502.19545 | null |
2025-02-26 | Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach | Anton Backhaus et.al. | 2502.19177 | null |
2025-02-25 | AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages | Joshua Sakthivel Raju et.al. | 2502.18020 | null |
2025-02-25 | Advantage-Guided Distillation for Preference Alignment in Small Language Models | Shiping Gao et.al. | 2502.17927 | link |
2025-02-25 | From underwater to aerial: a novel multi-scale knowledge distillation approach for coral reef monitoring | Matteo Contini et.al. | 2502.17883 | link |
2025-02-24 | Knowledge Distillation with Training Wheels | Guanlin Liu et.al. | 2502.17717 | null |
2025-02-24 | The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? | Zhenheng Tang et.al. | 2502.17535 | null |
2025-02-24 | CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation | Vishal Thengane et.al. | 2502.17429 | link |
2025-02-24 | Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing | Zhuoran Li et.al. | 2502.17308 | null |
2025-02-24 | Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation | Wenyuan Wu et.al. | 2502.17003 | null |
2025-02-24 | PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation | Eleftherios Ioannou et.al. | 2502.16996 | null |
2025-02-25 | CoT2Align: Cross-Chain of Thought Distillation via Optimal Transport Alignment for Language Models with Different Tokenizers | Anh Duc Le et.al. | 2502.16806 | null |
2025-02-24 | A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition | Dewan Tauhid Rahman et.al. | 2502.16762 | null |
2025-02-23 | EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation | Hong Cai Chen et.al. | 2502.16541 | null |
2025-02-21 | A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models | Yuchen Jiang et.al. | 2502.15959 | link |
2025-02-21 | Scaling Sparse and Dense Retrieval in Decoder-Only LLMs | Hansi Zeng et.al. | 2502.15526 | link |
2025-02-21 | When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models | Weilan Wang et.al. | 2502.15443 | null |
2025-02-20 | Optimizing Singular Spectrum for Large Language Model Compression | Dengjie Li et.al. | 2502.15092 | null |
2025-02-20 | Modifying Final Splits of Classification Tree for Fine-tuning Subpopulation Target in Policy Making | Lei Bill Wang et.al. | 2502.15072 | null |
2025-02-20 | TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation | Juntong Ni et.al. | 2502.15016 | null |
2025-02-20 | Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery | Minh-Quyet Ha et.al. | 2502.14631 | null |
2025-02-21 | Vision Foundation Models in Medical Image Analysis: Advances and Challenges | Pengchen Liang et.al. | 2502.14584 | null |
2025-02-20 | Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining | Wonhyeok Choi et.al. | 2502.14573 | null |
2025-02-20 | Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications | Kayhan Behdin et.al. | 2502.14305 | null |
2025-02-20 | Designing Parameter and Compute Efficient Diffusion Transformers using Distillation | Vignesh Sundaresha et.al. | 2502.14226 | null |
2025-02-19 | MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation | Romina Aalishah et.al. | 2502.14090 | null |
2025-02-19 | Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture | Shijin Duan et.al. | 2502.14075 | null |
2025-02-19 | Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles | Orestis Konstantaropoulos et.al. | 2502.14023 | null |
2025-02-19 | MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures | Jiayu Qin et.al. | 2502.14008 | null |
2025-02-19 | Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning | Caihua Liu et.al. | 2502.13754 | null |
2025-02-19 | JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework | Ziyuan Liu et.al. | 2502.13407 | link |
2025-02-18 | NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions | Weizhe Yuan et.al. | 2502.13124 | null |
2025-02-18 | Does Training with Synthetic Data Truly Protect Privacy? | Yunpeng Zhao et.al. | 2502.12976 | link |
2025-02-18 | Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models | Gyeongman Kim et.al. | 2502.12947 | null |
2025-02-18 | Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models | Neeraj Gangwar et.al. | 2502.12855 | null |
2025-02-18 | PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery | Bowei He et.al. | 2502.12594 | null |
2025-02-17 | FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control | Yutong Ye et.al. | 2502.11937 | null |
2025-02-17 | Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation | Zengkui Sun et.al. | 2502.11766 | link |
2025-02-17 | Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation? | Leyi Pan et.al. | 2502.11598 | link |
2025-02-17 | Leave No One Behind: Enhancing Diversity While Maintaining Accuracy in Social Recommendation | Lei Li et.al. | 2502.11374 | link |
2025-02-16 | Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation | Hieu Nguyen et.al. | 2502.11306 | null |
2025-02-16 | Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification | Thanushon Sivakaran et.al. | 2502.11258 | null |
2025-02-16 | DAViMNet: SSMs-Based Domain Adaptive Object Detection | A. Enes Doruk et.al. | 2502.11178 | link |
2025-02-16 | Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping | Yijie Chen et.al. | 2502.11104 | link |
2025-02-15 | LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs | Amit Roy et.al. | 2502.10914 | null |
2025-02-15 | OPTISHEAR: Towards Efficient and Adaptive Pruning of Large Language Models via Evolutionary Optimization | Shuqi Liu et.al. | 2502.10735 | null |
2025-02-14 | Forget the Data and Fine-Tuning! Just Fold the Network to Compress | Dong Wang et.al. | 2502.10216 | link |
2025-02-14 | Can Post-Training Quantization Benefit from an Additional QLoRA Integration? | Xiliang Zhu et.al. | 2502.10202 | null |
2025-02-13 | Automatic Pruning via Structured Lasso with Class-wise Information | Xiang Liu et.al. | 2502.09125 | null |
2025-02-13 | AIDE: Agentically Improve Visual Language Model with Domain Experts | Ming-Chang Chiu et.al. | 2502.09051 | null |
2025-02-12 | PLayer-FL: A Principled Approach to Personalized Layer-wise Cross-Silo Federated Learning | Ahmed Elhussein et.al. | 2502.08829 | link |
2025-02-12 | LLM Pretraining with Continuous Concepts | Jihoon Tack et.al. | 2502.08524 | null |
2025-02-12 | Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning | Barnaby Schmitt et.al. | 2502.08323 | null |
2025-02-11 | Vision-Language Models for Edge Networks: A Comprehensive Survey | Ahmed Sharshar et.al. | 2502.07855 | null |
2025-02-11 | DarwinLM: Evolutionary Structured Pruning of Large Language Models | Shengkun Tang et.al. | 2502.07780 | link |
2025-02-11 | Breaking Down Bias: On The Limits of Generalizable Pruning Strategies | Sibo Ma et.al. | 2502.07771 | null |
2025-02-11 | Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers | Zhaodong Bing et.al. | 2502.07436 | null |
2025-02-11 | OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms | Lumen AI et.al. | 2502.07312 | link |
2025-02-11 | Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification | Zicheng Liu et.al. | 2502.07299 | null |
2025-02-10 | DROP: Poison Dilution via Knowledge Distillation for Federated Learning | Georgios Syros et.al. | 2502.07011 | link |
2025-02-10 | A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer | Lirong Wu et.al. | 2502.06913 | link |
2025-02-13 | Rationalization Models for Text-to-SQL | Gaetano Rossiello et.al. | 2502.06759 | null |
2025-02-10 | Systematic Outliers in Large Language Models | Yongqi An et.al. | 2502.06415 | link |
2025-02-10 | Progressive Collaborative and Semantic Knowledge Fusion for Generative Recommendation | Longtao Xiao et.al. | 2502.06269 | null |
2025-02-10 | Right Time to Learn:Promoting Generalization via Bio-inspired Spacing Effect in Knowledge Distillation | Guanglong Sun et.al. | 2502.06192 | null |
2025-02-10 | Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures | Yaoxin Yang et.al. | 2502.06189 | null |
2025-02-10 | A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar | Seung-Hyun Song et.al. | 2502.06114 | null |
2025-02-09 | ClinKD: Cross-Modal Clinic Knowledge Distiller For Multi-Task Medical Images | Hongyu Ge et.al. | 2502.05928 | link |
2025-02-09 | Learning Accurate, Efficient, and Interpretable MLPs on Multiplex Graphs via Node-wise Multi-View Ensemble Distillation | Yunhui Liu et.al. | 2502.05864 | null |
2025-02-09 | Synergistic Effects of Knowledge Distillation and Structured Pruning for Self-Supervised Speech Models | Shiva Kumar C et.al. | 2502.05837 | null |
2025-02-09 | Contrastive Representation Distillation via Multi-Scale Feature Decoupling | Cuipeng Wang et.al. | 2502.05835 | null |
2025-02-07 | Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement | Xihao Yuan et.al. | 2502.04711 | null |
2025-02-06 | Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation | Chenyang Huang et.al. | 2502.04537 | link |
2025-02-06 | Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much) | Zony Yu et.al. | 2502.04499 | null |
2025-02-06 | PGB: One-Shot Pruning for BERT via Weight Grouping and Permutation | Hyemin Lim et.al. | 2502.03984 | null |
2025-02-06 | Towards Unified Music Emotion Recognition across Dimensional and Categorical Models | Jaeyong Kang et.al. | 2502.03979 | link |
2025-02-06 | BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation | Bo Pang et.al. | 2502.03860 | null |
2025-02-06 | Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation | Lin Li et.al. | 2502.03856 | null |
2025-02-05 | Knowledge Distillation from Large Language Models for Household Energy Modeling | Mohannad Takrouri et.al. | 2502.03034 | null |
2025-02-05 | Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons | Renjun Hu et.al. | 2502.02988 | null |
2025-02-04 | Theoretical Guarantees for Low-Rank Compression of Deep Neural Networks | Shihao Zhang et.al. | 2502.02766 | null |
2025-02-04 | On Teacher Hacking in Language Model Distillation | Daniil Tiapkin et.al. | 2502.02671 | null |
2025-02-04 | Activation-Informed Merging of Large Language Models | Amin Heyrani Nobari et.al. | 2502.02421 | link |
2025-02-03 | Memorization Inheritance in Sequence-Level Knowledge Distillation for Neural Machine Translation | Verna Dankers et.al. | 2502.01491 | null |
2025-02-03 | Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity | Alessandro Pierro et.al. | 2502.01330 | null |
2025-02-03 | CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation | Xiao Lin et.al. | 2502.01312 | null |
2025-02-03 | A Framework for Double-Blind Federated Adaptation of Foundation Models | Nurbek Tastan et.al. | 2502.01289 | null |
2025-02-03 | MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks | Alejandro Guerra-Manzanares et.al. | 2502.01158 | null |
2025-02-02 | Huff-LLM: End-to-End Lossless Compression for Efficient LLM Inference | Patrick Yubeaton et.al. | 2502.00922 | null |
2025-02-02 | Attention Sinks and Outlier Features: A 'Catch, Tag, and Release' Mechanism for Embeddings | Stephen Zhang et.al. | 2502.00919 | null |
2025-02-02 | FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation | Wenzheng Jiang et.al. | 2502.00870 | link |
2025-02-02 | VLM-Assisted Continual learning for Visual Question Answering in Self-Driving | Yuxin Lin et.al. | 2502.00843 | null |
2025-01-31 | Imagine with the Teacher: Complete Shape in a Multi-View Distillation Way | Zhanpeng Luo et.al. | 2501.19270 | null |
2025-01-31 | Position: Curvature Matrices Should Be Democratized via Linear Operators | Felix Dangel et.al. | 2501.19183 | null |
2025-01-31 | Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models | Jialin Zhao et.al. | 2501.19090 | null |
2025-02-04 | Efficient Supernet Training with Orthogonal Softmax for Scalable ASR Model Compression | Jingjing Xu et.al. | 2501.18895 | null |
2025-01-30 | Rethinking the Upsampling Layer in Hyperspectral Image Super Resolution | Haohan Shi et.al. | 2501.18664 | null |
2025-01-30 | SAFL: Structure-Aware Personalized Federated Learning via Client-Specific Clustering and SCSI-Guided Model Pruning | Nan Li et.al. | 2501.18659 | null |
2025-01-30 | Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design | Amna Murtada et.al. | 2501.18538 | null |
2025-01-30 | SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer | Enze Xie et.al. | 2501.18427 | null |
2025-01-29 | RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems | Duy A. Nguyen et.al. | 2501.18056 | null |
2025-01-29 | Perforated Backpropagation: A Neuroscience Inspired Extension to Artificial Neural Networks | Rorry Brenner et.al. | 2501.18018 | link |
2025-01-29 | Distilling Knowledge for Designing Computational Imaging Systems | Leon Suarez-Rodriguez et.al. | 2501.17898 | link |
2025-01-29 | Tapor: 3D Hand Pose Reconstruction with Fully Passive Thermal Sensing for Around-device Interactions | Xie Zhang et.al. | 2501.17585 | link |
2025-01-28 | A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts | Hossein Mirzaei et.al. | 2501.17289 | null |
2025-01-28 | FedEFM: Federated Endovascular Foundation Model with Unseen Data | Tuong Do et.al. | 2501.16992 | null |
2025-01-28 | Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning | Xi Chen et.al. | 2501.16966 | null |
2025-01-29 | TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models | Makoto Shing et.al. | 2501.16937 | null |
2025-01-28 | Target-driven Self-Distillation for Partial Observed Trajectories Forecasting | Pengfei Zhu et.al. | 2501.16767 | null |
2025-01-28 | Efficient Knowledge Distillation of SAM for Medical Image Segmentation | Kunal Dasharath Patil et.al. | 2501.16740 | null |
2025-01-30 | Return of the Encoder: Maximizing Parameter Efficiency for SLMs | Mohamed Elfeki et.al. | 2501.16273 | link |
2025-01-27 | PISCO: Pretty Simple Compression for Retrieval-Augmented Generation | Maxime Louis et.al. | 2501.16075 | null |
2025-01-26 | MimicGait: A Model Agnostic approach for Occluded Gait Recognition using Correlational Knowledge Distillation | Ayush Gupta et.al. | 2501.15666 | link |
2025-01-26 | Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis | Robinson Umeike et.al. | 2501.15370 | null |
2025-01-25 | You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning | Ayan Sengupta et.al. | 2501.15296 | null |
2025-01-25 | Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning | Yu Qiao et.al. | 2501.15257 | null |
2025-01-25 | Quark: Implementing Convolutional Neural Networks Entirely on Programmable Data Plane | Mai Zhang et.al. | 2501.15100 | null |
2025-01-25 | Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval | Bingjun Luo et.al. | 2501.15052 | null |
2025-01-25 | On Accelerating Edge AI: Optimizing Resource-Constrained Environments | Jacob Sander et.al. | 2501.15014 | null |
2025-01-24 | Remining Hard Negatives for Generative Pseudo Labeled Domain Adaptation | Goksenin Yuksel et.al. | 2501.14434 | null |
2025-01-24 | Multimodal Prescriptive Deep Learning | Dimitris Bertsimas et.al. | 2501.14152 | null |
2025-01-23 | Unlearning Clients, Features and Samples in Vertical Federated Learning | Ayush K. Varshney et.al. | 2501.13683 | null |
2025-01-24 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | link |
2025-01-22 | LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation | Jiahao Wang et.al. | 2501.12976 | null |
2025-01-22 | Practical quantum federated learning and its experimental demonstration | Zhi-Ping Liu et.al. | 2501.12709 | null |
2025-01-24 | EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation | Yifan Yu et.al. | 2501.12689 | null |
2025-01-22 | Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation | Jan Christian Blaise Cruz et.al. | 2501.12660 | null |
2025-01-22 | Toward Model-centric Heterogeneous Federated Graph Learning: A Knowledge-driven Approach | Huilin lai et.al. | 2501.12624 | null |
2025-01-21 | Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor | Jiaqi Guo et.al. | 2501.12524 | link |
2025-01-19 | AI Based Font Pair Suggestion Modelling For Graphic Design | Aryan Singh et.al. | 2501.10969 | null |
2025-01-18 | Learning to reconstruct signals with inexact sensing operator via knowledge distillation | Roman Jacome et.al. | 2501.10794 | null |
2025-01-18 | DNA 1.0 Technical Report | Jungyup Lee et.al. | 2501.10648 | null |
2025-01-17 | MultiPruner: Balanced Structure Removal in Foundation Models | J. Pablo Muñoz et.al. | 2501.09949 | link |
2025-01-16 | Enhancing Generalization in Chain of Thought Reasoning for Smaller Models | Maxwell J. Yin et.al. | 2501.09804 | null |
2025-01-16 | Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures | Pratyush Dhingra et.al. | 2501.09588 | null |
2025-01-19 | Class Incremental Fault Diagnosis under Limited Fault Data via Supervised Contrastive Knowledge Distillation | Hanrong Zhang et.al. | 2501.09525 | link |
2025-01-16 | FASP: Fast and Accurate Structured Pruning of Large Language Models | Hanyu Hu et.al. | 2501.09412 | null |
2025-01-16 | Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression | Yongheng Zhang et.al. | 2501.09321 | null |
2025-01-16 | Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images | Yongheng Zhang et.al. | 2501.09268 | null |
2025-01-15 | Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians | Ishan Amin et.al. | 2501.09009 | link |
2025-01-17 | VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science | Youssef Abdalla et.al. | 2501.08995 | link |
2025-01-15 | Feature-based One-For-All: A Universal Framework for Heterogeneous Knowledge Distillation | Jhe-Hao Lin et.al. | 2501.08885 | null |
2025-01-15 | SWSC: Shared Weight for Similar Channel in LLM | Binrui Zeng et.al. | 2501.08631 | null |
2025-01-14 | Self-Attentive Spatio-Temporal Calibration for Precise Intermediate Layer Matching in ANN-to-SNN Distillation | Di Hong et.al. | 2501.08049 | link |
2025-01-14 | Balance Divergence for Knowledge Distillation | Yafei Qi et.al. | 2501.07804 | null |
2025-01-13 | A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion | Fabio Montello et.al. | 2501.07451 | null |
2025-01-13 | Knowledge Distillation and Enhanced Subdomain Adaptation Using Graph Convolutional Network for Resource-Constrained Bearing Fault Diagnosis | Mohammadreza Kavianpour et.al. | 2501.07173 | null |
2025-01-13 | Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection | ZhouRui Zhang et.al. | 2501.07101 | null |
2025-01-13 | Research on the Online Update Method for Retrieval-Augmented Generation (RAG) Model with Incremental Learning | Yuxin Fan et.al. | 2501.07063 | null |
2025-01-13 | Rethinking Knowledge in Distillation: An In-context Sample Retrieval Perspective | Jinjing Zhu et.al. | 2501.07040 | null |
2025-01-12 | Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Haoxiang Gao et.al. | 2501.06680 | null |
2025-01-10 | Tensorization of neural networks for improved privacy and interpretability | José Ramón Pareja Monturiol et.al. | 2501.06300 | link |
2025-01-10 | Merging Feed-Forward Sublayers for Compressed Transformers | Neha Verma et.al. | 2501.06126 | link |
2025-01-10 | Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation | Daowan Peng et.al. | 2501.05690 | null |
2025-01-09 | LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts | Yuri Facanha Bezerra et.al. | 2501.05554 | link |
2025-01-09 | Neural Architecture Codesign for Fast Physics Applications | Jason Weitz et.al. | 2501.05515 | link |
2025-01-09 | Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning | Laura Puccioni et.al. | 2501.05248 | null |
2025-01-08 | Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models | Miaoyang He et.al. | 2501.04582 | null |
2025-01-08 | Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions | Na Yan et.al. | 2501.04436 | null |
2025-01-08 | Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images | Yuze Wang et.al. | 2501.04283 | null |
2025-01-08 | UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles | Abhishek Balasubramaniam et.al. | 2501.04213 | null |
2025-01-10 | CURing Large Models: Compression via CUR Decomposition | Sanghyeon Park et.al. | 2501.04211 | null |
2025-01-08 | Generative Dataset Distillation Based on Self-knowledge Distillation | Longzhen Li et.al. | 2501.04202 | null |
2025-01-07 | FedKD-hybrid: Federated Hybrid Knowledge Distillation for Lithography Hotspot Detection | Yuqi Li et.al. | 2501.04066 | link |
2025-01-07 | A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving | Yi Zhang et.al. | 2501.03670 | link |
2025-01-07 | Effective and Efficient Mixed Precision Quantization of Speech Foundation Models | Haoning Xu et.al. | 2501.03643 | null |
2025-01-07 | ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting | Yifeng Yang et.al. | 2501.03605 | link |
2025-01-05 | Strategic Fusion Optimizes Transformer Compression | Md Shoaibur Rahman et.al. | 2501.03273 | null |
2025-01-07 | LightGNN: Simple Graph Neural Network for Recommendation | Guoxuan Chen et.al. | 2501.03228 | link |
2025-01-06 | Comprehensive Pathological Image Segmentation via Teacher Aggregation for Tumor Microenvironment Analysis | Daisuke Komura et.al. | 2501.02909 | null |
2025-01-06 | Knowledge Distillation with Adapted Weight | Sirong Wu et.al. | 2501.02705 | null |
2025-01-04 | Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison | Tsz Kin Lam et.al. | 2501.02370 | null |
2025-01-04 | V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection | Sichao Wang et.al. | 2501.02363 | link |
2025-01-04 | Optimizing Small Language Models for In-Vehicle Function-Calling | Yahya Sowti Khiabani et.al. | 2501.02342 | null |
2025-01-04 | KD-MSLRT: Lightweight Sign Language Recognition Model Based on Mediapipe and 3D to 1D Knowledge Distillation | ulong Li et.al. | 2501.02321 | null |
2025-01-04 | Distillation-Enhanced Physical Adversarial Attacks | Wei Liu et.al. | 2501.02232 | null |
2025-01-03 | Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification | Jarin Ritu et.al. | 2501.01921 | link |
2025-01-03 | MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders | Jiajun Cao et.al. | 2501.01709 | null |
2025-01-02 | DiagrammaticLearning: A Graphical Language for Compositional Training Regimes | Mason Lary et.al. | 2501.01515 | null |
2024-12-31 | Pan-infection Foundation Framework Enables Multiple Pathogen Prediction | Lingrui Zhang et.al. | 2501.01462 | null |
2025-01-01 | A Survey of Secure Semantic Communications | Rui Meng et.al. | 2501.00842 | null |
2025-01-01 | LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity | Muhammet Anil Yagiz et.al. | 2501.00790 | null |
2024-12-30 | Temporal reasoning for timeline summarisation in social media | Jiayu Song et.al. | 2501.00152 | null |
2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
2024-12-28 | Injecting Explainability and Lightweight De#to Weakly Supervised Video Anomaly Detection Systems | Wen-Dong Jiang et.al. | 2412.20201 | null |
2024-12-28 | SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection | Phi Vu Tran et.al. | 2412.20047 | null |
2024-12-28 | Invariant debiasing learning for recommendation via biased imputation | Ting Bai et.al. | 2412.20036 | link |
2024-12-28 | Learning Adaptive and View-Invariant Vision Transformer with Multi-Teacher Knowledge Distillation for Real-Time UAV Tracking | You Wu et.al. | 2412.20002 | link |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | link |
2024-12-27 | Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models | Shuo Wang et.al. | 2412.19449 | null |
2024-12-26 | SpectralKD: Understanding and Optimizing Vision Transformer Distillation through Spectral Analysis | Huiyuan Tian et.al. | 2412.19055 | link |
2024-12-25 | Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models | Haowei Yang et.al. | 2412.18715 | null |
2024-12-23 | Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings | Harsh Joshi et.al. | 2412.18635 | null |
2024-12-24 | HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation | Mohammed Hamdan et.al. | 2412.18524 | null |
2024-12-24 | Understanding Artificial Neural Network's Behavior from Neuron Activation Perspective | Yizhou Zhang et.al. | 2412.18073 | null |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Chao Zeng et.al. | 2412.17560 | null |
2024-12-24 | Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement | Hyeonjin Kim et.al. | 2412.17387 | link |
2024-12-23 | Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction | Yuying Wang et.al. | 2412.17317 | null |
2024-12-23 | LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation | Riku Uemura et.al. | 2412.17282 | null |
2024-12-22 | Lightweight Design and Optimization methods for DCNNs: Progress and Futures | Hanhua Long et.al. | 2412.16886 | null |
2024-12-21 | Large Language Models Compression via Low-Rank Feature Distillation | Yaya Sy et.al. | 2412.16719 | null |
2024-12-21 | CyberSentinel: Efficient Anomaly Detection in Programmable Switch using Knowledge Distillation | Sankalp Mittal et.al. | 2412.16693 | null |
2024-12-21 | Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers | Yunshan Zhong et.al. | 2412.16553 | null |
2024-12-21 | STKDRec: Spatial-Temporal Knowledge Distillation for Takeaway Recommendation | Shuyuan Zhao et.al. | 2412.16502 | null |
2024-12-20 | BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models | Patrick Haller et.al. | 2412.15978 | null |
2024-12-20 | A New Method to Capturing Compositional Knowledge in Linguistic Space | Jiahe Wan et.al. | 2412.15632 | null |
2024-12-19 | Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation | Meghana Karri et.al. | 2412.15380 | null |
2024-12-19 | Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models | Reza Shirkavand et.al. | 2412.15341 | link |
2024-12-19 | Self-Evolution Knowledge Distillation for LLM-based Machine Translation | Yuncheng Song et.al. | 2412.15303 | null |
2024-12-19 | Adaptive Pruning for Large Language Models with Structural Importance Awareness | Haotian Zheng et.al. | 2412.15127 | null |
2024-12-19 | SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection | Ruoyu Xu et.al. | 2412.14571 | null |
2024-12-19 | Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models | Xiao Cui et.al. | 2412.14528 | link |
2024-12-19 | Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance | Sukrit Leelaluk et.al. | 2412.14526 | link |
2024-12-18 | A Survey on Inference Optimization Techniques for Mixture of Experts Models | Jiacheng Liu et.al. | 2412.14219 | link |
2024-12-18 | Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective | Zhiyuan Zeng et.al. | 2412.14135 | null |
2024-12-18 | On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process | Gereziher Adhane et.al. | 2412.13943 | null |
2024-12-18 | Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN | Pengxiang Li et.al. | 2412.13795 | link |
2024-12-18 | Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation | Kaiwen Huang et.al. | 2412.13742 | link |
2024-12-18 | On the Compression of Language Models for Code: An Empirical Study on CodeBERT | Giordano d'Aloisio et.al. | 2412.13737 | null |
2024-12-18 | Hybrid Data-Free Knowledge Distillation | Jialiang Tang et.al. | 2412.13525 | link |
2024-12-18 | Deploying Foundation Model Powered Agent Services: A Survey | Wenchao Xu et.al. | 2412.13437 | null |
2024-12-17 | In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning | Yifei Duan et.al. | 2412.13243 | null |
2024-12-17 | Modality-Inconsistent Continual Learning of Multimodal Large Language Models | Weiguo Pian et.al. | 2412.13050 | null |
2024-12-17 | Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation | Jiaqi Wang et.al. | 2412.12858 | null |
2024-12-17 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guanwenjie Zou et.al. | 2412.12603 | link |
2024-12-17 | PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts | Kun Guo et.al. | 2412.12460 | link |
2024-12-16 | Neural Collapse Inspired Knowledge Distillation | Shuoxi Zhang et.al. | 2412.11788 | null |
2024-12-16 | Relation-Guided Adversarial Learning for Data-free Knowledge Transfer | Yingping Liang et.al. | 2412.11380 | link |
2024-12-16 | BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions | Wonyong Seo et.al. | 2412.11365 | null |
2024-12-15 | Wearable Accelerometer Foundation Models for Health via Knowledge Distillation | Salar Abbaspourazad et.al. | 2412.11276 | null |
2024-12-15 | TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs | Lanxiang Hu et.al. | 2412.11242 | null |
2024-12-15 | ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes | Pedro Miguel Sánchez Sánchez et.al. | 2412.11207 | null |
2024-12-15 | Leveraging Large Language Models for Active Merchant Non-player Characters | Byungjun Kim et.al. | 2412.11189 | link |
2024-12-15 | Knowledge Migration Framework for Smart Contract Vulnerability Detection | Luqi Wang et.al. | 2412.11175 | null |
2024-12-15 | Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection | Mohammadreza Salehi et.al. | 2412.11148 | link |
2024-12-17 | On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning | Pengfei Fang et.al. | 2412.11017 | null |
2024-12-13 | Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias | Jianhua Zhang et.al. | 2412.09874 | null |
2024-12-13 | ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression | Kai Yao et.al. | 2412.09812 | null |
2024-12-13 | LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering | Patrick Sutanto et.al. | 2412.09807 | null |
2024-12-12 | SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training | Dongting Hu et.al. | 2412.09619 | null |
2024-12-12 | A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks | Saptarshi Mandal et.al. | 2412.09579 | null |
2024-12-12 | All You Need in Knowledge Distillation Is a Tailored Coordinate System | Junjie Zhou et.al. | 2412.09388 | null |
2024-12-12 | Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices | Thanaphon Suwannaphong et.al. | 2412.09289 | null |
2024-12-15 | DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification | Kunlun Xu et.al. | 2412.09224 | link |
2024-12-12 | Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation | Xinyue Liu et.al. | 2412.08949 | link |
2024-12-12 | Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration | Yunshuai Zhou et.al. | 2412.08939 | link |
2024-12-11 | Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach | Xihua Zhu et.al. | 2412.08672 | null |
2024-12-11 | Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Jiaming Lv et.al. | 2412.08139 | null |
2024-12-11 | DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation | Jaeho Moon et.al. | 2412.08116 | null |
2024-12-10 | Low-Rank Correction for Quantized LLMs | Meyer Scetbon et.al. | 2412.07902 | null |
2024-12-10 | Unlocking the Potential of Reverse Distillation for Anomaly Detection | Xinyue Liu et.al. | 2412.07579 | link |
2024-12-10 | TT-MPD: Test Time Model Pruning and Distillation | Haihang Wu et.al. | 2412.07114 | null |
2024-12-09 | FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering | Amirhossein Abaskohi et.al. | 2412.07030 | link |
2024-12-09 | VQ4ALL: Efficient Neural Network Representation via a Universal Codebook | Juncan Deng et.al. | 2412.06875 | null |
2024-12-09 | Compression for Better: A General and Stable Lossless Compression Framework | Boyang Zhang et.al. | 2412.06868 | null |
2024-12-09 | Lossless Model Compression via Joint Low-Rank Factorization Optimization | Boyang Zhang et.al. | 2412.06867 | null |
2024-12-08 | GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model | Haotong Yang et.al. | 2412.06849 | null |
2024-12-10 | Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks | Junhe Zhang et.al. | 2412.06414 | null |
2024-12-09 | U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening | Sungpyo Kim et.al. | 2412.06243 | null |
2024-12-08 | Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation | Aymen Sekhri et.al. | 2412.06003 | null |
2024-12-07 | Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery | Ye Wang et.al. | 2412.05573 | null |
2024-12-07 | Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search | Boxun Xu et.al. | 2412.05505 | null |
2024-12-06 | BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits | Wazib Ansar et.al. | 2412.05225 | null |
2024-12-06 | One-shot Federated Learning via Synthetic Distiller-Distillate Communication | Junyuan Zhang et.al. | 2412.05186 | link |
2024-12-06 | CCS: Continuous Learning for Customized Incremental Wireless Sensing Services | Qunhang Fu et.al. | 2412.04821 | null |
2024-12-05 | Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation | Ali Abbasi et.al. | 2412.04668 | null |
2024-12-05 | FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning | Jiayu Liu et.al. | 2412.04521 | link |
2024-12-05 | Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer | Gaole Dai et.al. | 2412.04060 | null |
2024-12-04 | Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa et.al. | 2412.03682 | null |
2024-12-04 | Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective | Jon Gutiérrez-Zaballa et.al. | 2412.03630 | link |
2024-12-03 | CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models | Amitash Nanda et.al. | 2412.03599 | null |
2024-12-07 | Enhancing CLIP Conceptual Embedding through Knowledge Distillation | Kuei-Chun Kao et.al. | 2412.03513 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-03 | Efficient Model Compression Techniques with FishLeg | Jamie McGowan et.al. | 2412.02328 | null |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Qianhan Feng et.al. | 2412.01282 | link |
2024-12-02 | Reducing Inference Energy Consumption Using Dual Complementary CNNs | Michail Kinnas et.al. | 2412.01039 | link |
2024-12-01 | QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval | T. Y. S. S. Santosh et.al. | 2412.00934 | null |
2024-12-01 | Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps | Girmaw Abebe Tadesse et.al. | 2412.00777 | null |
2024-11-30 | Continuous Concepts Removal in Text-to-image Diffusion Models | Tingxu Han et.al. | 2412.00580 | null |
2024-11-30 | Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling | Peihao Dong et.al. | 2412.00562 | link |
2024-11-30 | Toward Fair Graph Neural Networks Via Dual-Teacher Knowledge Distillation | Chengyu Li et.al. | 2412.00382 | null |
2024-11-29 | Reverse Thinking Makes LLMs Stronger Reasoners | Justin Chih-Yao Chen et.al. | 2411.19865 | null |
2024-11-28 | Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG | Xinxu Wei et.al. | 2411.19230 | null |
2024-12-03 | Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Akhiad Bercovich et.al. | 2411.19146 | null |
2024-11-28 | Headache to Overstock? Promoting Long-tail Items through Debiased Product Bundling | Shuo Xu et.al. | 2411.19107 | null |
2024-11-28 | Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems | Mansi Rana et.al. | 2411.18980 | null |
2024-11-27 | Active Data Curation Effectively Distills Large-Scale Multimodal Models | Vishaal Udandarao et.al. | 2411.18674 | null |
2024-11-27 | Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models | Yiming Wu et.al. | 2411.18375 | null |
2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
2024-11-27 | Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery | Zhenyu Yu et.al. | 2411.17973 | null |
2024-11-26 | Attamba: Attending To Multi-Token States | Yash Akhauri et.al. | 2411.17685 | link |
2024-11-26 | Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation | Minh-Tuan Tran et.al. | 2411.17046 | null |
2024-11-26 | Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation | Shambhavi Mishra et.al. | 2411.17002 | link |
2024-11-25 | Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models | Yao Fu et.al. | 2411.16991 | null |
2024-11-25 | Leveraging Foundation Models To learn the shape of semi-fluid deformable objects | Omar El Assal et.al. | 2411.16802 | null |
2024-11-25 | O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? | Zhen Huang et.al. | 2411.16489 | link |
2024-11-25 | When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? | Srikrishna Iyer et.al. | 2411.16487 | link |
2024-11-25 | Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Yanan Wang et.al. | 2411.16196 | link |
2024-11-25 | Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics | Tian Bowen et.al. | 2411.16139 | null |
2024-11-25 | Ensemble Learning via Knowledge Transfer for CTR Prediction | Honghao Li et.al. | 2411.16122 | link |
2024-11-23 | Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models | Tianhao Chen et.al. | 2411.15525 | null |
2024-11-23 | Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance | Jiayi Chen et.al. | 2411.15438 | link |
2024-11-23 | Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning | Xiaoyu Gan et.al. | 2411.15403 | null |
2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | null |
2024-11-22 | RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency | Wentao Huang et.al. | 2411.15076 | null |
2024-11-22 | Adaptive Group Robust Ensemble Knowledge Distillation | Patrik Kenfack et.al. | 2411.14984 | null |
2024-11-25 | Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation | Aniket Bhattacharyya et.al. | 2411.14957 | null |
2024-11-22 | Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers | Hongbo Liu et.al. | 2411.14789 | null |
2024-11-22 | Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation | Xunyu Zhu et.al. | 2411.14698 | null |
2024-11-21 | TaQ-DiT: Time-aware Quantization for Diffusion Transformers | Xinyan Liu et.al. | 2411.14172 | null |
2024-11-21 | DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization | Hexuan Deng et.al. | 2411.14055 | link |
2024-11-21 | Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference | Yunhui Liu et.al. | 2411.14035 | link |
2024-11-21 | CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition | Md Mahedi Hasan et.al. | 2411.13886 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-20 | FASTNav: Fine-tuned Adaptive Small-language-models Trained for Multi-point Robot Navigation | Yuxuan Chen et.al. | 2411.13262 | null |
2024-11-20 | Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning | Gang Zhao et.al. | 2411.13045 | null |
2024-11-19 | Puppet-CNN: Input-Adaptive Convolutional Neural Networks with Model Compression using Ordinary Differential Equation | Yucheng Xing et.al. | 2411.12876 | null |
2024-11-19 | Reward Modeling with Ordinal Feedback: Wisdom of the Crowd | Shang Liu et.al. | 2411.12843 | null |
2024-11-19 | What Makes a Good Dataset for Knowledge Distillation? | Logan Frank et.al. | 2411.12817 | null |
2024-11-19 | FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning | Qingsong Lv et.al. | 2411.12781 | link |
2024-11-19 | KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder | Maheswar Bora et.al. | 2411.12270 | null |
2024-11-19 | Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes | Rahul Garg et.al. | 2411.12174 | null |
2024-11-18 | Federated Incremental Named Entity Recognition | Duzhen Zhang et.al. | 2411.11623 | link |
2024-11-18 | Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms | Haizhou Ge et.al. | 2411.11406 | null |
2024-11-17 | Map-Free Trajectory Prediction with Map Distillation and Hierarchical Encoding | Xiaodong Liu et.al. | 2411.10961 | null |
2024-11-16 | Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting | Ebrahim Farahmand et.al. | 2411.10703 | link |
2024-11-16 | Multi-perspective Contrastive Logit Distillation | Qi Wang et.al. | 2411.10693 | null |
2024-11-16 | Exploring Feature-based Knowledge Distillation For Recommender System: A Frequency Perspective | Zhangchi Zhu et.al. | 2411.10676 | link |
2024-11-15 | Scaling Law for Post-training after Model Pruning | Xiaodong Chen et.al. | 2411.10272 | null |
2024-11-15 | Evidential Federated Learning for Skin Lesion Image Classification | Rutger Hendrix et.al. | 2411.10071 | null |
2024-11-14 | VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation | Xi Lin et.al. | 2411.09567 | null |
2024-11-14 | Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition | Zixing Zhang et.al. | 2411.09339 | null |
2024-11-14 | Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching | Yuran Wang et.al. | 2411.09151 | null |
2024-11-14 | Toward Democratized Generative AI in Next-Generation Mobile Edge Networks | Ruichen Zhang et.al. | 2411.09148 | null |
2024-11-13 | Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head | Penghui Yang et.al. | 2411.08937 | null |
2024-11-13 | UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Chengyuan Zhang et.al. | 2411.08569 | null |
2024-11-13 | Federated Graph Learning with Graphless Clients | Xingbo Fu et.al. | 2411.08374 | null |
2024-11-12 | Joint Diffusion models in Continual Learning | Paweł Skierś et.al. | 2411.08224 | null |
2024-11-12 | Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data | Juanhui Li et.al. | 2411.08028 | null |
2024-11-13 | Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models | Youan Cong et.al. | 2411.07820 | null |
2024-11-12 | ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Weibo Zhao et.al. | 2411.07762 | null |
2024-11-12 | Optimizing Traffic Signal Control using High-Dimensional State Representation and Efficient Deep Reinforcement Learning | Lawrence Francis et.al. | 2411.07759 | null |
2024-11-12 | ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation | Liang Zhao et.al. | 2411.07752 | null |
2024-11-12 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | link |
2024-11-13 | Feature Interaction Fusion Self-Distillation Network For CTR Prediction | Lei Sang et.al. | 2411.07508 | null |
2024-11-12 | Quantifying Knowledge Distillation Using Partial Information Decomposition | Pasan Dissanayake et.al. | 2411.07483 | null |
2024-11-11 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | link |
2024-11-11 | LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Runming Yang et.al. | 2411.06839 | null |
2024-11-11 | ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Jiawei Fan et.al. | 2411.06786 | link |
2024-11-11 | An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning | Dong Li et.al. | 2411.06659 | link |
2024-11-10 | CULL-MT: Compression Using Language and Layer pruning for Machine Translation | Pedram Rostami et.al. | 2411.06506 | null |
2024-11-10 | Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation | Yu-Liang Zhan et.al. | 2411.06448 | link |
2024-11-09 | Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification | Hongyu Chen et.al. | 2411.06023 | null |
2024-11-09 | Multi-hop RIS-aided Learning Model Sharing for Urban Air Mobility | Kai Xiong et.al. | 2411.06015 | null |
2024-11-08 | Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine | Anantha Sharma et.al. | 2411.05936 | null |
2024-11-08 | Asterisk: Keep it Simple* | Andrew Semenov et.al. | 2411.05691 | null |
2024-11-08 | Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles | Ayobami Adewale et.al. | 2411.05618 | null |
2024-11-08 | Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion | Nan Song et.al. | 2411.05544 | null |
2024-11-07 | ZipNN: Lossless Compression for AI Models | Moshik Hershcovitch et.al. | 2411.05239 | link |
2024-11-07 | Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo et.al. | 2411.05045 | null |
2024-11-06 | From Word Vectors to Multimodal Embeddings: Techniques, Applications, and Future Directions For Large Language Models | Charles Zhang et.al. | 2411.05036 | null |
2024-11-07 | Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers | Zhichao Geng et.al. | 2411.04403 | null |
2024-11-07 | GazeGen: Gaze-Driven User Interaction for Visual Content Generation | He-Yen Hsieh et.al. | 2411.04335 | null |
2024-11-06 | Towards Personalized Federated Learning via Comprehensive Knowledge Distillation | Pengju Wang et.al. | 2411.03569 | null |
2024-11-05 | Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy | Razvan-Gabriel Dumitru et.al. | 2411.03513 | link |
2024-11-05 | Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation | Francisco Giral et.al. | 2411.02975 | null |
2024-11-05 | Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery | Bowei Du et.al. | 2411.02861 | null |
2024-11-05 | Brewing Vodka: Distilling Pure Knowledge for Lightweight Threat Detection in Audit Logs | Weiheng Wu et.al. | 2411.02775 | null |
2024-11-05 | Multimodal Commonsense Knowledge Distillation for Visual Question Answering | Shuo Yang et.al. | 2411.02722 | null |
2024-11-04 | Information plane and compression-gnostic feedback in quantum machine learning | Nathan Haboury et.al. | 2411.02313 | null |
2024-11-04 | Training on the Test Model: Contamination in Ranking Distillation | Vishakha Suresh Kalal et.al. | 2411.02284 | link |
2024-11-03 | Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment | Chengting Yu et.al. | 2411.01547 | null |
2024-11-01 | On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance | Jaskirat Singh et.al. | 2411.00907 | null |
2024-11-01 | Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation | Bohan Lyu et.al. | 2411.00412 | null |
2024-11-01 | Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation | Arjun Ramesh Kaushik et.al. | 2411.00403 | null |
2024-11-01 | Efficient Model Compression for Bayesian Neural Networks | Diptarka Saha et.al. | 2411.00273 | null |
2024-10-31 | Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification | Thanh-Dung Le et.al. | 2411.00209 | link |
2024-10-31 | Mutual Information Preserving Neural Network Pruning | Charles Westphal et.al. | 2411.00147 | null |
2024-10-30 | Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation | Ahmed Akib Jawad Karim et.al. | 2411.00052 | null |
2024-10-30 | IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking | Run Luo et.al. | 2410.23907 | null |
2024-10-29 | ML Research Benchmark | Matthew Kenney et.al. | 2410.22553 | link |
2024-11-01 | Leveraging Recurrent Neural Networks for Predicting Motor Movements from Primate Motor Cortex Neural Recordings | Yuanxi Wang et.al. | 2410.22283 | null |
2024-10-28 | Unveiling Context-Aware Criteria in Self-Assessing LLMs | Taneesh Gupta et.al. | 2410.21545 | null |
2024-10-28 | Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications | Kemal Altwlkany et.al. | 2410.21478 | null |
2024-10-31 | LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Ge Yang et.al. | 2410.21352 | link |
2024-10-28 | EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation | Shih-Yang Liu et.al. | 2410.21271 | null |
2024-10-28 | Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study | Jiacheng Hu et.al. | 2410.20792 | null |
2024-10-28 | KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Rambod Azimi et.al. | 2410.20777 | link |
2024-10-28 | Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning | Bing Han et.al. | 2410.20775 | null |
2024-10-28 | Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA | Sangmin Bae et.al. | 2410.20672 | null |
2024-10-27 | Uncovering Capabilities of Model Pruning in Graph Contrastive Learning | Wu Junran et.al. | 2410.20356 | null |
2024-10-25 | A Survey of Small Language Models | Chien Van Nguyen et.al. | 2410.20011 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-25 | SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models | Jahyun Koo et.al. | 2410.19503 | null |
2024-10-24 | Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts | Danyal Aftab et.al. | 2410.19185 | null |
2024-10-24 | AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Ziqi Liang et.al. | 2410.19134 | null |
2024-10-24 | High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws | M. Emrullah Ildiz et.al. | 2410.18837 | null |
2024-10-24 | Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data | Anup Shirgaonkar et.al. | 2410.18588 | null |
2024-10-24 | SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning | Shivam Adarsh et.al. | 2410.18574 | link |
2024-10-23 | ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams | Srija Anand et.al. | 2410.17901 | null |
2024-10-23 | Beware of Calibration Data for Pruning Large Language Models | Yixin Ji et.al. | 2410.17711 | null |
2024-10-23 | Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need | Jon Irureta et.al. | 2410.17648 | null |
2024-10-23 | Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation | Muquan Li et.al. | 2410.17606 | link |
2024-10-23 | Multimodal Information Bottleneck for Deep Reinforcement Learning with Multiple Sensors | Bang You et.al. | 2410.17551 | null |
2024-10-23 | Physics-driven AI for Channel Estimation in Cellular Network | Xiaoqian Qi et.al. | 2410.17525 | null |
2024-10-22 | MiniPLM: Knowledge Distillation for Pre-Training Language Models | Yuxian Gu et.al. | 2410.17215 | link |
2024-10-22 | Self-calibration for Language Model Quantization and Pruning | Miles Williams et.al. | 2410.17170 | null |
2024-10-22 | DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization | Haowei Zhu et.al. | 2410.16942 | null |
2024-10-22 | Mitigating Vanishing Activations in Deep CapsNets Using Channel Pruning | Siddharth Sahu et.al. | 2410.16908 | link |
2024-10-22 | CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare | Nicholas I-Hsien Kuo et.al. | 2410.16872 | null |
2024-10-22 | AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models | Yongjian Wu et.al. | 2410.16820 | link |
2024-10-22 | SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation | Jing-Jing Li et.al. | 2410.16665 | null |
2024-10-21 | Pre-training Distillation for Large Language Models: A Design Space Exploration | Hao Peng et.al. | 2410.16215 | null |
2024-10-18 | Interpreting Microbiome Relative Abundance Data Using Symbolic Regression | Swagatam Haldar et.al. | 2410.16109 | link |
2024-10-21 | Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples | Kirill Lukyanov et.al. | 2410.15889 | null |
2024-10-20 | GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Haiwen Diao et.al. | 2410.15266 | link |
2024-10-19 | LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound | Xuechen Guo et.al. | 2410.15074 | null |
2024-10-19 | Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS | Tuan Nam Nguyen et.al. | 2410.14997 | null |
2024-10-18 | EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search | Oliver Sieberling et.al. | 2410.14649 | link |
2024-10-18 | Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation | Shuai Zhao et.al. | 2410.14425 | link |
2024-10-18 | Preview-based Category Contrastive Learning for Knowledge Distillation | Muhe Ding et.al. | 2410.14143 | null |
2024-10-17 | Leveraging Fine-Tuned Language Models for Efficient and Accurate Smart Contract Auditing | Zhiyuan Wei et.al. | 2410.13918 | link |
2024-10-17 | An Active Learning Framework for Inclusive Generation by Large Language Models | Sabit Hassan et.al. | 2410.13641 | null |
2024-10-18 | Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach | Luyao Zou et.al. | 2410.13602 | null |
2024-10-18 | Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach | Luyao Zou et.al. | 2410.13260 | null |
2024-10-16 | TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant | Guopeng Li et.al. | 2410.12342 | null |
2024-10-16 | Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Guanming Huang et.al. | 2410.12259 | null |
2024-10-16 | TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration | Yiwei Guo et.al. | 2410.12183 | link |
2024-10-17 | SAM-Guided Masked Token Prediction for 3D Scene Understanding | Zhimin Chen et.al. | 2410.12158 | null |
2024-10-15 | MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router | Yanyue Xie et.al. | 2410.12013 | null |
2024-10-15 | Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation | Andong Lu et.al. | 2410.11586 | link |
2024-10-15 | Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL | Qihuang Zhong et.al. | 2410.11371 | null |
2024-10-15 | Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling | Wenda Xu et.al. | 2410.11325 | null |
2024-10-14 | ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection | Martin Aubard et.al. | 2410.10554 | link |
2024-10-14 | QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models | Zhumazhan Balapanov et.al. | 2410.10318 | link |
2024-10-14 | Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation | Siru Ouyang et.al. | 2410.10141 | null |
2024-10-15 | Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices | Xiaoyu Xia et.al. | 2410.10128 | link |
2024-10-14 | REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation | Zhiyun Song et.al. | 2410.10097 | null |
2024-10-12 | SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs | Mohammad Mozaffari et.al. | 2410.09615 | link |
2024-10-12 | Distilling Invariant Representations with Dual Augmentation | Nikolaos Giakoumoglou et.al. | 2410.09474 | null |
2024-10-12 | Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets | Thomas Eiter et.al. | 2410.09428 | link |
2024-10-15 | Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI | Muhammet Anil Yagiz et.al. | 2410.09043 | null |
2024-10-11 | Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee et.al. | 2410.09037 | link |
2024-10-11 | Contrastive Knowledge Distillation for Robust Multimodal Sentiment Analysis | Zhongyi Sang et.al. | 2410.08692 | null |
2024-10-11 | GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning | Yubo Peng et.al. | 2410.08634 | null |
2024-10-11 | Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both | Abhijnan Nath et.al. | 2410.08458 | null |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-10 | Non-transferable Pruning | Ruyi Ding et.al. | 2410.08015 | null |
2024-10-10 | A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways | Jing Su et.al. | 2410.07915 | null |
2024-10-10 | SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks | Haiyang Wang et.al. | 2410.07857 | link |
2024-10-12 | Relational Diffusion Distillation for Efficient Image Generation | Weilun Feng et.al. | 2410.07679 | link |
2024-10-10 | CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression | Wenyuan Liu et.al. | 2410.07505 | null |
2024-10-09 | Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing | Ismail Erbas et.al. | 2410.07364 | null |
2024-10-09 | S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning | Weihao Lin et.al. | 2410.07046 | null |
2024-10-09 | Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation | Runze Chen et.al. | 2410.06982 | null |
2024-10-09 | Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Wenqi Niu et.al. | 2410.06561 | null |
2024-10-08 | SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching | Tianyi Zhang et.al. | 2410.06364 | null |
2024-10-08 | QT-DoG: Quantization-aware Training for Domain Generalization | Saqib Javed et.al. | 2410.06020 | link |
2024-10-10 | KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server | Wenhao Wang et.al. | 2410.05725 | link |
2024-10-07 | Progressive distillation induces an implicit curriculum | Abhishek Panigrahi et.al. | 2410.05464 | null |
2024-10-07 | ESPACE: Dimensionality Reduction of Activations for Model Compression | Charbel Sakr et.al. | 2410.05437 | null |
2024-10-07 | ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation | Yuelyu Ji et.al. | 2410.05168 | null |
2024-10-06 | CAPEEN: Image Captioning with Early Exits and Knowledge Distillation | Divya Jyoti Bajpai et.al. | 2410.04433 | link |
2024-10-06 | DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs | Divya Jyoti Bajpai et.al. | 2410.04424 | link |
2024-10-05 | Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution | Jianze Li et.al. | 2410.04224 | link |
2024-10-05 | Accelerating Diffusion Models with One-to-Many Knowledge Distillation | Linfeng Zhang et.al. | 2410.04191 | null |
2024-10-05 | DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech | Dominika Woszczyk et.al. | 2410.04188 | null |
2024-10-05 | Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher | Yong Guo et.al. | 2410.04140 | null |
2024-10-04 | Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models | Zhuochun Li et.al. | 2410.03663 | null |
2024-10-04 | DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models | Sungnyun Kim et.al. | 2410.03061 | null |
2024-10-03 | Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models | Mingxue Xu et.al. | 2410.03040 | null |
2024-10-03 | Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks | Siddharth Joshi et.al. | 2410.02116 | link |
2024-10-02 | Review Non-convex Optimization Method for Machine Learning | Greg B Fotopoulos et.al. | 2410.02017 | null |
2024-10-02 | PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation | Mike Ranzinger et.al. | 2410.01680 | null |
2024-10-04 | HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models | Seanie Lee et.al. | 2410.01524 | link |
2024-10-02 | Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks | Edan Kinderman et.al. | 2410.01483 | link |
2024-10-02 | PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Chao-Wei Huang et.al. | 2410.01383 | link |
2024-10-02 | "No Matter What You Do!": Mitigating Backdoor Attacks in Graph Neural Networks | Jiale Zhang et.al. | 2410.01272 | link |
2024-10-01 | Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging | Ismail Erbas et.al. | 2410.00948 | null |
2024-10-01 | Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading | Mostafa Hajighasemloua et.al. | 2410.00779 | null |
2024-10-01 | Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation | Jiyoon Myung et.al. | 2410.00683 | null |
2024-10-01 | AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation | Ziyang Luo et.al. | 2410.00558 | link |
2024-10-01 | Self-Updatable Large Language Models with Parameter Integration | Yu Wang et.al. | 2410.00487 | null |
2024-09-30 | Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation | Vlad-Cristian Matei et.al. | 2409.20498 | null |
2024-10-02 | Linear Projections of Teacher Embeddings for Few-Class Distillation | Noel Loo et.al. | 2409.20449 | null |
2024-09-30 | Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies | Shalini Sarode et.al. | 2409.20237 | null |
2024-09-30 | Aggressive Post-Training Compression on Extremely Large Language Models | Zining Zhang et.al. | 2409.20094 | null |
2024-10-01 | HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning | Momin Ahmad Khan et.al. | 2409.19912 | null |
2024-09-29 | Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation | Huidong Tang et.al. | 2409.19741 | null |
2024-09-29 | InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries | Mengze Hong et.al. | 2409.19689 | null |
2024-09-28 | Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training | Pihe Hu et.al. | 2409.19391 | null |
2024-09-28 | Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment | Tianyi Liu et.al. | 2409.19366 | null |
2024-09-27 | Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models | Shihua Qin et.al. | 2409.19185 | null |
2024-09-27 | MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation | Junyou Zhu et.al. | 2409.18800 | null |
2024-09-27 | Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation | Chaomin Shen et.al. | 2409.18785 | null |
2024-09-27 | Harmonizing knowledge Transfer in Neural Network with Unified Distillation | Yaomin Huang et.al. | 2409.18565 | null |
2024-09-27 | Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration | Mahdi Morafah et.al. | 2409.18461 | link |
2024-09-26 | EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation | Jiaxiang Tang et.al. | 2409.18114 | null |
2024-09-26 | Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation | Shuai Zhao et.al. | 2409.17946 | null |
2024-09-26 | Kendall's |
Yuchen Guan et.al. | 2409.17823 | null |
2024-09-26 | General Compression Framework for Efficient Transformer Object Tracking | Lingyi Hong et.al. | 2409.17564 | null |
2024-09-26 | Shape-intensity knowledge distillation for robust medical image segmentation | Wenhui Dong et.al. | 2409.17503 | link |
2024-09-25 | Search for Efficient Large Language Models | Xuan Shen et.al. | 2409.17372 | link |
2024-09-25 | MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events | Xiaoyu Yang et.al. | 2409.17010 | null |
2024-09-25 | Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation | Hanyu Zhou et.al. | 2409.17001 | null |
2024-09-25 | SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling | Laurent Dillard et.al. | 2409.16581 | null |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-25 | Privacy Evaluation Benchmarks for NLP Models | Wei Huang et.al. | 2409.15868 | link |
2024-09-24 | Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization | Lucas Deckers et.al. | 2409.15849 | null |
2024-09-23 | TS-TCD: Triplet-Level Cross-Modal Distillation for Time-Series Forecasting Using Large Language Models | Pengfei Wang et.al. | 2409.14978 | null |
2024-09-23 | DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models | Sangyeon Cho et.al. | 2409.14904 | link |
2024-09-23 | Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation | Li Li et.al. | 2409.14810 | null |
2024-09-23 | An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding | Wei-Bin Kou et.al. | 2409.14737 | null |
2024-09-18 | Applications of Knowledge Distillation in Remote Sensing: A Survey | Yassine Himeur et.al. | 2409.12111 | null |
2024-09-18 | Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction | Jin Jie Sean Yeo et.al. | 2409.11964 | null |
2024-09-18 | Distillation-free Scaling of Large SSMs for Images and Videos | Hamid Suleman et.al. | 2409.11867 | null |
2024-09-18 | EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis | Shaojie Li et.al. | 2409.11817 | null |
2024-09-18 | RUIE: Retrieval-based Unified Information Extraction using Large Language Model | Xincheng Liao et.al. | 2409.11673 | link |
2024-09-17 | Time-Series Forecasting, Knowledge Distillation, and Refinement within a Multimodal PDE Foundation Model | Derek Jollie et.al. | 2409.11609 | link |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation | Gerard I. Gállego et.al. | 2409.11003 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | link |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-15 | ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration | Ning-Chi Huang et.al. | 2409.09708 | null |
2024-09-14 | Effective Pre-Training of Audio Transformers for Sound Event Detection | Florian Schmid et.al. | 2409.09546 | link |
2024-09-14 | Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification | Wenhao Yang et.al. | 2409.09389 | null |
2024-09-14 | Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility | Xiaoyu Liu et.al. | 2409.09357 | null |
2024-09-13 | Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection | Dixi Yao et.al. | 2409.08858 | null |
2024-09-13 | An Efficient Privacy-aware Split Learning Framework for Satellite Communications | Jianfei Sun et.al. | 2409.08538 | null |
2024-09-13 | AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation | Zechao Sun et.al. | 2409.08516 | null |
2024-09-12 | DiReDi: Distillation and Reverse Distillation for AIoT Applications | Chen Sun et.al. | 2409.08308 | null |
2024-09-12 | Ruri: Japanese General Text Embeddings | Hayato Tsukagoshi et.al. | 2409.07737 | link |
2024-09-12 | Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios | Xinlei Huang et.al. | 2409.07694 | null |
2024-09-11 | DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis | Ke Chen et.al. | 2409.07584 | null |
2024-09-11 | EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data | Grégoire Petit et.al. | 2409.07566 | link |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | Enhancing CTC-Based Visual Speech Recognition | Hendrik Laux et.al. | 2409.07210 | null |
2024-09-11 | A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption | Marcus Rüb et.al. | 2409.07114 | null |
2024-09-11 | Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator | Kangyang Luo et.al. | 2409.06955 | null |
2024-09-10 | Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study | Ilias Siniosoglou et.al. | 2409.06904 | null |
2024-09-10 | EasyST: A Simple Framework for Spatio-Temporal Prediction | Jiabin Tang et.al. | 2409.06748 | link |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-10 | Knowledge Distillation via Query Selection for Detection Transformer | Yi Liu et.al. | 2409.06443 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | Enhancing Long Video Understanding via Hierarchical Event-Based Memory | Dingxin Cheng et.al. | 2409.06299 | null |
2024-09-09 | Joint Input and Output Coordination for Class-Incremental Learning | Shuai Wang et.al. | 2409.05620 | null |
2024-09-09 | LEROjD: Lidar Extended Radar-Only Object Detection | Patrick Palmer et.al. | 2409.05564 | link |
2024-09-09 | Federated Transfer Learning Based Cooperative Wideband Spectrum Sensing with Model Pruning | Jibin Jia et.al. | 2409.05462 | null |
2024-09-09 | Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition | Shiming Ge et.al. | 2409.05384 | null |
2024-09-09 | Application Specific Compression of Deep Learning Models | Rohit Raj Rai et.al. | 2409.05368 | link |
2024-09-09 | FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data | Rasoul Jafari Gohari et.al. | 2409.05359 | link |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-07 | LoCa: Logit Calibration for Knowledge Distillation | Runming Yang et.al. | 2409.04778 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-05 | Experimentation in Content Moderation using RWKV | Umut Yildirim et.al. | 2409.03939 | null |
2024-09-05 | DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture | Qianlong Xiang et.al. | 2409.03550 | link |
2024-09-05 | Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration | Pei Wang et.al. | 2409.03455 | null |
2024-09-05 | Efficient Image Compression Using Advanced State Space Models | Bouzid Arezki et.al. | 2409.02743 | null |
2024-09-04 | CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation | Minhee Cho et.al. | 2409.02699 | null |
2024-09-04 | Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation | Kangkai Zhang et.al. | 2409.02555 | null |
2024-09-04 | A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing | Davi Rodrigues et.al. | 2409.02528 | null |
2024-09-04 | Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation | Yilong Chen et.al. | 2409.02438 | null |
2024-09-03 | Low-Resolution Face Recognition via Adaptable Instance-Relation Distillation | Ruixin Shi et.al. | 2409.02049 | null |
2024-09-03 | Foundations of Large Language Model Compression -- Part 1: Weight Quantization | Sean I. Young et.al. | 2409.02026 | link |
2024-09-03 | Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique | Qiang Zheng et.al. | 2409.02020 | null |
2024-09-03 | Contemporary Model Compression on Large Language Models Inference | Dong Liu et.al. | 2409.01990 | link |
2024-09-03 | Adaptive Explicit Knowledge Transfer for Knowledge Distillation | Hyungkeun Park et.al. | 2409.01679 | null |
2024-08-30 | How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition | Pedro C. Neto et.al. | 2408.17399 | link |
2024-08-30 | HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution | Masoomeh Aslahishahri et.al. | 2408.16959 | link |
2024-08-29 | VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition | Zaiwei Zhang et.al. | 2408.16930 | null |
2024-08-29 | Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling | Hritik Bansal et.al. | 2408.16737 | null |
2024-08-29 | MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition | Eduarda Caldeira et.al. | 2408.16563 | link |
2024-08-29 | Convolutional Neural Network Compression Based on Low-Rank Decomposition | Yaping He et.al. | 2408.16289 | null |
2024-08-28 | LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation | Fangxun Shu et.al. | 2408.15881 | link |
2024-08-28 | ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation | Tiantian Feng et.al. | 2408.15803 | null |
2024-08-28 | Online pre-training with long-form videos | Itsuki Kato et.al. | 2408.15651 | null |
2024-08-28 | Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation | Lujun Gui et.al. | 2408.15562 | null |
2024-08-27 | Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification | Yiqiang Cai et.al. | 2408.14862 | link |
2024-08-27 | Learning effective pruning at initialization from iterative pruning | Shengkai Liu et.al. | 2408.14757 | link |
2024-08-26 | Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems | Nikhil Khani et.al. | 2408.14678 | null |
2024-08-25 | Variational autoencoder-based neural network model compression | Liang Cheng et.al. | 2408.14513 | null |
2024-08-26 | TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines | Hymalai Bello et.al. | 2408.14146 | null |
2024-08-27 | GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets | Sven Oehri et.al. | 2408.14131 | link |
2024-08-26 | Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection | Yuncheng Jiang et.al. | 2408.14051 | null |
2024-08-25 | Condensed Sample-Guided Model Inversion for Knowledge Distillation | Kuluhan Binici et.al. | 2408.13850 | null |
2024-08-25 | Bring the Power of Diffusion Model to Defect Detection | Xuyi Yu et.al. | 2408.13845 | null |
2024-08-24 | Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic | Yifei He et.al. | 2408.13656 | link |
2024-08-24 | MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning | Seungbeom Hu et.al. | 2408.13482 | null |
2024-08-23 | Growing Deep Neural Network Considering with Similarity between Neurons | Taigo Sakai et.al. | 2408.13291 | null |
2024-08-23 | Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption | Sakhinana Sagar Srinivas et.al. | 2408.13248 | null |
2024-08-23 | A Web-Based Solution for Federated Learning with LLM-Based Automation | Chamith Mawela et.al. | 2408.13010 | null |
2024-08-23 | A Survey on Drowsiness Detection -- Modern Applications and Methods | Biying Fu et.al. | 2408.12990 | null |
2024-08-22 | Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers | Sayed Mohammad Vakilzadeh Hatefi et.al. | 2408.12568 | link |
2024-08-22 | Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models | Meiyun Wang et.al. | 2408.12326 | link |
2024-08-22 | Rebalancing Multi-Label Class-Incremental Learning | Kaile Du et.al. | 2408.12161 | null |
2024-08-22 | Vision-Based Detection of Uncooperative Targets and Components on Small Satellites | Hannah Grauer et.al. | 2408.12084 | null |
2024-08-22 | Aligning (Medical) LLMs for (Counterfactual) Fairness | Raphael Poulain et.al. | 2408.12055 | link |
2024-08-22 | LAKD-Activation Mapping Distillation Based on Local Learning | Yaoze Zhang et.al. | 2408.11478 | null |
2024-08-21 | A Practical Trigger-Free Backdoor Attack on Neural Networks | Jiahao Wang et.al. | 2408.11444 | null |
2024-08-21 | Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Guo Pu et.al. | 2408.11413 | link |
2024-08-21 | Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection | Liang Yao et.al. | 2408.11407 | null |
2024-08-21 | A Unified Framework for Continual Learning and Machine Unlearning | Romit Chatterjee et.al. | 2408.11374 | null |
2024-08-20 | SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection | Huafeng Chen et.al. | 2408.10760 | null |
2024-08-20 | Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation | Md Fahim Sikder et.al. | 2408.10755 | null |
2024-08-20 | Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches | Yanjie Dong et.al. | 2408.10691 | null |
2024-08-20 | LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models | Yupeng Su et.al. | 2408.10631 | link |
2024-08-20 | Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers | Thanh Thi Nguyen et.al. | 2408.10503 | null |
2024-08-19 | Transferring Backdoors between Large Language Models by Knowledge Distillation | Pengzhou Cheng et.al. | 2408.09878 | link |
2024-08-20 | MoDeGPT: Modular Decomposition for Large Language Model Compression | Chi-Heng Lin et.al. | 2408.09632 | null |
2024-08-18 | MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment | Tianyi Liu et.al. | 2408.09465 | null |
2024-08-18 | CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination | Kaicheng Yang et.al. | 2408.09441 | null |
2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
2024-08-17 | RepControlNet: ControlNet Reparameterization | Zhaoli Deng et.al. | 2408.09240 | null |
2024-08-16 | Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition | Muhammad Haseeb Aslam et.al. | 2408.09035 | link |
2024-08-16 | Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase | Yicong Li et.al. | 2408.08684 | null |
2024-08-16 | ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Chao Zeng et.al. | 2408.08554 | link |
2024-08-15 | Computer Vision Model Compression Techniques for Embedded Systems: A Survey | Alexandre Lopes et.al. | 2408.08250 | link |
2024-08-15 | MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU | Yan Li et.al. | 2408.08144 | null |
2024-08-19 | Knowledge Distillation with Refined Logits | Wujie Sun et.al. | 2408.07703 | link |
2024-08-14 | FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher | Alessio Mora et.al. | 2408.07587 | null |
2024-08-14 | Towards Real-time Video Compressive Sensing on Mobile Devices | Miao Cao et.al. | 2408.07530 | link |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-14 | Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection | Zhonglin Chen et.al. | 2408.07455 | null |
2024-08-13 | Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach | Tong Wang et.al. | 2408.07238 | null |
2024-08-15 | An Event Structure-aware Generative Model for Biomedical Event Extraction | Haohan Yuan et.al. | 2408.06583 | null |
2024-08-12 | Optimizing Vision Transformers with Data-Free Knowledge Transfer | Gousia Habib et.al. | 2408.05952 | null |
2024-08-11 | Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation | Xiaoxiong Zhang et.al. | 2408.05748 | null |
2024-08-11 | Efficient Federated Learning Using Dynamic Update and Adaptive Pruning with Momentum on Shared Server Data | Ji Liu et.al. | 2408.05678 | null |
2024-08-08 | LaDiMo: Layer-wise Distillation Inspired MoEfier | Sungyoon Kim et.al. | 2408.04278 | null |
2024-08-08 | Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement | Runduo Han et.al. | 2408.04267 | null |
2024-08-14 | ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model | Yifan Chen et.al. | 2408.04145 | null |
2024-08-07 | AdapMTL: Adaptive Pruning Framework for Multitask Learning Model | Mingcan Xiang et.al. | 2408.03913 | null |
2024-08-07 | Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection | Xinyue Liu et.al. | 2408.03888 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Iterative Knowledge Distillation through Feedback-Driven Learning Cycles | Yujia Chen et.al. | 2408.03680 | null |
2024-08-07 | Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Zhongyao Luo et.al. | 2408.03647 | link |
2024-08-07 | Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation | Feng Zhou et.al. | 2408.03616 | link |
2024-08-06 | EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures | Teng Liang et.al. | 2408.03449 | link |
2024-08-06 | DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers | Lianwei Yang et.al. | 2408.03291 | null |
2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
2024-08-06 | Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization | Yanghai Zhang et.al. | 2408.03149 | link |
2024-08-06 | Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations | Leo Donisch et.al. | 2408.03130 | null |
2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | link |
2024-08-06 | VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation | Ju-Hyeon Nam et.al. | 2408.02888 | null |
2024-08-05 | An approach to optimize inference of the DIART speaker diarization pipeline | Roman Aperdannier et.al. | 2408.02341 | null |
2024-08-05 | Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution | Hojung Lee et.al. | 2408.02307 | link |
2024-08-05 | Unsupervised Domain Adaption Harnessing Vision-Language Pre-training | Wenlve Zhou et.al. | 2408.02192 | link |
2024-08-03 | Joint Model Pruning and Resource Allocation for Wireless Time-triggered Federated Learning | Xinlu Zhang et.al. | 2408.01765 | null |
2024-08-02 | An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression | Shiyi Luo et.al. | 2408.01534 | null |
2024-08-02 | Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning | Lu Yu et.al. | 2408.01076 | link |
2024-08-02 | Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs | Afia Anjum et.al. | 2408.01008 | null |
2024-08-01 | DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects | Yiheng Huang et.al. | 2408.00337 | null |
2024-08-01 | Clover-2: Accurate Inference for Regressive Lightweight Speculative Decoding | Bin Xiao et.al. | 2408.00264 | null |
2024-08-01 | Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation | Kohei Matsuura et.al. | 2408.00205 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-08-02 | Gemma 2: Improving Open Language Models at a Practical Size | Gemma Team et.al. | 2408.00118 | null |
2024-07-31 | Dynamic Object Queries for Transformer-based Incremental Object Detection | Jichuan Zhang et.al. | 2407.21687 | null |
2024-07-31 | Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins | Lukas Gienapp et.al. | 2407.21515 | null |
2024-07-31 | VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning | Yuhang Ming et.al. | 2407.21416 | null |
2024-07-31 | Lifelong Person Search | Jae-Won Yang et.al. | 2407.21252 | null |
2024-07-29 | SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation | Chakkrit Termritthikun et.al. | 2407.20062 | link |
2024-07-29 | ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality | Guoliang Xu et.al. | 2407.19820 | null |
2024-07-29 | Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices | Hayun Lee et.al. | 2407.19644 | null |
2024-07-28 | Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models | Mohammed Al-Maamari et.al. | 2407.19610 | link |
2024-07-28 | Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Knowledge Distillation and Random Data Erasing | Heejoon Koo et.al. | 2407.19540 | link |
2024-07-28 | LLAVADI: What Matters For Multimodal Large Language Models Distillation | Shilin Xu et.al. | 2407.19409 | null |
2024-07-28 | Logic Distillation: Learning from Code Function by Function for Planning and Decision-making | Dong Chen et.al. | 2407.19405 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-26 | Automatic Detection of Moral Values in Music Lyrics | Vjosa Preniqi et.al. | 2407.18787 | link |
2024-07-26 | Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers | Longkun Zou et.al. | 2407.18534 | link |
2024-07-26 | FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction | Wentao Ouyang et.al. | 2407.18472 | null |
2024-07-26 | Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation | Jiabo Ma et.al. | 2407.18449 | null |
2024-07-25 | Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT | Niels G. Faber et.al. | 2407.18288 | link |
2024-07-25 | Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang et.al. | 2407.18248 | link |
2024-07-25 | How to Train the Teacher Model for Effective Knowledge Distillation | Shayan Mohajer Hamidi et.al. | 2407.18041 | link |
2024-07-25 | Peak-Controlled Logits Poisoning Attack in Federated Distillation | Yuhan Tang et.al. | 2407.18039 | null |
2024-07-25 | Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach | Kangil Lee et.al. | 2407.17909 | null |
2024-07-25 | NC-NCD: Novel Class Discovery for Node Classification | Yue Hou et.al. | 2407.17816 | link |
2024-07-24 | CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis | Muhammad Alberb et.al. | 2407.17620 | link |
2024-07-24 | (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork | Tianjin Huang et.al. | 2407.17412 | null |
2024-07-23 | Strike a Balance in Continual Panoptic Segmentation | Jinpeng Chen et.al. | 2407.16354 | link |
2024-07-23 | OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection | Fan Cui et.al. | 2407.16237 | link |
2024-07-23 | DDK: Distilling Domain Knowledge for Efficient Large Language Models | Jiaheng Liu et.al. | 2407.16154 | null |