Poster: Time-Efficient Sparse and Lightweight Adaptation for Real-Time Mobile Applications
We propose TESLA Time-Efficient Sparse and Lightweight Adaptation strategy for real-time mobile applications, which skips adaptation for specific batches to increase the inference sample rate. Our method balances model accuracy and inference speed by accumulating domain-informative samples from non-adapted batches and sparsely adapting them.
Jun 3, 2024