Nov 05, 2024 Model Compression for Machine Translation in Large Language Models Oct 26, 2024 Optimizing Predictions: Vocabulary Reduction and Contrastive Decoding in LLMs