-
Object-Guided Visual Tokens:
Eliciting Compositional Reasoning
in Multimodal Language ModelsAddressing shortcomings of MLLMs in Compositional Reasoning through Segmentation
-
Confidently_Exiting/blogpost.md at main · joanvelja/Confidently_Exiting · GitHub
Optimizing Predictions: Vocabulary Reduction and Contrastive Decoding in LLMs. Work done as a reserch project for the MSc AI at the University of Amsterdam - Confidently_Exiting/blogpost.md at main · joanvelja/Confidently_Exiting