Building a Vision Mixture-of-Expert Model from several fine-tuned Phi-3-Vision Models Jun 12, 2024 • 6
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Paper • 2501.02393 • Published 8 days ago • 6 • 2