TLDR: Covenant-72B scored 67.1 on MMLU zero-shot, beating LLaMA-2-70B’s 65.6 under identical test conditions. SparseLoCo reduced communication overhead by 146x using sparsification, 2-bit quantization, and error feedback across nodes. Gauntlet scored every node’s contribution via loss evaluation and OpenSkill…
Source link



Be the first to comment