Built upon Llama-3.1 70B, NANDA 87B has been trained on a curated Hindi-English dataset with over 65 billion Hindi tokens. A custom Hindi-centric tokenizer boosts efficiency, reducing both training ...
G42 has released NANDA 87B, an open-source Hindi–English large language model developed with MBZUAI, Inception, and Cerebras.