Text, Conversation, Code generation, Translation, Reasoning and understanding
Meta Llama 3.1 70B Instruct represents a significant advancement over its predecessor, incorporating an extended 128K context window that allows for handling larger and more complex text inputs. This upgrade enhances its capabilities in multilingual contexts and improves its overall reasoning and understanding.
The Llama 3.1 series, which includes models in 8B, 70B, and 405B sizes, offers a range of generative models tailored for diverse applications. The instruction-tuned variants are particularly effective in multilingual dialogue tasks, showcasing superior performance compared to many current open-source chat models on key industry benchmarks. These models are suitable for various uses, from conversational AI to natural language generation.
Designed for both commercial and research applications, Llama 3.1 models support a wide range of natural language generation tasks, including synthetic data creation and model refinement. Leveraging an optimized transformer architecture, these models benefit from supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF), ensuring alignment with human preferences for effectiveness and safety.
Below you will find all supported platforms and the related CogniTech AI Credits costs.
Region | Context | TPM | RPM | Input Credits | Output Credits |
---|---|---|---|---|---|
Region | Context | TPM | RPM | Input Credits | Output Credits |
us-west-2 | 128,000 | 300,000 | 400 | 0.1287 / 1000 tokens | 0.1287 / 1000 tokens |