Blog

how-15-top-llms-perform-on-classification-accuracy-vs-cost-breakdown

How 15 top LLMs perform on classification: accuracy vs. cost breakdown

In this post, we explore and compare how LLMs perform on classification tasks.

Idriss Chebak·10/15/2024
how-openai-o1-changes-the-llm-training-picture-part-2

How OpenAI's o1 changes the LLM training picture - Part 2

What makes o1 different from previous state-of-the-art LLMs?

Josh Bauer·10/2/2024
how-openai-o1-changes-the-llm-training-picture-part-1

How OpenAI's o1 changes the LLM training picture - Part 1

What makes o1 different from previous state-of-the-art LLMs?

Josh Bauer·9/27/2024
introducing-ai-classification-and-labeling

Introducing AI-powered classification and labeling

We are launching a new feature to help you label and classify your unstructured datasets.

Emmanuel Turlay·9/13/2024
fineweb-edu-fortified

Introducing Fineweb-Edu-Fortified, an open dataset of high-quality educational web content

The finest collection of educational content the web has to offer, deduplicated and augmented with embeddings.

Josh Bauer·8/14/2024
airtrain-pro-a-comprehensive-toolkit-for-advanced-data-exploration

Airtrain PRO: A Comprehensive Toolkit For Advanced Data Exploration

Learn more about Airtrain PRO, a subscription plan built for data professionals.

Emmanuel Turlay·7/25/2024
how-airtrain-clusters-textual-data

How Airtrain Clusters Textual Data

At Airtrain, we aim to help people make sense of their unstructured text data, whether it is to build datasets for LLM fine-tuning, evaluation sets, or just figuring out how people are using their products.

Josh Bauer·7/18/2024
the-llm-playground-march-2024-release-and-product-hunt-launch

The LLM Playground: March 2024 release and Product Hunt launch

Read about our latest Playground features and how to support our Product Hunt launch.

Emmanuel Turlay·3/8/2024
the-airtrain-playground-match-up-mistral-gemini-gpt-4-phi-2-llama-2-and-more

The Airtrain Playground: Match up Mistral, Gemini, GPT-4, Phi-2, Llama 2 and more

The Airtrain Playground lets you interact with a large selection of open-source and proprietary LLMs.

Emmanuel Turlay·1/17/2024
getting-started-with-the-gemini-pro-api

Getting Started with the Gemini Pro API

This guide explores how to obtain an API key and get started with the Gemini Pro SDKs.

Emmanuel Turlay·1/5/2024
embedding-based-evaluation-metrics

Embedding Based Evaluation Metrics: Strengths and Weaknesses

We explore existing embedding-based evaluation metrics and review their benefits and pitfalls.

Josh Bauer·12/19/2023
beyond-openai-reasons-timing-and-strategies-for-exploring-alternatives

Beyond OpenAI: Reasons, Timing, and Strategies for Exploring Alternatives

OpenAI's recent hurdles is a wake up call. AI leaders must explore alternatives to guarantee sustainable AI strategies.

Emmanuel Turlay·12/9/2023
replicating-academic-benchmarks-with-airtrain-mmlu

Replicating Academic Benchmarks with Airtrain: MMLU

We show how Airtrain can easily replicate the MMLU benchmark results for the Llama 2 family of models.

Emmanuel Turlay·12/1/2023
the-comprehensive-guide-to-llm-evaluation

The comprehensive guide to LLM evaluation

In this extensive guide to LLM evaluation. We cover all existing methods to gauge the quality of a language model.

Emmanuel Turlay·11/16/2023
introducing-airtrain

Introducing Airtrain

We are very excited to introduce to you Airtrain, a no-code platform for evaluation and fine-tuning workloads.

Emmanuel Turlay·10/16/2023