Blog

Insights, tips, and product updates

Learn the latest techniques to building high-quality datasets for better performing AI.

Behind the Scenes: Evaluating LLM Red Teaming Techniques and LLM Vulnerabilities

October 29, 2024

Behind the Scenes: Evaluating LLM Red Teaming Techniques and LLM Vulnerabilities

Ensuring the safety of large language models (LLMs) across languages is crucial as AI becomes more integrated into our lives. This article presents our red teaming study at Kili Technology, evaluating LLM vulnerabilities against adversarial prompts in both English and French. Our findings reveal critical insights into multilingual weaknesses and highlight the need for improved safety measures in AI systems.

Kili Technology

No items found.

Research and methods on ensuring LLM Safety and AI safety

October 9, 2024

Research and methods on ensuring LLM Safety and AI safety

This article explores the latest developments in AI safety, including regulatory frameworks, risk management strategies, and technical approaches to mitigate potential harms. From red teaming and toxicity detection to reinforcement learning from human feedback, we delve into the multifaceted efforts to create AI systems that are not only powerful, but also reliable, ethical, and aligned with human values.

Kili Technology

No items found.

The Ultimate Guide to Red Teaming LLMs and Adversarial Prompts (Examples and Steps)

September 16, 2024

The Ultimate Guide to Red Teaming LLMs and Adversarial Prompts (Examples and Steps)

Explore the latest techniques in red teaming large language models (LLMs) and crafting adversarial prompts. Learn how to evaluate and improve AI safety through automated testing methods, benchmark datasets, and robust defense strategies.

Kili Technology

No items found.

Webinar recap: Surpass frontier LLM performance using RLHF

September 9, 2024

Webinar recap: Surpass frontier LLM performance using RLHF

Discover how to surpass frontier LLM performance using Reinforcement Learning from Human Feedback (RLHF) with this recap of our latest webinar featuring Adaptive ML.

Kili Technology

No items found.

A Guide to Using Small Language Models for Business Applications

August 30, 2024

A Guide to Using Small Language Models for Business Applications

The successful implementation of SLMs requires careful planning, from dataset preparation to continuous evaluation and monitoring. Organizations must adopt a proactive approach to manage these models, ensuring they continually meet the evolving demands and maintain alignment with business goals.

Kili Technology

Natural Language Processing NLP

What is SmolLM? A Guide to Hugging face's small language model

August 28, 2024

What is SmolLM? A Guide to Hugging face's small language model

Explore SmolLM, a compact yet powerful language model challenging the notion that bigger is always better in AI. Learn how its meticulously curated datasets and efficient design deliver high performance with lower resource demands, making it ideal for applications in education, coding, and customer support.

Kili Technology

Data Labeling

A Guide to GPT4o Mini: OpenAI's smaller, more efficient language model

August 27, 2024

A Guide to GPT4o Mini: OpenAI's smaller, more efficient language model

Explore how GPT-4o Mini delivers advanced AI capabilities at a reduced cost. Learn about its key features, performance comparisons, and real-world applications, making it a top choice for businesses seeking efficient and cost-effective AI solutions.

Kili Technology

No items found.

Llama 3.1 Guide: What to know about Meta's new 405B model and its data

Llama 3.1 Guide: What to know about Meta's new 405B model and its data

Llama 3.1 is Meta's latest flagship language model, boasting an impressive 405 billion parameters. This article dives into how the model was trained and fine-tuned, and pulls out new insights for domain-specific LLMs.

Kili Technology

No items found.

Food + Tool + Team = Redefining Nutritional Analysis Through Data Labeling

Food + Tool + Team = Redefining Nutritional Analysis Through Data Labeling

Imagine a Michelin-starred restaurant. The success of each dish relies on three key elements: premium ingredients (Food), the right tools (Tool), and a skilled team (Team).

Kili Technology

Data Labeling

What we can learn from DBRX Model Training, Data Quality, and Evaluation

What we can learn from DBRX Model Training, Data Quality, and Evaluation

Learn more about DBRX's model training, focus on data quality, and evaluation, showcasing how high-performing enterprise-level large language models are made.

Kili Technology

Data Labeling

Building High-Quality Datasets: Insights from Hugging Face's FineWeb

Building High-Quality Datasets: Insights from Hugging Face's FineWeb

This article explores the best practices and insights from FineWeb's documentation, offering a comprehensive guide for anyone seeking to build high quality datasets for their language models.

Kili Technology

Data Labeling

Challenges and Solutions: Building Generative AI Datasets

Challenges and Solutions: Building Generative AI Datasets

Generative AI (GenAI) is transforming industries by significantly boosting productivity and efficiency. This article explores GenAI's impact, emphasizing the crucial role of high-quality datasets in its success. We discuss challenges in creating these datasets and propose innovative solutions to enhance data processes, ensuring optimal performance of GenAI models.

Kili Technology

Data Labeling

Insights into Data Quality and Evaluation in Gemma 2 LLM

Insights into Data Quality and Evaluation in Gemma 2 LLM

Gemma 2 is an advanced AI model from Google, featuring diverse data integration, rigorous preprocessing, and comprehensive evaluations to ensure high performance, safety, and ethical behavior. Learn how Gemma 2 balances cutting-edge capabilities with robust security and ethical standards.

Kili Technology

No items found.

A Guide to RAG Evaluation and Monitoring (2024)

A Guide to RAG Evaluation and Monitoring (2024)

To ensure widespread adoption and long-term value delivery of your RAG application, following best practices when deploying such use cases is essential. By doing so, we can maximize the potential value that they offer.

Kili Technology

Natural Language Processing NLP

A Guide to Aligning Large Language Models (LLMs) through Data

A Guide to Aligning Large Language Models (LLMs) through Data

Fine-tuning and aligning LLMs can be a challenging and complex process. In this article, we hope to clarify and structure this journey based on our experience with clients and existing examples.

Kili Technology

Data Labeling

How to Build LLM Evaluation Datasets for Your Domain-Specific Use Cases

How to Build LLM Evaluation Datasets for Your Domain-Specific Use Cases

Assessing and benchmarking LLMs makes it easier for data science teams to select the right model and develop a strategy to adapt it faster. Here's a guide to building an LLM evaluation dataset.

Kili Technology

No items found.

How to fine tune large language models (LLMs) with Kili Technology

How to fine tune large language models (LLMs) with Kili Technology

Learn how to fine-tune large language models (LLMs) for specific tasks using Kili Technology. This tutorial provides a step-by-step guide and example on fine-tuning OpenAI models to categorize news articles into predefined categories.

Kili Technology

Data Labeling

Building Domain-Specific LLMs: Examples and Techniques

Building Domain-Specific LLMs: Examples and Techniques

Discover examples and techniques for developing domain-specific LLMs (Large Language Models) in this informative guide.

Kili Technology

No items found.

Open-Sourced Training Datasets for Large Language Models (LLMs)

Open-Sourced Training Datasets for Large Language Models (LLMs)

We share 9 open-sourced datasets used for training LLMs, and the key steps to data preprocessing.

Kili Technology

Data Labeling

Natural Language Processing NLP

Exploring Reinforcement Learning from Human Feedback (RLHF): A Comprehensive Guide

Exploring Reinforcement Learning from Human Feedback (RLHF): A Comprehensive Guide

Reinforcement learning from human feedback (RLHF) allows large language models to understand human instruction naturally. This approach allows LLMs with lesser parameters to perform better. It also ensures the model avoids producing dangerous behavior. Still, RLHF is an evolving process, with challenges researchers must overcome.

Kili Technology

No items found.

Subscribe for updates

Stay updated with the latest news, articles and update directly into your box