LEarn

This is a resource for legislators, policy makers, journalists, thought leaders, and researchers.

Artificial intelligence can be confusing and overwhelming. We aim to provide clarity and understanding.

The modules, articles, and guides presented here are intended to explain fundamental concepts in artificial intelligence and AI governance in accurate and non-technical language. New articles will be added as the technology and language of AI evolve—and they’re evolving quickly.

topics

AI 101

Your startup guide to understanding Artificial Intelligence.

companion chatbots


What they are, how they work, where the risks and dangers lie.

AI Safeguards

Exploring the foundations of AI safeguards and mitigation.

The ai developer’s Duty of care

Learn about duty of care, product liability, and how these concepts apply to artificial intelligence products.

Training Data transparency

Learn about the foundational ingredients of AI models, and why and how they should be disclosed.

DISCLOSING AI USE

Understand the importance of AI disclosure laws, and how content provenance makes disclosure possible.

TCAI research: further resources

TCAI guides to AI lawsuits, state data privacy laws, and more.

Complete Resource Library

Bruce Barcott Bruce Barcott

Understanding Synthetic Data

In today’s AI ecosystem there are two general types of training data: organic and synthetic.

Organic data describes information generated by actual humans, whether that’s a piece of writing, a numerical dataset, a song, an image, or a video. Synthetic data is created by generative AI models using organic data as a base material.

Read More
Bruce Barcott Bruce Barcott

Synthetic Data and AI ‘Model Collapse’

Just as a photocopy of a photocopy can drift away from the original, when generative AI is trained on its own synthetic data, its output can also drift away from reality, growing further apart from the organic data that it was intended to imitate.

Read More
Bruce Barcott Bruce Barcott

Transparency and Synthetic Data

The use of synthetic data isn’t inherently good or bad. In medical research, for example, it’s a critically important tool that allows scientists to make new discoveries while protecting the privacy of individual patients.

At the Transparency Coalition, we are not calling for limitations on the creation or use of synthetic data. What’s needed is disclosure: Developers should be transparent in their use of synthetic data when using it to train an AI model.

Read More