BiasBench — Political News Bias Classification Platform

Classifier Demo

Classify any article in real time

Paste an article excerpt and the reference model predicts its political bias - Left, Center, or Right - with calibrated confidence scores.

Article text

164 chars

Prediction

Enter an article and click Classify Bias.

Competition

Ready to join the competition?

Evaluate your model on the political news bias task by submitting a prediction.csv file. Compare results against researchers on the public leaderboard.

Season 01 · 12 days remaining

Push the state-of-the-art on political bias

Beat the current 0.66 Accuracy / 0.63 Macro F1 baseline. Earn a top-10 spot on the public leaderboard and get your model featured.

Task Overview

Classify news articles into Left, Center, or Right political bias categories. The task follows a standard supervised classification setup.

Submission Format

Upload a prediction.csv file with two columns: id (matching the test set) and label (one of left, center, right).

Evaluation Metrics

Performance is measured using Accuracy and Macro F1. Leaderboard ranks by Macro F1 with Accuracy as tiebreaker.

Dataset Access

Training, validation and test splits — plus starter code — are distributed via the public BiasBench GitHub repository.

Search leaderboard...

Active ChallengeGlobal Ranking

Political News Bias Classification

Classify 21K+ news articles into Left, Center, or Right bias categories.

Entries

Subs

Metric

Macro F1

Showing 3 of 3 live entries

Rank

User

Team

Model Type

Accuracy

Macro F1

Subs

Status

George Ghiugan

Apr 28, 2026

NumberOne

BERT

0.6679

0.6390

New SOTA

Sarah Chen

Apr 30, 2026

DeBERTa Fine Tune

DeBERTa

0.7000

0.6000

Rising

John Doe

Apr 30, 2026

RoBERTa Demo Baseline

RoBERTa

0.6120

0.5840

Baseline

My Submissions

Track your submitted runs and results

Upload a prediction.csv file, monitor scoring, and compare your best runs at a glance.

Drag & drop your prediction.csv

Or click below to browse. Max 50MB · CSV format with id, label columns.

Best Score

0.63

Macro F1 · RoBERTa Bias v3

Latest Submission

RoBERTa Bias v3 (ensemble)

2 hours ago · Scored

Estimated Rank

#1of 1,247 teams

Submission

File

Date

Accuracy

Macro F1

Status

RoBERTa Bias v3 (ensemble)

prediction_v3_ensemble.csv

Today · 2h ago

0.6600

0.6300

Scored

DeBERTa Political Large

prediction_deberta.csv

Today · 4h ago

0.6418

0.6052

Scored

Distilled RoBERTa + DomainAdapt

prediction_distilled.csv

Yesterday

—

Processing

Baseline TF-IDF + LogReg

prediction_baseline.csv

2 days ago

0.5734

0.5402

Scored

BERT Multilingual (test run)

prediction_mbert.csv

3 days ago

—

Failed

Documentation

Project docs and submission guide

Overview, dataset background, starter code links, setup steps, and the expected prediction file format.

Overview

Participants classify news articles into one of three political bias categories: left, center, or right. This is a supervised document-level classification task where the input is the full article body, not only the headline. Each article receives one political bias label, making the task a three-class, single-label classification problem.

To participate, train and run your own model, generate a prediction.csv file in the correct format, and submit it for evaluation on the held-out test dataset. Valid submissions are scored and placed on the public leaderboard.

GitHub Links

Starter Kitgithub.com/ghiugan/BiasBenchStarter

Starter code for training your own models, including a data folder structured with train, validation, and test splits.

Original Dataset: Qbiasgithub.com/irgroup/Qbias

Original Qbias dataset repository, including source data and scraping tools for future research access.

Dataset

The dataset contains 21,747 news articles collected from AllSides balanced news headline roundups in November 2022. AllSides presents expert-selected U.S. news articles from sources with different political perspectives, including left, center, and right viewpoints, to help readers compare coverage and recognize bias.

Each article was tagged by four expert annotators according to expressed political partisanship: left, right, or neutral/center. The collected data includes headlines, dates, article text, topic tags, publishing outlet, and AllSides' neutral topic description. The original scrape was collected on 2022-11-15 from AllSides headline roundups.

Left

10,273

Right

7,222

Center

4,252

Setup Instructions

Clone the starter repository.
Create a virtual environment and install dependencies.
Train your own model and run it on test.csv.

How to Run Models

Train a baseline, a fine-tuned transformer, or any custom architecture as long as it can produce one valid bias label for each article in the test split. The starter kit is intended as a reference workflow, but participants are expected to run their own model choices, tune them on the training and validation data, and generate predictions for final evaluation.

Goal: run inference on the test data and save your output as prediction.csv for upload.

Generating prediction.csv

Run inference on the provided test inputs and save the predictions as a CSV. Each row should contain the article text and your predicted label. Labels must be one of left, center, or right.

text

label

The article body discusses voting rights legislation and its effect on local communities...

left

The report summarizes the policy proposal, including statements from both parties...

center

The article argues that tax cuts and deregulation will strengthen the national economy...

right

Name the file prediction.csv before submitting it for leaderboard evaluation.

Submission Rules

File must be named exactly prediction.csv.
Labels must be one of left, center, or right in lowercase.
Use one row per test article so every test item receives exactly one prediction.
Up to 5 submissions per day, unlimited during the warmup period.

Evaluation Details

Scoring is performed against the held-out test set using Accuracy and Macro F1. The leaderboard ranks by Macro F1, with Accuracy as tiebreaker. Scored submissions appear on the leaderboard after evaluation; failed submissions receive a validation error.

Frequently Asked Questions

Everything you need to know about BiasBench, the dataset, and how to submit your model.

What is BiasBench and who is it for?

BiasBench is a public benchmark and competition platform for political news bias classification. It is built for NLP researchers, students, and practitioners who want to evaluate their models on a standardized political bias task and compare results with others on a public leaderboard.

How are bias labels defined in the dataset?

Each article in the dataset is annotated as left, center, or right based on the editorial stance of the outlet and a manual review pass. The full annotation guidelines, agreement scores, and edge-case policies are described in the Docs section and in the dataset card on GitHub.

What file format does a submission need?

Your submission must be a file named prediction.csv with exactly two columns: 'id' (matching the test set ids) and 'label' (one of left, center, or right, lowercase). One row per test id — missing ids fail validation immediately.

Which metrics are used for ranking?

We report both Accuracy and Macro F1 on the held-out test split. The leaderboard ranks primarily by Macro F1, with Accuracy as tiebreaker. Macro F1 is preferred to keep all three bias classes equally weighted regardless of class distribution.

How many submissions am I allowed to make?

During the warmup period, submissions are unlimited. Once the official season starts, each team has up to 5 scored submissions per day. Failed submissions caused by format errors do not count against your daily limit.

Can I use pretrained models and external data?

Yes — most pretrained language models (BERT, RoBERTa, DeBERTa, etc.) are allowed, as long as they were not trained on the BiasBench test set. External unsupervised data is permitted; external labeled political bias data must be disclosed in your submission notes for reproducibility.

Understand the bias behind the news

Join researchers pushing the state-of-the-art on political news bias classification and get your model on the public BiasBench leaderboard.