Classifier Demo

Classify any article in real time

Paste an article excerpt and the reference model predicts its political bias - Left, Center, or Right - with calibrated confidence scores.

164 chars
Prediction
Enter an article and click Classify Bias.
Competition

Ready to join the competition?

Evaluate your model on the political news bias task by submitting a prediction.csv file. Compare results against researchers on the public leaderboard.

Season 01 · 12 days remaining

Push the state-of-the-art on political bias

Beat the current 0.66 Accuracy / 0.63 Macro F1 baseline. Earn a top-10 spot on the public leaderboard and get your model featured.

Task Overview

Classify news articles into Left, Center, or Right political bias categories. The task follows a standard supervised classification setup.

Submission Format

Upload a prediction.csv file with two columns: id (matching the test set) and label (one of left, center, right).

Evaluation Metrics

Performance is measured using Accuracy and Macro F1. Leaderboard ranks by Macro F1 with Accuracy as tiebreaker.

Dataset Access

Training, validation and test splits — plus starter code — are distributed via the public BiasBench GitHub repository.

Search leaderboard...
Active ChallengeGlobal Ranking

Political News Bias Classification

Classify 21K+ news articles into Left, Center, or Right bias categories.

Entries
0
Subs
0
Metric
Macro F1
Rank
User
Team
Model Type
Accuracy
Macro F1
Status
No scored submissions yet.
Live leaderboard rows will appear after the first scored upload.
My Submissions

Track your submitted runs and results

Upload a prediction.csv file, monitor scoring, and compare your best runs at a glance.

Drag & drop your prediction.csv

Or click below to browse. Max 50MB · CSV format with id, label columns.

Best Score
0.63
Macro F1 · RoBERTa Bias v3
Latest Submission
RoBERTa Bias v3 (ensemble)
2 hours ago · Scored
Estimated Rank
#1of 1,247 teams
Submission
File
Accuracy
Macro F1
Status
RoBERTa Bias v3 (ensemble)
prediction_v3_ensemble.csv
0.6600
0.6300
Scored
DeBERTa Political Large
prediction_deberta.csv
0.6418
0.6052
Scored
Distilled RoBERTa + DomainAdapt
prediction_distilled.csv
Processing
Baseline TF-IDF + LogReg
prediction_baseline.csv
0.5734
0.5402
Scored
BERT Multilingual (test run)
prediction_mbert.csv
Failed
Documentation

Project docs and submission guide

Overview, dataset background, starter code links, setup steps, and the expected prediction file format.

Overview

Participants classify news articles into one of three political bias categories: left, center, or right. This is a supervised document-level classification task where the input is the full article body, not only the headline. Each article receives one political bias label, making the task a three-class, single-label classification problem.

To participate, train and run your own model, generate a prediction.csv file in the correct format, and submit it for evaluation on the held-out test dataset. Valid submissions are scored and placed on the public leaderboard.

Dataset

The dataset contains 21,747 news articles collected from AllSides balanced news headline roundups in November 2022. AllSides presents expert-selected U.S. news articles from sources with different political perspectives, including left, center, and right viewpoints, to help readers compare coverage and recognize bias.

Each article was tagged by four expert annotators according to expressed political partisanship: left, right, or neutral/center. The collected data includes headlines, dates, article text, topic tags, publishing outlet, and AllSides' neutral topic description. The original scrape was collected on 2022-11-15 from AllSides headline roundups.

Left
10,273
Right
7,222
Center
4,252

Setup Instructions

  1. Clone the starter repository.
  2. Create a virtual environment and install dependencies.
  3. Train your own model and run it on test.csv.

How to Run Models

Train a baseline, a fine-tuned transformer, or any custom architecture as long as it can produce one valid bias label for each article in the test split. The starter kit is intended as a reference workflow, but participants are expected to run their own model choices, tune them on the training and validation data, and generate predictions for final evaluation.

Goal: run inference on the test data and save your output as prediction.csv for upload.

Generating prediction.csv

Run inference on the provided test inputs and save the predictions as a CSV. Each row should contain the article text and your predicted label. Labels must be one of left, center, or right.

text
label
The article body discusses voting rights legislation and its effect on local communities...
left
The report summarizes the policy proposal, including statements from both parties...
center
The article argues that tax cuts and deregulation will strengthen the national economy...
right

Name the file prediction.csv before submitting it for leaderboard evaluation.

Submission Rules

  • File must be named exactly prediction.csv.
  • Labels must be one of left, center, or right in lowercase.
  • Use one row per test article so every test item receives exactly one prediction.
  • Up to 5 submissions per day, unlimited during the warmup period.

Evaluation Details

Scoring is performed against the held-out test set using Accuracy and Macro F1. The leaderboard ranks by Macro F1, with Accuracy as tiebreaker. Scored submissions appear on the leaderboard after evaluation; failed submissions receive a validation error.

Frequently Asked Questions

Everything you need to know about BiasBench, the dataset, and how to submit your model.

What is BiasBench and who is it for?
BiasBench is a public benchmark and competition platform for political news bias classification. It is built for NLP researchers, students, and practitioners who want to evaluate their models on a standardized political bias task and compare results with others on a public leaderboard.
How are bias labels defined in the dataset?
Each article in the dataset is annotated as left, center, or right based on the editorial stance of the outlet and a manual review pass. The full annotation guidelines, agreement scores, and edge-case policies are described in the Docs section and in the dataset card on GitHub.
What file format does a submission need?
Your submission must be a file named prediction.csv with exactly two columns: 'id' (matching the test set ids) and 'label' (one of left, center, or right, lowercase). One row per test id — missing ids fail validation immediately.
Which metrics are used for ranking?
We report both Accuracy and Macro F1 on the held-out test split. The leaderboard ranks primarily by Macro F1, with Accuracy as tiebreaker. Macro F1 is preferred to keep all three bias classes equally weighted regardless of class distribution.
How many submissions am I allowed to make?
During the warmup period, submissions are unlimited. Once the official season starts, each team has up to 5 scored submissions per day. Failed submissions caused by format errors do not count against your daily limit.
Can I use pretrained models and external data?
Yes — most pretrained language models (BERT, RoBERTa, DeBERTa, etc.) are allowed, as long as they were not trained on the BiasBench test set. External unsupervised data is permitted; external labeled political bias data must be disclosed in your submission notes for reproducibility.

Understand the bias behind the news

Join researchers pushing the state-of-the-art on political news bias classification and get your model on the public BiasBench leaderboard.