Paste an article excerpt and the reference model predicts its political bias - Left, Center, or Right - with calibrated confidence scores.
Evaluate your model on the political news bias task by submitting a prediction.csv file. Compare results against researchers on the public leaderboard.
Classify news articles into Left, Center, or Right political bias categories. The task follows a standard supervised classification setup.
Upload a prediction.csv file with two columns: id (matching the test set) and label (one of left, center, right).
Performance is measured using Accuracy and Macro F1. Leaderboard ranks by Macro F1 with Accuracy as tiebreaker.
Training, validation and test splits — plus starter code — are distributed via the public BiasBench GitHub repository.
Classify 21K+ news articles into Left, Center, or Right bias categories.
Upload a prediction.csv file, monitor scoring, and compare your best runs at a glance.
Or click below to browse. Max 50MB · CSV format with id, label columns.
Overview, dataset background, starter code links, setup steps, and the expected prediction file format.
Participants classify news articles into one of three political bias categories: left, center, or right. This is a supervised document-level classification task where the input is the full article body, not only the headline. Each article receives one political bias label, making the task a three-class, single-label classification problem.
To participate, train and run your own model, generate a prediction.csv file in the correct format, and submit it for evaluation on the held-out test dataset. Valid submissions are scored and placed on the public leaderboard.
The dataset contains 21,747 news articles collected from AllSides balanced news headline roundups in November 2022. AllSides presents expert-selected U.S. news articles from sources with different political perspectives, including left, center, and right viewpoints, to help readers compare coverage and recognize bias.
Each article was tagged by four expert annotators according to expressed political partisanship: left, right, or neutral/center. The collected data includes headlines, dates, article text, topic tags, publishing outlet, and AllSides' neutral topic description. The original scrape was collected on 2022-11-15 from AllSides headline roundups.
test.csv.Train a baseline, a fine-tuned transformer, or any custom architecture as long as it can produce one valid bias label for each article in the test split. The starter kit is intended as a reference workflow, but participants are expected to run their own model choices, tune them on the training and validation data, and generate predictions for final evaluation.
prediction.csv for upload.Run inference on the provided test inputs and save the predictions as a CSV. Each row should contain the article text and your predicted label. Labels must be one of left, center, or right.
Name the file prediction.csv before submitting it for leaderboard evaluation.
prediction.csv.left, center, or right in lowercase.Scoring is performed against the held-out test set using Accuracy and Macro F1. The leaderboard ranks by Macro F1, with Accuracy as tiebreaker. Scored submissions appear on the leaderboard after evaluation; failed submissions receive a validation error.
Everything you need to know about BiasBench, the dataset, and how to submit your model.