Text Results Report

News Category text classification report comparing scalable ML baselines, reduced-feature pipelines, and transformer fine-tuning on held-out test performance.

EDA Results

Overall Comparison

Traditional ML Baselines

Input Text

Headline and short description are combined into one document

Vectorize

Word TF-IDF, character TF-IDF, and hashing-based sparse features

Classify

Scalable linear classifiers and ensemble baselines for sparse text features

Evaluate

Accuracy, macro-F1, weighted-F1, precision, and recall on the test split

Evaluation protocol: train/test split only. These baselines are used for direct held-out comparison, without a separate validation stage.

Feature Reduction Pipeline Grid

Feature Extraction

Bag-of-Words and TF-IDF representations with unigram and bigram terms

Dimensionality Reduction

Chi-square feature selection and TruncatedSVD projections

Classifier

Linear LR/SVC/SGD models, with MLP evaluated on dense SVD features

Grid Ranking

Candidate pipelines are ranked by macro-F1 on the held-out test set

Evaluation protocol: train/test split only. The grid is presented as an empirical comparison of reduced-feature pipelines, not as validation-based model selection.

BERT Fine-Tuning Grid

Input Text

Combined headline and description, truncated or padded to 128 tokens

Tokenizer

Checkpoint-specific WordPiece tokenization for BERT-family encoders

Encoder

BERT and DistilBERT encoders are fine-tuned end to end

Pooling + Head

CLS, mean, or pooler-style representation with dropout and a linear head

Evaluate

The best validation checkpoint is evaluated once on the final test split

Evaluation protocol: train/validation/test split. Validation macro-F1 is used for checkpoint selection before reporting final test metrics.

Best Model Per-Class Report

Per-Class Metrics

Error Samples