News_Classification / README.md
Jiahuita
updated readme
4931f6a
metadata
title: News Classification
emoji: 📰
colorFrom: green
colorTo: indigo
sdk: docker
sdk_version: 0.95.2
app_file: app.py
pinned: false
language: en
license: mit
tags:
  - text-classification
  - news-classification
  - LSTM
  - tensorflow
pipeline_tag: text-classification

News Source Classifier

This model classifies news headlines as either Fox News or NBC News using a deep learning LSTM (Long Short-Term Memory) neural network architecture.

Model Description

Architecture

  • Input Layer: Embedding layer (vocab_size=74,934, embedding_dim=128)
  • LSTM Layer 1: 128 units with return sequences
  • Dropout Layer 1: For regularization
  • LSTM Layer 2: 64 units
  • Dropout Layer 2: For regularization
  • Output Layer: Dense layer with 2 units (binary classification)

Technical Details

  • Total Parameters: 9,772,676 (37.28 MB)
  • Training Parameters: 9,772,674 (37.28 MB)
  • Input Shape: (41, ) - sequences of length 41
  • Performance: Achieves binary classification of news sources

Usage

You can use this model through our REST API:

import requests

def predict_news_source(text):
    response = requests.post(
        "https://jiahuita-news-classification.hf.space/predict",
        json={"text": text},
        headers={"Content-Type": "application/json"}
    )
    return response.json()

# Example usage
headline = "Scientists discover breakthrough in renewable energy research"
result = predict_news_source(headline)
print(result)

Example response:

{
    "label": "nbc",
    "score": 0.789
}

Limitations and Bias

This model has been trained on news headlines from specific sources and time periods, which may introduce certain biases:

  • Training data is limited to two news sources
  • Headlines represent a specific time period
  • Model may be sensitive to writing style rather than just content

Training Details

The model was trained using:

  • TensorFlow 2.10.0
  • Binary cross-entropy loss
  • Embedding layer for text representation
  • Dual LSTM layers with dropout for robust feature extraction
  • Dense layer with softmax activation for final classification

License

This project is licensed under the MIT License.