atharv-16's picture
Create README.md
f8c2d8f verified

DETR (DEtection TRansformer) for Object Detection

Model Description

This model is a pre-trained DETR model for object detection. It uses a Transformer architecture to predict bounding boxes and class labels for each object in an image. It was trained on the COCO dataset and is capable of detecting a wide variety of objects in real-world images.

Model Details

  • Model: facebook/detr-resnet-50
  • Framework: PyTorch
  • Task: Object Detection
  • Input: Image (H, W, C)
  • Output: Bounding boxes and class labels for detected objects
  • License: MIT

How to Use

from transformers import DetrForObjectDetection, DetrImageProcessor
from PIL import Image
import torch

# Load the processor and model
processor = DetrImageProcessor.from_pretrained("your-username/detr-object-detection")
model = DetrForObjectDetection.from_pretrained("your-username/detr-object-detection")

# Prepare the image
image = Image.open("path_to_image.jpg")

# Process the image
inputs = processor(images=image, return_tensors="pt")
outputs = model(**inputs)

# Post-process and display the results
target_sizes = torch.tensor([image.size[::-1]])
results = processor.post_process_object_detection(outputs, target_sizes=target_sizes, threshold=0.9)[0]

# Print and visualize detected objects
for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
    box = [round(i, 2) for i in box.tolist()]
    print(f"Detected {model.config.id2label[label.item()]} with confidence {round(score.item(), 3)} at location {box}")