finetuned-llama / README.md

Update README.md

78211e3 verified 19 days ago

4.44 kB

	---
	library_name: transformers
	tags: [retrieval-augmented-generation, finetuning, llm, huggingface]
	---

	# Model Card for Finetuned Llama 3.2 (ROS Query System)

	This model is a finetuned version of Llama 3.2 specifically designed to answer questions related to the Robot Operating System (ROS). It was finetuned on Kaggle using domain-specific data scraped from GitHub repositories and Medium articles. The model powers a Retrieval-Augmented Generation (RAG) pipeline in our AI final project.

	---

	## Model Details

	### Model Description

	- Developed by: Krish Murjani (netid: km6520) & Shresth Kapoor (netid: sk11677)
	- Project Name: CS-GY-6613 AI Final Project: ROS Query System
	- Finetuned From: `sentence-transformers/all-MiniLM-L6-v2`
	- Language(s): English
	- License: Apache 2.0

	---

	### Model Sources

	- Repository: [GitHub Repository](https://github.com/krishmurjani/cs-gy-6613-final-project)

	---

	## Uses

	### Direct Use

	The model is used in a Retrieval-Augmented Generation (RAG) pipeline for answering questions related to the Robot Operating System (ROS). It integrates with a vector search engine (Qdrant) and MongoDB for efficient retrieval and query response generation.

	### Downstream Use

	The model can be extended for other technical domains through additional finetuning or plug-in integration into larger AI systems.

	### Out-of-Scope Use

	The model is not designed for tasks outside of technical documentation retrieval and answering ROS-related queries.

	---

	## Bias, Risks, and Limitations

	- Bias: The model may reflect biases inherent in the scraped ROS documentation and articles.
	- Limitations: Responses are limited to the scraped and finetuned dataset. It may not generalize to broader queries.

	### Recommendations

	- Use the model for educational and research purposes in robotics and ROS-specific domains.
	- Avoid using the model in high-stakes applications where critical decisions rely on the accuracy of generated responses.

	---

	## How to Get Started with the Model

	```python
	from transformers import AutoModel, AutoTokenizer

	model = AutoModel.from_pretrained("your-model-id")
	tokenizer = AutoTokenizer.from_pretrained("your-model-id")

	input_text = "How can I navigate to a specific pose using ROS?"
	inputs = tokenizer(input_text, return_tensors="pt")
	outputs = model(**inputs)
	print(outputs)
	```

	## Training Details

	### Training Data

	- Sources:
	- GitHub repositories related to the Robot Operating System (ROS).
	- Medium articles discussing ROS topics.

	### Training Procedure

	- Preprocessing:
	- Data cleaning, text chunking, and embedding using Sentence-BERT (`all-MiniLM-L6-v2`).
	- Used ClearML orchestrator for ETL and finetuning pipelines.

	- Training Framework:
	- Hugging Face Transformers, PyTorch

	- Training Regime:
	- fp16 mixed precision (for efficiency and memory optimization)

	---

	## Evaluation

	### Testing Data

	- Dataset:
	- Internal evaluation dataset created from project-specific queries and generated question-answer pairs.

	### Factors & Metrics

	- Metrics:
	- Query relevance, answer accuracy, and completeness.

	- Evaluation Results:
	- Achieved high relevance and precision for domain-specific questions related to ROS.

	---

	## Environmental Impact

	- Hardware Type:
	- NVIDIA Tesla T4 (Kaggle)

	- Hours Used:
	- Approximately 15-20 hours of training

	- Compute Region:
	- US Central (Kaggle Cloud)

	- Carbon Emitted:
	- Estimated using the [Machine Learning Impact Calculator](https://mlco2.github.io/impact#compute).

	---

	## Technical Specifications

	- Model Architecture:
	- Transformer-based language model (Llama 3.2)

	- Compute Infrastructure:
	- Kaggle Cloud with NVIDIA Tesla T4 GPUs

	- Frameworks:
	- Hugging Face Transformers, PyTorch, ClearML

	---

	## Citation

	```bibtex
	@misc{kapoor2024rosquery,
	title={ROS Query System: A Retrieval-Augmented Generation Pipeline},
	author={Shresth Kapoor and Krish Murjani},
	year={2024},
	note={CS-GY-6613 AI Final Project, NYU Tandon School of Engineering}
	}
	```

	## Model Card Authors
	- Krish Murjani ([krishmurjani](https://huggingface.co/krishmurjani))
	- Shresth Kapoor ([shresthkapoor7](https://huggingface.co/shresthkapoor7))


	## Model Card Contact

	For any inquiries, please contact us through our ([GitHub Repository](https://github.com/krishmurjani/cs-gy-6613-final-project)).