ZoneTwelve
/

NCUEatingAI-0.5B-v1

Model card Files Files and versions Community

NCUEatingAI-0.5B-v1 / README.md

ZoneTwelve's picture

Initialize README.md

fa337f4 10 months ago

|

history blame contribute delete

3.45 kB

	---
	license: apache-2.0
	---

	# NCUEatingAI-0.5B-v1

	This repository provides an example of how to use the NCUEatingAI-0.5B-v1 large language model from Hugging Face for chat-based inference. The model can be customized to act like any persona you specify in the system prompt, and it generates conversational responses based on user inputs.

	## Model Information

	- Model: [ZoneTwelve/NCUEatingAI-0.5B-v1](https://huggingface.co/ZoneTwelve/NCUEatingAI-0.5B-v1)
	- Size: 0.5 billion parameters
	- Task: Conversational AI / Chatbot

	## Usage

	### System Prompt

	You can set a system prompt to define how the model should behave during interactions. A simple example format is:

	```
	"You act like $USERNAME"
	```

	Where `$USERNAME` can be replaced with the desired persona (e.g., "a helpful assistant", "a curious learner", etc.).

	### Inference Example

	Here’s a simple way to perform inference using the model. You’ll need to load the model and tokenizer, define the user and system prompts, and format the input using the `apply_chat_template` method.

	### Code Example

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	import torch

	def chat_with_ncueatingai(
	model_path: str = "ZoneTwelve/NCUEatingAI-0.5B-v1",
	prompt: str = "What's for lunch?",
	system_prompt: str = "You act like a @ZoneTwelve.",
	max_tokens: int = 64,
	):
	# Load the model and tokenizer
	model = AutoModelForCausalLM.from_pretrained(model_path)
	tokenizer = AutoTokenizer.from_pretrained(model_path)

	# Prepare the chat messages
	messages = [
	{"role": "system", "content": system_prompt},
	{"role": "user", "content": prompt}
	]

	# Apply chat template
	input_text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)

	# Tokenize inputs
	inputs = tokenizer(input_text, return_tensors="pt")

	# Generate response
	with torch.no_grad():
	outputs = model.generate(
	inputs.input_ids,
	max_length=max_tokens,
	pad_token_id=tokenizer.eos_token_id
	)

	# Decode the response
	response = tokenizer.decode(outputs[0], skip_special_tokens=True)
	return response

	# Example usage
	if __name__ == "__main__":
	response = chat_with_ncueatingai(
	prompt="What's for lunch?",
	system_prompt="You act like @ZoneTwelve."
	)
	print("Model Response:", response)
	```

	### Parameters

	- `model_path`: The path or Hugging Face model hub identifier, default is `"ZoneTwelve/NCUEatingAI-0.5B-v1"`.
	- `prompt`: The user’s input prompt, which the model will respond to.
	- `system_prompt`: Defines the behavior or persona of the model.
	- `max_tokens`: The maximum number of tokens in the generated response.

	### Requirements

	Ensure the following Python packages are installed:

	```bash
	pip install torch transformers
	```

	### Model Download

	You can download the model directly from Hugging Face using:

	```python
	model = AutoModelForCausalLM.from_pretrained("ZoneTwelve/NCUEatingAI-0.5B-v1")
	tokenizer = AutoTokenizer.from_pretrained("ZoneTwelve/NCUEatingAI-0.5B-v1")
	```

	### License

	This project is licensed under the terms of the MIT license. See [LICENSE](./LICENSE) for details.

	---

	Enjoy using NCUEatingAI-0.5B-v1 to build your personalized conversational AI!