Spaces:

srossitto79
/

AgentLlama007B

Runtime error

App Files Files Community

srossitto79 commited on Oct 10, 2023

Commit

e6033cc

1 Parent(s): 2e5cd1d

Update README.md

Browse files

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -58,11 +58,27 @@ For more details on customization, model configuration, and tool parameters, ref
 AgentLlama007B's core logic is encapsulated in the `RBotAgent` class, which manages the conversational flow and tool integration. The knowledge base tool, `StorageRetrievalLLM`, uses persistent memory with a FAISS index of document embeddings. Various tools are provided, each encapsulating specific skills such as image generation and web search. The modular architecture allows easy replacement of components like the language model.
 ## Credits
 This project was created by Salvatore Rossitto as a passion project and a learning endeavor. Contributions from the community are welcome and encouraged.
 ## License
-AgentLlama007B is an open-source project released under the MIT license. You are free to use, modify, and distribute it as per the terms of the license.

 AgentLlama007B's core logic is encapsulated in the `RBotAgent` class, which manages the conversational flow and tool integration. The knowledge base tool, `StorageRetrievalLLM`, uses persistent memory with a FAISS index of document embeddings. Various tools are provided, each encapsulating specific skills such as image generation and web search. The modular architecture allows easy replacement of components like the language model.
+## Why it matters
+AgentLlama007B demonstrates the power of modern conversational AI in a real-world setting. Unlike many research prototypes, it runs smoothly on consumer hardware - a single 8 core CPU with 16GB of RAM.
+Remarkably, AgentLlama007B achieves language understanding and task automation using a quantized 7B parameter model. This is orders of magnitude smaller than models that power other conversational agents. For example, ChatGPT4 use a 180B parameter model.
+In practice, this means AgentLlama007B can understand free-form instructions and execute complex workflows, the most of the times :-).
 ## Credits
+AgentLlama007B has been evaluated using TheBloke's Mistral-7B-Instruct-v0.1-GGUF model. This 7 billion parameter model was converted from [MistralAI's original Mistral-7B architecture](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1), i personally find this 7b model impressive.
 This project was created by Salvatore Rossitto as a passion project and a learning endeavor. Contributions from the community are welcome and encouraged.
 ## License
+[MistralAI's original Mistral-7B architecture](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1)
+[TheBloke MistralAI's Mistral-7B GGUF architecture](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF)
+AgentLlama007B is an open-source project released under the MIT license.
+You are free to use, modify, and distribute it as per the terms of the license.
+The LLM model downloaded is subject to the original author license.