mike-chat / app.py
Trickshotblaster's picture
Minor update
766c9b2
import gradio as gr
import gpt
"""
For information on how to customize the ChatInterface, peruse the gradio docs: https://www.gradio.app/docs/chatinterface
"""
demo = gr.Interface(fn=gpt.get_response, inputs=["textbox",
gr.Slider(0, 100, value=50, step=1),
gr.Slider(0.1, 2.0, value=1.0),
gr.Dropdown(
["mike-chat", "mike-code", "mike-code-600m"], value="mike-chat"),
], outputs=gr.Markdown(line_breaks=True), title="Mike Chat", article="""
Notice: if you have a GPU, I would highly recommend cloning the space and running it locally. The CPU provided by spaces isn't very fast.
Mike is a small GPT-style language model. It was trained for about 8 hrs on my PC using fineweb-edu and open orca datasets. While it hallucinates a lot, it seems to be about on par with other LMs of its size (about 160M params). Model details:
block_size: 512
n_layers: 12
n_heads: 12
d_model: 768
(Same as gpt-2 but without weight tying)
Architecture for Mike-Code-600m:
block_size: 256
n_layers: 16
n_heads: 12
d_model: 1536""")
if __name__ == "__main__":
demo.launch()