AI & ML interests

AI inference, AI in the cloud, AI on edge, software acceleration of AI workloads on hardware, efficient AI deployments, GPU-Free AI inference, AI model optimization.

Recent Activity

jangrzybek  updated a collection 5 days ago
RakutenAI 7B
jangrzybek  published a model 5 days ago
AmpereComputing/rakutenai-7b-chat-gguf
View all activity

AmpereComputing 's collections 17

DeepSeek R1
Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp
DeepSeek R1
Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp