Novora
/

CodeClassifier-v1-Tiny

Text Classification

🇪🇺 Region: EU

Model card Files Files and versions Community

Impulse2000 commited on Apr 29, 2024

Commit

3d568e9

·

unverified ·

1 Parent(s): fb29812

Added more tags to Model

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -1,5 +1,8 @@
 ---
 license: apache-2.0
 ---
 # Introduction
@@ -14,6 +17,8 @@ This model is designed to be able to run on CPU, but optimally runs on GPUs.
 - 64 hidden dimensions
 - 2 linear layers
 - The `snowflake-arctic-embed-xs` model is used as the embeddings model.
 # Architecture

 ---
 license: apache-2.0
+datasets:
+    - Novora/CodeClassifier_v1
+pipeline_tag: text-classification
 ---
 # Introduction
 - 64 hidden dimensions
 - 2 linear layers
 - The `snowflake-arctic-embed-xs` model is used as the embeddings model.
+- Dataset split into 80% training set, 20% testing set.
+- The combined test and training data is 1,000 chunks per programming language, the data is 31,000 chunks (entries) as 512 tokens per chunk, being a snippet of the code.
 # Architecture