Why am I rejected?
4
#135 opened 5 months ago
by
tttxxxlll
Some parameters are on the meta device because they were offloaded to the cpu and disk.
#134 opened 5 months ago
by
Kermski
Access to repo
#133 opened 5 months ago
by
davidlopezsa
output_hidden_states
#132 opened 5 months ago
by
wegwerf
ValueError: `rope_scaling` must be a dictionary with with two fields
8
#131 opened 5 months ago
by
layor
CUDA out of memory
#130 opened 6 months ago
by
sparsh5
Request: DOI
#129 opened 6 months ago
by
Bang8654
Prompt instructions are not very clear
1
#127 opened 6 months ago
by
chateaux
Supported Language and Finetuning
1
#126 opened 6 months ago
by
Uiji
test
#125 opened 6 months ago
by
Jacoblccccccc
miss tokenizer.model
#123 opened 6 months ago
by
alice86
where to update max_prompt_len(to solve max_prompt_len <= params.max_seq_len, preferably using AWS JumpStart)
#121 opened 6 months ago
by
wichofer
Getting "Killed" out of memeory after shards is executed
4
#119 opened 6 months ago
by
nitin1607
[IFEVAL Dataset] Inquiry on Performance Metrics Decrease in LLaMA 3.1 Strict Levels Between July 18 and 22 Versions
#118 opened 6 months ago
by
linmoska
Seems overcooked in comparison to LLama 3.0 - short feedback
1
#117 opened 6 months ago
by
Dampfinchen
how should i provide prompts to the model that is locally downloaded and then used?
1
#116 opened 6 months ago
by
ayadav1
Update config.json
#115 opened 6 months ago
by
mohdazlah
Need a little guidance accessing https://huggingface.co/spaces/stevenijacobs/AI4Reading using an API. I'm trying to setup a resource to help students with learning disabilities.
#114 opened 6 months ago
by
stevenijacobs

Add missing space in prompt template
5
#113 opened 6 months ago
by
Rocketknight1

UPDATE README.md
#112 opened 6 months ago
by
Kryslynn93

tokenizer offset_mapping is incorrect
1
#111 opened 6 months ago
by
Aflt98

KeyError: 'llama'
2
#110 opened 6 months ago
by
ronnief1
OutOfMemoryError: CUDA out of memory
2
#109 opened 6 months ago
by
sieudd
Issue with accessing gated repo
6
#107 opened 6 months ago
by
vdcapriles

Deploy error (RuntimeError: weight lm_head.weight does not exist)
1
#106 opened 6 months ago
by
steveleancommerce
"TypeError: Object of type Undefined is not JSON serializable" when tokenizing tool_call inputs
3
#104 opened 6 months ago
by
ztgeng

Formats for prompting the model using Hugging face
3
#103 opened 7 months ago
by
javalenzuela
Request: DOI
#102 opened 7 months ago
by
guicozmaciel
Time Module issue or Model?
1
#101 opened 7 months ago
by
rkapuaala

Issues with Tools use and Chat templates
#99 opened 7 months ago
by
pyrator
Upgrading Linux Dist
#98 opened 7 months ago
by
rkapuaala

Clone Repository
1
#96 opened 7 months ago
by
clearcash

llama3.1 gguf format
3
#95 opened 7 months ago
by
davidomars
how can i use git clone Meta-Llama-3.1-8B-Instruct
2
#93 opened 7 months ago
by
xiangsuyu
Asking for Pro subscription
6
#92 opened 7 months ago
by
Mayo133
update rope_scaling
#91 opened 7 months ago
by
Arunjith
Update for correct tool use system prompt
3
#90 opened 7 months ago
by
ricklamers
What call() function parameters besides "query" can be used by the model when doing brave_search and wolfram_alpha tool calls?
#89 opened 7 months ago
by
sszymczyk
What form of the built-in brave_search and wolfram_alpha tool call output is expected by the model?
3
#88 opened 7 months ago
by
sszymczyk
ValueError
1
#87 opened 7 months ago
by
Bmurug3
Request: DOI
1
#86 opened 7 months ago
by
sanjeev929
Request: DOI
1
#85 opened 7 months ago
by
moh996
The model repeatedly outputs a large amount of text and does not comply with the instructs.
10
#84 opened 7 months ago
by
baremetal
Llama repo access not aproved yet
#83 opened 7 months ago
by
APaul1
Throwing Error for AutoModelForSequence Classification
1
#82 opened 7 months ago
by
deshwalmahesh
GSM8K Evaluation Result: 84.5 vs. 76.95
17
#81 opened 7 months ago
by
tanliboy

Deploying Llama3.1 to Nvidia T4 instance (sagemaker endpoints)
4
#80 opened 7 months ago
by
mleiter
Variable answer is getting predicted for same prompt
#79 opened 7 months ago
by
sjainlucky
Efficiency low after adding the adapter_model.safetensors with base model
#78 opened 7 months ago
by
antony-pk
