Raincleared commited on
Commit
5956334
·
verified ·
1 Parent(s): 91db613

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -11,4 +11,18 @@ pipeline_tag: text-generation
11
  This is the original 0.1B BlockFFN checkpoint used in the paper *BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity* for acceleration tests.
12
  You can load and use this model simply by using `AutoTokenizer` and `AutoModelForCausalLM`.
13
 
14
- Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  This is the original 0.1B BlockFFN checkpoint used in the paper *BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity* for acceleration tests.
12
  You can load and use this model simply by using `AutoTokenizer` and `AutoModelForCausalLM`.
13
 
14
+ Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
15
+
16
+ ### Citation
17
+
18
+ If you find our work useful for your research, please kindly cite our paper as follows:
19
+
20
+ ```
21
+ @article{song2025blockffn,
22
+ title={{BlockFFN}: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity},
23
+ author={Chenyang Song and Weilin Zhao and Xu Han and Chaojun Xiao and Yingfa Chen and Yuxuan Li and Zhiyuan Liu and Maosong Sun},
24
+ journal={arXiv preprint arXiv:2507.08771},
25
+ year={2025},
26
+ url={https://arxiv.org/pdf/2507.08771},
27
+ }
28
+ ```