Spaces:
Running
Running
# Activation Memory: Part 2 | |
Code accompanying the deep-dive [blog post on activation memory](https://determined.ai/blog/act-mem-2). | |
- The main utility code is in `act_mem.py`. | |
- Basic transformer layers are implemented in `layers.py`. | |
- The scripts `{block,mlp}_script.py` demonstrate how replacing `GELU` with `ReLU` affects activation | |
memory. | |
- `attn_script.py` shows the cost of activation memory in the attention layer. | |
- Tests of the code are in `test.py`. | |
- See `requirements.txt` for versions the code was built against. | |
## Contributors | |
- [Garrett Goon](https://github.com/garrett361) |