|
# Triton server |
|
|
|
## Building the image |
|
|
|
``` |
|
cd indicTrans2/inference/ |
|
docker build -f triton_server/Dockerfile -t indictrans2_triton . |
|
``` |
|
|
|
## Running the container |
|
|
|
Place the `en-indic` and `indic-en` checkpoint folders into `indicTrans2/checkpoints` directory |
|
|
|
Then start the server by: |
|
``` |
|
docker run --shm-size=256m --gpus=1 --rm -v ${PWD}/../checkpoints/:/models/checkpoints -p 8000:8000 -t indictrans2_triton |
|
``` |
|
|
|
## Sample client |
|
|
|
- Do `pip install tritonclient[all] gevent` first. |
|
- Then `python3 triton_server/client.py` |
|
|