John6666
/

llama-joycaption-alpha-two-hf-llava-nf4

@@ -14,23 +14,24 @@ This command will caption all the `.jpg` images in the specified directory using
 ## Command-Line Arguments
-**Note**: You must specify either `--glob` or `--filelist` to provide images, and either `--prompt` or `--prompt-file` to provide a prompt for caption generation.
-| Argument           | Description                                                | Default                                                                                                                 |
-| ------------------ | ---------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------- |
-| `--glob`           | Glob pattern to find images                                | N/A                                                                                                                     |
-| `--filelist`       | File containing a list of images                           | N/A                                                                                                                     |
-| `--prompt`         | Prompt to use for caption generation                       | N/A                                                                                                                     |
-| `--prompt-file`    | JSON file containing prompts                               | N/A                                                                                                                     |
-| `--batch-size`     | Batch size for image processing                            | 1                                                                                                                       |
-| `--greedy`         | Use greedy decoding instead of sampling                    | False                                                                                                                   |
-| `--temperature`    | Sampling temperature (used when not using greedy decoding) | 0.6                                                                                                                     |
-| `--top-p`          | Top-p sampling value (nucleus sampling)                    | 0.9                                                                                                                     |
-| `--top-k`          | Top-k sampling value                                       | None                                                                                                                    |
-| `--max-new-tokens` | Maximum length of the generated caption (in tokens)        | 256                                                                                                                     |
-| `--num-workers`    | Number of workers loading images in parallel               | 4                                                                                                                       |
-| `--model`          | Pre-trained model to use                                   | [fancyfeast/llama-joycaption-alpha-two-hf-llava](https://huggingface.co/fancyfeast/llama-joycaption-alpha-two-hf-llava) |
 ### Examples
@@ -40,6 +41,10 @@ This command will caption all the `.jpg` images in the specified directory using
    ```sh
    ./batch-caption.py --glob "images/*.png" --prompt "Write a descriptive caption for this image in a formal tone."
    ```
 2. **Use a JSON file for prompts**

 ## Command-Line Arguments
+**Note**: You must specify either `--glob` or `--filelist` or `--input` to provide images, and either `--prompt` or `--prompt-file` to provide a prompt for caption generation.
+| Argument           | Description                                                | Default                                                                                                                     |
+| ------------------ | ---------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------- |
+| `--input`          | Input images                                               | N/A                                                                                                                         |
+| `--glob`           | Glob pattern to find images                                | N/A                                                                                                                         |
+| `--filelist`       | File containing a list of images                           | N/A                                                                                                                         |
+| `--prompt`         | Prompt to use for caption generation                       | N/A                                                                                                                         |
+| `--prompt-file`    | JSON file containing prompts                               | N/A                                                                                                                         |
+| `--batch-size`     | Batch size for image processing                            | 1                                                                                                                           |
+| `--greedy`         | Use greedy decoding instead of sampling                    | False                                                                                                                       |
+| `--temperature`    | Sampling temperature (used when not using greedy decoding) | 0.6                                                                                                                         |
+| `--top-p`          | Top-p sampling value (nucleus sampling)                    | 0.9                                                                                                                         |
+| `--top-k`          | Top-k sampling value                                       | None                                                                                                                        |
+| `--max-new-tokens` | Maximum length of the generated caption (in tokens)        | 256                                                                                                                         |
+| `--num-workers`    | Number of workers loading images in parallel               | 4                                                                                                                           |
+| `--model`          | Pre-trained model to use                                   | [John6666/llama-joycaption-alpha-two-hf-llava-nf4](https://huggingface.co/John6666/llama-joycaption-alpha-two-hf-llava-nf4) |
+| `--bf16`           | Load model on torch.bfloat16                               | False                                                                                                                       |
 ### Examples
    ```sh
    ./batch-caption.py --glob "images/*.png" --prompt "Write a descriptive caption for this image in a formal tone."
    ```
+   or
+   ```sh
+   ./batch-caption.py --input "images/dog.png" --prompt "Write a descriptive caption for this image in a formal tone."
+   ```
 2. **Use a JSON file for prompts**