Notes:
- API Key: Saved in your browser's local storage for convenience. It is not sent anywhere except to official Google API servers.
- API Usage: Shows an offline estimate of your usage against the free tier limits (RPM: Requests Per Minute, RPD: Requests Per Day). This tracker resets daily based on your local clock.
- Delay: Important for preventing API rate limit errors. You can review rate limits here. This setting is ignored unless "Process Sequentially" is active.
- Generation Type: Choose your output: a descriptive Caption, comma-separated Tags, or a Weight Brief for AI art prompts (e.g., Stable Diffusion).
- Precise Value: Controls the AI's creativity. 'Precise' provides factual, repeatable descriptions (low temperature). 'Creative' allows for more imaginative interpretations (high temperature).
- Processing Mode: By default, images are processed in fast, cost-effective batches. Select "Process Sequentially" for slower, one-by-one processing, which enables the "Delay" setting to help avoid rate limits on heavy usage.
- Trigger Words: Adds specific keywords to the beginning of every generated prompt.
- Prompt Enrichment: Adds extra instructions to the AI's main task, allowing you to guide its focus.
- Export as JSON & Format as Single Paragraph: These options are available only for the 'Caption' generation type. They format the output for technical use cases.
- Auto Rename File Pairs: Renames all output files into a sequential format (e.g., 1.png, 1.txt). Can be combined with Kohya_SS export for numbered training data.
- Export as Kohya_SS structure: Creates a ZIP file with the specific folder structure required for Kohya_SS LoRA training.
Options
Usage Instructions:
- Open Google AI Studio to create an API key.
- Paste the key into the "API Key" box.
- Input one or more images.
- Select your desired mode and model.
- Adjust optional settings if needed.
- Click "Start".