How is this different from image captioning when the model used is a booru model... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Der_Einzige on Feb 8, 2023 \| parent \| context \| favorite \| on: Img2Prompt – Get prompts from stable diffusion gen... How is this different from image captioning when the model used is a booru model? That's already a thing people do with making their training data for fine tuning these models.

sahil_chaudhary on Feb 8, 2023 [–]

It actually works on top of an image captioning model, SD takes in keywords as well like "artstation" and "octane render" which are not covered in standard captioning so that is why the difference between using an off-the-shelf captioning model vs this

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact