Enhancing text-specific inpainting using diffusion models and OCR

Chavan, Rutuja; Hirve, Sahil; Shinde, Swati; Virdee, Bal Singh; Khanna, Ashish

London Met Repository

Tools

Lists

Chavan, Rutuja, Hirve, Sahil, Shinde, Swati, Virdee, Bal Singh and Khanna, Ashish (2025) Enhancing text-specific inpainting using diffusion models and OCR. In: 2025 9th International Conference on Computing, Communication, Control and Automation (ICCCBEA), 22-23 August 2025, Pune, India.

Abstract
Documents
Details
Record

[+][-]

Abstract

Text-specific inpainting-masking and replacing particular words and phrases in the different images. It has significant potential for applications such as document redaction, privacy protection, and automated image editing. While previous research has struggled with this task. This paper presents an approach combining Optical Character Recognition (OCR) and diffusion models to address these challenges. We use pytesseract for text detection and Stable Diffusion for inpainting, after using that we aim to accurately replace specific words in images. Our experiments show promising results in simpler cases but reveal limitations when handling intricate backgrounds and fonts. Based on these findings, we suggest improvements to enhance the robustness of the method, specifically in handling complex image environments.

Documents

11157:55946

[+][-]

11157:55946

[thumbnail of DiffusionModel - accepted.pdf]

Preview

DiffusionModel - accepted.pdf - Accepted Version
Available under License Creative Commons Attribution 4.0.

Download (522kB) | Preview

Details

Title:

Enhancing text-specific inpainting using diffusion models and OCR

Creators:

Chavan, Rutuja, Hirve, Sahil, Shinde, Swati, Virdee, Bal Singh and Khanna, Ashish

Official URL:

https://doi.org/10.1109/ICCUBEA65967.2025.11284109

Date:

15 December 2025

Subjects:

000 Computer science, information & general works
600 Technology

Department:

School of Computing and Digital Media

Uncontrolled Keywords:

Text-specific inpainting, OCR (Optical Character Recognition), Pytesseract, Stable Diffusion, Document redaction, Privacy protection, Automated image editing

Additional Information:

No embargo on AAM - see JISC-IEEE agreement for 2025

Record

URI:

https://repository.londonmet.ac.uk/id/eprint/11157

Item Type:

Conference or Workshop Item

Presentation Type:

Paper

Depositing User:

Balbir Virdee

Date Deposited:

05 Jan 2026 10:20

Revision:

Last Modified:

08 Apr 2026 13:24

View Item

CORE (COnnecting REpositories)

London Met Repository London Met Repository London Met Repository

Enhancing text-specific inpainting using diffusion models and OCR

London Met Repository