Key Work Done
Implemented multiple OCR methods for text recognition.
Automated text removal using EasyOCR + OpenCV inpainting.
Explored deep learning pipelines (DBNet + ZITS).
Trained and tested hybrid models (ResNetViT, ClipViT, ConvNeXt).
Tuned loss functions (BCE) to improve accuracy.
Compared outputs of different models.
Results
Conclusion
Explored multiple approaches for image cleaning.
Final models identified and tested:
ResNet-ViT
CBAM
UNet
ClipViT
These models gave the most promising results for preserving diagrams
while removing text and unwanted elements.