OCR Image Processing with PaddleOCR

This project is a web application for Optical Character Recognition (OCR) using PaddleOCR and Streamlit. It allows users to upload an image and extract text from it in multiple languages with confidence scores.

Features

Support for multiple languages (currently English and Arabic).
Displays uploaded image.
Extracts and displays text with confidence scores.
Provides an option to visualize OCR results on the image.

Technologies Used

Streamlit: An open-source app framework for Machine Learning and Data Science.
PaddleOCR: A rich repository for OCR models.
OpenCV: A library for computer vision tasks.
NumPy: A library for numerical computations.
PIL (Pillow): A library for image processing.

Installation

Clone the repository:

git clone https://github.com/Ansarimajid/OCR.git
cd OCR

Install the required packages:
```
pip install -r requirements.txt
```

Download the required PaddleOCR models:

paddleocr --lang en  # for English
paddleocr --lang ar  # for Arabic (if needed)

Usage

Run the Streamlit application:
```
streamlit run app.py
```
Open your web browser and go to http://localhost:8501.
Select the language from the sidebar.
Upload an image in PNG, JPG, or JPEG format.
View the uploaded image and the extracted text with confidence scores.
Optionally, visualize the OCR results on the image by checking the "Draw OCR Results on Image" checkbox.

File Structure

app.py: Main application file.
requirements.txt: Contains the list of Python packages required for the project.
README.md: Project documentation.
temp_img.jpg: Temporary image file created during OCR processing.

Contributing

Contributions are welcome! Please open an issue or submit a pull request for any improvements or bug fixes.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.devcontainer		.devcontainer
fonts		fonts
img		img
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
temp_img.jpg		temp_img.jpg
test.jpg		test.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Image Processing with PaddleOCR

Features

Technologies Used

Installation

Usage

File Structure

Contributing

License

About

Releases

Packages

Languages

Ansarimajid/OCR

Folders and files

Latest commit

History

Repository files navigation

OCR Image Processing with PaddleOCR

Features

Technologies Used

Installation

Usage

File Structure

Contributing

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages