
Developed an Egyptian Cash Reader for banknote recognition, manually collected and annotated the dataset, integrated objectdetection with OCR and QR code.
Implemented scene detection, applied Llama2 for title generation, created a caption dataset to fine-tune the Git model, integrated models into a Streamlit demo
Analyzed PaLM, GPT-2, and FLAN-T5 for document context retrieval and accuracy evaluation, provided an analysis report, integratedmodels into a Gradio demo.
Developed an AI-driven system to enhance training environments by implementing stroke type detection, court and player detection, Position estimation, pose estimation and ball tracking.
Fine-tuned YOLOv8 and Phi-3 models for Arabic text detection and extraction, integrated with Gradio for deployment