Posts by Collection

portfolio

publications

Paper Title Number 4

Published in GitHub Journal of Bugs, 2024

This paper is about fixing template issue #693.

Recommended citation: Your Name, You. (2024). "Paper Title Number 3." GitHub Journal of Bugs. 1(3).
Download Paper

talks

teaching

MNIST classifier

Machine Learning, Automated Classification, 2022

This project implements a simple neural network for classifying MNIST digits using only one dense layer.

Digital image processing, Dr. Gonzalez

Image Processing, Course, 2023

This project involves converting MATLAB code and algorithms from Dr. Rafael C. Gonzalez’s renowned book “Digital Image Processing” into Python.

Digital image processing, Dr. Hasanpour

Image Processing, Course, 2023

This project features the conversion of MATLAB code and algorithms from Dr. Hasanpour’s “Digital Image Processing” book into Python.

Chatbot

Natural Language Processing, NLP, 2023

This project involves developing a chatbot that delivers specific answers to a predefined set of questions.

Natural language processing

Natural Language Processing, Course, 2023

This project covers the foundational concepts and essential techniques in natural language processing (NLP) using Python and Linux.

Soft exudates segmentation

Computer Vision, Segmentation, 2024

This project utilizes a dataset from Kaggle containing original fundus images and their corresponding ground truth annotations.

Detecting car and its model

Computer Vision, Object Detection, 2024

This project involves training multiple models on Stanford’s car dataset to classify various car types.

Geospatial vehicle detection

Computer Vision, Object Detection, 2024

This project involves training a pretrained RetinaNet model on the NWPU-VHR-10 dataset, which contains high-resolution aerial images for object detection tasks.

Virtual environment segmentation

Computer Vision, Semantic Segmentation, 2024

This project, based on Stanford’s computer vision course, utilizes the KITTI dataset—a collection of images and LIDAR data for autonomous driving research.

Subtitle translator

Natural Language Processing, NLP, 2024

This code automatically translates English subtitles into a specified target language.

Instance Segmenation_balloons

Computer Vision, Instance Segmentation, 2024

This project involves training a custom dataset using Facebook Research’s Detectron2 framework, specifically with the Mask R-CNN model.

Eye tracker

Computer Vision, Object Tracking, 2024

In this project, CVZONE is used to detect and draw borders around a face, followed by the application of a mesh detector with a limit of detecting one face (maximum faces = 1).

Hand Gesture-based Volume Control

Computer Vision, Object Tracking, 2024

This project leverages CVZONE to detect a hand and recognize finger gestures for controlling volume. By tracking the thumb and index finger, the system adjusts the volume: closing the thumb and index finger lowers the volume, while opening them increases it. This interactive application showcases real-time hand gesture recognition for practical control mechanisms. The code is available for exploration in this GitHub repository.

Hand Gesture-based Mouse Control

Computer Vision, Object Tracking, 2024

This project uses CVZONE to enable mouse control through hand gestures. The system tracks the tip of the index finger to move the mouse pointer on the screen. A “click” action is triggered when the thumb touches the tip of the index finger, simulating a mouse click. This project highlights the integration of hand gesture recognition with interactive computing. The code is available for exploration in this GitHub repository.