Computer Vision & Deep Learning Enthusiast
Python Developer focused on backend systems and computer vision, with a solid foundation in AI and deep learning theory. Experienced in building automation pipelines and real-world applications using FastAPI, OpenCV, and MediaPipe. Currently developing projects with PyTorch and TensorFlow to translate theoretical ML concepts into practical, scalable solutions.
Technical Skills
- Languages & Scripting: Python, Bash
- AI/ML & Computer Vision: FastAPI, scikit-learn, OpenCV, MediaPipe, PyTorch, TensorFlow
- Data Handling & Visualization: pandas, NumPy, Matplotlib, Seaborn
- Web Development: NextJS, ReactJS, TailwindCSS, HTML, CSS
Work Experience
OpenAstronomy — Contributor @ SunPy Project (Jan 2025 – Feb 2025)
- Resolved a critical bug in SunPy’s
SRSClient
by replacing outdated NOAA FTP endpoints with HTTPS, restoring ~100% CI/CD pass rate across 30+ builds per week.
- Coordinated directly with NOAA API maintainers to verify updated links, reducing future API breakage risk by ~80%.
- Improved data reliability for 1,000+ monthly users and downstream scientific tools by merging fixes into SunPy’s core codebase.
- Fixed UI layout issues on OpenAstronomy.org, enhancing footer responsiveness, theme consistency, and visual flow across ~10 pages.
Education
-
B.Tech, Computer Science |
Rishihood University (August 2028) |
-
High School Diploma, CS |
National English School (March 2024) |
Projects
whispercast (March 2025 - April 2025)
GitHub | Website
- Built an AI-powered CLI tool for generating podcasts and audiobooks using Python, reducing manual content prep time by ~80%.
- Automated script creation via OpenAI LLMs and text-to-speech conversion using Coqui-TTS, enabling end-to-end audio generation
in under 15 seconds.
- Integrated Wikipedia, Google News, and Reddit APIs into the AI podcasting tool, delivering real-time, topic-specific content
updates for each audio segment and improving content freshness by 99%.
- Designed an interactive learning mode that let users query PDFs and Docs, increasing engagement by approx. 2x in test runs.
- Reduced file clutter by auto-managing playback files and temporary content, keeping storage usage under ~50MB per session.
melody.cli (May 2024 - Jun 2024)
GitHub | Website
- Created a terminal-based music player with ~0.5s average latency from command to playback.
- Integrated a smart caching system that reused previous downloads, cutting network usage by up to 60% across sessions.
- Designed for devs and low-resource systems, keeping RAM usage below 100MB even during continuous playback.
- Tested with 50+ tracks across genres to validate consistency and fault tolerance of autoplay queue logic.
GitHub
- Built a gesture-based controller using OpenCV + MediaPipe that replaced traditional inputs with hand gestures, achieving ~90%
gesture recognition accuracy.
- Enabled full mouse control, volume/brightness adjustment, and air typing with minimal latency (~1s response on average hardware).
- Tuned system to maintain stable tracking at 30 FPS, even in medium-light conditions, improving UX consistency.
- Reduced user reliance on peripherals by offering hands-free operation for basic OS tasks — tested successfully over 20+ live
sessions.