Honey Kotecha

Data Scientist & Analytics Enthusiast

Advancing Knowledge in Data Science & Data Analytics || Evolving through learning & problem solving || Gcet Official Team - Documentation Co-ordinator

3D Data Science Illustration

About Me

Hey there ! I am a computer science student with a strong foundation in data science, data analytics, and AI/ML concepts. Equipped with fundamental programming knowledge across multiple languages, including C, Python, and Java, complemented by proficiency in IoT technologies. Eager to expand this diverse technical foundation in a role that offers growth opportunities.

View My Resume

Skills & Technologies

Programming Languages

Python Python
SQL Java
SQL C
R R
SQL HTML
SQL Javascript

Libraries / Frameworks

Css Css
NumPy NumPy
Pandas Pandas
Matplotlib Matplotlib
Scikit-learn Scikit-learn
Django Django
Tableau Tablueu

IoT

Matplotlib Arduino Programing
Seaborn Arduino IDE

Tools

Pandas SQL
NumPy Power BI
Apache Spark Excel
Apache Spark GitHub

Skills

Apache Spark Public Speaking
Apache Spark Problem Solving
Apache Spark Technical Documentation

Experience

Data Science Intern
@ Code Clause

Jan 2025 - July 2025
  • Gained experience in working with hands-on projects like heart disease risk assessment, EDA on IRIS.
  • Advanced knowledge in Machine Learning through structured learning approaches.

AI Intern
@ AICTE Internship on AI Transformative Learning with TechSaksham

Jan 2025 - Feb 2025
  • Acquired extensive hands-on experience with cutting edge AI technologies by developing AI Health Assistant.
  • Strengthened foundational knowledge in Machine Learning and Natural Language Processing.

Featured Projects

Prediction of Employee Attrition

Analyzed 1,470+ employee records using Python and built data preprocessing pipeline across 18 features with zero missing values.

Addressed class imbalance (15% attrition) using over sampling techniques for improved model performance.

Numpy Pandas Scikit-Learn Seaborn Matplotlib

Heart Disease Risk Assessment

Developed a deep learning model achieving accuracy on 300+ patient clinical records during the training/internship.

Implemented complete workflow handling of 13 clinical features (cholesterol, BP, ECG results).

Tensorflow Python Pandas Numpy Scikit-learn Seaborn Matplotlib

Spotify User Behavior Segmentation

Preprocessed Spotify user data using pandas/numpy and segmented users into Music Enthusiasts, Podcast Lovers, and Casual Listeners through K-means clustering methods.

Identified distinctive user patterns based on listening habits, subscription value, and platform engagement metrics and developed an interactive PowerBI dash board that visualises key metrics.

PowerBI Scikit-learn Numpy Pandas Seaborn Matplotlib

Volunteering

Documentation Co-Ordinator - Subcore
@ ISA - Student Branch

Oct 2023 - Present

Documented various event reports, as well as collaborated with cross-functional teams which helped me improving my verbal communicationa and written documentation.

Documentation Co-Ordinator
(Gcet Social Media Team)

Jan 2024 - Present

Contributed to Gcet Social Media Team's Documentation part