Nahian Arnob

Nahian Arnob

Master of Science in Data Science

Regis University

Expected Graduation: Spring 2026

About Me

I’m a motivated and down-to-earth person who believes in working hard and staying positive. I enjoy learning new things, helping people, and pushing myself to grow both personally and professionally.

Technical Skills

Programming Languages

Python, Java, JavaScript, C, C++, R, MATLAB, SQL

Tools & Frameworks

VSCode, Collab, Jupyter Notebook

Machine Learning

TensorFlow, PyTorch, Scikit-learn, Keras, XGBoost, NumPy, Pandas, Matplotlib, Seaborn, Plotly, NLTK, FastAPI, Flask

Web Development

HTML, CSS, JavaScript, REST APIs, MongoDB, MySQL, Git, GitHub

Practicum Projects

MSDS 692

Credit Card Fraud Detection System using Machine Learning Technique

Machine Learning Data Science Python

This project analyses credit card transaction data to identify and understand fraudulent behaviour using exploratory data analysis and machine learning techniques. The analysis explores transaction patterns across fraud and non-fraud cases, including class imbalance, transaction amounts, and time-based trends. Multiple visualizations are used to highlight differences in spending behavior, and baseline classification models are developed to evaluate fraud detection performance using precision-focused metrics. The project serves as a prototype fraud detection system and demonstrates practical approaches used in real-world financial risk analytics.

Regis University | MSDS 692 | Spring 2025
MSDS 696

Transit Demand Forecasting using Machine Learning for NYC Subway Network

Machine Learning Web Application Streamlit FastAPI Web Development Data Analysis

Millions of people use urban subway transportation systems every day, and they are an important part of modern cities. Subway Transit authorities can make better use of their resources, change the frequency of service, cut down on passenger wait times, and make the whole system more efficient if they can accurately predict how many people will use a stop or route. This paper presents the design, implementation, and deployment of an end-to-end data-driven transit demand forecasting system for the New York City subway network. The system integrates 3 real-world public datasets: MTA Subway Hourly Ridership data (51 million rows, 2022–2024), MTA GTFS Static Network feeds, and NYC Permitted Events data for 2026. The dataset was filtered to focus on subway ridership for the 2024 calendar year, resulting in a clean dataset of 10 million records and a final processed feature set of 1,396,290 station-hour observations across 16 engineered features. 5 machine learning and deep learning models were trained and evaluated: XGBoost, Random Forest, Long Short-Term Memory (LSTM), Prophet, and Temporal Fusion Transformer (TFT). Model performance was assessed using Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Mean Absolute Percentage Error (MAPE). Amongst all, XGBoost achieved the best overall performance with an MAE of 349.04 riders per station per hour and an RMSE of 923.64, and was selected as the deployment model. The trained model was deployed through a two-tier architecture consisting of a FastAPI RESTful backend serving real-time predictions and a professional Streamlit web application featuring an interactive NYC subway map, forecast controls, confidence interval visualizations, and automated decision support alerts for transit planners. The complete system was implemented and deployed locally within a single development session, demonstrating the practical feasibility of machine learning-powered transit forecasting as a tool for smarter urban transit planning and operational decision-making.

Regis University | MSDS 696 | Spring 2025

Get In Touch

Let's Connect

Feel free to reach out for collaboration opportunities, questions about my projects, or just to connect!

Send a Message

I'm always open to discussing new projects, creative ideas, or opportunities to be part of your vision.

Email Me Download CV