Profile

Analytical and results-driven data science undergraduate with a finance minor, dedicated to turning complex economic and operational datasets into measurable business value. Experienced in end-to-end machine learning, spatial analytics, and predictive modeling. Built data-driven forecasting and vi sualization tool s that reduced stock-outs by 20 %, halved decision time, and cut material costs by 15 %. Proficient in Python, SQL, and Tableau. Fluent in English and Indonesian.

Work Experience

Data Scientist Intern β€” IDX Exchange (Real Estate Data Science Platform)

Sep 2025 – Dec 2025 Β· Remote, USA
  • Built and evaluated machine learning models to predict U.S. residential property prices using large-scale listing data from Cotality’s Trestle API.
  • Owned an end-to-end pipeline (EDA β†’ Feature Engineering β†’ Model Tuning β†’ Packaging).
  • Optimized Ridge/Random Forest/XGBoost; reached strong fit and stable error (MDAPE focus), achieving RΒ² β‰ˆ 0.89 and MDAPE β‰ˆ 6% on held-out data.
  • Produced reproducible Jupyter workflows and model diagnostics to identify major drivers of housing valuation and support iterative model improvement.

Data Scientist Intern - GAF

Jan–Sep 2024 Β· Jakarta
  • Developed ARIMA and decision-tree forecasting models that improved demand forecast accuracy by 15% and reduced inventory stock-outs by 20%.
  • Built Tableau KPI dashboards used to monitor supply-chain performance and support faster operational decision-making.
  • Analyzed sales and inventory patterns to improve planning accuracy and reduce avoidable supply disruptions.

Accounting & Data Analyst Intern - STAL Cooperation

2021–2023 Β· Jakarta
  • Built ARIMA-based forecasting models that reduced material procurement costs by 15%, improved planning accuracy by 10%.
  • Digitized expense and procurement tracking workflows, improving financial visibility and supporting more accurate budgeting.
  • Combined historical sales and external market indicators to strengthen demand planning and purchasing decisions.

Projects

Home Price Predictions β€” Real Estate ML Pipeline (IDX Exchange)

GitHub
  • Built a full ML pipeline to predict U.S. home prices using 80k+ housing records (CRMLS & Trestle API).
  • Cleaned, encoded, and engineered features (log transforms, interaction terms) for model stability.
  • Tuned Ridge & Random Forest models achieving RΒ² = 0.89 and MDAPE β‰ˆ 6 % with diagnostics used to explain the strongest pricing drivers
  • Visualized key drivers using feature importances and residual diagnostics.

TrueFork β€” Recipe Ratings & Nutrition ML

GitHub Β· Live
  • Analyzed 1M + Food.com ratings to test if unhealthy recipes receive higher scores.
  • Engineered a GridSearch-tuned scikit-learn pipeline (ColumnTransformer β†’ RandomForest) with SHAP explainability.
  • Published an interactive HTML report visualizing rating bias patterns and model insights.

Loan Charge Offs β€” Credit Risk Modeling

GitHub
  • Trained Random Forest and Deep-Learning (TensorFlow) models for charge-off prediction.
  • Achieved 94.9 % accuracy (↑ 6 pp over logistic baseline).
  • Delivered interactive notebooks for threshold tuning and misclassification review.

Urban Transit & Crime β€” Geospatial Analytics

GitHub
  • Used GeoPandas + PySAL to analyze 1M + NYC/Chicago records for station-adjacent crime risk.
  • Applied GWR and clustering to map crime hotspots and socioeconomic disparities.
  • Produced interactive maps supporting equitable city-planning insights.

Education

UC San Diego - B.S. Data Science, Minor in Finance

Expected 2026

UC Berkeley Extension - Certificate in Data Analytics

Nov 2023

πŸ“„ View Credential (PDF)