Profile
Analytical and results-driven data science undergraduate with a finance minor,
dedicated to turning complex economic and operational datasets into measurable business value.
Experienced in end-to-end machine learning, spatial analytics, and predictive modeling.
Built data-driven forecasting and vi sualization tool s that reduced stock-outs by 20 %,
halved decision time, and cut material costs by 15 %.
Proficient in Python, SQL, and Tableau. Fluent in English and Indonesian.
Work Experience
Data Scientist Intern β IDX Exchange (Real Estate Data Science Platform)
Sep 2025 β Dec 2025 Β· Remote, USA
- Built and evaluated machine learning models to predict U.S. residential property prices using large-scale listing data from Cotalityβs Trestle API.
- Owned an end-to-end pipeline (EDA β Feature Engineering β Model Tuning β Packaging).
- Optimized Ridge/Random Forest/XGBoost; reached strong fit and stable error (MDAPE focus), achieving RΒ² β 0.89 and MDAPE β 6% on held-out data.
- Produced reproducible Jupyter workflows and model diagnostics to identify major drivers of housing valuation and support iterative model improvement.
GitHub: Home Price Predictions
Data Scientist Intern - GAF
JanβSep 2024 Β· Jakarta
- Developed ARIMA and decision-tree forecasting models that improved demand forecast accuracy by 15% and reduced inventory stock-outs by 20%.
- Built Tableau KPI dashboards used to monitor supply-chain performance and support faster operational decision-making.
- Analyzed sales and inventory patterns to improve planning accuracy and reduce avoidable supply disruptions.
Accounting & Data Analyst Intern - STAL Cooperation
2021β2023 Β· Jakarta
- Built ARIMA-based forecasting models that reduced material procurement costs by 15%, improved planning accuracy by 10%.
- Digitized expense and procurement tracking workflows, improving financial visibility and supporting more accurate budgeting.
- Combined historical sales and external market indicators to strengthen demand planning and purchasing decisions.
Projects
Home Price Predictions β Real Estate ML Pipeline (IDX Exchange)
GitHub
- Built a full ML pipeline to predict U.S. home prices using 80k+ housing records (CRMLS & Trestle API).
- Cleaned, encoded, and engineered features (log transforms, interaction terms) for model stability.
- Tuned Ridge & Random Forest models achieving RΒ² = 0.89 and MDAPE β 6 % with diagnostics used to explain the strongest pricing drivers
- Visualized key drivers using feature importances and residual diagnostics.
TrueFork β Recipe Ratings & Nutrition ML
GitHub Β·
Live
- Analyzed 1M + Food.com ratings to test if unhealthy recipes receive higher scores.
- Engineered a GridSearch-tuned scikit-learn pipeline (ColumnTransformer β RandomForest) with SHAP explainability.
- Published an interactive HTML report visualizing rating bias patterns and model insights.
Loan Charge Offs β Credit Risk Modeling
GitHub
- Trained Random Forest and Deep-Learning (TensorFlow) models for charge-off prediction.
- Achieved 94.9 % accuracy (β 6 pp over logistic baseline).
- Delivered interactive notebooks for threshold tuning and misclassification review.
Urban Transit & Crime β Geospatial Analytics
GitHub
- Used GeoPandas + PySAL to analyze 1M + NYC/Chicago records for station-adjacent crime risk.
- Applied GWR and clustering to map crime hotspots and socioeconomic disparities.
- Produced interactive maps supporting equitable city-planning insights.
Education
UC San Diego - B.S. Data Science, Minor in Finance
Expected 2026