Data Engineering · Machine Learning

QUOC (LEO) HO

Building reliable data pipelines and production-ready ML systems.

Data Engineer and Machine Learning Engineer who turns raw data into deployable, high-impact solutions across ingestion, modeling, and delivery.

50%Workflow efficiency gain
40%Model size reduction
95%Accuracy in model selection

Experience

Shipping data and ML that scale

From research to production APIs, I align data pipelines, modeling, and developer experience to deliver measurable impact.

Jan 2025 – Present

Research Assistant

Grand Valley State University

  • Documented and maintained data collection protocols for AI experiments, boosting reproducibility and efficiency by 30%.
  • Built automated data collection and analysis workflows with Python, Pandas, and Scikit-learn, halving processing time and reaching 95% accuracy in model selection.
  • Applied knowledge distillation, quantization, and fine-tuning to shrink model size by 40% and accelerate inference by 50%.

Aug 2025 – Dec 2025

Software Developer

Grand Valley State University

  • Designed and optimized REST/GraphQL routers to cut response latency and improve throughput under production-like load.
  • Built reusable API adapters and middleware for auth, validation, and error handling to speed up integration for new endpoints.
  • Aligned backend contracts and typing with frontend adapters to deliver reliable, typed client-facing features.

Aug 2024 – Dec 2024

Data Analyst Intern

Yiddish Arts and Academics Association of North America

  • Implemented scalable data collection and storage on Firebase for 500+ user-generated records across accounts, media, and messaging.
  • Produced clear technical documentation that streamlined onboarding and reduced knowledge transfer time.
  • Partnered with cross-functional stakeholders to convert requirements into analytical frameworks and data visualizations.

Projects

Selected builds

Experiments and production-minded prototypes that explore data systems, graph analytics, and efficient ML.

arXiv Citation Network Analysis

Transformed citation data with Python (Pandas, NumPy) and built a Neo4j graph for visualization and influence analysis.

Highlighted key academic influencers and trend insights.

Skills

Data & ML toolkit

A stack tuned for building, shipping, and observing data-intensive systems.

Programming & DevOps

PythonSQLGitLinuxAPIsVS CodeJupyter

Data Engineering

Apache AirflowETL/ELTSnowflakeData Modeling

Machine Learning

Scikit-learnPyTorchTensorFlow/Keras

Platforms & Databases

AWSNeo4jFirebase

Education

Grand Valley State University

B.S. in Computer Science · Minor in Mathematics · GPA: 3.82 · Dean’s List · Graduation: December 2025

Let's build

Ready to design and deploy your next data or ML system.

I love hard problems at the intersection of data infrastructure, modeling, and product impact.