Data Engineering · Machine Learning
QUOC (LEO) HO
Building reliable data pipelines and production-ready ML systems.
Data Engineer and Machine Learning Engineer who turns raw data into deployable, high-impact solutions across ingestion, modeling, and delivery.
50%Workflow efficiency gain
40%Model size reduction
95%Accuracy in model selection
- Documented and maintained data collection protocols for AI experiments, boosting reproducibility and efficiency by 30%.
- Built automated data collection and analysis workflows with Python, Pandas, and Scikit-learn, halving processing time and reaching 95% accuracy in model selection.
- Applied knowledge distillation, quantization, and fine-tuning to shrink model size by 40% and accelerate inference by 50%.
- Designed and optimized REST/GraphQL routers to cut response latency and improve throughput under production-like load.
- Built reusable API adapters and middleware for auth, validation, and error handling to speed up integration for new endpoints.
- Aligned backend contracts and typing with frontend adapters to deliver reliable, typed client-facing features.
- Implemented scalable data collection and storage on Firebase for 500+ user-generated records across accounts, media, and messaging.
- Produced clear technical documentation that streamlined onboarding and reduced knowledge transfer time.
- Partnered with cross-functional stakeholders to convert requirements into analytical frameworks and data visualizations.
Highlighted key academic influencers and trend insights.
Reduced inference latency by 40% while keeping accuracy deployment-ready.
Programming & DevOps
PythonSQLGitLinuxAPIsVS CodeJupyter
Data Engineering
Apache AirflowETL/ELTSnowflakeData Modeling
Machine Learning
Scikit-learnPyTorchTensorFlow/Keras
Platforms & Databases
AWSNeo4jFirebase
Let's build
Ready to design and deploy your next data or ML system.
I love hard problems at the intersection of data infrastructure, modeling, and product impact.