Cheng-Hsiu (Jash) Lee

📞 917-971-9551  |   |  Portfolio  |  LinkedIn  |  Brooklyn, NY Print
Summary

Engineering Leader with 20+ years of experience building, scaling, and optimizing global platforms for top tech companies. Expert in architecting resilient large scale distributed systems and leading and mentoring high-performing teams across the US and APAC. Proven track record of people management, cross-functional leadership, and leveraging LLMs, agentic workflows, and automation to deliver 10x productivity, multi-million dollar cost savings, and industry-leading reliability. Passionate about empowering organizations to innovate at scale, accelerate AI adoption, and develop future leaders.

Core Competencies
  • Big Data/AI/ML Large Scale Infrastructure & Platform Engineering
  • Site Reliability Engineering (SRE)
  • Cloud Architecture (AWS, Azure)
  • Generative AI & LLM Integration
  • Global Team Leadership & Mentoring
  • DevOps & CI/CD Automation
  • Big Data/AI/ML Systems & Observability
  • Cost Optimization & Scalability
  • Incident Response & Disaster Recovery
  • Open Source Contributions
Selected Achievements
  • Reduced cloud infrastructure costs by $3M+/year at Microsoft through automation and hybrid cloud strategy.
  • Architected Big Data/AI/ML platforms with 300TB+ daily data at global scale.
  • Launched AI-powered apps featured on Amazon and Google Play, driving user engagement and business growth.
  • Built and led global teams (US, Europe, APAC) delivering 24/7/365 reliability and rapid innovation.
  • Implemented LLM/agentic workflows to achieve 10x engineering velocity and accelerate AI adoption.
Education

New York University, USA

Computer Science

Open Source Contribution
GitHubGitHub GitBookGitBook DockerHubDockerHub SoundCloudSoundCloud MixCloudMixCloud
Skills
LanguagesLanguages BackendBackend FrontendFrontend
Bash, Python Go, Python/Django TypeScript, React Native, jQuery, HTML5, CSS3, Bootstrap

Load BalancingLoad Balancing Configuration ManagementConfiguration Management VirtualizationVirtualization
LVS, HAProxy, F5, ProxySQL Ansible, Puppet Xen, KVM, Vagrant, Docker, ESXi

Operating SystemOperating Systems MonitoringMonitoring Web ServerWeb Servers
Linux, Windows, Mac OS, CoreOS Munin, Nagios, Zenoss, Zabbix, Graphite, New Relic, Collectd, OpsGenie Apache, JBoss, Wildfly, Nginx

High AvailabilityHigh Availability StorageStorage CI/CDCI/CD
DRBD, Keepalived, Heartbeat GlusterFS, Ceph, ZFS, NFS, Samba Jenkins, Concourse, Spinnaker, Azure DevOps

DatabasesDatabases Distributed SystemsDistributed Systems NetworkingNetworking
PostgreSQL, MySQL, SQLite, MongoDB, Redis, Elasticsearch, Aerospike, OpenLDAP, ClustrixDB, IRONdb Hadoop, Spark, Yarn, Kafka, Zookeeper, Mesos, Kubernetes, Cassandra, ScyllaDB, HBase, Hive, Presto, Zeppelin, QFS, Imply, Druid, AB-Initio, TigerGraph CDN, LAN/WAN, DNS, NAT, Firewall, TCP/IP, IPSec, VPN, Routing, Switching, SDN, Cisco, Juniper

CloudCloud Version ControlVersion Control SecuritySecurity
AWS, Azure Git, SVN, Bitbucket LDAP, Kerberos, SSL, HTTPS

Generative AIGenerative AI
LLMs (ChatGPT, Claude, Ollama, Phi3), Stable Diffusion (SDXL, RealisticVision, GFPGAN, InSwapper), txt2img, img2img, txt2video, inpainting, ControlNet, Extract, Train, Convert
Generative AI Experience
YouTube Content Creation
  • Produced high-impact thumbnails using Stable Diffusion and inpainting for enhanced engagement.
  • Scripted and refined video content with LLMs (ChatGPT, Ollama) for audience retention.
  • Automated video segment creation with text-to-video (txt2video) tools.
Coding & Prototyping
  • Accelerated development cycles using LLMs for code generation, debugging, and review.
  • Created rapid prototypes and UI mockups with generative models.
  • Fine-tuned AI models using Extract and Train for project-specific needs.
Image & Video Generation
  • Generated unique visuals and memes with img2img/txt2img for digital campaigns.
  • Applied ControlNet for advanced image manipulation and creative workflows.
Work Experience

MicrosoftMicrosoft (AI - Copilot)Member of Technical Staff, Mar 2025 - Present

  • Optimizing data platforms and AI/ML infrastructure to enhance Copilot and Applied/Gen AI experiences.
  • Leveraged LLMs, AI agents, and agentic workflows to drive 10x engineering efficiency.

Scientific GamblingScientific Gambling AppFounder, Dec 2024 - Present

MicrosoftMicrosoft (AI)Principal Software Engineering Manager, Jan 2023 - Mar 2025

  • Directed global teams to deliver 24/7 Big Data infrastructures for internal partners.
  • Architected big data systems (300TB+/day), achieving 99.99% uptime and 40% performance gains.
  • Reduced operational overhead by 60% and infrastructure costs by $2M+ via automation.

MicrosoftMicrosoft (Advertising)Sr. Manager, Technology, Jun 2022 - Jan 2023

  • Led hybrid cloud migration, reducing costs by 45% and tripling scalability.
  • Managed distributed engineering teams to ensure 24/7 service delivery.

XandrXandrSr. Manager, Technical Operations (Big Data), Dec 2021 - Jun 2022

  • Scaled big data infrastructure to 1M+ QPS with 99.95% uptime.
  • Reduced data pipeline latency by 70% and storage costs by $1M+.

XandrXandrManager, Technical Operations (Big Data), Apr 2019 - Dec 2021

  • Built and led a high-performing team, achieving 100% project delivery.
  • Implemented monitoring and alerting, reducing MTTR by 80%.
  • Established CI/CD pipelines, cutting deployment time from 4 hours to 15 minutes.

AppNexusAppNexusSenior Systems Engineer (Big Data), Dec 2015 - Apr 2019

  • Led initiatives for a data platform handling 50B+ daily requests with sub-10ms latency.
  • Designed auto-scaling infrastructure, reducing manual ops by 90%.
  • Mentored junior engineers and improved system reliability to 99.9%.

NYUNew York UniversityTechnical Consultant (Part-Time), Jun 2015 - Dec 2015

  • Researched unikernel solutions and resolved private cloud challenges.
  • Streamlined IT operations for academic environments.

NielsenNielsenSite Reliability Engineer, Apr 2015 - Dec 2015

  • Automated deployments, reducing manual effort by 95%.
  • Integrated LVS load balancing, saving $500K+ annually.
  • Migrated Python services to Go, improving performance by 60%.

eXelateeXelateSite Reliability Engineer, Jan 2015 - Apr 2015

  • Diagnosed and resolved infrastructure issues across 7 data centers.
  • Developed automation tools and integrated Twilio API for alerts.

NYUNew York UniversityTechnical Lab Manager, Aug 2014 - Jan 2015

  • Led a cross-functional team to deliver private cloud and high-availability solutions.
  • Managed departmental data center and IT operations.

NYUNew York UniversityDeveloper, Feb 2013 - Aug 2014

  • Engineered and maintained virtual lab infrastructure for research and education.
  • Developed automation scripts and managed server/network operations.
  • Collaborated on redesign and training for NSF-funded Vlab platform.

CEMCLChina Engineering & Mercantile Co., Ltd.

Senior MIS Engineer (Sept 2002 - Aug 2012)

  • Developed and maintained web-based customer feedback and quotation systems (ASP.NET, Python).
  • Redesigned and implemented company website (PHP5 + jQuery, HTML5, LAMP stack).
  • Administered Linux, Windows, and Mac OS X servers and managed virtualization platforms.
  • Oversaw network infrastructure and ensured high availability and security.
  • Provided comprehensive IT support and led internal automation initiatives.

TaiwanThe Republic of China Army (Taiwan)Second Lieutenant, Aug 2010 - Jul 2011

  • Directed daily operations and training as Deputy Company Commander.
  • Led and managed two platoons, ensuring operational readiness and discipline.
Hobbies
  • Electronic music composer and DJ; passionate about classical music.
  • Zone 2 training practitioner.
  • Cat owner.