CV
Paul Nguyen (Luong Nguyen)
Summary
Applied AI engineer building safety-critical AI products. Ship generative AI into a mass-market automotive environment. Author of Autodidact (pip install autodidact) — an open-source self-evolving AI agent that learns from its cloud queries; v1.0.x shipping on PyPI, grounded in my zero-shot confidence research (arXiv:2605.02241). PhD in adversarial detection (IEEE TVLSI, IEEE TDSC).
Education
- Ph.D. in Electrical and Computer Engineering — Thesis: Novel machine learning based techniques using side-channel analysis for hardware Trojan detectionGeorgia Institute of Technology
- BSc + MSc in Electrical and Computer Engineering (Korean Government Scholarship Program) — Thesis: Redundancy Reduction in Interpolation Calculation for HEVC Video Compression's Fractional Motion EstimationSeoul National University
Work Experience
- Senior Machine Learning Engineer2022-08 - PresentAmazon — Alexa AutoApplied ML engineer shipping generative AI into automotive Alexa. Work spans the full stack — fine-tuning and evaluation, cloud infrastructure, device-side integration, and cross-team coordination across the domain teams (music, car control, navigation, search) whose features ride on the GenAI stack.
- Led the development of speech-to-speech Edge AI model fine-tuning, evaluation, and profiling infrastructure for on-device automotive voice assistants
- Led cross-functional integration of generative AI into the Alexa Auto experience, contributing across the entire ML engineering stack from model fine-tuning to scalable cloud infrastructure
- Fine-tuned and evaluated LLMs to improve in-car voice assistant capabilities and contextual understanding; owned evaluation frameworks and benchmarks for edge-environment deployment
- Built tooling and infrastructure for seamless deployment of GenAI features across Alexa Auto clients, coordinating with music, car control, navigation, and search domain teams to turn generic LLM capability into automotive-specific experts
- Led multi-team initiatives to design and deliver distributed systems powering Alexa Auto's next-generation AI experiences
- Designed and delivered voice- and text-based search and navigation experiences tailored for automotive environments, including ranking models and autocomplete algorithms for Alexa Auto text search
- Built scalable indexing pipelines using Apache Spark, Amazon EMR, OpenSearch, and S3 to power efficient ingestion, transformation, and retrieval for product features
- Architected experimentation frameworks on AWS Bedrock (including Anthropic's Claude models) to explore LLM-powered improvements in automotive search and navigation
- Root-caused a permanent-offline regression by analyzing multi-million-line device logs, tracing the bug to stale state-machine persistence in the auth subsystem — corrected my own initial hypothesis twice based on evidence before landing the fix
- Diagnosed recurring AI-system degradation that had been misdiagnosed as a network issue by correlating timing patterns across shared event loops, identifying resource contention as the actual root cause
- Created a team-wide decision-trace methodology to capture non-obvious engineering reasoning, making systemic debugging reusable across team members
- R&D Engineer, Senior I2020-07 - 2022-08PrimeTime, SynopsysR&D on signoff products (timing and power analysis) for chip designs, applying machine learning and distributed computing to speed up and improve established EDA flows.
- Developed new flows and features for Synopsys signoff products using machine learning and distributed computing
- Built C/C++ algorithms for timing and power analysis of complex chip designs
- Graduate Research Assistant2016-08 - 2020-05CompArch Lab, Georgia Institute of TechnologyPhD research on detecting hidden malicious behaviors in complex systems through electromagnetic side-channel analysis and machine learning. Published in IEEE TVLSI and IEEE TDSC; the broader research program attracted $5M in NSF funding.
- Developed the first off-chip electromagnetic side-channel technique capable of detecting dormant hardware trojans as small as 0.31% of the original circuit with 100% accuracy and 0% false positives — this program received $5M in NSF funding for further development
- Built a framework that uses electromagnetic side-channel signals to detect malicious software activity on embedded and cyber-physical systems — achieved 100% detection at under 1% false positive rate from distances up to 3 meters
- Developed a novel method for profiling program execution without instrumenting or otherwise affecting the profiled system
- Research Assistant2014-09 - 2016-06Computer Architecture and Parallel Processing Lab, Seoul National UniversityComplexity reduction, hardware design, and FPGA implementation for HEVC fractional motion estimation and real-time computer vision; recommender systems on big data.
- Reduced more than 80% of the complexity of the HEVC fractional motion estimation algorithm; awarded Best Paper at the 2016 SoC Conference of Korea
- Frame-memory reduction and hardware optimization on FPGA for real-time Lucas-Kanade Optical Flow
- Built an e-recommender system for an e-commerce website using machine learning on big data — ranked 50th out of thousands of teams in the 2015 RECSYS challenge
Skills
Languages
- Python
- C++14/17
- Java
- Kotlin
- TypeScript
- Scala
- Chisel (HDL)
Applied AI
- GenAI Product Integration
- LLM Fine-tuning & Evaluation
- AWS Bedrock
- Anthropic Claude
- Prompt Engineering
- Model Distillation
- Ranking / Autocomplete
- PyTorch
- HuggingFace Transformers
- Ollama
- llama.cpp
- FAISS
Systems & Infrastructure
- Apache Spark
- Amazon EMR
- OpenSearch
- Amazon S3
- AWS Services
- Android SDK / NDK
- On-Device Deployment
- Distributed Systems Debugging
- Safety-Critical Deployment
Domains
- AI Safety
- Automotive Voice AI
- Speech / Audio Systems
- Search & Information Retrieval
- Adversarial Detection
- Responsible AI Deployment
Publications
- Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren't Worth Training2026arXiv preprintDemonstrated that average token log-probability matches or beats supervised routing (RouteLLM) in-distribution and significantly outperforms it under distribution shift — tested across 3 model families, 2 datasets, ~4,500 queries
- Creating a backscattering side channel to enable detection of dormant hardware trojans2019IEEE Transactions on Very Large Scale Integration (VLSI) SystemsApplied ML-based electromagnetic side-channel analysis to detect hidden adversarial behaviors in complex hardware systems — directly analogous to detecting misaligned behavior in opaque AI systems
- IDEA: Intrusion detection through electromagnetic-signal analysis for critical embedded and cyber-physical systems2019IEEE Transactions on Dependable and Secure ComputingML-based anomaly detection via side-channel signals in embedded systems — foundational work in detecting hidden behaviors through indirect observation, a core challenge in AI interpretability
- Malware detection in embedded systems using neural network model for electromagnetic side-channel signals2019Journal of Hardware and Systems SecurityNeural network approach to detecting malicious software execution patterns via electromagnetic emanations
- Exploiting switching of transistors in digital electronics for RFID tag design2019IEEE Journal of Radio Frequency IdentificationNovel approach to passive RFID tag design leveraging transistor switching characteristics (Best Poster Award, RFID 2018)
- A Novel Golden-Chip-Free Clustering Technique Using Backscattering Side Channel for Hardware Trojan Detection2020IEEE International Symposium on Hardware Oriented Security and Trust (HOST)Golden-chip-free unsupervised hardware trojan detection using electromagnetic backscattering side channels (Best Paper Award, HOST 2020)
- A Reduction of Interpolation Redundancy for Fractional Motion Estimation in HEVC20162016 SoC Conference of KoreaReduced redundant interpolation computations in HEVC fractional motion estimation (Best Paper Award)
- Advanced Decision of PU Partition and CU Depth for Fractional Motion Estimation in HEVC2016International Conference on Electronics, Information and Communication (ICEIC)Early termination algorithm for PU partition and CU depth decisions in HEVC fractional motion estimation
Honors & Awards
- Best Student Paper Award2020IEEE International Symposium on Hardware Oriented Security and Trust (HOST)
- Second Best Hardware Demo Award2019IEEE International Symposium on Hardware Oriented Security and Trust (HOST)
- TechConnect Innovation Award2018TechConnect World Innovation Conference & Expo
- Second Prize2018Southeastern USA Korean Speech Contest
- Best Paper Award20162016 SoC Conference of Korea
- Korean Government Scholarship ProgramMinistry of Education, Republic of KoreaGovernment scholarship recipient
Portfolio
- Autodidact — Self-Evolving AI Agent2026Open sourceAuthor and maintainer. Open-source local-first AI agent that learns from its cloud queries. v1.0.x shipping on PyPI (`pip install autodidact`). Confidence-based routing (logprob_uncertainty, AUROC 0.65–0.83), hybrid BM25+FAISS retrieval via RRF, document synthesis, 5 setup modes (Ollama / OpenAI-compatible / AWS Bedrock), zero-friction setup wizard, single-file SQLite portability. Routing layer grounded in companion paper (arXiv:2605.02241).