Applied AI Researcher
Donghyeon Kim
AI Researcher Portfolio
Applied AI Researcher
Applied AI researcher and AI engineer focused on deployable conversational AI and medical AI systems
I am interested in applied AI problems grounded in real-world data, and in connecting them to models, evaluation, and deliverables that collaborators can actually use. Recently, I have been working across conversational AI, medical AI, and ML systems with a focus on improving reproducibility and practical delivery.
I am a graduate researcher with experience in NLP, medical AI, deep learning, and end-to-end AI project implementation. I am particularly interested in turning experimental AI work into trustworthy systems through stronger evaluation, clear documentation, and a practical MLOps perspective.
A working principle that guides how research becomes implementation.
A working principle that guides how research becomes implementation.
A working principle that guides how research becomes implementation.
This portfolio is structured in a reading- and print-friendly format that first presents a short overview, followed by case study sheets for each project.
The selected work is organized around applied AI problems in which modeling, interpretation, and delivery were all important.
2024.03 - 2026.02
2024.03 - 2025.12
2025.09 - 2025.12
2024.01 - 2024.02
2019.03 - 2025.02
2024.03 - 2026.02
GPA 4.14/4.5. Intelligent Data Analytics Lab. Advisor - OkRan Jeong. Focused on clinical AI, evaluation, and implementation-oriented research delivery.
2019.03 - 2024.02
Gained leadership and community management experience by participating in an official programming club and leading various study groups, including basic ML, advanced ML, financial ML, and GNN.
A study on implementing Keyword Extraction and NER with KoBERT
A study on building buy recommendation models from financial statements and indicators
A study on recommendation systems using knowledge graphs and GNNs
A study on the fundamentals of data science and machine learning
The ability to refine real-world data for reliable modeling through analysis and preprocessing
Data Preparation · Feature Engineering · Data Wrangling · Data Manipulation
The ability to turn domain-specific NLP into concrete tasks through experimentation and implementation
Conversational AI · Medical AI · Knowledge Graph · Logic-based AI
The ability to build repeatable workflows and scale them through deployment and automation
Cloud · Docker · Continuous Training · Feature Store
An operational perspective for tracing model behavior to improve reproducibility and maintenance
Logging · Monitoring · Artifacts · Provenance
2026
This study proposes a hierarchical clinical decision support framework that estimates diagnostic context via partial-label automated ICD coding and reinjects it into irregular ICU time-series forecasting through context-adaptive gating for mechanical ventilation transition prediction. By conditioning temporal interpretation on diagnostic context, the framework substantially improves rare-event detection.
Clinical Decision Support System · Automated ICD Coding · ICU Time-series · Mechanical Ventilation Prediction · Partial-Label Learning · Extreme Multi-Class Classification · TCN · Gating · Rare Event Detection
2025
This study proposes an automatic ICD coding model for nursing surveillance of abdominal surgery patients by integrating EMR-based test data, patient information, and nursing notes. A stacking architecture combining dual KM-BERT, XGBoost, and PCA outperformed both a single KM-BERT model and simpler ensemble variants.
Medical AI · Nursing Surveillance · EMR · Automatic ICD Coding · Deep Learning · KM-BERT · XGBoost · Ensemble · Abdominal Surgery
2025
This study proposes an empathetic dialogue generation model using reinforcement learning with AI-based feedback (RLAIF) to address limited diversity and reliance on human feedback. By leveraging an LLM as a reward evaluator and integrating it into EmpRL, the model generates more diverse empathetic responses.
Empathetic Dialogue · Reinforcement Learning · RLAIF · RLHF Alternative · LLM · Dialogue Generation · NLP · AI Feedback
Certificates & Qualifications
2023.11
Completed practical training in Elastic-based DevOps monitoring and testing.
Certificates & Qualifications
2023.12
Completed HashiCorp-based multi-cloud orchestration and IaC training.
Certificates & Qualifications
2024.02
Participated in industry-linked practical training focused on security and DevOps engineering.
Certificates & Qualifications
2024.02
Completed a micro-degree program for training software specialists.
Case Study 1
Built an automatic ICD coding pipeline for nursing surveillance of abdominal surgery patients using core EMR data.
Graduate Researcher · AI Engineer · Data Scientist
Case Study 2
Served as the DevSecOps lead for an internal employee-only commerce platform, building the delivery, operations, security, and observability foundation behind the service.
DevSecOps Engineer · Infra Engineer · Platform Engineer
Case Study 3
Built an LLM prototype that helps solo developers receive contextualized feedback from different professional roles.
LLM Product Builder · Prototype Engineer
Case Study 4
Contributed to the AI side of a React + Spring blog project, focusing on polarity recognition and style transfer for Korean text.
AI Engineer · NLP Engineer
Case Study 5
Contributed to the machine learning pipeline for a conversational diary app, with a primary focus on emotion recognition and Korean language model integration.
AI Engineer · NLP Engineer
Case Study 1
2025Built an automatic ICD coding pipeline for nursing surveillance of abdominal surgery patients using core EMR data.
Nursing surveillance required diagnosis-related classification, but key clinical signals were fragmented across heterogeneous EMR sources.
I integrated structured EMR features and nursing text, then used a dual KM-BERT, PCA, and XGBoost stacking architecture for ICD prediction.
The final model achieved 0.9245 accuracy and strong rare-class recall without depending on physician-centered post-hoc documents.
Case Study 2
2024Served as the DevSecOps lead for an internal employee-only commerce platform, building the delivery, operations, security, and observability foundation behind the service.
The project needed more than frontend and backend implementation; it required cloud infrastructure and platform foundations that could reliably support search, notifications, admin workflows, and ongoing operations.
I focused on the DevSecOps and infrastructure/platform layer, organizing CI/CD, cloud deployment flow, operational setup, and security-aware service foundations for a React + Spring commerce system.
The project demonstrated how infrastructure and platform engineering directly shape the reliability and readiness of a production-style internal commerce service.
Case Study 3
2024Built an LLM prototype that helps solo developers receive contextualized feedback from different professional roles.
Developers working alone often need UI, performance, or code quality feedback, but they rarely have an easy way to gather role-specific input at the right time.
I framed the service as a role-aware feedback bridge and focused on role constraints, and contextual input design so the generated responses would stay relevant and scoped.
The prototype demonstrated how structured prompting and role-specific constraints could turn LLM output into more useful, contextualized project feedback.
Case Study 4
2023Contributed to the AI side of a React + Spring blog project, focusing on polarity recognition and style transfer for Korean text.
The product needed ML components that could classify emotional polarity and generate stylistic variations of text to support richer blog content workflows.
I focused on the machine learning side by implementing a KoBERT-based polarity recognition model and a KoBART-based style transfer pipeline, along with dataset handling and repeated model experiments.
The project gave the team practical AIs for blog-oriented text processing and strengthened my experience in integrating classification and generation models into a web product context.
Case Study 5
2023Contributed to the machine learning pipeline for a conversational diary app, with a primary focus on emotion recognition and Korean language model integration.
The product needed ML components that could recognize user emotion and support a diary-generation workflow from daily conversation logs.
I focused on the ML side by implementing and refining the emotion recognition and by helping connect chatbot and summarization models into the overall AI workflow.
The project delivered an MVP-level conversational diary experience and gave me hands-on experience in integrating Korean NLP models for emotion-aware user interaction.