Donghyeon Kim
ProjectsResearchAbout
KO EN

This portfolio contains diverse experiences ranging from data analysis to conversational AI and medical AI.

Portfolio Document

Applied AI Researcher

Donghyeon Kim

AI Researcher Portfolio

Portfolio / 2026
I build AI that is accurate, understandable, and reusable.
Location Incheon, South Korea
Email eastlighting1@gachon.ac.kr
GitHub github.com/eastlighting1
LinkedIn www.linkedin.com/in/동현-김-350b4b29b
Donghyeon Kim
Philosophy

Way of Working

I build AI that is accurate, understandable, and reusable.

I treat evaluation as part of the deliverable, not an appendix.

I connect experimentation and delivery beyond individual ownership.

Core Domain

Data Science

The ability to refine real-world data for reliable modeling through analysis and preprocessing

Data Preparation · Feature Engineering · Data Wrangling

Applied AI

The ability to turn domain-specific NLP into concrete tasks through experimentation and implementation

Conversational AI · Medical AI · Knowledge Graph

MLOps

The ability to build repeatable workflows and scale them through deployment and automation

Cloud · Docker · Continuous Training

Observability in AI

An operational perspective for tracing model behavior to improve reproducibility and maintenance

Logging · Monitoring · Artifacts

Experience

Recent Experience

Intelligent Data Analytics Lab., Gachon University

2024.03 - 2026.02

Graduate Researcher

  • Led graduate research on EMR-based nursing surveillance decision support and diagnostic classification.
  • Built end-to-end modeling pipelines using KM-BERT ensembles, XGBoost, and both structured and text data.

National Research Foundation of Korea (NRF)

2024.03 - 2025.12

Research Project Participant

  • Contributed to an NRF-funded clinical AI project centered on nursing surveillance decision support using EMR data.
  • Implemented workflows for clinical text understanding, including keyword extraction, dependency parsing-based preprocessing, topic modeling, and similarity analysis.

Institute of Information & Communications Technology Planning & Evaluation (IITP)

2025.09 - 2025.12

Research Project Participant

  • Implemented evaluation-related code in a human-centered multimodal AI project.
  • Bridged evaluation requirements with actual code and reviewable deliverables.
Selected Projects

Featured Projects

EMR-Based Nursing Surveillance for Automatic ICD Coding

2025

Clinical AI Research | Graduate Researcher, AI Engineer, Data Scientist

Nursing surveillance required diagnosis-related classification, but key clinical signals were fragmented across heterogeneous EMR sources.

Medical AINursing SurveillanceEMRAutomatic ICD Coding

FRIMO: Conversational AI for Emotional Support and Diary Generation

2023

Conversational AI Project | AI Engineer, NLP Engineer

The product needed ML components that could recognize user emotion and support a diary-generation workflow from daily conversation logs.

Conversational AIEmotion RecognitionKoBERTKoGPT2
Map

Education and Study

Education

Gachon University

M.S. in Artificial Intelligence

2024.03 - 2026.02

Gachon University

B.S. in Software

2019.03 - 2024.02

Study

NEKA

NLP Study

A study on implementing Keyword Extraction and NER with KoBERT

SUNAROUM

Financial AI Study

A study on building buy recommendation models from financial statements and indicators

I-DA

Recommender Systems Study

A study on recommendation systems using knowledge graphs and GNNs

MLADS

Basic Machine Learning Study

A study on the fundamentals of data science and machine learning

Research

Research

A Context-Adaptive Gated Embedding Framework for Advanced Clinical Decision-Making

2026

Mathematics (submitted)

This study proposes a hierarchical clinical decision support framework that estimates diagnostic context via partial-label automated ICD coding and reinjects it into irregular ICU time-series forecasting through context-adaptive gating for mechanical ventilation transition prediction. By conditioning temporal interpretation on diagnostic context, the framework substantially improves rare-event detection.

Clinical Decision Support SystemAutomated ICD CodingICU Time-series

Deep Learning based Automatic ICD Coding for Nursing Surveillance of Abdominal Surgery Patients

2025

Journal of The Korea Society of Computer and Information

This study proposes an automatic ICD coding model for nursing surveillance of abdominal surgery patients by integrating EMR-based test data, patient information, and nursing notes. A stacking architecture combining dual KM-BERT, XGBoost, and PCA outperformed both a single KM-BERT model and simpler ensemble variants.

Medical AINursing SurveillanceEMR

Empathetic Dialogue Generation Model Using Reinforcement Learning with AI-Based Feedback

2025

Korea Computer Congress (KCC)

This study proposes an empathetic dialogue generation model using reinforcement learning with AI-based feedback (RLAIF) to address limited diversity and reliance on human feedback. By leveraging an LLM as a reward evaluator and integrating it into EmpRL, the model generates more diverse empathetic responses.

Empathetic DialogueReinforcement LearningRLAIF
Certificate & Award

Completion and Awards

Certificates & Qualifications

Practical Implementation of Monitoring and Testing in DevOps Environments

2023.11

LLOYDK

Completed practical training in Elastic-based DevOps monitoring and testing.

  • Studied DevOps-related services and operational environments provided by Elastic.
  • Learned DevOps concepts applicable to practical work from monitoring and testing perspectives.

Multi Cloud Orchestration Program

2023.12

5Works

Completed HashiCorp-based multi-cloud orchestration and IaC training.

  • Studied mainly HashiCorp IaC tools such as Packer and Terraform.
  • Learned the concepts of infrastructure automation and orchestration in multi-cloud environments.
Certificate & Award

Completion and Awards

Certificates & Qualifications

Company-Led Intensive Project Training

2024.02

DK Techin

Participated in industry-linked practical training focused on security and DevOps engineering.

  • Conducted practical projects while working on cloud, CI/CD, and security-related tasks.
  • Took on the role of advisor and build assistant as an experienced member during the recommender system development process.

Micro Degree in Software Specialist Training

2024.02

Gachon University

Completed a micro-degree program for training software specialists.

  • Completed a program aimed at strengthening software and practice-oriented digital capabilities.
  • Built a record of completion that connects undergraduate-based education with practice-oriented project experience.
Case Study 1

EMR-Based Nursing Surveillance for Automatic ICD Coding

2025

Built an automatic ICD coding pipeline for nursing surveillance of abdominal surgery patients using core EMR data.

Problem

Nursing surveillance required diagnosis-related classification, but key clinical signals were fragmented across heterogeneous EMR sources.

Approach

I integrated structured EMR features and nursing text, then used a dual KM-BERT, PCA, and XGBoost stacking architecture for ICD prediction.

Outcome

The final model achieved 0.9245 accuracy and strong rare-class recall without depending on physician-centered post-hoc documents.

0.9245 Accuracy0.9157 Weighted F1Strong Rare-class Recall
Keywords
Medical AINursing Surveillance +5
Links
Paper
Case Study 2

FRIMO: Conversational AI for Emotional Support and Diary Generation

2023

Contributed to the machine learning pipeline for a conversational diary app, with a primary focus on emotion recognition and Korean language model integration.

Problem

The product needed ML components that could recognize user emotion and support a diary-generation workflow from daily conversation logs.

Approach

I focused on the ML side by implementing and refining the emotion recognition and by helping connect chatbot and summarization models into the overall AI workflow.

Outcome

The project delivered an MVP-level conversational diary experience and gave me hands-on experience in integrating Korean NLP models for emotion-aware user interaction.

Emotion RecognitionKorean NLP PipelineMVP Contribution
Keywords
Conversational AIEmotion Recognition +4
Case Study 3

Devridge: LLM-Based Feedback Bridge for Developers

2024

Built an LLM prototype that helps solo developers receive contextualized feedback from different professional roles.

Problem

Developers working alone often need UI, performance, or code quality feedback, but they rarely have an easy way to gather role-specific input at the right time.

Approach

I framed the service as a role-aware feedback bridge and focused on role constraints, and contextual input design so the generated responses would stay relevant and scoped.

Outcome

The prototype demonstrated how structured prompting and role-specific constraints could turn LLM output into more useful, contextualized project feedback.

Hackathon PrototypeRole-Constrained PromptsContextual Feedback
Keywords
LLMPrompt Engineering +4
Donghyeon Kim

Applied AI Researcher

Donghyeon Kim

Applied AI researcher and AI engineer focused on deployable conversational AI and medical AI systems

I am interested in applied AI problems grounded in real-world data, and in connecting them to models, evaluation, and deliverables that collaborators can actually use. Recently, I have been working across conversational AI, medical AI, and ML systems with a focus on improving reproducibility and practical delivery.

I am a graduate researcher with experience in NLP, medical AI, deep learning, and end-to-end AI project implementation. I am particularly interested in turning experimental AI work into trustworthy systems through stronger evaluation, clear documentation, and a practical MLOps perspective.

Location. Incheon, South Korea
Email. eastlighting1@gachon.ac.kr
GitHub. github.com/eastlighting1
LinkedIn. www.linkedin.com/in/동현-김-350b4b29b
Applied AI ResearcherData ScienceApplied AIMLOpsObservability in AI

Working Principles

I build AI that is accurate, understandable, and reusable.

A working principle that guides how research becomes implementation.

I treat evaluation as part of the deliverable, not an appendix.

A working principle that guides how research becomes implementation.

I connect experimentation and delivery beyond individual ownership.

A working principle that guides how research becomes implementation.

Background & Foundations

This portfolio is structured in a reading- and print-friendly format that first presents a short overview, followed by case study sheets for each project.

The selected work is organized around applied AI problems in which modeling, interpretation, and delivery were all important.

Experience.

Intelligent Data Analytics Lab., Gachon University

Graduate Researcher

2024.03 - 2026.02

  • Led graduate research on EMR-based nursing surveillance decision support and diagnostic classification.
  • Built end-to-end modeling pipelines using KM-BERT ensembles, XGBoost, and both structured and text data.
  • Explored conversational AI topics, including empathetic dialogue generation with AI-based feedback.

National Research Foundation of Korea (NRF)

Research Project Participant

2024.03 - 2025.12

  • Contributed to an NRF-funded clinical AI project centered on nursing surveillance decision support using EMR data.
  • Implemented workflows for clinical text understanding, including keyword extraction, dependency parsing-based preprocessing, topic modeling, and similarity analysis.
  • Provided interpretable results and web-based analysis interfaces for collaborative researchers.
  • Achieved 92%+ diagnostic prediction accuracy in a multi-label task by ensembling text and structured data models (KM-BERT, XGBoost).

Institute of Information & Communications Technology Planning & Evaluation (IITP)

Research Project Participant

2025.09 - 2025.12

  • Implemented evaluation-related code in a human-centered multimodal AI project.
  • Bridged evaluation requirements with actual code and reviewable deliverables.

dktechin

Industry-led Intensive Program Trainee

2024.01 - 2024.02

  • Participated in an industry-led intensive program focused on cloud, CI/CD, security, and DevOps practices.
  • Contributed to implementation tasks with a focus on security and DevOps roles in project-based training.

Gachon University / Notion Community Program

Student Leader and Community Organizer

2019.03 - 2025.02

  • Held multiple leadership roles in the official university programming club and served as president in 2022.
  • Planned and led study groups on machine learning, big data, financial ML, and GNNs.
  • Supported campus learning communities and resource sharing as part of the Notion Campus Leader program (24-2 ~ 25-1).

Education & Learning Foundation

Education.

Gachon University

M.S. in Artificial Intelligence

2024.03 - 2026.02

GPA 4.14/4.5. Intelligent Data Analytics Lab. Advisor - OkRan Jeong. Focused on clinical AI, evaluation, and implementation-oriented research delivery.

Gachon University

B.S. in Software

2019.03 - 2024.02

Gained leadership and community management experience by participating in an official programming club and leading various study groups, including basic ML, advanced ML, financial ML, and GNN.

Study.

NEKA

NLP Study

A study on implementing Keyword Extraction and NER with KoBERT

SUNAROUM

Financial AI Study

A study on building buy recommendation models from financial statements and indicators

I-DA

Recommender Systems Study

A study on recommendation systems using knowledge graphs and GNNs

MLADS

Basic Machine Learning Study

A study on the fundamentals of data science and machine learning

Capability Summary

Data Science

The ability to refine real-world data for reliable modeling through analysis and preprocessing

Data Preparation · Feature Engineering · Data Wrangling · Data Manipulation

Applied AI

The ability to turn domain-specific NLP into concrete tasks through experimentation and implementation

Conversational AI · Medical AI · Knowledge Graph · Logic-based AI

MLOps

The ability to build repeatable workflows and scale them through deployment and automation

Cloud · Docker · Continuous Training · Feature Store

Observability in AI

An operational perspective for tracing model behavior to improve reproducibility and maintenance

Logging · Monitoring · Artifacts · Provenance

Research

A Context-Adaptive Gated Embedding Framework for Advanced Clinical Decision-Making

2026

Mathematics (submitted)

This study proposes a hierarchical clinical decision support framework that estimates diagnostic context via partial-label automated ICD coding and reinjects it into irregular ICU time-series forecasting through context-adaptive gating for mechanical ventilation transition prediction. By conditioning temporal interpretation on diagnostic context, the framework substantially improves rare-event detection.

Clinical Decision Support System · Automated ICD Coding · ICU Time-series · Mechanical Ventilation Prediction · Partial-Label Learning · Extreme Multi-Class Classification · TCN · Gating · Rare Event Detection

Deep Learning based Automatic ICD Coding for Nursing Surveillance of Abdominal Surgery Patients

2025

Journal of The Korea Society of Computer and Information

This study proposes an automatic ICD coding model for nursing surveillance of abdominal surgery patients by integrating EMR-based test data, patient information, and nursing notes. A stacking architecture combining dual KM-BERT, XGBoost, and PCA outperformed both a single KM-BERT model and simpler ensemble variants.

Medical AI · Nursing Surveillance · EMR · Automatic ICD Coding · Deep Learning · KM-BERT · XGBoost · Ensemble · Abdominal Surgery

Empathetic Dialogue Generation Model Using Reinforcement Learning with AI-Based Feedback

2025

Korea Computer Congress (KCC)

This study proposes an empathetic dialogue generation model using reinforcement learning with AI-based feedback (RLAIF) to address limited diversity and reliance on human feedback. By leveraging an LLM as a reward evaluator and integrating it into EmpRL, the model generates more diverse empathetic responses.

Empathetic Dialogue · Reinforcement Learning · RLAIF · RLHF Alternative · LLM · Dialogue Generation · NLP · AI Feedback

Completion and Awards

Certificates & Qualifications

Practical Implementation of Monitoring and Testing in DevOps Environments

LLOYDK

2023.11

Completed practical training in Elastic-based DevOps monitoring and testing.

  • Studied DevOps-related services and operational environments provided by Elastic.
  • Learned DevOps concepts applicable to practical work from monitoring and testing perspectives.

Certificates & Qualifications

Multi Cloud Orchestration Program

5Works

2023.12

Completed HashiCorp-based multi-cloud orchestration and IaC training.

  • Studied mainly HashiCorp IaC tools such as Packer and Terraform.
  • Learned the concepts of infrastructure automation and orchestration in multi-cloud environments.

Certificates & Qualifications

Company-Led Intensive Project Training

DK Techin

2024.02

Participated in industry-linked practical training focused on security and DevOps engineering.

  • Conducted practical projects while working on cloud, CI/CD, and security-related tasks.
  • Took on the role of advisor and build assistant as an experienced member during the recommender system development process.

Certificates & Qualifications

Micro Degree in Software Specialist Training

Gachon University

2024.02

Completed a micro-degree program for training software specialists.

  • Completed a program aimed at strengthening software and practice-oriented digital capabilities.
  • Built a record of completion that connects undergraduate-based education with practice-oriented project experience.

Project Archive

Case Study 1

EMR-Based Nursing Surveillance for Automatic ICD Coding

Built an automatic ICD coding pipeline for nursing surveillance of abdominal surgery patients using core EMR data.

2025 Clinical AI Research

Graduate Researcher · AI Engineer · Data Scientist

Case Study 2

Dalkom Shop: Internal Employee Mileage Commerce Platform

Served as the DevSecOps lead for an internal employee-only commerce platform, building the delivery, operations, security, and observability foundation behind the service.

2024 Internal Commerce Platform

DevSecOps Engineer · Infra Engineer · Platform Engineer

Case Study 3

Devridge: LLM-Based Feedback Bridge for Developers

Built an LLM prototype that helps solo developers receive contextualized feedback from different professional roles.

2024 LLM Application Prototype

LLM Product Builder · Prototype Engineer

Case Study 4

BloGeek: AI Modules for a React + Spring Blog Project

Contributed to the AI side of a React + Spring blog project, focusing on polarity recognition and style transfer for Korean text.

2023 Collaborative NLP Project

AI Engineer · NLP Engineer

Case Study 5

FRIMO: Conversational AI for Emotional Support and Diary Generation

Contributed to the machine learning pipeline for a conversational diary app, with a primary focus on emotion recognition and Korean language model integration.

2023 Conversational AI Project

AI Engineer · NLP Engineer

Case Study 1

2025

EMR-Based Nursing Surveillance for Automatic ICD Coding

Built an automatic ICD coding pipeline for nursing surveillance of abdominal surgery patients using core EMR data.

Clinical AI Research Graduate Researcher, AI Engineer, Data Scientist

Problem

Nursing surveillance required diagnosis-related classification, but key clinical signals were fragmented across heterogeneous EMR sources.

Approach

I integrated structured EMR features and nursing text, then used a dual KM-BERT, PCA, and XGBoost stacking architecture for ICD prediction.

Outcome

The final model achieved 0.9245 accuracy and strong rare-class recall without depending on physician-centered post-hoc documents.

Metrics

0.9245 Accuracy
0.9157 Weighted F1
Strong Rare-class Recall

Tech & Tags

Medical AINursing SurveillanceEMRAutomatic ICD CodingKM-BERTXGBoostEnsemble

Links

Paper https://doi.org/10.9708/jksci.2025.30.05.021

Case Study 2

2024

Dalkom Shop: Internal Employee Mileage Commerce Platform

Served as the DevSecOps lead for an internal employee-only commerce platform, building the delivery, operations, security, and observability foundation behind the service.

Internal Commerce Platform DevSecOps Engineer, Infra Engineer, Platform Engineer

Problem

The project needed more than frontend and backend implementation; it required cloud infrastructure and platform foundations that could reliably support search, notifications, admin workflows, and ongoing operations.

Approach

I focused on the DevSecOps and infrastructure/platform layer, organizing CI/CD, cloud deployment flow, operational setup, and security-aware service foundations for a React + Spring commerce system.

Outcome

The project demonstrated how infrastructure and platform engineering directly shape the reliability and readiness of a production-style internal commerce service.

Metrics

Internal Commerce Operations
CI/CD and Cloud Infrastructure
Security and Observability

Tech & Tags

DevSecOpsInfrastructurePlatform EngineeringReactSpring BootDockerGitHub ActionsElasticsearchAWS

Case Study 3

2024

Devridge: LLM-Based Feedback Bridge for Developers

Built an LLM prototype that helps solo developers receive contextualized feedback from different professional roles.

LLM Application Prototype LLM Product Builder, Prototype Engineer

Problem

Developers working alone often need UI, performance, or code quality feedback, but they rarely have an easy way to gather role-specific input at the right time.

Approach

I framed the service as a role-aware feedback bridge and focused on role constraints, and contextual input design so the generated responses would stay relevant and scoped.

Outcome

The prototype demonstrated how structured prompting and role-specific constraints could turn LLM output into more useful, contextualized project feedback.

Metrics

Hackathon Prototype
Role-Constrained Prompts
Contextual Feedback

Tech & Tags

LLMPrompt EngineeringRole-Based FeedbackAWSPartyRockPrototype

Case Study 4

2023

BloGeek: AI Modules for a React + Spring Blog Project

Contributed to the AI side of a React + Spring blog project, focusing on polarity recognition and style transfer for Korean text.

Collaborative NLP Project AI Engineer, NLP Engineer

Problem

The product needed ML components that could classify emotional polarity and generate stylistic variations of text to support richer blog content workflows.

Approach

I focused on the machine learning side by implementing a KoBERT-based polarity recognition model and a KoBART-based style transfer pipeline, along with dataset handling and repeated model experiments.

Outcome

The project gave the team practical AIs for blog-oriented text processing and strengthened my experience in integrating classification and generation models into a web product context.

Metrics

Polarity Recognition
Style Transfer
Web AI Contribution

Tech & Tags

Polarity RecognitionStyle TransferKoBERTKoBARTKorean NLPPyTorch

Case Study 5

2023

FRIMO: Conversational AI for Emotional Support and Diary Generation

Contributed to the machine learning pipeline for a conversational diary app, with a primary focus on emotion recognition and Korean language model integration.

Conversational AI Project AI Engineer, NLP Engineer

Problem

The product needed ML components that could recognize user emotion and support a diary-generation workflow from daily conversation logs.

Approach

I focused on the ML side by implementing and refining the emotion recognition and by helping connect chatbot and summarization models into the overall AI workflow.

Outcome

The project delivered an MVP-level conversational diary experience and gave me hands-on experience in integrating Korean NLP models for emotion-aware user interaction.

Metrics

Emotion Recognition
Korean NLP Pipeline
MVP Contribution

Tech & Tags

Conversational AIEmotion RecognitionKoBERTKoGPT2KoBARTNLP