Skip to content
View greeksharifa's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report greeksharifa

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
greeksharifa/README.md

You Won Jang

Research Engineer / Applied Scientist
Integrated M.S.-Ph.D. Candidate in Computer Science and Engineering, Seoul National University
Expected Graduation: August 2026

Reliable Multimodal Reasoning Video Understanding Evaluation Reliability

Email GitHub LinkedIn Google Scholar About Blog

I build multimodal AI systems that reason more reliably over images, videos, and language. My work focuses on reliable multimodal reasoning, video understanding, and evaluation reliability, with recurring use of structured intermediate representations, confidence-aware inference, internal knowledge graphs, temporal grounding, and LLM-as-a-Judge protocol design.

🧭 Research Focus

  • Reliable multimodal reasoning for language-vision systems
  • Video understanding and structured video-language pipelines
  • Evaluation reliability for LLM-based assessment systems

πŸ“„ Selected Work

I also contributed to earlier foundational work on character-centered video story understanding through DramaQA.

πŸ“œ Patents

  • 4 patent families in multimodal reasoning and video-story understanding
  • Includes a self-questioning-based visual question answering patent family
  • Includes DramaQA-related question answering and character-centered video story understanding patent families with confirmed Korean and international records

πŸš€ Projects & Leadership

  • Development of Uncertainty-Aware Agents Learning by Asking Questions (2022-present)
    Student responsible researcher on a long-horizon project about uncertainty-aware agents that improve by asking questions.

  • LA4IRA@RO-MAN 2023
    Workshop organizer for research on learning by asking for intelligent robots and agents.

  • DramaQA / Video Turing Test activities
    Organizer across workshops, challenges, and competition activities related to video story understanding and benchmark building.

πŸ’Ό Experience

  • BioIntelligence Lab, Seoul National University β€” Integrated M.S./Ph.D. Researcher
    Mar. 2019 - Expected Aug. 2026
    Research on multimodal reasoning, video understanding, and LLM evaluation.

  • KT Corporation β€” Research Intern
    Jul. 2023 - Aug. 2023
    Worked on instruction-tuned self-questioning for multimodal reasoning.

  • NAVER Corp β€” Research Intern
    Jul. 2018 - Aug. 2018
    Worked on hierarchical category classification and large-scale category structure.

πŸŽ“ Education

  • Seoul National University β€” Integrated M.S. and Ph.D. in Computer Science and Engineering
    Mar. 2019 - Expected Aug. 2026

  • Seoul National University β€” B.S. in Computer Science and Engineering
    Mar. 2015 - Feb. 2019

  • Seoul Science High School (SSHS) β€” High School
    Mar. 2012 - Feb. 2015

πŸ› οΈ Skills

  • Programming: Python, C++
  • Frameworks & Tools: PyTorch, Hugging Face, Git, Linux, LaTeX
  • Research Areas: Reliable multimodal reasoning, video understanding, vision-language models, LLM evaluation, temporal grounding

🎯 Interests

I am particularly interested in Research Engineer and Applied Scientist roles in multimodal AI, video understanding, and evaluation-reliability problems, especially in industry research labs that value both research depth and system-building ability.

Popular repositories Loading

  1. greeksharifa.github.io greeksharifa.github.io Public

    SCSS 8 2

  2. LBA_LAVIS LBA_LAVIS Public

    Forked from salesforce/LAVIS

    LAVIS - A One-stop Library for Language-Vision Intelligence

    Jupyter Notebook 3 2

  3. ps_code ps_code Public

    Algorithm

    C++ 1

  4. Tutorial.code Tutorial.code Public

    C++, Python, Web

    Jupyter Notebook 1 16

  5. LBA_Integration_2022 LBA_Integration_2022 Public

    for SNU LBA task

    Python 1

  6. Command-set Command-set Public

    Batchfile 1