cv

Basics

Name Nick Oh
Label Researcher
Email nick.sh.oh@gmail.com
Url https://github.com/socius-org
Summary summary...

Work

  • 2023.01 - Present
    Principal Researcher
    socius
    socius is an independent research lab endorsed by the London School of Economics and Political Science. The lab is rooted in social science principles, believing that AI development and deployment must be guided by robust theories and philosophies. We create tools and frameworks that both technical and social science researchers can confidently use and interpret.
    • NLP
  • 2021.09 - 2023.01
    Junior NLP Engineer
    Numen Capital
    Worked on the in-house Knowsis research team, leveraging NLP and machine learning for alternative data insights. (1) Developed NLP pipelines for alternative data insights, including ESG tweet classifier based on transformer architecture (97% accuracy), rule-based sentiment analyser, and domain-specific dictionary. (2) Analysed alternative data signals derived from NLP pipelines using time series models. This analysis led to accurate forecasts of market movements, directly assisting quantitative investment strategies.
    • NLP

Education

  • 2022.04 - 2023.12

    London, UK

    MSc
    University of London
    Data Science and Artificial Intelligence
    • Big data analysis
    • Data programming in Python
    • Statistics and statistical mining
    • Machine learning
    • Data science research topics
    • Mathematics of financial markets
    • Artificial intelligence
    • Natural language processing
    • Neural networks
    • Social media and network science
  • 2016.09 - 2020.07

    London, UK

    BSc
    London School of Economics and Political Science
    Politics and Economics
    • Macroeconomic Principles
    • Microeconomic Principles I
    • Public Choice and Politics
    • Research Design in Political Science
    • Monetary Economics
    • Applied Quantitative Methods for Political Science
    • Algorithms and Programming
    • Game Theory I
    • Politics of Money and Finance in Comparative Perspective

Certificates

Big Data to Decisions: AI and Machine Learning
London Business School 2019-03-11

Publications

  • 2024
    sentibank: A Unified Resource of Sentiment Lexicons and Dictionaries
    International AAAI Conference on Web and Social Media
    Sentiment analysis is critical across computational social science domains, but faces challenges in interpretability. Rule-based methods relying on expert lexicons enable transparency, yet applying them is hindered by resource fragmentation and lack of validation. This paper introduces sentibank, a large-scale unified database consolidating 15 original sentiment dictionaries and 43 preprocessed dictionaries, spanning 7 genres and 6 domains.

Languages

Korean
Fluent
English
Fluent

Interests

Applied AI/ML
Natural Language Processing
Domain-specific Sentiment Analysis
Interpretable AI/ML System
Social Data Science
Explainable AI in Social Contexts
Prediction and forecasting models
Social Media Analytics
Political Discourse Analysis
Text Mining

Projects

  • 2023.12 - Present
    RedditHarbor
    Open-source Python toolkit designed to simplify the process of collecting and archiving Reddit data for research purposes
    • Reddit Data Collection
    • Research Data Toolkit
    • Research Data Pipeline
    • Reddit Data API
    • Data Crawler