Author Image

Hi, I am Karina

Karina Hu

Computational Linguist at Glossika

I am a computational linguist and data science researcher, who is interested in the integration of data from different language sources. I am able to code efficiently and utilize my knowledge of data mining and machine learning effectively to build NLP algorithms.

Leadership
Team Work
Communication
Hard Working
Fast Learner
Problem Solving

Skills

Experiences

1
Computational Linguist
Glossika

Mar 2020 - Present, Remote

Glossika helps you speak a new language fluently in the least possible time by using smart technology, adaptive learning algorithms and structured content

Responsibilities:
  • Optimized the Syntax corpus and used the phrase structure rules to define the part of speech and analyzed the theta role of the text
  • Developed torch.nn module for word embedding based on vocabulary size and the dimensionality of embeddings
  • Applied knowledge of seq2seq model to develop NLP modeling tasks and handle great amount of preprocessing data (Arabic, Russian, Polish) with team
  • Optimized datasets based on language and conducted dataset to be trained and built LSTM RNN (Seq2Seq), CBOW, Skip-gram and set up Docker container on AWS EC2 instance for Deep Learning

Research Assistant
National Tsing Hua University Natural Language Processing Lab

July 2020 - Present, Remote

Our main areas of research include computer-assisted language learning, word alignment, information retrieval, and machine translation

Responsibilities:
  • Collaborated with python team members to optimize and develop English lesson plans for users to level up their grammar ability
  • Developed the grammar detection system to detect Cambridge grammar rules by SpaCy, Regex, Pytorch
  • Develop detection system API on Flask and collaborated with frontend (React) to show the level up graph to users on Linngle
2

3
Data Tagging Specialist
iTutorGroup

Sep 2016 - Apr 2017, Taipei

iTutorGroup is the premier online education platform and largest English-language learning institution in the world

Responsibilities:
  • Successfully extracted suitable English-Mandarin teaching materials from the database and provided specific training session to call center members to hit great performance at request
  • Collaborated with team members to create a motivate oriented culture

Education

Master of Arts in Computational Linguistics
CGPA: 3.88 out of 4
Bachelor of Arts in English
CGPA: 3.65 out of 4

Projects

Deep Learning for Reddit Posts
Deep Learning for Reddit Posts
Developer Feb 2020

Sentiment analysis for social media networks.

Brill Tagger Project
Brill Tagger Project
Developer Oct 2019

Implement a rule-based tagging algorithm

The Ups and Downs of Writing Proficiency Level
Research Assistant Mar 2021

Develop an online system to assist learners with Hugging Face to improve language proficiency level