Yubin Kim
Publications
- Yubin Kim, Reyyan Yeniterzi, Jamie Callan.
Overcoming Vocabulary Limitations in Twitter Microblogs. In
Proceedings of the Twenty First Text REtrieval Conference (TREC
2012). National Institute of Standards and Technology, special
publication. To appear.
- George Beskales, Mohamed A. Soliman, Ihab F. Ilyas, Shai Ben-David,
Yubin Kim. ProbClean: A probabilistic duplicate detection
system. In IEEE 26th International Conference on Data Engineering
(ICDE 2010), pp.1193-1196. 2010.
Research Experience
- Research Assistant at Carnegie Mellon University
(09/2011 - current)
- Advised by Prof. Jamie Callan in the Language Technologies
Institute
- Investigated microblog ad-hoc search methodologies using Twitter
data
- Participated in the Microblog Track of TREC 2012
- Research Assistant at University of
Waterloo (09/2010 - 12/2010)
- Full-time position under Prof. Ihab Ilyas in the Database Systems group
- Implemented a database system that natively handles unstructured
text
- Research Assistant at University of
Waterloo (05/2010 - 09/2010)
- Part-time position under Prof. Charles Clarke in the Information Retrieval group
- Designed and implemented a system to detect and summarize events in online news
media using Ruby
- Research Intern at Primal Fusion Inc. (01/2010 - 04/2010)
- Designed and implemented a prototype of the next-generation semantic
engine that serves as the back-end for all of Primal Fusion's products
- Presented work to company's top officers
- Research Assistant at University of
Waterloo (09/2009 - 12/2009)
- Part-time position under Prof. Ihab Ilyas in the Database Systems research group
- Contributed in implementation of duplicate data detection system
written in Java
Relevant Project
- Fame to Flame (2010 - 2011)
- Designed and implemented a product review aggregator that provides
metrics such as sentiment analysis and salient terms
- Awarded overall 3rd in design project symposium - $500 prize
Related Work Experience
- Software Developer Intern at A9.com, Inc. (05/2009 - 08/2009)
- Improved Amazon.com's product search engine
- Developed a tool that displays the contents of a search index for
debugging and QA purposes in C++
- Revamped the index metadata files to use XML formatting, using Python and C++
- Researched an open source search server called Solr and prepared a presentation comparing it to A9.com's search
- Software Developer Intern at Google, Inc.
(09/2008 - 12/2008)
- Developed a system that allows users and radio stations to interact
by SMS via mobile phones in Java
- Designed and launched new features for the system:
- Java servlets in the back-end to handle requests generated by the new features
- Web control panels and analytic dashboard implemented in GWT and Java
- Software Developer at Sybase iAnywhere, Inc.
(01/2008 - 04/2008)
- Revamped and fully automated the test framework utilizing Java, C++ and HTTP
- Built a multi-threaded database extraction tool from scratch utilizing J2ME, C++ and HTTP
- Fixed several bugs in UltraLiteJ, a light-weight DB for the Blackberry, and wrote test cases for each fix
- Software Developer at Encom Information Systems,
Inc. (07/2007 - 08/2007)
- Communicated directly with client to debug and develop a staff scheduling system written in Progress 4GL
- Rebuilt the defunct clinic scheduling system and enabled it to go
live at the beta site
Awards and Honours
- Peter Jackson Fellowship - $17625
(2012)
- Microsoft Research Graduate Women's Scholarship - $17000
(2011)
- NSERC Postgraduate Scholarship - $17300
(2011)
- Alexander Graham Bell Canada Graduate Scholarship - $17500
(declined)
(2011)
- Ontario Graduate Scholarship - $15000 (declined)
(2011)
- Graduated with distinction on the Dean's Honours List
(2011)
- NSERC Undergraduate Student Research Award - $4500
(2010)
- Faculty of Engineering Upper-Year Scholarship - $400
(2010)
- Software Engineering Entrance Scholarship - $4000
(2006)
- President's Scholarship - $2000
(2006)
- Queen Elizabeth II Aiming for the Top scholarship - $3500 x 4
(2006 - 2011)
- Dean's Honour List
(2006 - 2011)
- Governor General's Academic Medal for first in graduating class
(2006)
Teaching Experience
- Teaching Assistant for Text Analytics (95-865)(Fall 2011)
- Large scale text analysis course for information science students
- Prepared and marked assignments, held office hours for assignment-related
questions
Professional Activities
- Sub-reviewer for Program Committee member Jamie Callan
(2012)
- International Symposium on String Processing and Information
Retrieval (SPIRE)
- International Conference on Web Search and Data Mining (WSDM)
Interests and Activities
- Member of Activities Committee for the Languages Technologies Institute
(2011 - Present)
- Voluntary position; organized graduate department wine and cheese
event
- Co-Head Delegate for CUSEC 2008 representing University of
Waterloo (01/2008)
- Marketed the event to Software Engineering students
- Organized a 3-day trip from Waterloo to the conference in Montreal
- Successfully presented the conference to the Math Endowment Fund for funding
- Orientation Week Leader (09/2007 - 09/2009)
- Led a group of first-years through various activities during Orientation Week
- Head leader in Sep. 09; organized a team of leaders
- Facilities Representative of Software Engineering class of 2011
- Elected position, responsible for managing interpersonal and
technical issues in the labs
- Writer and proofreader of university Math society newspaper
Education
- PhD Candidate in Computer Science
Carnegie Mellon University, Pittsburgh, Pennsylvania (2011 - current)
- Bachelor of Software Engineering
University of Waterloo, Waterloo, Ontario (2006 - 2011)
- Graduated with distinction on the Dean's Honours List
Technical Proficiency
- Proficient with Java, C/C++, Ruby, Python, dabbled with Haskell
- Comfortable with Linux, Windows XP/7 environments
- Experienced in modifying search index and database internals
References available on request.