About Me

I am a graduate student in School of Computer Science of Carnegie Mellon University
Master of Very Large Information Systems (MSIT-VLIS), directed by Anthony Tomasic.
Before that, I graduate from Beihang University in Beijing, China, with bachelor degree of Software Engineering.
My interests in study and research include: Information Retrieval, Large Scale Database, Large Scale Machine Learning, and Statistical NLP.
I will join CBS Interactive Business Intelligence (BI) group in Somerville as a full-time engineer from Feburary 2012. Here is my resume.
- Phone: (412) 916-7917
- Email: wpang AT cs DOT cmu DOT edu
- Office: A104, 300 S Craig Steet, Pittsburgh, PA 15213
- Micro Blogging: My Sina Weibo Show (Chinese)
Experience
July 2011 - 2011CBS Interactive Somerville, MA
Intern- Scalable user cluster analysis based on Apache Mahout and Hadoop;
- Applied Canopy and K-Means clustering, based on SVD dimensioned reduced subspace;
- Scalable click-stream analysis using Parallel FP-Growth;
- Experimented other cluster computing frameworks (Spark and GraphLab).
July 2009 – June 2010Tsinghua-Sohu Joint Research Lab of Searching Technology (THUIR) Beijing, China
Research Intern- Worked for a joint engineering project with Sohu.com Inc. (SOHU, top search engine and browser provider in China);
- Independent Research on vertical crawler, dealing with data crawling in XMLHttpRequest enabled (AJAX featured) webpage;
- Product of the program was able to crawl real-time user comments for 12 mainstream video-sharing websites in China, and easy for extension;
- Tentative research on Deep-Web information crawling based on user browsing log.
January 2009 – April 2009Tata Consultancy Services Limited (TCS) Chennai, India
Intern- Worked for Motorola Offshore Development Center (ODC) at TCS Chennai branch;
- Enrolled in an internal information system project by team of 5 person; Responsible for system design and core component implementation;
- Released online, improved efficiency of project weekly status report;
- Selected from 198 students into the first cooperation program with TCS.
Coursework
Fall 2011 |
|||
10-710 | Structured Prediction | William Cohen and Noah Smith | |
Spring 2011 |
|||
11-741 | Information Retrieval | Jamie Callan and Yiming Yang | |
10-701* | Machine Learning | Tom Mitchell | |
11-761 | Language and Statistics | Roni Rosenfeld | |
Fall 2010 |
|||
08-741 | Very Large Information System | Anthony Tomasic | |
36-705 | Intermediate Statistics | Larry Wasserman | |
11-791 | Software Engineering | Eric Nyberg | Project |
11-711 | Algorithms for NLP | Alon Lavie | |
(* indicates CSD Ph.D. "starred" course.) |
Awards and other Projects
October 2005 – December 2007ACM - ICPC Participants
Beihang University Programming Contest Team- Silver Medal for 2007 ACM - ICPC Asian Regional: Chengdu Site;
- Bronze Medal for 2007 ACM - ICPC Asian Regional: Nanjing Site;
- Serve as Judge for 2007 ACM - ICPC Asian Regional: Beijing Site;
April 2006 – August 2007Beihang University Online Judge (Bianchengla)
- A Top-Coder like online programming competition platform;
- Responsible for interface and back-end system management tool;
- Released the first version of the system online, welcomed by students and teachers;
- Served as a training platform for ACM-ICPC and teaching assistant software for programming courses in Beihang University.