dwlee-personal-website.netlify.app
Open in
urlscan Pro
2a05:d014:275:cb02::c8
Public Scan
Submitted URL: http://dwlee-personal-website.netlify.com/
Effective URL: https://dwlee-personal-website.netlify.app/en/
Submission: On October 25 via api from US — Scanned from DE
Effective URL: https://dwlee-personal-website.netlify.app/en/
Submission: On October 25 via api from US — Scanned from DE
Form analysis
0 forms found in the DOMText Content
SEARCH David Lee * Home * Publications * Life * Experiences * Competitions * Projects * Resume * * * English English 中文 (繁體) DA-WEI LEE MACHINE LEARNING RESEARCHER JINGLE.AI * * * * BIOGRAPHY My name is Da-Wei Lee (David Lee). I am an ex-Data & Applied Scientist at Microsoft. Currently working at a Quant company doing trading strategy research using Machine Learning. Enjoy being a maker - think of any creative idea and try hard to make it come true. I love to play music and any other cool stuff. I have a huge enthusiasm for learning and curiosity about discovering. Also, I am very willing to help people and sharing what I have learned. INTERESTS * Artificial Intelligence * Natural Language Processing * Quant * Recommender System * Embedding System Design EDUCATION * MEng in Software Engineering, 2021 Peking University * BSc in Electronic and Computer Engineering, 2017 National Taiwan University of Science and Technology SKILLS Learn everything I am interested in and master them. PROGRAMMING Python for Machine Learning C/C++ for Embedding System Design Node.js for Back-end Design Java for Android App. Dev. C# for Unity Game Design Verilog HDL for FPGA Design Matlab, R for Math Calculation Fluent using Vim MUSIC Drum Kit Guitar Piano Wind Band Percussion Home Studio OTHER HOBBIES Photography Coffee (Latte Art & Pour-Over Coffee) Skateboarding Rubik’s Cube 3D Printing Rollerskating Bicycling Skiing Motorcycling Drone PUBLICATIONS OPEN RELATION EXTRACTION VIA QUERY-BASED SPAN PREDICTION QORE utilizes a Transformers-based language model to derive a representation of the interaction between arguments and context, and can … Huifan Yang, Da-Wei Lee, Zekun Li, Donglin Yang, Jinsheng Qi, Bin Wu PDF Video OPEN RELATION EXTRACTION WITH NON-EXISTENT AND MULTI-SPAN RELATIONSHIPS We proposed a Query-based Multi-head Open Relation Extractor (QuORE) to extract single/multi-span relations and detect non-existent … Huifan Yang, Da-Wei Lee, Zekun Li, Donglin Yang, Jinsheng Qi, Bin Wu PDF Code TOWARDS TOPIC-AWARE SLIDE GENERATION FOR ACADEMIC PAPERS WITH UNSUPERVISED MUTUAL LEARNING Generating slides from papers by extractive summarization techniques and unsupervised mutual learning to deal with data lacking issue. Da-Wei Lee, Danqing Huang, Tingting Ma, Chin-Yew Lin PDF Code Dataset Project Poster Video LIFE You only live once, so YOLO! .js-id-Music Music Travel MICROSOFT SUZHOU BAND - RETURN TRUE We formed at FY22 Kickoff (Aug 2021), and continue till today. I act as a drummer, acoustic guitar player, keyboardist, vocal, recordist, mixer, … in this band. EXPERIENCE Job / Intern MACHINE LEARNING RESEARCHER JINGLE.AI May 2023 – Present Shanghai, China T0 Quant Machine Learning Strategy Research DATA & APPLIED SCIENTIST MICROSOFT SOFTWARE TECHNOLOGY CENTER ASIA - WEBXT BING MULTIMEDIA Jul 2021 – Mar 2023 Suzhou, China Recommender System for Video Recommendation. ALGORITHM INTERN MICROSOFT SOFTWARE TECHNOLOGY CENTER ASIA - WEBXT BING NLP CARINA Jul 2020 – Jun 2021 Beijing, China Worked on Writing Assistant related projects in two main parts: * AI Writer: An application which aims to increase diversity of an article as well as reduces efforts of writing “filling text” for human. (This was the project used for the intern conversion) * Continuous Writing: * GPT2 * Rewriting: * Paraphrasing * UniLM (SimBERT) * Back-translation * Bing Translator * Google Translation * Information-Retrieval-based * Elastic Search * Approximate Nearest Neighbor (annoy) * Style Transfer * Style Transformer * Value Understanding: Built a numerical extractor which can extract quantity fact from raw text. * Designed an annotation guideline especially for Chinese quantity extraction. * Communication with labeling company and annotate more than 2000 article data to construct the training dataset from scratch. * Designed two major approaches namely “NER Combine” and “Quantity MRC”. * NER Combine: Combine spans with label extracted from NER model with an scope-based rule-based algorithm * Quantity MRC: Construct query for each slots based on extracted Quantity * Post-processing modules that able to deal with complex sentences especially the “respectively cases”. * Got used as back-end of three different projects * Writing Assistant (mainly finance): Including value consistency and value recommendation * Medical thesis analyser * A WeChat mini program Interviewed 5 internship candidates (after getting the return offer). RESEARCH INTERN MICROSOFT RESEARCH ASIA - KNOWLEDGE COMPUTING Dec 2019 – May 2020 Beijing, China Take over mainly two research-oriented NLP projects. * Generation of slides from academic paper * Math word problem generation RESEARCH INTERN (LABORATORY) PEKING UNIVERSITY NATIONAL ENGINEERING RESEARCH CENTER OF SOFTWARE ENGINEERING Jul 2019 – Jun 2021 Beijing, China Doing case of Anti-healthcare fraud and Medical record analysis. Including research of: * Information Extraction * Named-entity Recognition * Relation Classification * Knowledge Graph > PKU Thesis: Design and Implementation of Chinese Document Numerical Fact > Extraction EMBEDDING SYSTEM DESIGN SOFTWARE INTERN INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE (ITRI) Jul 2016 – Aug 2016 Hsinchu, Taiwan I was in the self-driving group, I mainly handled the STV0991 development board which was going to carry the computer vision algorithms. PIECEWORK Freelance / Personal Case EEG ANALYSIS NTUST DEPARTMENT OF BUSINESS ADMINISTRATION PROFESSOR Sep 2018 – May 2019 Remote I used Matlab to process and analysis EEG raw data. And do some visualization and animation on it. LEAPSY AR GLASSES VIDEO STREAM PAN/TILT HEAD ALL JOINT Jul 2017 – Oct 2017 Taipei, Taiwan I collected sensor data on Android-based AR Glasses to capture current attitude and sent it back to Raspberry Pi to synchronize camera pan/tilt head’s direction then return video stream back to glasses through Wi-Fi. And I made pan/tilt head structure using 3D print model to contain camera and two servo motors, and designed the power supply circuit for both motors and Raspberry Pi. ECG ANALYSIS NTU ON-THE-JOB PH.D. STUDENT Nov 2015 – Feb 2016 Remote I used Matlab to do fourier transformation on ECG (Electrocardiogram) signal by filtering out the high frequency noise and finally predicting its trend. OIL MONITOR SYSTEM BELTON Jul 2014 – Oct 2014 Remote I and my collage roommate Tom built a Windows application to get the machine’s sensor values, show them and store them in a database. This project was asked to use Visual Basic. COMPETITION BECHANGEMAKER WORLD SKILLS Mar 2023 – Sep 2023 Online Ecojoy We want to solve the problem of “Toy waste”. Excessive pollution not only affects the physical environment of future generations but also cultivates children who do not cherish resources, which has a major impact on the world. We hope that through a very simple way, every old toy will no longer be piled up at home or enter the landfill, but can also become a resource for others. We have software engineering, social education, and economics background. Observing that the problem of toy waste is becoming more and more serious, it is readily available and cheap, becoming a quick solution for most parents to deal with their children. We believe that as long as the sharing and acquisition methods are simple enough, it can immediately improve the situation of excessive waste. Through subscription to become members of Ecojoy App, you can easily share excess toys at home, and through the perfect toy information and rating system on APP, users can easily find suitable toys to meet their needs and achieve toy sharing and reuse. Facebook Page JIGSAW UNINTENDED BIAS IN TOXICITY CLASSIFICATION KAGGLE Feb 2019 – May 2019 Online This competition is aim to classify whether a comments is toxic. Our team design different models such as BERT, ELMo etc. as classifier and finally ensemble them. Our team reach Top 1% in rank. FAILURE PREDICTION OF CONCRETE PISTON FOR CONCRETE PUMP VEHICLES DIGITAL CHINA INNOVATION CONTEST 2019 Jan 2019 – Mar 2019 Online In this competition, each sample is a time-series data of a concrete pump vehicle. The goal is to predict the likelihood of each data sequence that whether a machine might fail. I used LightGBM and reach Top 5% in rank. Source Code ARM DESIGN CONTEST ARM Apr 2016 – Nov 2016 Hsinchu, Taiwan Based on my independent study of department project - the quadcopter project. Using specified development board STM32F4 to drive the quadcopter. We get Top 10 in the final. HOLTEK MCU DESIGN CONTEST HOLTEK Apr 2016 – Nov 2016 Taichung, Taiwan Based on my independent study of department project - the quadcopter project. Using specified development board STM32F4 to drive the quadcopter. Finally, we get honorable award. NTU SYSTEM APP CONTEST NATIONAL TAIWAN UNIVERSITY SYSTEM May 2015 – Aug 2015 Taipei, Taiwan Designed a platform called Skill Exchange - maa talent and skill exchange platform which matches people with their know-how and what they want to learn. Finally, we get honorable award. NSYSU LED DESIGN CONTEST NSYSU EE Oct 2014 – May 2015 Kaohsiung, Taiwan An installation art LED grid ball that combined sound and light. This project collaborated with design department students. Using gaming button to trigger MIDI signal to a computer to make a sound. And control LED grid with Arduino. Finally we get merit award. NTU TAIWAN 2048 BOT CONTEST NTU May 2014 – Jul 2014 Taipei, Taiwan I and my friend Tom built an AI BOT for the 2048 game. We used Monte Carlo Tree Search (MCTS) with alpha-beta pruning to select best action. And score each state(board) with our own designed evaluation function. Finally we get honorable award. ACCOMPLISHMENTS Certifications INTERMEDIATE BARISTA Jiangsu Skilled Talent Evaluation Center Feb 2023 Jiangsu Vocational Skills Certificate, No. S000032050806234001680 See certificate TOEIC 785⁄990 Educational Testing Service Jan 2016 Test of English for International Communication: Advanced See certificate TECHNICIAN CERTIFICATE: COMPUTER MAINTENANCE CLASS B Workforce Development Agency, Ministry of Labor Mar 2013 See certificate TECHNICIAN CERTIFICATE: COMPUTER MAINTENANCE CLASS C Workforce Development Agency, Ministry of Labor Jun 2012 See certificate PROJECTS Side Projects / Courseworks / Source Code * All ML/DL NLP School Project Side Project Competition STANFORD CS224N NLP WITH DL Self-learning of the course. Including projects of word2vec, dependency parsing, machine translation, question answering. SEMEVAL-2013 WORD SENSE INDUCTION SemEval-2013 Task 13 Word Sense Induction for Graded and Non-Graded Senses. SEMEVAL-2018 RELATION CLASSIFICATION SemEval-2018 Task 7 Semantic Relation Extraction and Classification in Scientific Papers. OPERATING SYSTEM PKU OS course project and notes based on Nachos and XV6 2048 AI BOT An AI BOT for 2048 game. Built MCTS version in 2014. Rebuilt RL version in 2018. RASPBERRY PI CLUSTER An efficient quick-start tool to build a Raspberry Pi Cluster with popular ecosystem like Hadoop, Spark. DEEP LEARNING PRACTICE Neural Network Implementation. Course project including NLP, RL, CV topics. MACHINE LEARNING PRACTICE Implement machine learning algo. from scratch. Including course projects and notes which are related to statistics machine learning. MODULARIZED QUADCOPTER ARCHITECTURE WITH COMPUTER VISION CONTROL My independent study of department. Built a quadcopter from scratch, running on different platform and combined with CV. Earn school … LEADERSHIP AND EXTRACURRICULAR ACTIVITIES .js-id-Leadership Leadership Activities Coding Related PKU OSA The PKU Open Source Association. Being the core member of ML & NLP department. And also the participant of Web Full Stack department. Code STUDENT ASSOCIATION OF ECE DEPARTMENT Serve as atristic designer. Handle poster design and Facebook fans page operation. JUNIOR HIGH SCHOOL ALUMNI WIND BAND Being principal percussionist in junior high school alumni wind band during 2016, 2017, 2018, 2019 summer. INTEL IOT ROADSHOW The Intel IoT Roadshow Hackathon. Development on Intel Edison. Group with two friends met in 2016 SITCON. STUDENTS’ INFORMATION TECHNOLOGY CONFERENCE As an attendant of SITCON in 2016 and 2018 spring. HACKNTU As a attendant of NTU Hackathon. Group with two of my friends and two strangers. HACKATHON TAIWAN Attend Hackathon Taiwan 6th and 8th. On the second one, design a google dinasour AI (because of the poor internet). Code HSINCHU ALUMNI ASSOCIATION Serve as photographer and social media manager. Serving in hometown for elementary school students in 2014. CYCLING AROUND TAIWAN Cycling counter-clockwise around Taiwan with senior high school classmate in ten days. © Da-Wei Lee. All rights reserved. · Powered by the Academic theme for Hugo. CITE × Copy Download