Bio

May.2023 – Present

TikTok Software Engineer

  • Global Payment

May. 2022 – Aug. 2022

TikTok Software Engineer Intern

  • Project: TikTok Search Product R&D Trace Robot
  • Designed the structures for tracing logs and a solution for persistence covering 90% of search business logic
  • Created an HTTP service for analysis, attribution, and location of search-related issues using above tracing logs
  • Developed a lark (instant message) robot to help developers around the world fix issues in production efficiently
  • Related Techniques: GoLang, Python, Gin(http), Kite(rpc), Redis, Loghouse, ByteFaas, Lark Applet/Robot

May. 2021 – May. 2023

Northeastern University

M.S. in Information Systems (Under Computer Software Engineering Category)

Jul.2018 – Aug.2021

Trip.com Group – Senior Machine Learning Engineer

  • Project: Machine Translation Service
  • Built a translation engine providing mutual translation of 54 languages and the identification of 180 languages
  • Ensured average service response within 100ms and concurrent service capacity over 50K chars per second
  • Translated 500M characters daily and served 200+ business scenarios with lower cost and competent quality
  • Built an corpus database on the top of HBase with 3B+ parallel sentences labeled with multiple scenario tags
  • Developed a multiple language text processing toolbox for text tokenization, normalization and splitting
  • Implemented an request batching module with batchsize and timeout options by asynchronous frameworks
  • Proposed an load balance mechanism for NLP tasks by using text split algorithm and asynchronous frameworks
  • Implemented transformer based on FT2.0, 7 times faster than native TF implementation. GTC 2020 [CNS20729]
  • • Project: Machine learning Platform (Task Life Cycle Management Part)
  • Management of computing and accelerator resources for 10+ kinds of ML tasks
  • Suspend, migrate, terminate or report a task according to the requests of scheduler node
  • Publications & Presentations & Open Source Contributions
  • 3 Patents CN2020113749199 | CN2020106787482 | CN2020107397065, 2 Software Copyrights
  • Author of Chapter 4.4 of Trip.com Artificial Technology Practices [ISBN9787121384400]
  • Qcon+ 2020, 2020.08.13, ”Application of Machine Translation Technology in Trip.com Group”
  • Personal Contributor Google Tensorflow/Tensor2Tesnor PR1295 | PR1302 | PR1303
  • Related Techniques: Hive, HBase, CRedis, CLog, Kafka, Flask, FastText, Text Tokenizer, Gitlab CI, Ceph/Swift,
  • InfluxDB/Grafana, ElasticSearch/Kibana, Load Balance/Nginx, Rquest Batching/Python Async, TensorFlow/CUDA

Nov.2017 – Oct.2018

GEC Academy – Remote Teach Assistant

  • Signal and Image Processing & Feature Extraction – Munib W., Ph.D. Harvard University
  • Applied Machine Translation Learning: Fake News Detection – Mike T., Ph.D. University of California Berkeley
  • Real-time face recognition – Dr.Nakul Verma Columbia University

Sep.2017 – Nov.2017

University of California Berkeley, I School, Remote Lab

  • Supervisor: Mike T., Ph.D. Lecturer for UC Berkeley I School
  • Team Leader of Project Zestimate
  • Top 1 Team Among Six

Oct.2016 – Jul.2017

Institute of Shipping Information Technology (IOTLAB)

  • Supervisor: Liu Jin Professor Ph.D. Supervisor for Shanghai Maritime University (SMU)
  • Domain Machine Translation Research

Oct.2013 – Jul.2017

Shanghai Maritime University (SMU) Bachelor of Engineer (B. Eng. or BE)

  • Top 6 Bachelor’s Theses Among 120 Students