Yong Jiang

Alibaba DAMO Academy

Biography

Hi! I currently work at Alibaba DAMO Academy. I received my Ph.D from the joint program of ShanghaiTech University and University of Chinese Academy of Sciences. I was very fortunate to be advised by Prof. Kewei Tu. I am interested in machine learning and natural language processing.

My current research mainly focuses on entity understanding tasks, information retrieval (query/doc understanding), language model pretraining, multilingual NLP, structured prediction and so on. Furthermore, I also ship these cutting-edge technologies to real products and platforms.

In my PhD time, I mainly worked on learning latent variable models for NLP problems and ML problems.

Spotlight of our recent work:

Incorporating various kinds of knowledge to improve named entity recognition: embedding combination, ACE, retrieval guided learning, sparse retrieval, multi-modal NER.
Knowledge distillation for learning multilingual models: structure-level KD, structural KD.
Improving sequence labeling methods: designing powerful potential functions, speeding up CRF training & inference.
Leveraging source models to improve cross-lingual ability: risk minimization, multi-view learning, word reordering.
Unsupervised grammar induction: the first neural-based unsupervised parser, discriminative autoencoder, 2nd order parsing, EACL tutorial and empirical study.
Multi-view learning for NER, entity linking and cross-lingual learning.
Fun with KL divergence: KL(p(*|a, b, c) || p(*|d, e)), KL(P || p), KL(p || q), KL(tractable || intractable?), KL (different modality).

We have some research intern positions available in Alibaba DAMO Academy. If you are interested in NLP and ML, please feel free to contact me: jiangyong.ml@gmail.com.

Interests

Natural Language Processing
Machine Learning
Deep Learning

Education

PhD in Computer Science, 2019

ShanghaiTech University
PhD in Computer Science, 2019

University of Chinese Academy of Sciences

Experience

Research Intern

Tencent AI Lab

Jan 2018 – Oct 2018 Shenzhen

Research Intern

Tencent

Jul 2017 – Sep 2017 Shanghai

Visiting Scholar

UC Berkeley

Aug 2016 – Feb 2016 California

Selected Publications

** indicates the intern/student author, * denotes equal contribution, the list might change over time.

Chengyue Jiang**, Yong Jiang, Weiqi Wu, Pengjun Xie, Kewei Tu

October 2022 EMNLP 2022 Entity Typing, NER, Graphical Model

Modeling Label Correlations for Ultra-Fine Entity Typing with Neural Pairwise Conditional Random Field

A SOTA system that can perform entity typing tasks of 10k entity types.

Xinyu Wang**, Yongliang Shen, Jiong Cai, Tao Wang, Xiaobin Wang, Pengjun Xie, Fei Huang, Weiming Lu, Yueting Zhuang, Kewei Tu, Wei Lu, Yong Jiang

June 2022 SemEval 2022 Sequence Labeling, NER, Multi-lingual NLP

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

We utilize the wikipedia to improve the RaNER model, which wins the SemEval 2022 competition and obtains the best system paper award.

PDF Code

Xinyin Ma**, Yong Jiang, Nguyen Bach, Tao Wang, Fei Huang, Weiming Lu

September 2021 EMNLP 2021 Entity Linking

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

Our first work on entity linking. Stay tuned for follow-up works.

PDF Code

Xinyu Wang**, Yong Jiang, Nguyen Bach, Tao Wang, Fei Huang, Kewei Tu

May 2021 ACL 2021 Sequence Labeling, Structured Prediction, NER, Multi-lingual NLP

Automated Concatenation of Embeddings for Structured Prediction

This paper achieves SOTA performance over 24 datasets of 6 tasks, spanning over NER, POS, chunking, dependency parsing, semantic parsing, aspect extraction, following the More Embeddings, Better Sequence Labelers paper.

PDF Code

Xinyu Wang**, Yong Jiang, Nguyen Bach, Tao Wang, Fei Huang, Kewei Tu

May 2021 ACL 2021 Sequence Labeling, NER