I joint Information Sciences Institute, University of Southern California in Fall 2020 as a computer scientist.

I got my Ph.D degree in Language Technologies Institute at Carnegie Mellon University, where I was working with Prof. Eduard Hovy.
Before coming to CMU, I was a master student of Center for Brain-like Computing and Machine Intelligence (BCMI), Shanghai Jiao Tong University, Shanghai, China.
I received my Bachelor degree in Computer Science from Shanghai Jiao Tong University, where I was a member of ACM Class, now part of Zhiyuan College in SJTU.

I believe that representation learning techniques based on deep learning methods can fundamentally transform the conventional feature designing paradigm.
Representation learning can, in principle, automatically learn representations that are mathematically and computationally convenient to process.
Furthermore, beyond learning representations for specific tasks, representation learning allows us to identify and disentangle the underlying causal factors,
to tease apart the underlying dependencies of the data, so that it becomes easier to understand, to classify, or to perform other tasks such as, even, controllable and interpretable data generation or manipulation.
**My research focuses on fulfilling this transformation to enhance the effectiveness, efficiency, interpretablility and robustness of representation learning,
by developing and analyzing deep learning techniques
**.
The key contributions of my research are as follows:

**Representation learning via deep generative models**- We developed deep generative models to improve both data density estimation and latent representation learning for text and image data.
**Interpretable and robust systems via disentangled representation learning**- We enhance deep learning systems' interpretability and robustness by building models upon disentangled representations to tease apart underlying dependencies of data and connect output representations with input causal factors.
**Applications of representation learning techniques**- We apply (unsupervised and/or weakly-supervised) representation learning techniques to NLP and CV tasks in different domains to advance the state-of-the-art and/or to reduce requirement of human annotated resources.

Most recent publications on Google Scholar.

**Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization**
ArXiv
Code

**Xuezhe Ma**

*Preprint*

**Decoupling Global and Local Representations from/for Image Generation**
PDF
Code

**Xuezhe Ma**, Xiang Kong, Shanghang Zhang, Eduard Hovy

*Preprint*

**MaCow: Masked Convolutional Generative Flow**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Xiang Kong, Shanghang Zhang, Eduard Hovy

*Proceddings of Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019)*

**FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow**
PDF
Bib
ArXiv
Code

**Xuezhe Ma*******, Chunting Zhou*****, Xian Li, Graham Neubig, Eduard Hovy

*Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)*

**MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Chunting Zhou, Eduard Hovy

*Proceedings of 7th International Conference on Learning Representations (ICLR 2019)*
(**Oral**)

**An Empirical Investigation of Structured Output Modeling for Graph-based Neural Dependency Parsing**
PDF
Bib
Code

Zhisong Zhang, **Xuezhe Ma**, Eduard Hovy

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*

**Density Matching for Bilingual Word Embedding**
PDF
Bib
ArXiv
Video
Code

Chunting Zhou, **Xuezhe Ma**, Di Wang, Graham Neubig

*Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)*

**On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing**
PDF
Bib
ArXiv
Code

Wasi Uddin Ahmad*****, Zhisong Zhang*****, **Xuezhe Ma**, Eduard Hovy, Kai-Wei Chang, Nanyun Peng

**Stack-Pointer Networks for Dependency Parsing**
PDF
Bib
ArXiv
Slides
Code

**Xuezhe Ma**, Zecong Hu, Jingzhou Liu, Nanyun Peng, Graham Neubig, Eduard Hovy

*Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)*
(**Oral**)

**Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML**
ArXiv

**Xuezhe Ma**, Pengcheng Yin, Jingzhou Liu, Graham Neubig, Eduard Hovy

*ArXiv Preprint*

**Dropout with Expectation-Linear Regularization**
PDF
Bib
ArXiv

**Xuezhe Ma**, Yingkai Gao, Zhiting Hu, Yaoliang Yu, Yuntian Deng, Eduard Hovy

*Proceedings of 5th International Conference on Learning Representations (ICLR 2017)*

**Neural Probabilistic Model for Non-projective MST Parsing**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Eduard Hovy

*Proceedings of 5th International Joint Conference on Natural Language Processing (IJCNLP 2017)*

**An Interpretable Knowledge Transfer Model for Knowledge Base Completion**
PDF
Bib
ArXiv
Video

Qizhe Xie, **Xuezhe Ma**, Zihang Dai, Eduard Hovy

*Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017)*
(**Oral**)

**End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Eduard Hovy

*Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)*

**Harnessing Deep Neural Networks with Logic Rules**
PDF
Bib
ArXiv

Zhiting Hu, **Xuezhe Ma**, Zhengzhong Liu, Eduard Hovy, Eric P. Xing

*Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)*
(**Outstanding Paper Award**)

**Unsupervised Ranking Model for Entity Coreference Resolution**
PDF
Bib
ArXiv

Xuezhe Ma, Zhengzhong Liu, Eduard Hovy

*Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)*
(**Oral**)

**Efficient Inner-to-outer Greedy Algorithm for Higher-order Labeled Dependency Parsing**
PDF
Bib

**Xuezhe Ma**, Eduard Hovy

*Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015)*

**Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization**
ArXiv
Code

**Xuezhe Ma**

*Preprint*

**Decoupling Global and Local Representations from/for Image Generation**
PDF
Code

**Xuezhe Ma**, Xiang Kong, Shanghang Zhang, Eduard Hovy

*Preprint*

**MaCow: Masked Convolutional Generative Flow**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Xiang Kong, Shanghang Zhang, Eduard Hovy

*Proceddings of Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019)*

**FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow**
PDF
Bib
ArXiv
Code

**Xuezhe Ma*******, Chunting Zhou*****, Xian Li, Graham Neubig, Eduard Hovy

*Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)*

**MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Chunting Zhou, Eduard Hovy

*Proceedings of 7th International Conference on Learning Representations (ICLR 2019)*
(**Oral**)

**Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation**
PDF
Bib
Code

Zhiting Hu, Haoran Shi, Bowen Tan, Wentao Wang, Zichao Yang, Tiancheng Zhao, Junxian He, Lianhui Qin, Di Wang, **Xuezhe Ma**, Zhengzhong Liu, Xiaodan Liang Wangrong Zhu, Devendra Singh Sachan, Eric P. Xing

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*
(**Best Demo Paper Nomination**)

**An Empirical Investigation of Structured Output Modeling for Graph-based Neural Dependency Parsing**
PDF
Bib
Code

Zhisong Zhang, **Xuezhe Ma**, Eduard Hovy

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*

**Choosing Transfer Languages for Cross-Lingual Learning**
PDF
Bib

Yu-Hsiang Lin, Chian-Yu Chen, Jean Lee, Zirui Li, Yuyan Zhang, Mengzhou Xia, Shruti Rijhwani, Junxian He, Zhisong Zhang, **Xuezhe Ma**, Antonios Anastasopoulos, Patrick Littell, Graham Neubig

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*

**Density Matching for Bilingual Word Embedding**
PDF
Bib
ArXiv
Video
Code

Chunting Zhou, **Xuezhe Ma**, Di Wang, Graham Neubig

**On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing**
PDF
Bib
ArXiv
Code

Wasi Uddin Ahmad*****, Zhisong Zhang*****, **Xuezhe Ma**, Eduard Hovy, Kai-Wei Chang, Nanyun Peng

**Stack-Pointer Networks for Dependency Parsing**
PDF
Bib
ArXiv
Slides
Code

**Xuezhe Ma**, Zecong Hu, Jingzhou Liu, Nanyun Peng, Graham Neubig, Eduard Hovy

*Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)*
(**Oral**)

**Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML**
ArXiv

**Xuezhe Ma**, Pengcheng Yin, Jingzhou Liu, Graham Neubig, Eduard Hovy

*ArXiv Preprint*

**Dropout with Expectation-Linear Regularization**
PDF
Bib
ArXiv

**Xuezhe Ma**, Yingkai Gao, Zhiting Hu, Yaoliang Yu, Yuntian Deng, Eduard Hovy

*Proceedings of 5th International Conference on Learning Representations (ICLR 2017)*

**Neural Probabilistic Model for Non-projective MST Parsing**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Eduard Hovy

*Proceedings of 5th International Joint Conference on Natural Language Processing (IJCNLP 2017)*

**An Interpretable Knowledge Transfer Model for Knowledge Base Completion**
PDF
Bib
ArXiv
Video

Qizhe Xie, **Xuezhe Ma**, Zihang Dai, Eduard Hovy

*Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017)*
(**Oral**)

**CMU System for Entity Discovery and Linking at TAC-KBP**
PDF

**Xuezhe Ma** and Nicolas R Fauceglia, Yiu-Chang Lin, Eduard Hovy

*Proceedings of Text Analytics Conference (TAC 2017)*

**End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Eduard Hovy

*Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)*

**Harnessing Deep Neural Networks with Logic Rules**
PDF
Bib
ArXiv

Zhiting Hu, **Xuezhe Ma**, Zhengzhong Liu, Eduard Hovy, Eric P. Xing

*Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)*
(**Outstanding Paper Award**)

**Unsupervised Ranking Model for Entity Coreference Resolution**
PDF
Bib
ArXiv

Xuezhe Ma, Zhengzhong Liu, Eduard Hovy

*Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)*
(**Oral**)

**Efficient Inner-to-outer Greedy Algorithm for Higher-order Labeled Dependency Parsing**
PDF
Bib

**Xuezhe Ma**, Eduard Hovy

*Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015)*

**Word Sense Disambiguation via PropStore and OntoNotes for Event Mention Detection**
PDF
Bib

Nicolas R Fauceglia, Yiu-Chang Lin, **Xuezhe Ma** and Eduard Hovy

*Proceedings of the 3rd Workshop on Events: Definition, Detection, Coreference and Representation (NAACL 2015)*

**Unsupervised Dependency Parsing with Transferring Distribution via Parallel Guidance and Entropy Regularization**
PDF
Bib

**Xuezhe Ma** and Fei Xia

*Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014)*
(**Oral**)

**Dependency Parser Adaptation with Subtrees from Auto-Parsed Target Domain Data**
PDF
Bib

**Xuezhe Ma** and Fei Xia

*Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013)*

**Fourth-Order Dependency Parsing**
PDF
Bib

**Xuezhe Ma** and Hai Zhao

*Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012)*

**Probabilistic Models for Higher-order Projective Dependency Parsing**
ArXiv

**Xuezhe Ma** and Hai Zhao

*Technical Report: ArXiv Preprint*

**Decoupling Global and Local Representations from/for Image Generation**
PDF
Code

**Xuezhe Ma**, Xiang Kong, Shanghang Zhang, Eduard Hovy

*Preprint*

**FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow**
PDF
Bib
ArXiv
Code

**Xuezhe Ma*******, Chunting Zhou*****, Xian Li, Graham Neubig, Eduard Hovy

*Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)*

**MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Chunting Zhou, Eduard Hovy

*Proceedings of 7th International Conference on Learning Representations (ICLR 2019)*
(**Oral**)

**Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML**
ArXiv

**Xuezhe Ma**, Pengcheng Yin, Jingzhou Liu, Graham Neubig, Eduard Hovy

*ArXiv Preprint*

**FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow**
PDF
Bib
ArXiv
Code

**Xuezhe Ma*******, Chunting Zhou*****, Xian Li, Graham Neubig, Eduard Hovy

*Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)*

**Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation**
PDF
Bib
Code

Zhiting Hu, Haoran Shi, Bowen Tan, Wentao Wang, Zichao Yang, Tiancheng Zhao, Junxian He, Lianhui Qin, Di Wang, **Xuezhe Ma**, Zhengzhong Liu, Xiaodan Liang Wangrong Zhu, Devendra Singh Sachan, Eric P. Xing

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*
(**Best Demo Paper Nomination**)

**An Empirical Investigation of Structured Output Modeling for Graph-based Neural Dependency Parsing**
PDF
Bib
Code

Zhisong Zhang, **Xuezhe Ma**, Eduard Hovy

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*

**Choosing Transfer Languages for Cross-Lingual Learning**
PDF
Bib

Yu-Hsiang Lin, Chian-Yu Chen, Jean Lee, Zirui Li, Yuyan Zhang, Mengzhou Xia, Shruti Rijhwani, Junxian He, Zhisong Zhang, **Xuezhe Ma**, Antonios Anastasopoulos, Patrick Littell, Graham Neubig

*Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)*

**Density Matching for Bilingual Word Embedding**
PDF
Bib
ArXiv
Video
Code

Chunting Zhou, **Xuezhe Ma**, Di Wang, Graham Neubig

**On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing**
PDF
Bib
ArXiv
Code

Wasi Uddin Ahmad*****, Zhisong Zhang*****, **Xuezhe Ma**, Eduard Hovy, Kai-Wei Chang, Nanyun Peng

**Stack-Pointer Networks for Dependency Parsing**
PDF
Bib
ArXiv
Slides
Code

**Xuezhe Ma**, Zecong Hu, Jingzhou Liu, Nanyun Peng, Graham Neubig, Eduard Hovy

*Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)*
(**Oral**)

**Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML**
ArXiv

**Xuezhe Ma**, Pengcheng Yin, Jingzhou Liu, Graham Neubig, Eduard Hovy

*ArXiv Preprint*

**Neural Probabilistic Model for Non-projective MST Parsing**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Eduard Hovy

*Proceedings of 5th International Joint Conference on Natural Language Processing (IJCNLP 2017)*

**An Interpretable Knowledge Transfer Model for Knowledge Base Completion**
PDF
Bib
ArXiv
Video

Qizhe Xie, **Xuezhe Ma**, Zihang Dai, Eduard Hovy

*Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017)*
(**Oral**)

**CMU System for Entity Discovery and Linking at TAC-KBP**
PDF

**Xuezhe Ma** and Nicolas R Fauceglia, Yiu-Chang Lin, Eduard Hovy

*Proceedings of Text Analytics Conference (TAC 2017)*

**End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF**
PDF
Bib
ArXiv
Code

**Xuezhe Ma**, Eduard Hovy

*Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)*

**Harnessing Deep Neural Networks with Logic Rules**
PDF
Bib
ArXiv

Zhiting Hu, **Xuezhe Ma**, Zhengzhong Liu, Eduard Hovy, Eric P. Xing

*Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)*
(**Outstanding Paper Award**)

**Unsupervised Ranking Model for Entity Coreference Resolution**
PDF
Bib
ArXiv

Xuezhe Ma, Zhengzhong Liu, Eduard Hovy

*Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)*
(**Oral**)

**Efficient Inner-to-outer Greedy Algorithm for Higher-order Labeled Dependency Parsing**
PDF
Bib

**Xuezhe Ma**, Eduard Hovy

*Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015)*

**Word Sense Disambiguation via PropStore and OntoNotes for Event Mention Detection**
PDF
Bib

Nicolas R Fauceglia, Yiu-Chang Lin, **Xuezhe Ma** and Eduard Hovy

**Unsupervised Dependency Parsing with Transferring Distribution via Parallel Guidance and Entropy Regularization**
PDF
Bib

**Xuezhe Ma** and Fei Xia

*Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014)*
(**Oral**)

**Dependency Parser Adaptation with Subtrees from Auto-Parsed Target Domain Data**
PDF
Bib

**Xuezhe Ma** and Fei Xia

*Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013)*

**Fourth-Order Dependency Parsing**
PDF
Bib

**Xuezhe Ma** and Hai Zhao

*Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012)*

**Probabilistic Models for Higher-order Projective Dependency Parsing**
ArXiv

**Xuezhe Ma** and Hai Zhao

*Technical Report: ArXiv Preprint*

**Invited Talk at ( USC ISI, Duke CS, UMich CSE)** (3/2020 - 4/2020): Towards Structured-Infused and Disentangled Representation Learning

**Microsoft Research AI Breakthroughs** (9/2019): FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

This website uses the website design and template by Martin Saveski