Biography
Haifeng WANG is Chief Technology Officer of Baidu, Managing Director of Baidu Research and Head of the National Engineering Laboratory for Deep Learning Technology and Applications. He is in charge of Baidu’s Artificial Intelligence Group (AIG), overseeing the research and development efforts in artificial intelligence technologies.
Dr. Wang served as the president of the Association for Computational Linguistics (ACL) in 2013 and became the founding chair of Asia-Pacific Chapter of the ACL in 2018. He served as area chair, tutorial chair, program chair, etc. at top AI conferences, including IJCAI, KDD, ACL and so on. He is also an adjunct professor at several top universities.
Dr. Wang is an ACL Fellow, IEEE Fellow, CAAI Fellow and Academician of International Eurasian Academy of Science. He was awarded the National Technological Invention Award, National Science and Technology Progress Award, China Patent Golden Award, Guanghua Engineering Science and Technology Award, and the Outstanding Contribution Award of Wu Wen Jun AI Science and Technology Award.
Dr. Wang has published more than 100 academic papers at top journals and AI conferences including IJCAI (5 papers), AAAI (12 papers), Nature Machine Intelligence, ACL, KDD, CVPR etc., and holds over 100 Chinese and international patents. He has given more than 100 invited talks at academic conferences and industrial events.
Research interests
Natural Language Processing, Machine Translation, Knowledge Graph, Search, Deep Learning
Activities
- President, Beijing Artificial Intelligence Industry Alliance, since 2021
- Vice President, Artificial Intelligence Industry Alliance, since 2018
- Member, IEEE Industry Advisory Board, since 2018
- Founding chair, Asia-Pacific Chapter of the ACL (AACL), 2018-2020
- Vice President, Chinese Information Processing Society of China, since 2016
- Vice President, Chinese Institute of Electronics, since 2016
- Executive committee member, Asian Federation of Natural Language Processing (AFNLP), 2015-2016
- Chair of coordinating committee, ACL 2014
- Area co-chair, IJCAI 2013
- President, ACL, 2013
- Industrial Liaison co-chair, KDD 2012
- Associate Editor, ACM Transactions on Intelligent Systems and Technology (TIST), since 2012
- Executive committee member, ACL, 2011-2014
- Program co-chair, IJCNLP 2011
- Program chair, CWMT 2011
- Industrial Track co-chair, SIGIR 2011
- Tutorial co-chair, ACL 2010
- Workshop co-chair, COLING 2010
- Associate Editor, Transactions on Asian Language Information Processing (TALIP), 2010-2011
- Area co-chair, ACL-IJCNLP 2009
Honors
- Fellow, IEEE, 2022, for contributions and leadership in natural language processing and AI technologies.
- National Technological Invention Award of China, 2021
- China Patent Gold Award, 2021
- Academician, International Eurasian Academy of Science, 2021
- Guanghua Engineering Science and Technology Award, 2020, for significant contributions to large-scale industrial applications of AI technologies.
- Fellow, Chinese Association for Artificial Intelligence, 2018
- The Outstanding Contribution Award of Wu Wenjun Artificial Intelligence Science and Technology Award, 2018
- Fellow, ACL, 2016, for significant contributions to MT, NLP and search engines in both academia and industry, and to the growth of the ACL in Asia.
- National Science & Technology Progress Award of China, 2015
Publications (Selected)
- Geometry-enhanced Molecular Representation Learning for Property Prediction. Nature Machine Intelligence. 2022.
- TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations. AAAI-2022.
- ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora. EMNLP-2021.
- DuRecDial 2.0: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Conversational Recommendation. EMNLP-2021.
- GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI and Accessibility Extraction at Baidu Maps. CIKM-2021.
- Progress in Machine Translation. Engineering. 2021.
- Discovering Dialog Structure Graph for Coherent Dialog Generation. ACL/IJCNLP-2021.
- ERNIE-Doc: A Retrospective Long-Document Modeling Transformer. ACL/IJCNLP-2021.
- Link Prediction on N-ary Relational Facts: A Graph-based Approach. ACL/IJCNLP-2021.
- ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding. NAACL-2021.
- HGAMN: Heterogeneous Graph Attention Matching Network for Multilingual POI Retrieval at Baidu Maps. KDD-2021.
- ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graph. AAAI-2021.
- A Unified Pre-training Framework for Conversational AI. AAAI-2021.
- Learning to Select External Knowledge with Multi-Scale Negative Sampling. AAAI-2021.
- The Development of Deep Learning Technologies. Springer Nature. 2020.
- Enhancing Dialog Coherence with Event Graph Grounded Content Planning. IJCAI-2020.
- ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation. IJCAI-2020.
- Multi-Task Learning for Entity Recommendation and Document Ranking in Web Search. ACM TIST. 2020.
- Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation. ACL-2020.
- Towards Conversational Recommendation over Multi-Type Dialogs. ACL-2020.
- ERNIE 2.0: A Continual Pre-training Framework for Language Understanding. AAAI-2020.
- Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation. AAAI-2020.
- Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding. AAAI-2020.
- Proactive Human-Machine Conversation with Explicit Conversation Goals. ACL-2019.
- Joint Extraction of Entities and Overlapping Relations using Position-Attentive Sequence Labeling. AAAI-2019.
- Modeling Coherence for Discourse Neural Machine Translation. AAAI-2019.
- Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs. EMNLP-2019.
- Multi-agent Learning for Neural Machine Translation. EMNLP-2019.
- MONOPOLY: Learning to Price Public Facilities for Revaluing Private Properties with Large-Scale Urban Data. CIKM-2019.
- Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification. ACL-2018.
- Improving Entity Recommendation with Search Log and Multi-Task Learning. IJCAI-2018.
- Entity Highlight Generation as Statistical and Neural Machine Translation. IEEE/ACM TASLP. 2018.
- Learning to Explain Entity Relationships by Pairwise Ranking with Convolutional Neural Networks. IJCAI-2017.
- Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification. EMNLP-2017.
- Active Learning for Dependency Parsing with Partial Annotation. ACL-2016.
- Generating Recommendation Evidence Using Translation Model. IJCAI-2016.
- A Universal Framework for Inductive Transfer Parsing across Multi-typed Treebanks. COLING-2016.
- Improved Neural Machine Translation with SMT Features. AAAI-2016.
- A Representation Learning Framework for Multi-Source Transfer Parsing. AAAI-2016.
- Cross-lingual Dependency Parsing Based on Distributed Representations. ACL/IJCNLP-2015.
- Multi-Task Learning for Multiple Language Translation. ACL/IJCNLP-2015.
- Exploiting Collective Hidden Structures in Webpage Titles for Open Domain Entity Extraction. WWW-2015.
- On the Granularity of Dialog Strategies: Insights from Large-scale Analyses of Two Commercial Travel Information Spoken Dialog Systems. AAAI-2015.
- Learning Semantic Hierarchies via Word Embeddings. ACL-2014.
- Learning Sense-specific Word Embeddings By Exploiting Bilingual Resources. COLING-2014.
- Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Count of Phrase Pairs. EMNLP-2014.
- Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality. EMNLP-2014.
- Improving Pivot-Based Statistical Machine Translation Using Random Walk. EMNLP-2013.
- Introduction to Special Section on Paraphrasing. ACM TIST. 2013.
- Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information. ACL-2012.
- Improve SMT Quality with Automatically Extracted Paraphrase Rules. ACL-2012.
- User Behaviors Lend a Helping Hand: Learning Paraphrase Query Patterns from Search Log Sessions. COLING-2012.
- Two-word Collocation Extraction Using Monolingual Word Alignment Method. ACM TIST. 2011.
- Automatically Generating Questions from Queries for Community-based Question Answering. IJCNLP-2011.
- Enriching SMT Training Data via Paraphrasing. IJCNLP-2011.
- Reordering with Source Language Collocations. ACL-2011.
- Improving Statistical Machine Translation with Monolingual Collocation. ACL-2010.
- Paraphrasing with Search Engine Query Logs. COLING-2010.
- Collocation Extraction Using Monolingual Word Alignment Method. EMNLP-2009.
- Revisiting Pivot Language Approach for Machine Translation. ACL/IJCNLP-2009.
- Dependency Based Chinese Sentence Realization. ACL/IJCNLP-2009.
- The TCH Machine Translation System for IWSLT 2008. IWSLT-2008.
- Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora. COLING-2008.
- Dependency-Based N-Gram Models for General Purpose Sentence Realisation. COLING-2008.
- Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora. ACL-2008.
- Pivot Language Approach for Phrase-Based Statistical Machine Translation. Machine Translation. 2007.
- Comparative Study of Word Alignment Heuristics and Phrase-Based SMT. MT SUMMIT XI.
- Log-linear Generation Models for Example-based Machine Translation. MT SUMMIT XI.
- Recovering Non-Local Dependencies for Chinese. EMNLP/CoNLL 2007.
- Pivot Language Approach for Phrase-Based Statistical Machine Translation. ACL-2007.
- Example-Based Machine Translation Based on Tree-string Correspondence and Statistical Generation. Machine Translation. 2006.
- Boosting Statistical Word Alignment Using Labeled and Unlabeled Data. Coling/ACL-2006.
- Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs. Coling/ACL-2006.
- Alignment Model Adaptation for Domain-Specific Word Alignment. ACL-2005.
- Improving Statistical Word Alignment with a Rule-Based Machine Translation System. COLING-2004.
- Towards a Next Generation Search Engine. PRICAI-2000.
- A Unified Approach to Statistical Language Modeling for Chinese. ICASSP-2000.