I am a Machine Learning Engineer at Canva working on search and recommender systems. Prior to that, I have been working as a Data Scientist at SEEK Ltd, and a few more years back as a Research Fellow at RMIT University. I have a background in natural language processing and information retrieval, with recent work focused mostly on representation, neural models, ranking, and NLP applications. I review for academic conferences as an industry researcher in related areas such as SIGIR, WSDM, KDD, and EMNLP.

I received my doctorate in 2013 from National Taiwan University. Detailed track records can be found in my CV.

I'm also on twitter and github.

Publications

BAMBINO-LM:(Bilingual-) Human-Inspired Continual Pretraining of BabyLM
Zhewen Shen, Aditya Joshi, Ruey-Cheng Chen
CMCL '24 (ACL '24 Workshop) [pdf]

Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng Chen, Chia-Jung Lee
EMNLP '20 [pdf]

Correcting for Recency Bias in Job Recommendation
Ruey-Cheng Chen, Qingyao Ai, Gaya Jayasinghe, W. Bruce Croft
CIKM '19 [pdf, poster]

Joint Optimization of Cascade Ranking Models
Luke Gallagher, Ruey-Cheng Chen, Roi Blanco, J. Shane Culpepper
WSDM '19 [pdf, code]

Ranking Documents by Answer-Passage Quality
Evi Yulianti, Ruey-Cheng Chen, Falk Scholer, W. Bruce Croft, Mark Sanderson
SIGIR '18 [pdf, slides, code]

Document Summarization for Answering Non-Factoid Queries
Evi Yulianti, Ruey-Cheng Chen, Falk Scholer, W. Bruce Croft, Mark Sanderson
IEEE TKDE vol. 30, no. 1 [pdf]

RMIT at the NTCIR-13 We Want Web Task
Luke Gallagher, Joel Mackenzie, Rodger Benham, Ruey-Cheng Chen, Falk Scholer, J. Shane Culpepper
NTCIR-13 [pdf]

RMIT at the TREC CORE Track
Rodger Benham, Luke Gallagher, Joel Mackenzie, Tadale T. Damessie, Ruey-Cheng Chen, Falk Scholer, Alistair Moffat, J. Shane Culpepper
TREC 2017 [pdf]

An Empirical Analysis of Pruning Techniques
Ruey-Cheng Chen, Leif Azzopardi, Falk Scholer
CIKM '17 [pdf, poster, code]

On the Benefit of Incorporating External Features in a Neural Architecture for Answer Sentence Selection
Ruey-Cheng Chen, Evi Yulianti, Mark Sanderson, W. Bruce Croft
SIGIR '17 [pdf, poster, code]

Efficient Cost-Aware Cascade Ranking for Multi-Stage Retrieval
Ruey-Cheng Chen, Luke Gallagher, Roi Blanco, J. Shane Culpepper
SIGIR '17 [pdf, code]

RMIT at the TREC 2016 LiveQA Track
Joel Mackenzie, Ruey-Cheng Chen, J. Shane Culpepper
TREC 2016 [pdf]

Using Semantic and Context Features for Answer Summary Extraction
Evi Yulianti, Ruey-Cheng Chen, Falk Scholer, Mark Sanderson
ADCS 2016 [pdf, poster]

RMIT at the NTCIR-12 MobileClick-2: iUnit Ranking and Summarization Subtasks
Kevin Ong, Ruey-Cheng Chen, Falk Scholer
NTCIR-12 [pdf]

Beyond Factoid QA: Effective Methods for Non-Factoid Answer Sentence Retrieval
Liu Yang, Qingyao Ai, Damiano Spina, Ruey-Cheng Chen, Liang Pang, W. Bruce Croft, Jiafeng Guo, Falk Scholer
ECIR '16 [pdf, code]

RMIT at the TREC 2015 LiveQA Track
Ruey-Cheng Chen, J. Shane Culpepper, Tadele Tadela Damessie, Timothy Jones, Ahmed Mourad, Kevin Ong, Falk Scholer, Evi Yulianti
TREC 2015 [pdf]

Harnessing Semantics for Answer Sentence Retrieval
Ruey-Cheng Chen, Damiano Spina, W. Bruce Croft, Mark Sanderson, Falk Scholer
ESAIR '15 (CIKM '15 Workshop) [pdf, slides]

On Divergence Measures and Static Index Pruning
Ruey-Cheng Chen, Chia-Jung Lee, W. Bruce Croft
ICTIR '15 [pdf, slides, code]

Incremental Learning for Fully Unsupervised Word Segmentation Using Penalized Likelihood and Model Selection
Ruey-Cheng Chen
arXiv, 2014 [pdf]

An Improved MDL-Based Compression Algorithm for Unsupervised Word Segmentation
Ruey-Cheng Chen
ACL '13 [pdf, poster]

An Information-Theoretic Account of Static Index Pruning
Ruey-Cheng Chen, Chia-Jung Lee
SIGIR '13 [pdf, slides, code]

Information Preservation and Its Application to Natural Language Processing
Ruey-Cheng Chen
National Taiwan University, 2013 [pdf]

Information Preservation in Static Index Pruning
Ruey-Cheng Chen, Chia-Jung Lee, Chiung-Min Tsai, Jieh Hsiang
CIKM '12 [pdf, poster, code]

A Regularized Compression Method to Unsupervised Word Segmentation
Ruey-Cheng Chen, Chiung-Min Tsai, Jieh Hsiang
SIGMORPHON '12 (NAACL '12 Workshop) [pdf]

Sampling the Web as Training Data for Text Classification
Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, Pu-Jen Cheng
International Journal of Digital Library Systems, 1(4) [publisher]

Relevance Model Revisited: With Multiple Document Representations
Ruey-Cheng Chen, Chiung-Min Tsai, Jieh Hsiang
AIRS 2010 [pdf]

An Adaptation of Topic Modeling to Sentences
Ruey-Cheng Chen, Reid Swanson, Andrew S. Gordon
arXiv, 2010 [pdf]

Query Formulation by Selecting Good Terms
Chia-Jung Lee, Yi-Chun Lin, Ruey-Cheng Chen, Pei-Sen Liu, Pu-Jen Cheng
ROCLING 2009 [pdf]

Web Mining for Unsupervised Classification
Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, Pu-Jen Cheng, Pei-Sen Liu
ROCLING 2009 [pdf]

Selecting Effective Terms for Query Formulation
Chia-Jung Lee, Yi-Chun Lin, Ruey-Cheng Chen, Pu-Jen Cheng
AIRS 2009 [pdf]

A Term Dependency-Based Approach for Query Terms Ranking
Chia-Jung Lee, Ruey-Cheng Chen, Shao-Hang Kao, Pu-Jen Cheng
CIKM '09 [pdf]

LiveImage: Organizing Web Images by Relevant Concepts
Shuo-Peng Liao, Pu-Jen Cheng, Ruey-Cheng Chen, Lee-Feng Chien
Workshop on the Sciences of the Artificial 2005 [pdf]

Translating Unknown Queries with Web Corpora for Cross-Language Information Retrieval
Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, Lee-Feng Chien
SIGIR '04 [pdf]

On the Verification of Wireless Transaction Protocol Using SGM and RED
Pao-Ann Hsiung, Farn Wang, Ruey-Cheng Chen
RTCSA '00 [pdf]