ResumeNet : a learning-based framework for automatic resume quality assessment

Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resu...

Full description

Bibliographic Details
Main Authors: Luo, Yong, Zhang, Huaizheng, Wang, Yongjie, Wen, Yonggang, Zhang, Xinwen
Other Authors: School of Computer Science and Engineering
Format: Conference Paper
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/143040
_version_ 1826113431431806976
author Luo, Yong
Zhang, Huaizheng
Wang, Yongjie
Wen, Yonggang
Zhang, Xinwen
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Luo, Yong
Zhang, Huaizheng
Wang, Yongjie
Wen, Yonggang
Zhang, Xinwen
author_sort Luo, Yong
collection NTU
description Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resume quality assessment (RQA). This motivates us to develop a method for automatic RQA. Since there is also no public dataset for model training and evaluation, we build a dataset for RQA by collecting around 10K resumes, which are provided by a private resume management company. By investigating the dataset, we identify some factors or features that could be useful to discriminate good resumes from bad ones, e.g., the consistency between different parts of a resume. Then a neural-network model is designed to predict the quality of each resume, where some text processing techniques are incorporated. To deal with the label deficiency issue in the dataset, we propose several variants of the model by either utilizing the pair/triplet-based loss, or introducing some semi-supervised learning technique to make use of the abundant unlabeled data. Both the presented baseline model and its variants are general and easy to implement. Various popular criteria including the receiver operating characteristic (ROC) curve, F-measure and ranking-based average precision (AP) are adopted for model evaluation. We compare the different variants with our baseline model. Since there is no public algorithm for RQA, we further compare our results with those obtained from a website that can score a resume. Experimental results in terms of different criteria demonstrate effectiveness of the proposed method. We foresee that our approach would transform the way of future human resources management.
first_indexed 2024-10-01T03:23:07Z
format Conference Paper
id ntu-10356/143040
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:23:07Z
publishDate 2020
record_format dspace
spelling ntu-10356/1430402020-07-22T07:48:25Z ResumeNet : a learning-based framework for automatic resume quality assessment Luo, Yong Zhang, Huaizheng Wang, Yongjie Wen, Yonggang Zhang, Xinwen School of Computer Science and Engineering 2018 IEEE International Conference on Data Mining (ICDM) Engineering::Computer science and engineering Resume Quality Assessment Dataset and Features Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resume quality assessment (RQA). This motivates us to develop a method for automatic RQA. Since there is also no public dataset for model training and evaluation, we build a dataset for RQA by collecting around 10K resumes, which are provided by a private resume management company. By investigating the dataset, we identify some factors or features that could be useful to discriminate good resumes from bad ones, e.g., the consistency between different parts of a resume. Then a neural-network model is designed to predict the quality of each resume, where some text processing techniques are incorporated. To deal with the label deficiency issue in the dataset, we propose several variants of the model by either utilizing the pair/triplet-based loss, or introducing some semi-supervised learning technique to make use of the abundant unlabeled data. Both the presented baseline model and its variants are general and easy to implement. Various popular criteria including the receiver operating characteristic (ROC) curve, F-measure and ranking-based average precision (AP) are adopted for model evaluation. We compare the different variants with our baseline model. Since there is no public algorithm for RQA, we further compare our results with those obtained from a website that can score a resume. Experimental results in terms of different criteria demonstrate effectiveness of the proposed method. We foresee that our approach would transform the way of future human resources management. National Research Foundation (NRF) Accepted version This work is supported by Singapore NRF2015ENCGDCR01001-003, administrated via IMDA and NRF2015ENCGBICRD001-012, administrated via BCA. 2020-07-22T07:48:25Z 2020-07-22T07:48:25Z 2018 Conference Paper Luo, Y., Zhang, H., Wang, Y., Wen, Y., & Zhang, X. (2018). ResumeNet : a learning-based framework for automatic resume quality assessment. Proceedings of 2018 IEEE International Conference on Data Mining (ICDM), 307-316. doi:10.1109/icdm.2018.00046 978-1-5386-9160-1 https://hdl.handle.net/10356/143040 10.1109/icdm.2018.00046 2-s2.0-85061375946 307 316 en © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/icdm.2018.00046 application/pdf
spellingShingle Engineering::Computer science and engineering
Resume Quality Assessment
Dataset and Features
Luo, Yong
Zhang, Huaizheng
Wang, Yongjie
Wen, Yonggang
Zhang, Xinwen
ResumeNet : a learning-based framework for automatic resume quality assessment
title ResumeNet : a learning-based framework for automatic resume quality assessment
title_full ResumeNet : a learning-based framework for automatic resume quality assessment
title_fullStr ResumeNet : a learning-based framework for automatic resume quality assessment
title_full_unstemmed ResumeNet : a learning-based framework for automatic resume quality assessment
title_short ResumeNet : a learning-based framework for automatic resume quality assessment
title_sort resumenet a learning based framework for automatic resume quality assessment
topic Engineering::Computer science and engineering
Resume Quality Assessment
Dataset and Features
url https://hdl.handle.net/10356/143040
work_keys_str_mv AT luoyong resumenetalearningbasedframeworkforautomaticresumequalityassessment
AT zhanghuaizheng resumenetalearningbasedframeworkforautomaticresumequalityassessment
AT wangyongjie resumenetalearningbasedframeworkforautomaticresumequalityassessment
AT wenyonggang resumenetalearningbasedframeworkforautomaticresumequalityassessment
AT zhangxinwen resumenetalearningbasedframeworkforautomaticresumequalityassessment