NKCorpus: Extracting High Quality Large Chinese Dataset from Web Data
LI Dongwen,ZHONG Zhenyu,SHEN Junyu,WANG Haotian,SUN Yufei,ZHANG Yuzhi
Frontiers of Data and Computing . 2022, (3): 30 -45 .  DOI: 10.11871/jfdc.issn.2096-742X.2022.03.003