考虑文件 example.html
example.html
<đầu> link1content link2more contend no interest 1 no interest 2 no interest 3 keyword1: unkown_content keyword2: unkown_content not interested not interested not interested keyword1: unkown_content keyword2: unkown_content đầu>