FURI | Spring 2021

Understanding Machine vs Human Generated Text in News Articles

Data icon, disabled. Four grey bars arranged like a vertical bar chart.

The aim of this research is to create a model that can take a text input to classify the content of the message into two categories, Machine Generated and Human Written. This research uses Hierarchical Attention Networks (HAN) trained with PolitiFact data to create the predictive model. This model can be run by websites for helping prevent disinformation. In the future, HAN can be trained to infer not only from words and sentences but also from the paragraphs, which can potentially provide more insight.

Student researcher

Mertay Dayanc

Computer science

Hometown: Ankara, Incek, Turkey

Graduation date: Fall 2021