Detecting AI-Generated Text: NUARI and Norwich University Researchers Publish Study

Detecting AI-Generated Text: NUARI and Norwich University Researchers Publish Study

 

We are happy to share an article authored by NUARI and Norwich University researchers for the IEEE Transactions on Artificial Intelligence journal.

Dr. Ali Al Bataineh, Director, Artificial Intelligence Center at Norwich University, and Dr. Kristen Pedersen, Chief Research Officer, NUARI, Rachel Sickler, Senior Developer and Machine Learning Engineer, NUARI, and Data Scientist Kerry Kurcz.

From the paper's abstract – "Artificial Intelligence (AI) is increasingly embedded in our everyday lives. With the introduction of ChatGPT in November 2022 by OpenAI, people can now ask a bot to generate comprehensive writeups in seconds. This new transformative technology also introduces ethical, safety, and other general concerns. It is important to harness the power of AI to understand whether a body of text is generated by AI or whether it is organically human. In this paper, we create and curate a medium-sized dataset of 10,000 records containing both human and machine-generated text and utilize it to train a reliable model to accurately distinguish between the two."

Additionally, some points that should be highlighted from the paper are as follows:

  • The dataset that our researchers created was one that other researchers can use – they automated that process so that fellow researchers could create their datasets using our team's process.
  • Models that can legitimately identify AI-written text will become more important as time goes on and more of the internet (big tech's training ground) becomes saturated with this text – it will make LLMs worse (model collapse) if we don't remove it from training data.
  • The literature review found that the models that generated the text are most apt at identifying whether it was AI-generated. Still, our team's results showed that simple machine learning models (XGBoost, logistic regression, and random forest) did better for our team's dataset. People think that only deep learning can do a good job. Still, our research showed that these less computationally expensive and less complex models can do the job too, maybe even better, given the right scenario.

 

The full paper can be downloaded from the IEEE website.

 

Jakon Hays

Jakon Hays

Jakon is the Senior Marketing and Strategic Communications Specialist for Norwich University Applied Research Institutes (NUARI). He develops and executes digital and social media awareness initiatives promoting NUARI's mission of enabling a resilient society through rapid research, development, and education in cybersecurity, defense technologies, and information warfare.

More posts by Jakon Hays

Latest News

New NCPC Course Launch: FEMA-Certified AWR-432 Now Available

NUARI and Norwich University -  Pioneering Innovation at the Cyber Fusion Center

On April 25, 2025, Norwich University officially broke ground on the new 13,000-square-foot Cyber...

NUARI and Cyber Florida Announce Strategic Partnership to Enhance Cybersecurity and Infrastructure Resilience

We are pleased to announce a new partnership between NUARI and The Florida Center for...

Questions?

GET IN TOUCH
Back To Top

You are being redirected to an external site.

NUARI You are now leaving NUARI - Norwich University Applied Research Institutes provides links to websites of other organizations for convenience and for informational purposes. A link does not constitute an endorsement of the content, viewpoint, policies, products, or services of that website. Once you link to another website not maintained by NUARI, you are subject to the terms and conditions of that website, including but not limited to its privacy policy.

You will be redirected to

Click the link above to continue or CANCEL