Featured Publications

AI-Generated vs. Human Text: Introducing
a New Dataset for Benchmarking and Analysis

Having the power to differentiate between human and machine generated text is crucial. Due to the rapid proliferation of AI-generated content, which poses significant challenges in distinguishing it from human-authored text, our paper provides the research community with a balanced dataset of 10,000 records that include both AI-generated and human-generated text. The dataset offers a standardized benchmark for training and evaluating AI models that detect AI-generated content and contributes to the ongoing discussions in AI ethics and the field of natural language processing. This paper both combats plagiarism and encourages the education of writing among young learners.

The AI Mindset: Thriving Within Civilization's Next Big Disruption

Find out how AI can help you and your business thrive from global experts in a variety of AI domains. Prepare yourself and your company with the best advice on AI in law, ethics, privacy, implementation, product, and more. Available on Amazon (disponible en français ici)!


Using Natural Language Processing for an Artificially Intelligent Wine Connoisseur

How do various vectorization and machine learning methods compare to ChatGPT as a wine recommendation system?


See how Doc2Vec outscored OpenAI's ChatGPT, Google's BERT, and other embedding strategies in 2022 from the matrix shown here.


LDA Topic Modeling on Product Reviews

Best paper award received from the Electronic Commerce Research using topic modeling on consumer feedback reviews in both English and Mandarin to increase consumer trust in products to maximize revenues in collaboration with many talented authors, including Jian Mou. Read more here.