Navigating Artificial Intelligence (AI) Integration in English Language Teaching: Challenges, Opportunities and Future Directions

Main Article Content

Daniel Ginting
Rudi Hartono
Iskandar Iskandar

Abstract

This article overviews a study on integrating generative AI tools into education. English language education is critical for AI integration due to its reliance on text-based learning, communication skills and adaptive instructional approaches. Employing a systematic literature review to analyse existing research on AI tools in English language education, it focuses on their benefits and challenges. Articles from Scopus-indexed journals and other scholarly sources published between 2021 and 2024 were reviewed. The study discusses four significant findings: the classification of generative AI, key research topics on AI, the role of AI in cognitive offloading and academic dishonesty and the phenomenon of AI-generated hallucinations. The findings highlight the need for educators to stay updated with AI advancements and adapt their teaching practices accordingly. While AI offers opportunities for personalised learning, it also raises concerns about academic integrity and the reliability of AI-generated content. The review is limited to studies published between 2021 and 2024 and may not encompass all relevant research. Additionally, the focus is primarily on English language education, which may not fully represent the impact of AI tools across other educational contexts. Future research could explore the broader implications of AI integration across different subject areas and examine its long-term effects on pedagogy and student learning outcomes.

Article Details

Section
Articles

References

V., BehnamGhader, P., Lu, X. H., Meade, N., & Reddy, S. (2024). Evaluating correctness and faithfulness of instruction-following models for question answering. Transactions of the Association for Computational Linguistics, 12, 681–699. https://doi.org/10.1162/tacl_a_00667

Agomuoh, F. (2023, March 14). The best ChatGPT alternatives (according to ChatGPT). DigitalTrends. https://www.digitaltrends.com/computing/the-best-chatgptalternatives-according-to-chatgpt/

Ahsan, K., Akbar, S., & Kam, B. (2022). Contract cheating in higher education: A systematic literature review and future research agenda. Assessment & Evaluation in Higher Education, 47(4), 523–539. https://doi.org/10.1080/02602938.2021.1931660

Alharbi, M. (2024). The role of artificial intelligence in advancing English as a foreign language teaching at Saudi universities. World Journal on Educational Technology: Current Issues, 16(3), 181–200. https://doi.org/10.18844/wjet.v16i3.9311

Ali, N. B., & Petersen, K. (2014). Evaluating strategies for study selection in systematic literature studies. In Proceedings of the 8th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM ‘14). (Article 45, pp. 1–4). New York: Association for Computing Machinery. https://doi.org/10.1145/2652524.2652557

Alkaissi, H., & McFarlane, S. I. (2023). Artificial hallucinations in ChatGPT: Implications in scientific writing. Cureus Journal of Medical Science 15(2): e35179. https://doi.org/10.7759/cureus.35179

Assarroudi, A., Heshmati Nabavi, F., Armat, M. R., Ebadi, A., & Vaismoradi, M. (2018). Directed qualitative content analysis: The description and elaboration of its underpinning methods and data analysis process. Journal of Research in Nursing, 23(1), 42–55. https://doi.org/10.1177/1744987117741667

Aydın, Ö., & Karaarslan, E. (2023). Is ChatGPT leading generative AI? What is beyond expectations? Academic Platform Journal of Engineering and Smart Systems, 11(3), 118–134. https://doi.org/10.21541/apjess.1293702

Beasley, E. M. (2014). Students reported for cheating explain what they think would have stopped them. Ethics & Behavior, 24(3), 229–252. https://doi.org/10.1080/10508422.2013.845533

Bowman, E. (2023, January 9). A college student created an app to tell whether AI wrote an essay. NPR. https://www.npr.org/2023/01/09/1147549845/gptzero-ai-chatgptedward-tian-plagiarism

Cooper, K. (2021, November 1). OpenAI GPT-3: Everything you need to know. Springboard. https://www.springboard.com/blog/data-science/machine-learning-gpt-3-open-ai/

Cotton, D. R., Cotton, P. A., & Shipway, J. R. (2023, March 13). Chatting and cheating: Ensuring academic integrity in the era of ChatGPT. EdArXiv. https://doi.org/10.35542/osf.io/mrz8h

Eysenbach, G. (2023). The role of ChatGPT, generative language models, and artificial intelligence in medical education. JMIR Medical Education, 9, e46885. https://doi.org/10.2196/46885

Fowler, G. A. (2023, August 7). AI is acting ‘pro-anorexia’, and tech companies are not stopping it. Washington Post. Retrieved from https://link.gale.com/apps/doc/A759935629/AONE?u=anon~6a1f13d5&sid=googleScholar&xid=fb70a03f

Gao, C. A., Howard, F. M., Markov, N. S., Dyer, E. C., Ramesh, S., Luo, Y., Pearson, A. T., & Knight, K. R. (2023). Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers. npj Digital Medicine, 6, 75. https://doi.org/10.1038/s41746-023-00819-6

García-Peñalvo, F.J. (2023). The perception of artificial intelligence in educational contexts after the launch of ChatGPT: Disruption or panic? Education in the Knowledge Society, 24, e31279. https://doi.org/10.14201/eks.31279

Gayed, J. M., Carlon, M. K. J., Oriola, A. M., & Cross, J. S. (2022). Exploring an AI-based writing assistant’s impact on English language learners. Computers & Education: Artificial Intelligence, 3, 100055. https://doi.org/10.1016/j.caeai.2022.100055

Gillard, C., & Rorabaugh, P. (2023, February 3). You are not going to like how colleges respond to ChatGPT. Slate. https://slate.com/technology/2023/02/chat-gpt-cheatingcollege-ai-detection.html

Ginting, D., & Saukah, A. (2016). Tests of writing in the school examination in upper secondary schools. SAGE Open, 6(4), 1–7. https://doi.org/10.1177/2158244016673130

Gregorcic, B., & Pendrill, A.-M. (2023). ChatGPT and the frustrated Socrates. Physics Education, 58(3), 035021. https://doi.org/10.1088/1361-6552/acc299

Haensch, A. C., Ball, S., Herklotz, M., & Kreuter, F. (2023). Seeing ChatGPT through students’ eyes: An analysis of TikTok data. In 2023 Big Data Meets Survey Science (BigSurv) (pp. 1–8). IEEE. https://doi.org/10.1109/BigSurv59479.2023.10486710

Hofmann, R., Wimmer, C., Schlecht, J., Eskofier, B. M., & Stanzel, F. (2023). ChatGPT, Bing, and medical education: Evaluating the performance of AI models on the German Medical State Examination of 2022. JMIR Medical Education, 9, e46482. https://doi.org/10.2196/46482

Holmes, W., Bialik, M., & Fadel, C. (2019). Artificial intelligence in education: Promises and implications for teaching and learning. Center for Curriculum Redesign. https://unesdoc.unesco.org

Howarth, J. (2025, August 11). ChatGPT-5 has arrived: What OpenAI’s most powerful AI model can actually do. Exploding Topics. https://explodingtopics.com/blog/newchatgpt-release-date

Hu, J., Liu, F., Chu, C., & Chang, Y. (2023). Rapid survey study: Health care trainees’ and professionals’ perceptions of ChatGPT in improving medical knowledge training. Journal of Medical Internet Research, 25, e49385. https://doi.org/10.2196/49385

Hutson, M. (2021). Robo-writers: The rise and risks of language-generating AI. Nature, 591(7848), 22–25. https://doi.org/10.1038/d41586-021-00530-0

Ism, I. (2025, February 13). GPT-5: Everything we know about OpenAI’s new model. Chatbase. https://www.chatbase.co/blog/gpt-5

Jaboob, M. (2024). Integration of generative AI techniques and applications in student behavior and cognitive achievement in Arab higher education. International Journal of Human-Computer Interaction, 41(1), 353–366. https://doi.org/10.1080/10447318.2023.2300016

Kabilan, M. K., & Veratharaju, K. (2013). Professional development needs of primary school English-language teachers in Malaysia. Professional Development in Education, 39(3), 330-351. https://doi.org/10.1080/19415257.2012.762418

Kaddour, J., Harris, J., Mozes, M., Bradley, H., Raileanu, R., & McHardy, R. (2023). Challenges and applications of large language models. ArXiv. https://arxiv.org/abs/2307.10169

Kaswan, K. S. (2024). AI in personalised learning. Advances in Technological Innovations in Higher Education: Theory and Practices, 103–117. https://doi.org/10.1201/9781003376699-9

Khalil, M., & Er, E. (2023). Will ChatGPT get you caught? Rethinking plagiarism detection. arXiv preprint. Retrieved on April 9, 2023, from https://arxiv.org/abs/2302.04335

Knopf, J. W. (2006). Doing a literature review. PS: Political Science & Politics, 39(1), 127–132.

LaCour, M., Cantú, N. G., & Davis, T. (2019). When calculators lie: A demonstration of uncritical calculator usage among college students and factors that improve performance. PLOS ONE, 14(10), e0223736. https://doi.org/10.1371/journal.pone.0223736

LazyProgrammer. (2025). GPT-5: What we know and why it is delayed. Retrieved from https://lazyprogrammer.me/gpt-5/

Li, K., Patel, O., Vi´egas, F., Pfister, H., & Wattenberg, M. (2023). Inference-time intervention: Eliciting truthful answers from a language model. ArXiv. https://arxiv.org/abs/2306.03341

Liu, J., Liu, A., Lu, X., Welleck, S., West, P., Le Bras, R., Choi, Y., & Hajishirzi, H. (2022). Generated knowledge prompting for commonsense reasoning [Paper presentation]. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 3154–3169, Dublin, Ireland: Association for Computational Linguistics.

Lodge, J. M. (2023, May 21). Cheating with generative AI: Shifting focus from means and opportunity to motive [LinkedIn page]. LinkedIn. https://www.linkedin.com/pulse/cheating-generative-ai-shifting-focus-from-means-motive-lodge/

Min, B., Ross, H., Sulem, E., Veyseh, A. P. B., Nguyen, T. H., Sainz, O., ... & Roth, D. (2023). Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys, 56(2), 1–40.

Muhlgay, D., Ram, O., Magar, I., Levine, Y., Ratner, N., Belinkov, Y., Abend, O., Leyton-Brown, K., Shashua, A., & Shoham, Y. (2023). Generating benchmarks for factuality evaluation of language models. ArXiv. https://arxiv.org/abs/2307.06908

Ng, D. T. K. (2021). Conceptualising AI literacy: An exploratory review. Computers and Education: Artificial Intelligence, 2. https://doi.org/10.1016/j.caeai.2021.100041

Okonkwo, C. W., & Ade-Ibijola, A. (2021). Chatbots applications in education: A systematic review. Computers and Education: Artificial Intelligence, 2, 100033. https://doi.org/10.1016/j.caeai.2021.100033

OpenAI. (2023). GPT-4 technical report. Retrieved from https://openai.com/research/gpt-4

OpenAI. (n.d.). GPT-4 Turbo in the OpenAI API. OpenAI Help Center. https://help.openai.com/en/articles/8555510-gpt-4-turbo-in-the-openai-api

Oppenlaender, J. (2022). The creativity of text-to-image generation. Proceedings of the 25th International Academic Mindtrek conference (Academic Mindtrek ‘22). ACM, 11. https://doi.org/10.1145/3569219.3569352

Ray, P. P. (2023). ChatGPT: A comprehensive review of background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems, 3, 121–154. Retrieved from www.keaipublishing.com/en/journals/internet-of-things-and-cyber-physical-systems

Rossa, N.. (2024, March 22). Waduh! Peneliti temukan banyak jurnal akademis yang ternyata ditulis ChatGPT. DetikEdu. Retrieved from https://www.detik.com/edu/edutainment/d-7256390/waduh-peneliti-temukan-banyak-jurnal-akademis-yangternyata-ditulis-chatgpt

Stewart, J. (2023a, February 17). Noam Chomsky says ChatGPT is a form of “high-tech plagiarism”. My Modern Met. https://mymodernmet.com/noam-chomsky-chat-gpt/

Stewart, I. (2023b, March 8). Noam Chomsky: The false promise of ChatGPT. The New York Times. https://www.nytimes.com/2023/03/08/opinion/noam-chomsky-chatgpt.html

Sullivan, M., Kelly, A., & McLaughlin, P. (2023). ChatGPT in higher education: Considerations for academic integrity and student learning. Journal of Applied Learning & Teaching, 6(1), 1–10. https://doi.org/10.37074/jalt.2023.6.1.17

UNESCO. (2021). Recommendation on the ethics of artificial intelligence. UNESCO. https://unesdoc.unesco.org/ark:/48223/pf0000381137

Vincent, J. (2022, December 5). AI-generated answers are temporarily banned on coding Q&A sites. The Verge. https://www.theverge.com/2022/12/5/23493932/chatgpt-aigenerated-answers-temporarily-banned-stack-overflow-llms-dangers

Ward, P. (2023). Choice, uncertainty, and decision superiority: Is less AI-enabled decision support more? IEEE Transactions on Human-Machine Systems, 53(4), 781–791. https://doi.org/10.1109/THMS.2023.3279036

Weissman, J. (2023). “ChatGPT is a Plague Upon Education,” Inside Higher Ed. Retrieved April 26, 2023. Available at https://www.insidehighered.com/opinion/views/2023/02/08/chatgpt-plague-upon-education-opinion

Welborn, A. (2023, March 9). ChatGPT and fake citations. Duke University Libraries. https://blogs.library.duke.edu/blog/2023/03/09/chatgpt-and-fake-citations/

Zawacki-Richter, O., Marín, V. I., Bond, M., & Gouverneur, F. (2019). Systematic review of research on artificial intelligence applications in higher education – where are the educators? International Journal of Educational Technology in Higher Education, 16, 39. https://doi.org/10.1186/s41239-019-0171-0

Zhai, C., Wibowo, S., & Li, L. D. (2024). The effects of over-reliance on AI dialogue systems on students’ cognitive abilities: a systematic review. Smart Learning Environments, 11(1), 28.

Zhang, S., Pan, L., Zhao, J., & Wang, W. Y. (2023). The knowledge alignment problem: Bridging human and external knowledge for large language models. ArXiv. https://arxiv.org/abs/2305.13669