DEVELOPMENT OF A CLOUD SERVICE FOR AUTOMATIC ANALYSIS OF TEXT DOCUMENTS OF DISTANCE LEARNING SYSTEMS

15.02.2026 22:37

[1. Information systems and technologies]

Author: Oleksandr Viunenko, Candidate of Economic Sciences, Associate Professor of Cybernetics and Informatics Department, Sumy National Agrarian University, Sumy

ORCID: 0000-0002-8835-0704

The growth in the volume and complexity of text data in various industries highlights the urgent need to implement effective and accurate document analysis methods. Traditional approaches to text analysis are often insufficient for processing the modern scale and complexity of text documents. In contrast, the integration of deep neural networks and cloud services shows promise in the field of document analysis automation. This approach allows users to obtain samples, generate explanations, and engage in interactive dialogue with neural network models. The development of a web service for automated text analysis meets this need by providing a powerful tool for improving productivity and knowledge discovery in various fields. The main problem lies in the complexity and inaccessibility of existing text document analysis services, such as Google Natural Language AI, Amazon Comprehend, and Lexalytics. These services, although powerful, are designed for developers and require a certain level of technical knowledge to use effectively. In addition, these services do not have a convenient interactive interface, which makes it difficult for ordinary users to effectively analyse text documents.

To solve this problem, it is necessary to develop a convenient web service that uses the capabilities of deep neural networks to analyse and explain text documents in interactive mode. This service should offer a chat interface that allows users to interact with the system in a natural, conversational manner, asking questions and receiving answers in real time. This will make the text analysis process more engaging and intuitive, removing the barriers often associated with more technical systems. The integration of modern deep neural network models, such as OpenAI's GPT-4 model, into the web service is an important aspect of enabling automated analysis and interpretation of text documents. This process involves several key steps to ensure effective interaction between the web service and the neural network model, as well as the generation of accurate and contextually relevant responses. Other models, such as BERT, may also be considered depending on specific requirements [1]. As with any external API integration, it is necessary to have error handling mechanisms in place to address issues such as network failures, API rate limits, or unexpected model errors. Implementing retry strategies and fallback mechanisms can help ensure service reliability.

Effective data management is crucial to ensuring the smooth operation of a web service, especially when processing user documents, requests, and responses. Data management encompasses several key components, including storage, search, and integration with external services. Use of third-party services to store PDF documents uploaded by users. After a document is uploaded, the service generates a unique URL, which is then stored along with additional metadata in a relational database. This metadata includes information such as the user ID, document title, upload timestamp, and any relevant tags or categories associated with the document.

Semantic understanding is a key aspect of the web service, ensuring accurate interpretation of the meanings of text documents and user queries. The web service uses the OpenAI Embeddings API to convert text documents and user queries into semantic vectors [1]. A semantic vector is a dense, high-dimensional representation of text that reflects the semantic relationships between words and phrases. By converting text into vector space, the web service can perform semantic operations such as similarity comparisons and contextual analysis.

When a user uploads a document, its content is processed to extract textual information. This text is then passed through the Embeddings model to create a semantic vector representation. The vector encapsulates the semantic meaning of the document, reflecting its key concepts, themes, and relationships between words.

Semantic search methods are used to match the semantic vectors of user queries with the content of documents, involving the calculation of similarity scores between vectors using methods such as cosine similarity. Documents with vectors most similar to the query vector are found and considered as potential context for generating responses. Once relevant documents have been identified, their semantic vectors are used to provide context to deep neural network models. By incorporating this context into the analysis process, models can generate more accurate and contextually relevant responses to user queries [2]. Semantic understanding is the basis for a web service's ability to analyse text documents and provide meaningful explanations. Using the Embeddings model and semantic search methods, the service can effectively bridge the semantic gap between user queries and document content, enabling more accurate and in-depth analysis. Effectively solving these problems allows for the development of a reliable cloud-based educational web service capable of providing automated analysis and explanations of text documents in an interactive and in-depth manner.

Text document analysis using deep neural networks is a powerful tool for extracting meaningful information from large volumes of training text data. Text document analysis methods include information retrieval, text summarization, topic modelling, sentiment analysis, named entity recognition, and text classification. DNNs are used to perform these tasks with high accuracy, scalability, and versatility. The creation of such web services for analysing educational text documents also solves the problem of forming individual learning trajectories for students. They can be specifically designed to meet the specific needs of a general audience and enable users not only to obtain text analysis, but also to interact with the data in the most convenient and effective way.

References

1. Horev R. BERT Explained: State of the art language model for NLP. Towards Data Science. URL: https://towardsdatascience.com/bert-explained-state-of-the-art-language-model-for-nlp-f8b21a9b6270 (date of acces: 20.01.2026).

2. Tripathi R. What are Vector Embeddings. Pinecone. URL: https://www.pinecone.io/learn/vector-embeddings/ (date of acces: 20.01.2026).

Ця робота ліцензується відповідно до Creative Commons Attribution 4.0 International License

Знайшли помилку? Виділіть помилковий текст мишкою і натисніть Ctrl + Enter

Another articles in this section

Сonferences

Conference 2026

Information society: technological, economic and technical aspects of formation (issue 106) (15-16.01.2026)

Information society: technological, economic and technical aspects of formation (issue 107) (10-11.02.2026)

Information society: technological, economic and technical aspects of formation (issue 108) (5-6.03.2026)

Conference 2025

Information society: technological, economic and technical aspects of formation (issue 95) (16-17.01.2025)

Information society: technological, economic and technical aspects of formation (issue 96) (11-12.02.2025)

Information society: technological, economic and technical aspects of formation (issue 97) (13-14.03.2025)

Information society: technological, economic and technical aspects of formation (issue 98) (15-16.04.2025)

Information society: technological, economic and technical aspects of formation (issue 99) (14-15.05.2025)

Information society: technological, economic and technical aspects of formation (issue 100) (11-12.06.2025)

Information society: technological, economic and technical aspects of formation (issue 101) (09-10.07.2025)

Information society: technological, economic and technical aspects of formation (issue 102) (16-17.09.2025)

Information society: technological, economic and technical aspects of formation (issue 103) (14-15.10.2025)

Information society: technological, economic and technical aspects of formation (issue 104) (13-14.11.2025)

Information society: technological, economic and technical aspects of formation (issue 105) (11-12.12.2025)

Conference 2024

Information society: technological, economic and technical aspects of formation (issue 84) (18-19.01.2024)

Information society: technological, economic and technical aspects of formation (issue 85) (15-16.02.2024)

Information society: technological, economic and technical aspects of formation (issue 86) (12-13.03.2024)

Information society: technological, economic and technical aspects of formation (issue 87) (11-12.04.2024)

Information society: technological, economic and technical aspects of formation (issue 88) (14-15.05.2024)

Information society: technological, economic and technical aspects of formation (issue 89) (12-13.06.2024)

Information society: technological, economic and technical aspects of formation (issue 90) (9-10.07.2024)

Information society: technological, economic and technical aspects of formation (issue 91) (10-11.09.2024)

Information society: technological, economic and technical aspects of formation (issue 92) (8-9.10.2024)

Information society: technological, economic and technical aspects of formation (issue 93) (12-13.11.2024)

Information society: technological, economic and technical aspects of formation (issue 94) (11-12.12.2024)

Conference 2023

Information society: technological, economic and technical aspects of formation (issue 74) (06-07.02.2023)

Information society: technological, economic and technical aspects of formation (issue 75) (06-07.03.2023)

Information society: technological, economic and technical aspects of formation (issue 76) (03-04.04.2023)

Information society: technological, economic and technical aspects of formation (issue 77) (09-10.05.2023)

Information society: technological, economic and technical aspects of formation (issue 78) (08-09.06.2023)

Information society: technological, economic and technical aspects of formation (issue 79) (06-07.07.2023)

Information society: technological, economic and technical aspects of formation (issue 80) (19-20.09.2023)

Information society: technological, economic and technical aspects of formation (issue 81) (11-12.10.2023)

Information society: technological, economic and technical aspects of formation (issue 82) (9-1.11.2023)

Information society: technological, economic and technical aspects of formation (issue 83) (7-8.12.2023)

Conference 2022

Information society: technological, economic and technical aspects of formation (issue 65) (8-9.02.2022)

Information society: technological, economic and technical aspects of formation (issue 66) (6-7.04.2022)

Information society: technological, economic and technical aspects of formation (issue 67) (11-12.05.2022)

Information society: technological, economic and technical aspects of formation (issue 68) (7-8.06.2022)

Information society: technological, economic and technical aspects of formation (issue 69) (4-5.07.2022)

Information society: technological, economic and technical aspects of formation (issue 70) (22-23.09.2022)

Information society: technological, economic and technical aspects of formation (issue 71) (18-19.10.2022)

Information society: technological, economic and technical aspects of formation (issue 72) (15-16.11.2022)

Information society: technological, economic and technical aspects of formation (issue 73) (08-09.12.2022)

Conference 2021

Information society: technological, economic and technical aspects of formation (Issue 55) (09.02.2021)

Information society: technological, economic and technical aspects of formation (Issue 56) (10.03.2021)

Information society: technological, economic and technical aspects of formation (issue 57) (13.04.2021)

Information society: technological, economic and technical aspects of formation (issue 58) (12.05.2021)

Information society: technological, economic and technical aspects of formation (issue 59) (08.06.2021)

Information society: technological, economic and technical aspects of formation (issue 60) (13.07.2021)

Information society: technological, economic and technical aspects of formation (issue 61) (15.09.2021)

Information society: technological, economic and technical aspects of formation (issue 62) (12.10.2021)

Information society: technological, economic and technical aspects of formation (issue 63) (11.11.2021)

Information society: technological, economic and technical aspects of formation (issue 64) (10.12.2021)

Congratulation from Internet Conference!

Рік заснування видання - 2011

DEVELOPMENT OF A CLOUD SERVICE FOR AUTOMATIC ANALYSIS OF TEXT DOCUMENTS OF DISTANCE LEARNING SYSTEMS

Another articles in this section