DETECTING THREATS IN THE JAVASCRIPT CODE OF WEB APPLICATIONS

08.01.2025 17:09

[1. Information systems and technologies]

Author: Yaroslav Chuiko, master, National Technical University “Kharkiv Polytechnic Institute”, Kharkiv; Viacheslav Karpenko, candidate of technical sciences, National Technical University “Kharkiv Polytechnic Institute”, Kharkiv

ORCID: 0000-0002-8378-129X Viacheslav Karpenko

JavaScript is a dynamic programming language that is used by the vast majority of websites and supported by all modern web browsers. Its prevalence has become one of the factors that in recent years JavaScript has become the most common and successful language for building web attacks. Recent cyberattacks regularly exploit JavaScript weaknesses, and sometimes even mask their malicious intentions to avoid detection. Attackers can embed malicious JavaScript in a web page, and it will be automatically executed when the page is loaded in any browser. All this makes the task of detecting threats in JavaScript code very important.

There are different approaches to malware detection. In solving this problem, it is advisable to use static analysis (known as source code analysis), which tests and evaluates the program by examining the code without executing the program. It is usually used to analyze the code syntax. The goal is to check if there are any suspicious keywords or code fragments [1].

There are various methods for analyzing static source code for potential vulnerabilities, and after analyzing them, lexical analysis was chosen. Lexical analysis transforms the syntax of the source code into “tokens” of information in an attempt to abstract the source code and facilitate its manipulation. This analysis is aimed at recognizing patterns, anomalies, and suspicious content in the data.

To find threats in JavaScript code, the following five-step algorithm was proposed: the stage of obtaining a URL, the stage of loading an HTML page, the stage of searching for <script> HTML elements and extracting JavaScript code from them, the stage of searching for potentially dangerous JavaScript code, and the stage of classifying malicious code.

The first stage involves obtaining the URL of the page the user wants to analyze. A URL is nothing more than the address of a specific unique resource on the Internet. Such resources can be an HTML page, a CSS document, an image, etc.

The purpose of the second stage is to get the HTML page for further analysis. This process takes place using the HTTP GET request method to the previously received URL. HTTP defines a set of request methods that indicate the desired action to be performed on a particular resource. The GET method requests a representation of the specified resource. The result of a GET request is not necessarily an HTML page, so at this stage, the extension of the received file is also checked.

The next step is to search for all JavaScript scripts that are used on the downloaded HTML page. Since on all HTML pages both embedded JavaScript scripts and external JavaScript files can be contained only in special script tags. The search is carried out by parsing the entire HTML page without rendering the page and applying styles that are executed by browsers, as this significantly slows down the search process. After all the <script> elements are found, they are divided into two groups: those that contain embedded JavaScript code and those that contain links to external JavaScript files. This division is based on whether the <script> element contains a special src attribute that contains the URI of a link to external files. After that, for those <script> elements that do not contain the src attribute, the internal content, i.e. JavaScript code, is extracted without prior execution, and for those that do contain the src attribute, GET requests are made to obtain external JavaScript files for further analysis.

At the stage of searching for potentially unsafe JavaScript code, we search for standard JavaScript functions and functions available through the Web API, which can also be potentially dangerous. Potentially unsafe JavaScript functions and the vulnerabilities they can cause are listed in [2]. The search is performed by parsing JavaScript code using regular expressions. From the previous stage, the JavaScript code is presented as lines of text in which the code that may contain potential vulnerabilities is searched.

At the last stage, if potentially dangerous JavaScript functions are found, the algorithm proceeds to the malicious code classification stage. At this stage, the potentially dangerous code found earlier is classified according to the type of attack it can lead to and the overall level of danger. The malicious JavaScript code is classified by the level of danger in such a way that each individual part of the code has its own weight and level of danger. After analyzing the code, the presence of potentially dangerous functions and the level of potential vulnerability are determined.

After classification, the result is presented in the form of the overall level of danger on the site and a list of possible attacks.

Based on this algorithm, a corresponding software solution was developed. It consists of three main components: a server side responsible for the business logic of the application, a client side responsible for the user interface, and a database server responsible for storing data. The user on the client side enters the URL of the site he wants to check for danger. After entering the correct URL, this value will be sent to the server where the code scanning process will begin. After successful completion of the scan, the user will be redirected to a page where the scan results will be displayed.

Thus, the proposed algorithm and the corresponding software solution allow an ordinary user to significantly increase the security of using third-party web applications.

References

1. Dynamic Analysis vs. Static Analysis // https://www.intel.com/content/www/us/en/docs/inspector/user-guide-windows/2022/dynamic-analysis-vs-static-analysis.html, 02.10.2023.

2. CSSXC: Context-sensitive Sanitization Framework for Web Applications against XSS Vulnerabilities in Cloud Environments // https://www.researchgate.net/publication/303745888_CSSXC_Context-sensitive_Sanitization_Framework_for_Web_Applications_against_XSS_Vulnerabilities_in_Cloud_Environments, 24.10.2023.

Ця робота ліцензується відповідно до Creative Commons Attribution 4.0 International License

Знайшли помилку? Виділіть помилковий текст мишкою і натисніть Ctrl + Enter

Another articles in this section

Сonferences

Conference 2025

Information society: technological, economic and technical aspects of formation (issue 95) (16-17.01.2025)

Information society: technological, economic and technical aspects of formation (issue 96) (11-12.02.2025)

Information society: technological, economic and technical aspects of formation (issue 97) (13-14.03.2025)

Information society: technological, economic and technical aspects of formation (issue 98) (15-16.04.2025)

Information society: technological, economic and technical aspects of formation (issue 99) (14-15.05.2025)

Information society: technological, economic and technical aspects of formation (issue 100) (11-12.06.2025)

Information society: technological, economic and technical aspects of formation (issue 101) (09-10.07.2025)

Conference 2024

Information society: technological, economic and technical aspects of formation (issue 84) (18-19.01.2024)

Information society: technological, economic and technical aspects of formation (issue 85) (15-16.02.2024)

Information society: technological, economic and technical aspects of formation (issue 86) (12-13.03.2024)

Information society: technological, economic and technical aspects of formation (issue 87) (11-12.04.2024)

Information society: technological, economic and technical aspects of formation (issue 88) (14-15.05.2024)

Information society: technological, economic and technical aspects of formation (issue 89) (12-13.06.2024)

Information society: technological, economic and technical aspects of formation (issue 90) (9-10.07.2024)

Information society: technological, economic and technical aspects of formation (issue 91) (10-11.09.2024)

Information society: technological, economic and technical aspects of formation (issue 92) (8-9.10.2024)

Information society: technological, economic and technical aspects of formation (issue 93) (12-13.11.2024)

Information society: technological, economic and technical aspects of formation (issue 94) (11-12.12.2024)

Conference 2023

Information society: technological, economic and technical aspects of formation (issue 74) (06-07.02.2023)

Information society: technological, economic and technical aspects of formation (issue 75) (06-07.03.2023)

Information society: technological, economic and technical aspects of formation (issue 76) (03-04.04.2023)

Information society: technological, economic and technical aspects of formation (issue 77) (09-10.05.2023)

Information society: technological, economic and technical aspects of formation (issue 78) (08-09.06.2023)

Information society: technological, economic and technical aspects of formation (issue 79) (06-07.07.2023)

Information society: technological, economic and technical aspects of formation (issue 80) (19-20.09.2023)

Information society: technological, economic and technical aspects of formation (issue 81) (11-12.10.2023)

Information society: technological, economic and technical aspects of formation (issue 82) (9-1.11.2023)

Information society: technological, economic and technical aspects of formation (issue 83) (7-8.12.2023)

Conference 2022

Information society: technological, economic and technical aspects of formation (issue 65) (8-9.02.2022)

Information society: technological, economic and technical aspects of formation (issue 66) (6-7.04.2022)

Information society: technological, economic and technical aspects of formation (issue 67) (11-12.05.2022)

Information society: technological, economic and technical aspects of formation (issue 68) (7-8.06.2022)

Information society: technological, economic and technical aspects of formation (issue 69) (4-5.07.2022)

Information society: technological, economic and technical aspects of formation (issue 70) (22-23.09.2022)

Information society: technological, economic and technical aspects of formation (issue 71) (18-19.10.2022)

Information society: technological, economic and technical aspects of formation (issue 72) (15-16.11.2022)

Information society: technological, economic and technical aspects of formation (issue 73) (08-09.12.2022)

Conference 2021

Information society: technological, economic and technical aspects of formation (Issue 55) (09.02.2021)

Information society: technological, economic and technical aspects of formation (Issue 56) (10.03.2021)

Information society: technological, economic and technical aspects of formation (issue 57) (13.04.2021)

Information society: technological, economic and technical aspects of formation (issue 58) (12.05.2021)

Information society: technological, economic and technical aspects of formation (issue 59) (08.06.2021)

Information society: technological, economic and technical aspects of formation (issue 60) (13.07.2021)

Information society: technological, economic and technical aspects of formation (issue 61) (15.09.2021)

Information society: technological, economic and technical aspects of formation (issue 62) (12.10.2021)

Information society: technological, economic and technical aspects of formation (issue 63) (11.11.2021)

Information society: technological, economic and technical aspects of formation (issue 64) (10.12.2021)

Congratulation from Internet Conference!

Рік заснування видання - 2011

DETECTING THREATS IN THE JAVASCRIPT CODE OF WEB APPLICATIONS

Another articles in this section