The paper describes a web application developed for managing and presenting experiment data of the WIDER UPTAKE project funded by Horizon Europe. The project's goal is to promote water-smart and sustainable solutions among stakeholders in multiple countries. The application enhances data management and stakeholder engagement through the use of a third-party large language model. It integrates data from demonstration case studies with real-time sensor measurements and laboratory tests, into a comprehensive cloud-based platform. It facilitates data visualization, regulatory compliance checks, and risk assessments for chemical and microbial hazards. The application significantly aided in the coordination and communication of project findings among stakeholders. Key functionalities include interactive diagrams, risk assessment tools, and automated report generation using artificial intelligence (AI). The AI-generated reports, while maintaining confidentiality of data in most cases, provided clear and informative summaries of data compliance with regulatory standards. The web application extended stakeholder engagement, democratized access to complex data, and supported decision-making processes for implementing sustainable water management solutions. However, the development encountered significant challenges surrounding transparency, fairness, accountability, and privacy, which impedes the refinement and scalability of this approach for broader use. Future research should focus on overcoming these obstacles to ensure a more effective and ethical application.

  • Artificial intelligence (AI)-enhanced water management: Large language model (LLM) integration for better communication and decision-making in a water-smart circular economy project.

  • Advanced data management: Web app for efficient visualization of experimental data, improving stakeholder engagement.

  • User-friendly interface: interface with customizable visualization options and automated report generation.

  • AI-driven risk assessment: Augments risk analysis with AI-generated commentary and regulatory compliance checks.

Water supports diverse ecosystems across the planet and holds our societies together (Dudgeon et al. 2006; Navarro-Ortega et al. 2015; Tahir et al. 2018; Morseletto et al. 2022). However, continual industrialization, urban growth, and climate change significantly worsen its availability and quality (Andrews et al. 2011; Colella et al. 2021). Moreover, the same factors jeopardizing the water sector favour further production of waste (Collivignarelli et al. 2019). This imbalance between resource consumption and replenishment – accentuated by the perceivably irresponsible use of potable water for tasks that do not require it (e.g., certain irrigation practices, street washing, and toilet flushing) – created demand for more sustainable water management approaches as Wanner et al. (2023) demonstrates. Due to the scale of the problem, the proposed approaches became increasingly complex, involving a wide array of interconnected stakeholders, policymakers, and institutions. Initially, however, it was a simpler idea – the need to preserve limited natural resources – that drove different actors to work together in closing the loop through reuse and recycling. Over time, these collaborative efforts and interlinked approaches coalesced into what is now known as the Circular Economy (CE) framework, which aims to minimize resource waste and promote sustainable practices (Merli et al. 2018; Morseletto et al. 2022).

Recently, a great number of research and innovation projects dedicated to water reuse have sprung up, including those funded by Horizon Europe, a EU Framework Programme for research and innovation until 2027 (European Commission 2021). Given the disruptive nature of CE and the spirit of the responsible research, these projects tend to include a wide array of stakeholders ranging from research institutions and large industries to small- and medium-sized enterprises (SMEs) and societies at large (Mazzonetto & Simone 2018). In other words, embracing CE principles in the water sector usually requires transformative efforts across the entire value chain, from researchers to end-users (Kirchherr et al. 2023).

Yet, the potential of these efforts can be hindered by inefficient data management and stakeholder engagement (Bellini & Bang 2022; Bellini et al. 2024). This is especially true for case studies, including pilot tests and demo sites, since they are designed to demonstrate the feasibility, viability, and safety of innovative water reuse solutions to a broader, external audience. Recent Horizon Europe initiatives have generated an extensive amount of valuable knowledge, which was delivered as various assessment frameworks/toolkits, analytic platforms, and technology databases as seen in works of Matta et al. (2021), Afghani et al. (2022), Akinsete et al. (2022), Bhambhani et al. (2022), Ghafourian et al. (2022) and Bouziotas et al. (2023). Consequently, while some projects, such as Water2Return (European Commission 2024b), SMART-Plant (European Commission 2024c), the AWESOME project by PRIMA (European Commission 2024a) relied heavily on these technical factsheets, special deliverables, and frameworks for dissemination, others took a more innovative approach by developing digital tools to promote their findings, enhance dissemination efforts and engage more intensely with stakeholders (Table 1).

Table 1

Digital dissemination tools used in horizon Europe projects

Dissemination toolProject acronymDescriptionReference
Decision analytic platform (DAP) Project Ô Light data science cloud-based platform for polluted wells and water distribution network analyses. Corti et al. (2022)  
Replication tool HYDROUSA The tool quickly assesses the replicability of six HYDROUSA solutions based on user-provided information through questionnaires. Fatone et al. (2023)  
HYDROUSA Game HYDROUSA A browser game for increased public awareness. Tsiropoulos et al. (2024)  
Serious game NextGen An educational browser game aiming to demonstrate the roles of water, energy and materials in the urban water cycle. Khoury & Evans (2023)  
Augmented reality-enhanced demo cases WATER-MINING Educational and gamified augmented reality (AR) presentation of the project's case studies. Katika & Tsiakou (2023)  
AR Tool NextGen An AR tool for engaging citizens on the topic of the CE. Campos (2022)  
Water Europe Marketplace NextGen, B-WaterSmart, ULTIMATE A hub for innovative circular economy solutions in water, energy, and materials Frijns (2023)  
Dissemination toolProject acronymDescriptionReference
Decision analytic platform (DAP) Project Ô Light data science cloud-based platform for polluted wells and water distribution network analyses. Corti et al. (2022)  
Replication tool HYDROUSA The tool quickly assesses the replicability of six HYDROUSA solutions based on user-provided information through questionnaires. Fatone et al. (2023)  
HYDROUSA Game HYDROUSA A browser game for increased public awareness. Tsiropoulos et al. (2024)  
Serious game NextGen An educational browser game aiming to demonstrate the roles of water, energy and materials in the urban water cycle. Khoury & Evans (2023)  
Augmented reality-enhanced demo cases WATER-MINING Educational and gamified augmented reality (AR) presentation of the project's case studies. Katika & Tsiakou (2023)  
AR Tool NextGen An AR tool for engaging citizens on the topic of the CE. Campos (2022)  
Water Europe Marketplace NextGen, B-WaterSmart, ULTIMATE A hub for innovative circular economy solutions in water, energy, and materials Frijns (2023)  

However, despite these advancements, there remains untapped potential in utilizing large language models (LLMs) to get complex topics across to a broader audience – such as end-users, communities, stakeholders, potential clients, and policymakers – a method that has yet to be widely adopted within these resources. LLMs use a particular architecture that enables computers to understand and generate natural language (Rogers & Luccioni 2024), which is useful to help broader audiences to grasp engineering and environmental topics that are often riddled with jargon and complex data. Artificial intelligence (AI) and machine learning (ML) are closely associated with LLMs and have been used in water management before, albeit sparingly compared to other engineering fields. They have been successfully utilized to model wastewater treatment systems, optimize process parameters, predict performance outcomes, and detect and identify contamination as demonstrated in the extensive study by Wang et al. (2023). Moreover, it falls in line with a recent trend for data-based approaches to water management, as there is an established benefit from using technologies such as sensor networks, smart meters, and real-time monitoring systems to significantly enhance its efficiency, risk assessment, and sustainability (Eggimann et al. 2017). Nevertheless, the trend stays purely technical in nature despite notorious communication issues with stakeholders and end-users in the field (Elelman et al. 2021). LLMs have every potential to bridge that gap. Furthermore, as Doorn (2021) notes, water professionals already cooperate closely with data scientists and social sciences experts, presenting an opportunity to address ethical concerns associated with stakeholder engagement in data-driven decision-making in other sectors.

LLMs have been shown to help democratize complex data access and facilitate decision-making. They address a key challenge in implementing ML for water management: democratizing data assets in a socially justified manner (Sun & Scanlon 2019) and supporting decision-making (Kamyab et al. 2023). By introducing interactivity and enhancing the communicability of data, LLMs can make data more accessible. In healthcare, for example, educational chatbots – which allow users to interact with LLMs – have demonstrated significant potential when supplementing rather than supplanting human expertise (Reddy 2024). To the best of the authors' knowledge, an AI-driven tool used to convey regulated pollution indicators, technical parameters, and technological principles of a water reuse demonstration case study to peripheral stakeholders is still an area to explore.

This article focuses on developing a web application for storing, processing, and showcasing the experimental and conceptual data generated in the WIDER UPTAKE project (hosted at https://database.wider-uptake.eu/). Funded by Horizon Europe, WIDER UPTAKE is a multi-disciplinary project dedicated to water-smart CE solutions spanning five countries. It supports wastewater treatment plants in resource recovery, water reuse, and energy sustainability by addressing technological, regulatory, organizational, social, and economic barriers (Mannina et al. 2021). The case study in Prague was tasked with demonstrating the technical and chemical feasibility of water reuse in the urban context. The team has decided to utilize a LLM-powered web application to bridge the gap in sustainable water management communication, providing a valuable example for other similar Horizon Europe projects. The web application grants access to data, data visualization, and detailed information about case studies, as well as qualitative chemical risk assessment (QCRA) and qualitative microbial risk assessment (QMRA) conducted using the measured data. Consequently, it uses AI to process this and other data to generate a report containing a regulation compliance check for the measured parameters. This allows the data to be interpreted not only by trained professionals but also by indirect stakeholders involved in the project. Furthermore, it opens the possibility of user data input, potentially turning the web application into a full-fledged decision-making tool for those interested in water recycling.

This article covers the role of the web application in WIDER UPTAKE project, the web application's architecture and development, as well as necessary considerations for future research.

The WIDER UPTAKE project aims to demonstrate the feasibility of CE solutions through case studies in Ghana, Italy, Norway, the Czech Republic, and the Netherlands (Mannina et al. 2021). In Prague, the Czech Republic, the project explored using treated effluent from the Central wastewater treatment plant for urban irrigation. Four irrigation boxes (see Figure 1) demonstrated the effect of different water sources (i.e., river water, centralwastewater treatment plant effluent, the same effluent additionally treated with ultraviolet disinfection, and the same effluent treated with ultraviolet disinfection and ultrafiltration) on plant growth, soil quality, and water safety, showcasing the feasibility of using reclaimed water for sustainable city landscaping.
Figure 1

A photo of one of the ‘rain garden’ irrigation boxes, which includes sections of lawn, plants, and shrubs.

Figure 1

A photo of one of the ‘rain garden’ irrigation boxes, which includes sections of lawn, plants, and shrubs.

Close modal

From the beginning of April until the end of November, during the 2022–2023 seasons, the experiment was conducted to study the short- and long-term effects of wastewater reuse for irrigation. To assess these effects accurately, a wide range of parameters was closely monitored. Some parameters were measured in real-time, while others were evaluated through offline methods, such as monthly or semi-monthly sampling and laboratory tests (see Table 1).

The full construction process of the irrigation boxes – including further details on the sensors and the camera – is described in publicly available deliverables of the WIDER UPTAKE project (Pollert 2021; Pollert et al. 2023).

From sensor data to web application

The real-time data acquisition system for the irrigation boxes was designed by utilizing the Internet of Things (IoT) technology (see Figure 2). The sensors were connected to an ESP8266 Wi-Fi microcontroller, while a camera transferred hourly snapshots of the vegetation during daylight through an Ethernet connection. A local network, established by a router connected to the internet via a modem, supported the system. Aggregated data were continuously transferred to the Home Assistant, an open-source platform for smart home automation and data management, installed on a single-board computer Raspberry Pi for data management and coordination. Subsequently, the data was sent to the WIDER UPTAKE cloud on the Microsoft Azure platform. This cloud platform also served as the entry point for batch-processed data from laboratory tests, relevant thresholds values from the regulations (see Supplementary material, Appendix 1), and other case studies within the project. Naturally, the data from laboratory tests were manually and/or automatically processed to comply with the predefined standard. Both types of data from the experiment (Table 2) formed a data lake (i.e., a centralized repository that stores structured and unstructured data) consisting of adjusted comma-separated value (CSV) files.
Figure 2

Web application and IoT measurement infrastructure overview.

Figure 2

Web application and IoT measurement infrastructure overview.

Close modal

The primary purpose of data aggregation was to create a comprehensive project database. This database not only had to aggregate data but also facilitates its retrieval for visualization and risk assessment. The cloud storage was chosen as the best intermediary for the job due to its scalability, accessibility, and robust security features, ensuring that data could be efficiently managed, accessed from various locations, and protected against unauthorized access. The overarching goal was to enable project participants to ensure compliance with regulatory standards, assess risks for workers and end-users of the recycled water, and present data to stakeholders using advanced visualization techniques.

These goals implied the development of a web application that would authenticate the users, fetch the data, and allow the users to interact with it in a certain way. Moreover, as web applications generally do not require installing anything on the user's device, they are accessible from anywhere, regardless of the operating system, ensuring user-friendliness and intuitiveness. It was decided to host it on a Synology-brand server at the Czech Technical University (CTU) in Prague. The backend was written in PHP programming language to manage data creation, storage, and changed. To safeguard data integrity and security, rigorous measures such as firewall rules, automated IP blocking for suspicious activities, and daily backups to a secondary server are implemented. From the architecture perspective of the web application, the server served two main purposes: user authentication and data fetching. Furthermore, the data fetching is limited by the role of a user. The core functionalities for interacting with the fetched data was completely transferred to the frontend as the application evolved as a small-scale extension of the database, making a simple frontend implementation more practical given the limited audience. Thus, the frontend became responsible for processing the data, performing computations, and providing an interactive user interface, effectively offloading these tasks from the server and enhancing overall system performance in case of a large number of simultaneous active sessions. However, this advantage comes with higher security risks, especially if the application is opened to external users.

Frontend and functionalities

The visualization scripts, as well as QCRA and QMRA algorithms are client-side, i.e., the data are received and processed in a browser's memory by running the TypeScript code to initiate the web interface of the application. An authenticated user is welcomed by an uncluttered interface with several menus on the side to piece together the visualization of his choosing. Alternatively, they can toggle one of the predefined templates for some of the case studies (e.g., ‘Prague – Heavy Metals: Pb, Cd, Cr, Fe’ or ‘Stavanger – Cd, Ni and As, Hg – Across All Measured Sources’). In any case, the interaction with the data is guided by 9 independent widgets (i.e., an interface component that functions independently and executes its own code) including the Report widget that gathers the data from five other widgets for automatic report generation (see Table 3). Each widget allows for further clarification of the data to be shown if available, i.e., place of sampling, measured parameter, or additional legal and situational context.

Table 2

Parameters measured during the Prague case experiment

Data collectionGroup of parametersExamples
Online Related to water Irrigation system flow rate, turbidity, conductivity, pH 
Soil and air Soil moisture and temperature, relative humidity and air temperature 
Camera feed Snapshots of vegetation 
Offline Basic parameters BOD5, CODCr, TOC, N-, N-, P-, Ptot., Mg2+, etc. 
Trace elements B, Cd, Pb, As, Cr, Cu, Ni, Zn, Fe, Mn 
Microbiologic Parameters Escherichia coli, Clostridium perfringens, thermotolerant coliform bacteria, etc. 
Micropollutants 50+ pharmaceuticals, personal care products, and other organic substances 
Data collectionGroup of parametersExamples
Online Related to water Irrigation system flow rate, turbidity, conductivity, pH 
Soil and air Soil moisture and temperature, relative humidity and air temperature 
Camera feed Snapshots of vegetation 
Offline Basic parameters BOD5, CODCr, TOC, N-, N-, P-, Ptot., Mg2+, etc. 
Trace elements B, Cd, Pb, As, Cr, Cu, Ni, Zn, Fe, Mn 
Microbiologic Parameters Escherichia coli, Clostridium perfringens, thermotolerant coliform bacteria, etc. 
Micropollutants 50+ pharmaceuticals, personal care products, and other organic substances 
Table 3

Names and descriptions of widgets developed for the web application

Widget nameDescription
Utilized by report widget Overview of treatment plant design Interactive diagram of the plant at the monitoring site. 
Risk assessment Quantifying chemical and biological exposure risks and probabilities via QCRA and QMRA developed during the project. 
Visualization of processes and parameters Entire process breakdown for a selected case study and studied parameters. 
End-state snapshot The last picture of the process taken at the end of chosen time range. 
Measured values Data presentation of measured values in line graph form over current time range. 
Not utilized by report widget Roadmap information Sustainable water management roadmap, showcasing models and key findings. 
Process time-lapse Time-lapse of the process over the chosen time range. 
Data export Tabular data export from Database. 
– Report Case study summary with an AI-generated text report. 
Widget nameDescription
Utilized by report widget Overview of treatment plant design Interactive diagram of the plant at the monitoring site. 
Risk assessment Quantifying chemical and biological exposure risks and probabilities via QCRA and QMRA developed during the project. 
Visualization of processes and parameters Entire process breakdown for a selected case study and studied parameters. 
End-state snapshot The last picture of the process taken at the end of chosen time range. 
Measured values Data presentation of measured values in line graph form over current time range. 
Not utilized by report widget Roadmap information Sustainable water management roadmap, showcasing models and key findings. 
Process time-lapse Time-lapse of the process over the chosen time range. 
Data export Tabular data export from Database. 
– Report Case study summary with an AI-generated text report. 

For the sake of example, below are two screenshots representing the general interface structure (Figures 3 and 4).
Figure 3

The web application's interface: ‘Prague – BOD5’ template.

Figure 3

The web application's interface: ‘Prague – BOD5’ template.

Close modal
Figure 4

The web application's interface: custom layout with report (left-hand side), risk analysis (top right corner) and measured values (bottom right corner) widgets.

Figure 4

The web application's interface: custom layout with report (left-hand side), risk analysis (top right corner) and measured values (bottom right corner) widgets.

Close modal

In Figure 4, the Report widget is displayed on the left side of the screen. The Report widget can be considered the backbone of the entire application, as it meets user expectations by providing an intelligible report on the data of greatest interest. Its primary function is to guide the user through the most informative widgets (e.g., End-State Snapshot, Measured Values, Visualization of Processes and Parameters, Overview of Treatment Plant Design, and Risk Assessment for the Prague case), focusing only on the parameters chosen in the initial step. The main advantage of the Report widget is the AI-generated description accompanying each used widget. An API call to OpenAI's ChatGPT-4 (previously ChatGPT-3) is initiated on the client side of the application, containing the relevant data, regulations, and standards for the selected parameters. Using AI for regulatory compliance checks allowed for storing legal documents as PDFs in the database without the need to transcribe them into separate CSV files. Although the documents were later injected in the prompt (see Supplementary material, Appendix 2), this process was also facilitated by AI. This approach not only eliminated unnecessary work and enabled the integration of various legislative texts but also provided essential context for the LLM to make recommendations for possible applications.

The prompt provided for the AI-based report generation is designed to ensure the confidentiality of the measured values while delivering an accurate and informative analysis. It includes specific instructions to avoid displaying the raw numerical values of the measurements directly in the generated report to prevent potential legal issues associated with disclosing confidential data. The prompt includes various standards with their names, levels, and units for reference. However, since the measured values are kept confidential, there is no need to output these standards directly in the report. The focus is on discussing the compliance of the measured values with relevant standards (such as European and Czech standards) while maintaining the confidentiality of the measured data. This approach focused on the use of AI in regulatory compliance and risk assessment within the WIDER UPTAKE project, balancing data confidentiality and informative reporting.

The Risk Assessment widget was developed in collaboration with colleagues from the Norwegian University of Science and Technology. Upon launch, it runs algorithms based on the selected type of water and exposure scenario (e.g., brief contact for a ‘visitor’ or prolonged exposure for a ‘worker’) to assess the risk posed by a specific compound (in the case of QCRA) or microbial contamination (in the case of QMRA). The widget presents the results as either the potential intake dose for chemicals or the Disability-Adjusted Life Years (DALY) per case for microbial contaminants. These procedures allow for a transparent and comprehensive risk evaluation by government bodies or public end-users. Therefore, the widget was at the core of the application and the key component of the Report widget. For more information about the risk assessment see Ardiyanti et al. (2021) and Ardiyanti et al. (2024).

Once the user reviews the information, they can download the automated report in PDF format, which includes pictures, graphs, and snapshots. The report also contains an AI-generated assessment of the measured data based on regulations and recommendations for possible water reuse applications. The primary goal of generating this report is to test the technology, which could later assist potential external clients in making decisions for sustainable water-smart solutions based on their measurements. However, it was also found useful during the project for keeping colleagues from other cases, as well as the Prague case, informed about the status of the respective experiments. If a team member wanted to analyse the data for a chosen period using their own methods and software, the data could be easily exported using the Export widget (see Figure 5).
Figure 5

Export widget example.

Figure 5

Export widget example.

Close modal

In summary, a system was implemented that uses sensors to collect real-time data on plant growth using treated wastewater for irrigation. This data is transmitted to a cloud platform for storage and management. A web application was developed to visualize this data, assess potential risks using custom QCRA and QMRA algorithms, and generate reports. This approach allows the evaluation of the feasibility and safety of using treated wastewater for urban agriculture.

The web application served as the primary database throughout the entire duration of the WIDER UPTAKE project. It accompanied the Prague case experiment during the 2021–2022 measurement seasons and acted as a repository of knowledge for other cases and key findings of the project (e.g., the Roadmap widget). Specifically, it helped the Prague team coordinate efforts to overcome legal barriers in urban water reuse in the Czech Republic. Additionally, stakeholders who were approached with water reuse pilot solutions had the opportunity to familiarize themselves with and interact with the data and the explanations provided through the Report widget in their browsers (an example of a report for the Prague case is provided in Supplementary material, Appendix 3). It was especially valuable given the legal status of water reuse in the Czech Republic. It is not generally popular among the population and there is no legal act that would regulate it (Ramm & Smol 2023). Among other efforts, in the result of the Prague case study demonstration efforts (among which the web application was the main knowledge hub and demonstration unit), the ministry of agriculture has decided to update their ‘Circular Czechia Action Plan’ (Ministry of the Environment of the Czech Republic 2022) strategy to emphasize water reuse and to accept the European water reuse regulation (i.e., Regulation 2020/741) in the future. Thus, the web application achieved its goal of extended stakeholder engagement, including legal bodies and peripheral stakeholders such as parks and one of Prague's district administrations.

The web application served as the foundational platform for managing data from the large experiment involving continuous real-time measurements and periodic laboratory tests. It structured the data within the Visualization of Processes and Parameters widget, allowing measured parameters to be traced across multiple levels, such as water, plants, soil, and filtrate. Additionally, it generated ideal material for AI analysis, which was instrumental in achieving the application's other two goals: ensuring compliance with regulations and generating risk assessment conclusions. At the same time, this analysis is unique in the field of data management for circular water economy projects, so it is the most important functionality to describe and assess its performance here.

AI conclusion inference for risk assessment

The QCRA and QMRA algorithms were specifically developed to account for potential adverse effects of the hazards contained in recycled water (Ardiyanti et al. 2021, 2024). The methodology was incorporated into Risk Analysis and Report widgets. The latter provided a conclusion generated on the basis of the results of the assessment on the right-hand side of the widget. The commentary usually had the following structure (as exemplified by the QMRA for thermotolerant coliform bacteria TCOLI in Supplementary material, Appendix 4):

  • introductory phrases,

  • description of QMRA methodology,

  • analysis of TCOLI values in different water sources,

  • visualization comment,

  • implications and standards comparison,

  • conclusion.

As seen in Supplementary material, Appendix 2, the prompt instructed the LLM to avoid displaying confidential values to the user. Therefore, the ‘Analysis of TCOLI values’ section of the report begins with a polite disclaimer regarding the impossibility of disclosing the full set of data. The commentary then proceeds to describe the relationship with all the legislative documents included in the prompt, indicating whether the values exceed or remain within the limits. Note that the standards are referred to by code names, a remnant of the data-processing pipeline. Further study of the provided commentary might leave the reader with a feeling of uncertainty, as most of the answers the LLM provides are very cautious.

AI response consistency analysis for risk assessment and regulatory compliancy check

Due to the nature of the LLM, this commentary varied each time the user generated it. For the sake of analysing the perceived consistency of AI-generated reports, 20 reports were generated for the same set of parameters (TCOLI and Escherichia coli) and empirically assessed according to the following qualities:

  • 1. compliance status (indicating whether the values are above or below regulation limits);

  • 2. structural consistency (ensuring they have a common text structure and flow);

  • 3. tone consistency (investigating how invariable is the overall perceived tone of the texts);

  • 4. consistent terminology (ensuring the same terms are used across texts);

  • 5. quantitative disclosure (indicating how many texts reveal numerical data).

The analysis revealed that all AI-generated reports uniformly indicated that the TCOLI levels were within the limits set by relevant regulatory standards, which is true. Moreover, the structural consistency across the reports was remarkably high. Each report typically included an introduction to the QMRA process, an analysis of TCOLI levels in different water sources, a comparison with regulatory standards, and a conclusion discussing the health implications – ultimately closely following the example mentioned above (see Supplementary material, Appendix 4). This uniform structure combined with a consistently formal tone observed in all the reportsdemonstrates the AI's ability to organize content coherently and systematically, which is essential for readability and comprehension in scientific reporting. The tone of the reports also remained consistently formal and scientific, appropriate for professional and academic contexts. Maintaining a consistent tone is important for ensuring that the reports are taken seriously and understood correctly by their intended audience.

While the AI consistently used key terms such as ‘QMRA,’ ‘TCOLI,’ and references to specific standards, there was some variation in the exact wording. This resulted in approximately 80% consistency in word usage. Such variation can be beneficial as it introduces slight differences in phrasing but may pose inconsistency risks in the future. Additionally, the inclusion of explicit numerical data varied across the reports. Only about 30% of the reports included specific numerical values (e.g., ‘DALY per case’), likely due to confidentiality concerns. It should be noted that the values used in the reports were not confidential and were displayed in the graph of the widget. This cautious approach to handling sensitive information indicates that the AI can adapt its content to respect privacy and legal considerations, although it does result in less uniformity in presenting quantitative data (see Figure 6).
Figure 6

Spider chart for the response consistency analysis. Each tick represents a 20% step with 0% in the centre.

Figure 6

Spider chart for the response consistency analysis. Each tick represents a 20% step with 0% in the centre.

Close modal

Overall, the AI-generated QMRA reports exhibited high consistency in qualifying risk, maintaining structure, and tone, which underscores the reliability of AI in producing coherent and professional scientific reports. On the other hand, inconsistencies did arise in revealing some of the less important numbers and word choices. In other words, they continue to be the source of doubt. Overall, it may be concluded that the use of text generated by LLM rather benefited the result presentation. Not one instance of changing the overall conclusion completely (e.g., recommending a certain use case in one report that was discouraged in subsequently generated reports) was observed during the experiment. That is not to say that there is no possibility that it might happen.

Concerns and future outlook

The use of recycled water in public spaces like parks and streets should be a decision made by the public, based on thorough risk analyses to understand potential health and environmental risks. Ideally, the participation of people affected by the decision ever so slightly ought to be facilitated by democratic use of data and extended stakeholder involvement. The database web application discussed in this article is the first step toward developing a publicly available consultancy solution, enabling people to collaborate with scientists in the data science process. Despite AI and LLMs paving a way for more democratic data-driven decision-making of the kind, the use of this still emerging technology comes along with certain concerns one has to address before implementing it into a publicly available solution, namely transparency, justice and fairness, responsibility and accountability, privacy, and non-maleficence (Doorn 2021).

Transparency in LLM-powered applications is crucial for stakeholder engagement, addressing concerns about understanding model behaviour, accountability, and communication of capabilities and limitations. The complexity and unpredictability of LLMs, along with their massive and opaque architectures, make it challenging for stakeholders to fully grasp their functions. Proprietary models such as ChatGPT-4 further restrict independent verification (Liao & Wortman Vaughan 2024), which means there is no way for a third-party user to ensure that they are being trained on diverse and representable datasets. Moreover, in the case of the WIDER UPTAKE database web application, users may either over trust or distrust LLM outputs, leading to overreliance on flawed AI or rejection of water reuse. Although our response consistency analysis did not show any signs of AI hallucination or non-compliance with regulations, such issues could still arise in the worst-case scenarios due to the nature of LLMs. One way to address this issue is by referring users of the application to the extensive repository of deliverables and milestone reports published in the project, emphasizing that the Report widget is merely a tool for obtaining a high-level overview of the results. However, it is encouraging to see the tool reliability during the consistency test.

Justice and fairness address the issues arising from bias in the training data used for publicly available AI chatbots or ML algorithms. In other words, the commentary provided by the Report widget might be overly positive, despite data indicating potential concerns. To ensure a balanced perspective, it is crucial to encourage cross-referencing the widget's commentary with the actual data and critically evaluating any discrepancies by users. A supplementary documentation or a hand-out can inform the user of the possible risks associated with the use of LLMs.

The issue of responsibility and accountability is largely evaded in the project because the descriptions provided by the web application do not carry any legal significance. Nonetheless, if the technology is to be scaled and opened for the public, the current lack of accountability could lead to significant ethical and legal challenges. Without clear guidelines and mechanisms for responsibility, users might misuse the application, resulting in misinformation, privacy violations, or even harm that overlaps with the maleficence issue. To address these concerns, it is imperative to work together with legal teams to establish robust frameworks that define the scope and limitations of the technology and finds ways to incorporate them into the answers provided by the application.

Finally, the biggest concern associated with the use of third-party AI applications is privacy. Sending data to OpenAI servers does not provide the necessary level of privacy required for projects with high degrees of confidentiality. However, Responsible Research and Innovation (RRI) programmes such as WIDER UPTAKE can allow for some degree of freedom in this regard. After all, it is similar to allowing an external person to view the data and draw their own conclusions. Nevertheless, within the project, some data were confidential, and due to the nature of the AI API call, access to certain data had to be restricted. Moreover, even though the prompt was engineered to prevent the display of raw data, it still occurred in approximately 30% of cases, as demonstrated in the response consistency analysis. Consequently, for projects where data confidentiality is a top priority, a custom or locally run LLM is necessary. Additionally, the application's scripts should be moved to the backend, as keeping them in the frontend undermines security. However, in the current implementation, access has been granted strictly on a need-to-know basis, limited only to the project stakeholders.

Simply enumerating the weak points of using LLMs in similar projects reveals a range of challenges that need to be addressed and areas for improvement in future research. Unfortunately, projects of this size might not have enough time to fine-tune the data management applications, train a custom LLM or even host a computationally demanding models for the project or experiments. This makes publications like this, which showcase the drawbacks and imperfections of certain implementations, all the more valuable. Despite the numerous areas for improvement, the application did serve its purpose and achieved all the stated goals, including extended stakeholder engagement, democratized data access, risk analysis commentary, and regulatory compliance checks.

The development and deployment of a web application within the WIDER UPTAKE project have proven crucial in managing and visualizing complex datasets generated from case studies. This application not only facilitates real-time monitoring and risk assessment but also enhances stakeholder engagement by making data accessible and comprehensible to a broader audience. The use of LLMs for generating AI-driven reports has shown promise in democratizing data interpretation, ensuring compliance with regulations, and fostering informed decision-making among stakeholders.

One of the critical implications of this work is the potential for AI and machine learning to bridge the gap between technical data and practical application. The AI-generated reports, despite some variability, consistently provided accurate risk assessments and regulatory compliance checks, demonstrating the feasibility of using LLMs in environmental monitoring and decision-making processes. However, the study also highlights the need for transparency, accountability, and data privacy when employing AI in sensitive domains. Future research should focus on developing custom or locally-run LLMs to address privacy concerns and improve the consistency and reliability of AI-generated outputs.

The nature of the WIDER UPTAKE project underscores the importance of collaborative efforts across various sectors, including research institutions, industries, and local communities. By involving a wide array of stakeholders, the project has created a more inclusive approach to water management, ensuring that the benefits of sustainable practices are widely recognized and adopted. This holistic approach can serve as a model for other Horizon Europe projects and similar initiatives.

The integration of CE principles in water management, supported by advanced technological solutions and stakeholder collaboration, offers a promising pathway toward sustainable urban development. The WIDER UPTAKE project has laid a strong foundation for future research and innovation in this field, highlighting the need for continuous improvement in data management practices and AI applications. By addressing the challenges identified in this study, future projects can build on these insights to achieve more resilient and sustainable water management systems.

Special thanks to M. Slezák and T. Burger for their invaluable contributions to this research. Their expertise and dedication were instrumental in the successful completion of this work. This research was supported by the Horizon 2020 project: Achieving wider uptake of water-smart solutions (H2020-SC5-2019-2), Grant agreement ID: 869283.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Afghani
N.
,
Hamhaber
J.
&
Frijns
J.
(
2022
)
An integrated assessment framework for transition to water circularity
,
Sustainability
,
14
(
14
),
8533
.
https://doi.org/10.3390/su14148533
.
Akinsete
E.
,
Stergiopoulou
L.
,
El Said
N.
&
Koundouri
P.
(
2022
)
Multi-actor working groups as fora for WEF nexus innovation and resilience
,
Environmental Sciences Proceedings
,
15
(
1
),
69
.
https://doi.org/10.3390/environsciproc2022015069
.
Andrews
M.
,
Berardo
P.
&
Foster
D.
(
2011
)
The sustainable industrial water cycle – a review of the economics and approach
,
Water Supply
,
11
(
1
),
67
77
.
https://doi.org/10.2166/ws.2011.010
.
Ardiyanti
R.
,
Hallé
C.
&
Azrague
K.
(
2021
)
Procedure for Monitoring and Control of Health and Quality Risks
. .
Ardiyanti
R.
,
Azrague
K.
,
Medema
G.
&
Hallé
C.
(
2024
)
Navigating quality, health and environmental risk: a novel framework for wastewater resource recovery products
,
Journal of Cleaner Production
,
434, 140260. https://doi.org/10.1016/j.jclepro.2023.140260
.
Bellini
A.
&
Bang
S.
(
2022
)
Barriers for data management as an enabler of circular economy: an exploratory study of the Norwegian AEC-industry, IOP Conference Series: Earth and Environmental Science. IOP Publishing
.
Bellini
A.
,
Tadayon
A.
,
Andersen
B.
&
Klungseth
N. J.
(
2024
)
The role of data when implementing circular strategies in the built environment: a literature review
,
Cleaner Environmental Systems
,
13
,
100183
.
https://doi.org/10.1016/j.cesys.2024.100183
.
Bhambhani
A.
,
van der Hoek
J. P.
&
Kapelan
Z.
(
2022
)
Life cycle sustainability assessment framework for water sector resource recovery solutions: strengths and weaknesses
,
Resources, Conservation and Recycling
,
180
,
106151
.
https://doi.org/10.1016/j.resconrec.2021.106151
.
Bouziotas
D.
,
Stofberg
S.
,
Frijns
J.
,
Nikolopoulos
D.
&
Makropoulos
C.
(
2023
)
Assessing the Resilience of Circularity in Water Management: A Modeling Framework to Redesign and Stress-Test Regional Systems Under Uncertainty
,
Taylor & Francis, Oxford
.
Campos
L. B.
(
2022
)
Report on Editorial, Video and Visual Content and Distribution
,
Nieuwegein, the Netherlands. ESCI
. .
Colella
M.
,
Ripa
M.
,
Cocozza
A.
,
Panfilo
C.
&
Ulgiati
S.
(
2021
)
Challenges and opportunities for more efficient water use and circular wastewater management. The case of Campania Region, Italy
,
Journal of Environmental Management
,
297
,
113171
.
https://doi.org/10.1016/j.jenvman.2021.113171
.
Collivignarelli
M. C.
,
Canato
M.
,
Abbà
A.
&
Carnevale Miino
M.
(
2019
)
Biosolids: what are the different types of reuse?
,
Journal of Cleaner Production
,
238
,
117844
.
https://doi.org/10.1016/j.jclepro.2019.117844
.
Corti, S., Davide, C., Marco, M. & Enrico, W.
(
2022
)
Decision Analytic Platform [Software Modules]
.
Zenodo, Geneva, Switzerland
.
Doorn
N.
(
2021
)
Artificial intelligence in the water domain: opportunities for responsible use
,
Science of The Total Environment
,
755
,
142561
.
https://doi.org/10.1016/j.scitotenv.2020.142561
.
Dudgeon
D.
,
Arthington
A. H.
,
Gessner
M. O.
,
Kawabata
Z.-I.
,
Knowler
D. J.
,
Lévêque
C.
,
Naiman
R. J.
,
Prieur-Richard
A.-H.
,
Soto
D.
,
Stiassny
M. L. J.
&
Sullivan
C. A.
(
2006
)
Freshwater biodiversity: importance, threats, status and conservation challenges
,
Biological Reviews
,
81
,
163
182
.
https://doi.org/10.1017/S1464793105006950
.
Eggimann
S.
,
Mutzner
L.
,
Wani
O.
,
Schneider
M. Y.
,
Spuhler
D.
,
Moy de Vitry
M.
,
Beutler
P.
&
Maurer
M.
(
2017
)
The potential of knowing more: a review of data-driven urban water management
,
Environmental Science & Technology
,
51
(
5
),
2538
2553
.
https://doi.org/10.1021/acs.est.6b04267
.
Elelman
R.
,
Wencki
K.
,
Chen
A.
& Englund, A., Anzaldua, G. (
2021
)
The Need for Digital Water in A Green Europe – EU H2020 Projects’ Contribution to the Implementation and Strengthening of EU Environmental Policy
.
Publications Office
, .
European Commission
(
2021
)
Horizon Europe, the EU Research and Innovation Programme (2021–27): for A Green, Healthy, Digital and Inclusive Europe
.
Publications Office of the European Union, Luxembourg
.
Available at: https://data.europa.eu/doi/10.2777/052084 (Accessed: 24 July 2024)
.
European Commission
(
2024a
)
Advanced Wind Energy Systems Operation and Maintenance Expertise (AWESOME)
.
Publications Office of the European Union, Luxembourg. Available at: https://cordis.europa.eu/project/id/642108 (Accessed: 8 October 2024)
.
European Commission
(
2024b
)
REcovery and REcycling of Nutrients TURNing wasteWATER Into Added-Value Products for A Circular Economy in Agriculture (Water2REturn)
.
Publications Office of the European Union, Luxembourg. Available at: https://cordis.europa.eu/project/id/730398 (Accessed: 8 October 2024)
.
European Commission
(
2024c
)
Scale-Up of Low-Carbon Footprint Material Recovery Techniques in Existing Wastewater Treatment Plants (SMART-Plant)
.
Publications Office of the European Union, Luxembourg. Available at: https://cordis.europa.eu/project/id/690323 (Accessed: 8 October 2024)
.
Fatone
F.
,
De Simoni
L.
&
Camejo
J. G.
(
2023
)
HYDROUSA Service Including Transferability and Replication Plan
.
Ancona, Italy. UNIVPM
. .
Frijns
J.
(
2023
)
D7.6 NextGen Final Report
.
Nieuwegein. KWR
. .
Ghafourian
M.
,
Nika
C.-E.
,
Mousavi
A.
,
Mino
E.
,
Al-Salehi
M.
&
Katsou
E.
(
2022
)
Economic impact assessment indicators of circular economy in a decentralised circular water system – case of eco-touristic facility
,
Science of The Total Environment
,
822
,
153602
.
https://doi.org/10.1016/j.scitotenv.2022.153602
.
Kamyab
H.
,
Khademi
T.
,
Chelliapan
S.
,
SaberiKamarposhti
M.
,
Rezania
S.
,
Yusuf
M.
,
Farajnezhad
M.
,
Abbas
M.
,
Hun Jeon
B.
&
Ahn
Y.
(
2023
)
The latest innovative avenues for the utilization of artificial intelligence and big data analytics in water resource management
,
Results in Engineering
,
20
,
101566
.
https://doi.org/10.1016/j.rineng.2023.101566
.
Katika
T.
&
Tsiakou
D.
(
2023
)
Three AR-‘Enhanced’ Demo Cases
.
Athens, Greece. ICCS
. .
Khoury
M.
&
Evans
B.
(
2023
)
NEXTGEN Serious Game About Circular Economy for Water (1.0), Zenodo
.
Elsevier, Amsterdam. (Accessed 24 July 2024)
.
Kirchherr
J.
,
Yang
N.-H. N.
,
Schulze-Spüntrup
F.
,
Heerink
M. J.
&
Hartley
K.
(
2023
)
Conceptualizing the circular economy (Revisited): an analysis of 221 definitions
,
Resources, Conservation and Recycling
,
194
,
107001
.
https://doi.org/10.1016/j.resconrec.2023.107001
.
Liao
Q. V.
&
Wortman Vaughan
J.
(
2024
)
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap, Harvard Data Science Review
,
MIT Press, Cambridge, Massachusetts, USA (Special Issue 5)
.
Mannina
G.
,
Badalucco
L.
,
Barbara
L.
,
Cosenza
A.
,
Di Trapani
D.
,
Gallo
G.
,
Laudicina
V. A.
,
Marino
G.
,
Muscarella
S. M.
,
Presti
D.
& Helness, H. (
2021
)
Enhancing a transition to a circular economy in the water sector: the EU project WIDER UPTAKE
,
Water
,
13
(
7
),
946
.
https://doi.org/10.3390/w13070946
.
Matta
E.
,
Giuliani
M.
,
Palatnik
R.
,
Shechter
M.
,
Pyka
C.
,
Cekin
D.
,
Hassanen
M.
,
Rocco
M. V.
,
Rulli
M. C.
,
Kondouri
P.
,
Vergalli
S.
&
Castelletti
A.
(
2021
)
The AWESOME project: a decision analytic framework for managing water energy food and ecosystems across sectors and scales in the South Mediterranean
,
Copernicus GmbH
.
Copernicus GmbH, Göttingen. https://doi.org/10.5194/egusphere-egu21-10411
.
Mazzonetto
M.
&
Simone
A.
(
2018
)
Introduction to ‘Science, society and citizens: suggestions (and hopes) on how to foster RRI in Horizon Europe’
,
JCOM
,
17
(
03
),
C01
.
https://doi.org/10.22323/2.17030301
.
Merli
R.
,
Preziosi
M.
&
Acampora
A.
(
2018
)
How do scholars approach the circular economy? A systematic literature review
,
Journal of Cleaner Production
,
178
,
703
722
.
doi:10.1016/j.jclepro.2017.12.112
.
Ministry of the Environment of the Czech Republic
(
2020
)
Akční plan Cirkulární Česko 2040 pro období 2022–2027 (Action Plan Circular Czechia for A Period of 2022–2027)
.
Ministry of the Environment of the Czech Republic, Prague. Available at: https://www.mzp.cz/cz/akcni_plan_cirkularni_cesko_2040 (Accessed: 8 October 2024)
.
Morseletto
P.
,
Mooren
C. E.
&
Munaretto
S.
(
2022
)
Circular economy of water: definition, strategies and challenges
,
Circular Economy and Sustainability
,
2
,
1463
1477
.
https://doi.org/10.1007/s43615-022-00165-x
.
Navarro-Ortega
A.
,
Acuña
V.
,
Bellin
A.
,
Burek
P.
,
Cassiani
G.
,
Choukr-Allah
R.
,
Dolédec
S.
,
Elosegi
A.
,
Ferrari
F.
,
Ginebreda
A.
,
Grathwohl
P.
,
Jones
C.
,
Rault
P. K.
,
Kok
K.
,
Koundouri
P.
,
Ludwig
R. P.
,
Merz
R.
,
Milacic
R.
,
Muñoz
I.
, Nikulin, G., Paniconi, C., Paunović, M., Petrovic, M., Sabater, L., Sabater, S., Skoulikidis, N. T., Slob, A., Teutsch, G., Voulvoulis, N. & Barceló, D. (
2015
)
Managing the effects of multiple stressors on aquatic ecosystems under water scarcity. The GLOBAQUA project
,
Science of The Total Environment
,
503–504
,
3
9
.
https://doi.org/10.1016/j.scitotenv.2014.06.081
.
Pollert
J.
(
2021
)
3D Printed Rain Garden Boxes
.
CTU in Prague
, .
Pollert
J.
,
Wanner
J.
,
Puškáčová
A.
,
Strogonov
V.
&
Kovařík
J.
(
2023
)
Functioning of the Rain Gardens
.
CTU in Prague
.
Athens, Greece, Prague. Available at: https://wider-uptake.eu/wp-content/uploads/2023/10/Attachment_0-6.pdf (Accessed: 24 July 2024)
.
Ramm
K.
&
Smol
M.
(
2023
)
Water reuse – analysis of the possibility of using reclaimed water depending on the quality class in the European countries
,
MDPI AG: Sustainability
,
15 (17), 12781. doi:10.3390/su151712781
.
Rogers
A.
&
Luccioni
S.
(
2024
). '
Position: key claims in LLM research have a long tail of footnotes
’,
Proceedings of the 41st International Conference on Machine Learning
, Vol.
235
, pp.
42647
42665
.
Sun
A. Y.
&
Scanlon
B. R.
(
2019
)
How can big data and machine learning benefit environment and water management: a survey of methods, applications, and future directions
,
Environmental Research Letters
,
14
(
7
),
073001
.
https://doi.org/10.1088/1748-9326/ab1b7d
.
Tahir
S.
,
Steichen
T.
&
Shouler
M.
(
2018
)
Water and Circular Economy: A White Paper
.
Arup, Antea Group
:
Ellen MacArthur Foundation
.
Tsiropoulos
Z.
,
Tsiropoulou
A.
&
Zafeiri
E.
(
2024
)
HYDROUSA Game for Increased Public Awareness
.
Athens, Greece. AGENSO
. .
Wang
Y.
,
Cheng
Y.
,
Liu
H.
,
Guo
Q.
,
Dai
C.
,
Zhao
M.
&
Liu
D.
(
2023
)
A review on applications of artificial intelligence in wastewater treatment
,
Sustainability
,
15
(
18
),
13557
.
https://doi.org/10.3390/su151813557
.
Wanner
J.
,
Srb
M.
&
Beneš
O.
(
2023
)
Water reuse in the frame of circular economy
. In:
Ngo, H. H., Guo, W., Pandey, A., Varjani, S., Tsang, D. C. W. (eds.). Current Developments in Biotechnology and Bioengineering. Amsterdam,
Elsevier
, pp.
221
266
.
(Accessed: 24 July 2024)
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).