Journal Archive

Johnson Matthey Technol. Rev., 2022, 66, (4), 466
doi: 10.1595/205651322X16594453018855

Chemical Networks: A Methodology to Rapidly Assess the Environmental Impact of Chemical Processes

Applying graph theory principles to chemical industry data enables early-stage decision making for optimum decarbonisation solutions

  • Joseph Staddon*, Joost Smit
  • Johnson Matthey, 10 Eastbourne Terrace, London, W2 6LG, UK
  • Zinovia Skoufa
  • Johnson Matthey, PO Box 1 Belasis Avenue, Billingham, TS23 1LB, UK
  • David Watson
  • Johnson Matthey Technology Centre, Princeton Drive, Stockton-on-Tees, TS17 6PY, UK
  • *Email:


Submitted 14th January 2022; Revised 10th July 2022; Accepted 2nd August 2022; Online 2nd August 2022

Article Synopsis

As the chemicals industry transitions towards a net zero future, rapid assessment of the sustainability metrics of different process results will be essential to support investment decisions in innovation and deployment. Life cycle analysis (LCA) offers the gold standard for process assessment, but LCA can take weeks or months to complete, with incomplete databases and inflexibility in comparing different chemical pathways. In this study, we demonstrate an alternative and complementary methodology. By simplifying the metrics used to describe chemical processes, each process may be linked to another by its feedstocks and products. This generates a network of the chemical industry, which may be investigated using graph theory principles. A case study of the plastics industry is provided, using publicly available information to quantitatively compare with a more formalised and detailed LCA approach. This methodology proves useful for quickly estimating the carbon intensity and water footprint of thousands of routes. Further development, such as including Scope 3 emissions and additional industrial data, may further improve the methodology.


In its “Climate Change 2021: The Physical Science Basis” report, the Intergovernmental Panel on Climate Change (IPCC) laid out a clear and grim prediction of the future climate (1). In even the lowest greenhouse gas emissions scenario considered, it is more likely than not that a 1.5°C global warming level will be exceeded in the mid-21st century. And only in the lowest emissions scenario does that level fall back under 1.5°C by the century’s end.

Meeting this lowest emissions requirement, in line with the Paris Agreement pledge (2), requires unprecedented transformation across all facets of society. The chemicals industry has a vital role to play in enabling this transition, by reducing emissions of processes currently using fossil feedstocks, and transitioning to emerging carbon feedstocks such as waste, biomass and carbon dioxide.

Several major industrial players have made net zero pledges (3) and the magnitude of this task should not be overlooked. Based on the International Energy Agency’s recent roadmap to net zero by 2050 (4, 5), the CO2 emissions of the chemical industry must fall by an approximate average of 9.1% year-on-year between 2020 and 2050 to reach net zero targets, a substantial step-up from the 1.4% year-on-year decrease under current announced pledges (5). These data are summarised in Figure 1.

Fig. 1.

Estimated annual CO2 emissions of the chemical industry in an announced pledges and a net zero scenario. Based on data from International Energy Agency, 2021 (4) as modified by Johnson Matthey

Many chemical companies are unsure how to meet their net zero pledges, highlighted by a recent survey from Black & Veatch, USA (6). This survey showed that while 80% of companies with revenues over US$250 million set greenhouse gas reduction targets, 25% of them set goals at such a level that they were unsure how to meet them.

While this task is daunting, it is not impossible. In fact, there are many possible decarbonisation pathways proposed by the chemicals industry, such as the use of bio-based feedstocks, carbon capture and utilisation (CCUS), electrification of processes using renewable electricity, incorporation of low carbon hydrogen and more (7). The optimum solution varies for different chemical processes, different regions and different infrastructure, so selecting which technologies to invest and innovate in at an early stage is a challenge in of itself. This is also a key consideration when considering what incentives, credits or funding schemes are applicable to support development and implementation.

To address this challenge, the chemicals industry requires data-driven analysis approaches to help inform early-stage decision making. These approaches should be accessible to a diverse group of stakeholders (technologists, project developers, financiers), flexible to the wide array of different process solutions available, and quick enough to adapt to the fast-changing industry.

Life Cycle Analysis

“All models are wrong, but some are useful” is an appropriate aphorism when discussing sustainability assessments (8). The most common form of process and product environmental sustainability assessment is LCA. Considerable effort has been made to standardise LCA methodologies and these offer the most accurate estimates for carbon intensity of products. They are of course still only models, so many assumptions must still be made. The main issues with the LCA approach for this work are its site and process specificity and its labour-intensity.

While LCA databases like ecoinventTM (9) provide equivalent CO2 figures for various chemical products such as plastics, this data lacks granularity. The specific process steps used are often unclear and it is non-trivial to alter specific steps. Finding sources of data points takes considerable time (with full LCAs taking weeks or months to complete) and interacting with the data through programs such as SimaProTM (10) can be non-intuitive.

The authors would like to be clear that the time-consuming nature of LCA processes is necessary to allow for adequate quality control and the involvement of external practitioners, both of which are critical to prevent methodology misuse and deception. However, to meet these necessary standards the full LCA process must be repeated for every path from raw material to final product, which is unfeasible for early-stage, high-level investigations. And so, a complementary and alternative tool to guide early pathway thinking and innovation decisions is highly desirable.

The Chemical Networks Approach

A novel approach, as discussed here, involves invoking the principles of graph theory to model chemical processes. Graph theory is a complex field of mathematics and will not be covered in detail here. However, a brief introduction of terms is presented in the Supplementary Information accompanying the online version of this article.

The abstract concept of graph theory underpins networks that are essential to our day-to-day lives, from the internet to satellite navigation. Networks have previously been used to describe chemical reaction phenomena, notably for the dynamic behaviour of biochemical systems (1113).

Graph theory may also be used to explore chemical processes, where chemical feedstocks and products are nodes, linked by chemical reactions or processes. In doing so, process data (and the associated sustainability metrics) may be linked, allowing a ‘big picture’ view of complex, multi-step process routes. Chemists and chemical engineers intuitively link the feedstocks and products of these routes, so it makes sense to link the data in the same way.

A concept of chemical networks was taken from conception through to commercialisation by Grzybowski et al., who created a vast chemical network of organic chemistry from the Beilstein Database (14, 15). Investigating the topology of this network elucidated some interesting insights into its structure, but the data lacked sufficient detail for more interesting questions, such as where to find novel and more efficient retrosynthetic pathways, to be answered.

Subsequent research (16) plus considerably more data and complex algorithms allowed the team to demonstrate the concept’s real-world application with predicted and validated one-pot reactions (17), optimised synthesis pathways (18) and management of chemical threats (19). The resulting ‘chemical internet’ allowed for considerable synthetic insight, with the associated Chematica program used to autonomously design synthetic pathways to medicinally relevant targets (20, 21). This software was subsequently purchased by Merck, where it exists as the purchasable SYNTHIATM software package (22).

While this is an impressive body of work, SYNTHIATM’s prime use is for the complex synthetic organic chemistry of the pharmaceutical industry. The same conceptual framework may also be used for evaluating large-scale industrial chemical processes in the commodities space. The methodology described herein was developed independently but shares the same base premise and philosophy.

In this work, industrial chemical feedstocks and products are represented as nodes in the chemical network, with the edges between the nodes containing industrial chemical process data provided from various literature, market or commercial sources. This concept is illustrated in Figure 2. These edges may be weighted by industrially relevant data such as CO2 equivalence (CO2e), water usage or feedstock consumption.

Fig. 2.

Illustration of methodology with the example of formaldehyde production from methanol, with various industrial process available

Similar methodologies for evaluating sustainability metrics have been employed in the past, using linear programming and ‘superstructure’ network analysis to evaluate the chemical industry (2329). The work herein differs in its approach, in designing an explorable network of every possible route and an interactive user interface to allow users to explore all possible routes to all products in real time, as well as crude approximations of the effects of different geographies.

The analysis carried out aims to apply the principles of graph theory to industrial chemical process data in order to rapidly assess the environmental metrics of different pathways from raw material to final chemical product. These methods are then compared with existing assessment methods. In this research, the plastics production value chain was chosen as a case study. This was chosen because emissions from the plastic industry are sizeable (potentially reaching 15% of the global carbon budget by 2050 (30)), with a large proportion of industrially relevant processes involved from raw material to final product.

Materials and Methods

The first step required to represent the sustainability metrics of chemical process data as edge weights in a network is to define the simplified parameters used to represent the metrics. The goal here is to apply universal and quick-to-compute rules to all processes. Such simplification admittedly reduces the accuracy of the data provided and therefore should only be used for preliminary screening and comparison of different process routes, rather than for detailed sustainability data for an individual process or process routes.

Defining Equivalent Carbon Dioxide

This methodology defines CO2e of a chemical process using three parameters, namely ‘direct process’, ‘direct utilities’ and ‘indirect utilities’ which are defined in detail in the following section and shown in Equation (i). These definitions are similar to those presented by IHS Markit in its chemical footprint methodology (31). A fourth parameter, relating to the CO2 equivalence of sourcing the feedstock, is presented for discussion and further work.


where mtarget product > 0.

The first parameter discussed is ‘direct process’. Direct process represents any CO2 that originates from the feedstocks. It is calculated from the sum of the net equivalent mass of CO2 formed per mass of all carbon containing useful products, subtracted from the net equivalent mass of CO2 formed per mass of all carbon containing feedstocks, assuming complete oxidation to CO2. This is represented in Equation (ii):


where mtarget product > 0, if component = feedstock, mfeedstock > 0, if component = useful product, museful product < 0.

For a defined chemical component, with a defined molar mass, calculating the CO2e is trivial, as shown in Equation (iii):


For more ill-defined chemical feedstocks, which consist of mixtures of chemicals, CO2e may be estimated by approximating as one component (for example, pyrolysis gasoline may be reasonably approximated as benzene due to its high aromatic content); a mixture of defined components (for example, BTX may have a known percentage of benzene, toluene and xylenes, from which CO2e can be estimated); or from literature (for example, coal’s carbon content will vary, so literature values can be used to give an approximation).

Exceptions may be made for bio-based feedstocks. As the CO2 used to form these feedstocks is directly removed from the atmosphere via photosynthesis, CO2ebio-feedstock = 0. The same approach is taken with direct air capture CO2. Further complexity may be introduced by including the CO2 contribution of sourcing the raw materials, which is particularly important for bio-feedstocks. This will be discussed in a later section.

All useful products, defined as products which have sufficient value as to be taken as a negative cost in the process economics, are factored in, even if they are used in the process. As an example, fuel gas is produced in several petrochemical processes and then burned to cover the process heat requirements. The museful product value here is still negative, as the direct utilities CO2e is a gross value, so will cancel this out.

The second variable is ‘direct utilities’. Direct utilities represents any CO2 that originates from onsite combustion of fuels to generate heat, or conversely any heat energy exported from exothermic processes. Value represented in process data must be converted to a useable CO2e. Typically, the values are split into two variables: ‘fuel consumption’ and ‘steam consumption’. Fuel consumption is defined in units of energy per mass of target product, so the CO2e value can be calculated by multiplying this energy by the CO2e value for the regional energy source (Table I), which is estimated from the US Environmental Protection Agency. Steam consumption is defined in units of mass per mass of target product. The energy required to generate this steam is calculated from the latent heat of saturated steam at the given pressure, multiplied by the mass of steam consumed. The CO2e is then calculated (Equation (iv)) from the energy according to the regional values supplied in Table I.


where mtarget product > 0.

Table I

Assumed Regional Direct Utilities CO2 Equivalents (32)

Region Direct utilities fuel Direct utilities CO2 coefficient, tonneCO2e MWh–1
USGC Natural gas 0.181
Germany Natural gas 0.181
China Coal 0.326

The third value is ‘indirect utilities’, where CO2e is calculated in a similar way as direct utilities (Equation (v)), except here the utilities energy (for electricity and refrigeration in the process) is supplied from offsite power. Again, this is region-specific, depending on the energy mix in the regions. This is taken into account by employing the CO2 coefficients shown in Table II. These coefficients account for differences in regional power mixes.


where mtarget product > 0.

Table II

Assumed Regional Indirect Utilities CO2 Coefficients (3337)

Region Indirect utilities CO2 coefficient, tonneCO2e MWh–1
USGC 0.388
Germany 0.414
China 0.670

As mentioned earlier, adding a fourth variable representing the CO2 equivalent of sourcing the feedstock would improve the accuracy of the estimated total CO2e. This variable is tentatively titled ‘indirect process’ or ‘indirect material sourcing’. For example, chlorine (which has no direct process equivalent CO2) would have a value of ~1 tonneCO2e tonne–1 due to the energy requirements of the chloralkali process (38). The addition of this variable would be especially helpful when assessing bio-based processes, where the carbon footprint associated with land use for growing the biomass plays an important role. Data can be found by consulting the databases available for LCA assessment (9).

The largest degree of uncertainty in these data, assuming that the process data input is accurate, is in assigning a CO2e value to ill-defined chemical feedstock compositions, such as coal. Literature searches reveal highly variable values for the carbon density of coal, naphtha, crude oil and other raw materials. More accurate data, tabulated in a database and perhaps regionally specific, would help in defining the direct process carbon equivalence. It is helpful to use a small number of values for the carbon intensity of these feedstocks (rather than making them process specific), as this allows for easier comparison between processes and simplifies the chemical networks. The accuracy of this data is sufficient for preliminary screening and comparison of several routes based on the same feedstock.

As an example of a reasonable compromise, in this study synthesis gas (syngas) has been simplified into ‘Syngas 1:1’, ‘Syngas 2:1’ and ‘Syngas 3:1’, representing the different ratios of hydrogen and carbon monoxide. Though different processes will produce different compositions of gas, grouping in this way allows for easier comparison in chemical networks. These approximate ratios are cited in process report data, so the assumption is unlikely to add significant uncertainty to results. It should be noted that other gas products produced, such as methane, are explicitly accounted for.

Defining Other Variables

Other variables, such as the E-factor, energy and water consumption, are defined in the Supplementary Information.

Building the Network

The data for individual process steps can be built into network graphs and the chemical pathways created are used to estimate CO2e for a particular chemical pathway. Details on how the network is built, and the rules applied, can be found in the Supplementary Information.

In order to assess the large amount of industrial chemical processes required, IHS Markit Process Economics Program (PEP) reports were primarily used for process data (39). A table of processes cited, with their associated primary feedstock, primary product and reference, are available in the Supplementary Information.

By applying these rules to a database of process data, linking each process by its chemical feedstocks and products, every possible chemical pathway from a feedstock to product may be found, and the CO2e for this pathway can be calculated. The workflow is summarised in Figure 3, with the process data transferred to a database, the chemical network formed from the data and the individual pathway data extracted from the network.

Fig. 3.

Illustration of chemical networks workflow

Expanding the network is trivial as new process data may be entered into the database and the network expanded by the algorithm of linking processes by feedstocks and products.

Results and Discussion

Visualising the Network

An illustration of the chemical network corresponding to the plastics production value chain is provided in Figure 4. In this graph, each node is a chemical feedstock or product and the edges between each node are weighted by the equivalent CO2 of the lowest carbon route from that feedstock to 1 tonne of the next product. Equivalent CO2 from other sources such as materials sourcing, transport and leaks are not included. The selected region is the US Gulf Coast (USGC), with direct utilities supplied via natural gas and indirect utilities sourced from the regional energy mix. No special accounts of renewable energy use are used. The graph is constructed from over 200 processes, creating over 23,000 possible routes to the final polymers. The process data are sourced from literature articles, internal analysis and IHS Markit PEP reports. See Supplementary Information for full list of external reports. Some process data includes integrated steps, leading to intermediate nodes being excluded, for example lignite to propylene. The final products selected are the polymers: low density polyethylene (LDPE), linear low-density polyethylene (LLDPE), bimodal high-density polyethylene (HDPE), polypropylene (PP), polyethylene terephthalate (PET), polyvinyl chloride (PVC), polybutylene terephthalate (PBT) and polyacrylic acid (PAA). Plotting a chemical network in this way allows for quick insights, crucially on where the most carbon intensive steps lie in the plastics value chain. Typically, these are the first or second chemical transformations, with the final polymerisation steps being less carbon intensive. Therefore, more investment and scientific effort should be placed on reducing the carbon intensity of the initial key chemical feedstocks such as ethylene and methanol.

Fig. 4.

A representative chemical network for the plastics industry, with the targeted polymers LDPE, LLDPE, bimodal HDPE, PP, PET, PVC, PBT and PAA for the USGC region. The colour and thickness of the edges represent the edge weighting, the colour and size of each node represent the average shortest path length and the out degree of the nodes respectively. Analysis by the authors using the Python library NetworkX (40). The graph is plotted using Cytoscape (41, 42) and arranged using the yFiles hierarchic layout

This graph is limited in its use, however, as it does not account for the feedstock consumption of each step toward the targeted products. Furthermore, the above visualisation does not show every possible route (of which there are over 23,000 to the polymer end products from the over 200 processes included), instead only showing the least carbon intensive step between each feedstock and product.

Interacting with the Network

Further complexity (and more information) may be introduced by selecting specific chemical pathways via an interactive user interface, such as the one shown in Figure 5. This user interface allows the user to select a region, a target product from the network, a starting feedstock, any intermediate products to include in the pathway and any specific processes to include as well. The output is a breakdown of the metrics of every possible route in the network that fulfils the user restrictions. This can be used to quickly overview different process routes and see where the most carbon intensive steps are. Searching for process routes to a specific product, with the option to select starting feedstocks, intermediate products and specific processes, takes less than one minute. Data entry for a new process, assuming the data is available, takes less than five minutes. This compares favourably with much more detailed LCA approaches, which can take weeks or months to complete.

Fig. 5.

The interactive chemical networks user interface, allowing user to interact with and explore the network in real time. In this example, the user is investigating different routes to PVC from ethane in China

An example of using this network for high-level, early-stage process assessment is shown in Figure 6. Here, the effect of changing the raw material when making PP via methanol is investigated. For each process step, the lowest equivalent CO2 process was selected, with Steps 2 and 3 identical for Figure 6(a), 6(b) and 6(c). All equivalent CO2 values were calculated according to the methodology described. Equivalent CO2 from other sources, such as materials sourcing, transport and leaks, are not included. The selected region is the USGC, with direct utilities supplied via natural gas, and indirect utilities sourced from the regional energy mix, except for Figure 6(c) where indirect utilities is approximated as 0 tonneCO2e kWh–1. Direct air capture CO2 is assumed to have a component CO2e of 0 tonneCO2e tonneCO2–1.

Fig. 6.

Graphs showing outputs from the chemical networks tool for different routes to PP from: (a) natural gas; (b) coal; and (c) CO2 captured from the air. The process data are sourced from literature articles, internal analysis and IHS Markit PEP reports. See Supplementary Information for full list of external reports

From this analysis, some initial conclusions may be drawn. Firstly, the final polymerisation step has a lower net CO2e than the propylene formation step and the methanol formation step in Figure 6(a) and 6(b). In addition, the vast majority of the polymerisation net CO2e stems from the indirect utilities used. By using renewable electricity, this value could be drastically reduced. This is also shown in full LCA analysis of PP production, with cradle-to-propylene production accounting for 82% of the CO2e of PP production and the majority of utilities in a PP resin plant used for electricity generation (43).

Secondly, the dramatic effect of changing the feedstock used can be seen, with the use of coal over natural gas leading to an over six-times increase in the overall carbon intensity. On the other hand, using direct air capture CO2, green hydrogen and renewable electricity to make methanol leads to a significantly carbon negative process.

Finally, the chemical networks approach allows the user to view the ‘big picture’ effect of their first process step choices. By choosing to use a green methanol process, the carbon intensity of the propylene formation and polymerisation steps may be cancelled out, leading to overall carbon-negative PP. What’s more, captured CO2 produced from the natural gas utilities could then be reused in the first step in an integrated plant, reducing the electricity requirements.

These conclusions could help direct resourcing efforts to the most impactful changes to reduce the CO2e of PP and therefore reach the industry’s net zero goals.

Comparison with Literature

While a comparison with literature for every process analysed using this methodology is not feasible, Figure 7 presents a comparison for a variety of processes used to make ethylene. As shown in Figure 7, the chemical networks methodology presented in this paper provides comparable results with other data available in the literature for the ethylene production processes.

Fig. 7.

Comparison of the CO2e per tonne of high value products for varying industrial ethylene production processes in the USGC. The results of this methodology (with error bars associated with different process routes), are compared with literature results (with error bars associated with the range of results) (4446)

A weakness of the current methodology is highlighted in Figure 8. Applying the chemical networks methodology to estimate the CO2 per tonne for the production of PVC from ethane in the USGC results in a substantially lower CO2e value compared to the values estimated by LCA using data from ecoinventTM (1.35 tonnesCO2e tonnePVC–1 vs. 2.59 tonnesCO2e tonnePVC–1 respectively). However, if the indirect material sourcing equivalent CO2 for the chlorine is included (1.09 tonnesCO2e tonnePVC–1), then the results produced by the two methods are comparable (2.44 tonnesCO2e tonnePVC–1 vs. 2.59 tonnesCO2e tonnePVC–1). It is expected that the chemical networks result would be lower than LCA, as variables such as transport are not factored in. This indicates the importance of including this variable in further work.

Fig. 8.

Comparison of the CO2e per tonne of PVC in the USGC between a route selected using the chemical networks tool and LCA from ecoinventTM (9). Indirect material sourcing equivalent CO2, attributed to sourcing the 1.7 tonne of chlorine required, is estimated from the electricity requirement of the chloralkali process (38). The route has been selected to as closely match the LCA route as possible (ethane steam cracking; ethylene dichloride using a fix-bed reactor; VCM from ethylene dichloride; and PVC via emulsion polymerisation)

Advantages of the Chemical Networks Methodology

The Chemical Networks methodology was designed to quickly assess the sustainability metrics of process routes from raw material to final product and it is successful in rapidly assessing the environmental impact of tens of thousands of pathways in minutes. Such speed is valuable for early-stage decision making for investment or policy decisions and where reliable LCA data is scarce. The ease of use of the tool also encourages users to consider the sustainability consequences of their decisions at the start of new project cycles, as timely initial assessments are no longer a barrier to do so.

What’s more, if the process data is available (or can be reasonably estimated), adding new processes and expanding the network is trivial. This is essential as new, sustainable processes (such as bio-based or electrochemical processes) are commercialised and require quick comparison with existing routes.

Further Work

While this work serves as an initial proof of concept for the methodology, further development will enhance the utility and accuracy of the tool developed. A notable weakness of the current network is its inability to meaningfully handle multiple feedstocks and products as branching paths. By creating a binodal graph, where feedstocks and products act as one node and processes as another, this could be overcome. This is illustrated in Figure 9, where factoring in multiple products is greatly simplified through the binodal approach. By containing all the process data in nodes, rather than in both nodes and edges, branching path computation is greatly simplified. This approach was undertaken by Grzybowski et al. in their research on retrosynthesis of organic reactions using a chemical networks approach (1421). The user could interact with the network in the same manner as the current implementation but limit the length of paths unrelated to the target product to prevent the computation reaching unfeasibly expensive levels.

Fig. 9.

An example of a binodal, multidirected graph, where u, v, w, x, y and z represent chemical feedstocks or products, and i, ii, iii and iv represent industrial chemical processes

Another addition, discussed earlier, is the inclusion of indirect material sourcing metrics to generate more realistic CO2e levels for different chemical feedstocks. Additional process metrics, such as cost, may be added to provide even more meaningful comparison between different chemical pathways. Adding additional variables to the network is trivial as it simply requires the addition of information to the starting database.

In their current state, the methodology and results do not meet the LCA standards of ISO 14044:2006 (47). This was out of scope for this initial proof of concept. Instead, this work is designed to complement LCA studies by providing an initial, high-level view for early-stage investigations. While meeting the LCA International Organization for Standardization (ISO) standard steps (goal and scope definition, life cycle inventory analysis, life cycle impact assessment and interpretation) for each pathway is not feasible, the methodology could be adapted and used with established LCA software, data and methodologies to meet these standards while maintaining the intuitive design.

Finally, the chemical network may be vastly expanded to include a further range of chemical industries. While this work has focused on the plastics industry, through the simple methodology provided and with the process data available, adding additional processes and routes to other chemicals is trivial. As the number of processes used in large-scale industrial chemical synthesis is rather more limited than, for example, the pharmaceutical industry, providing a meaningful picture of this industry is not implausible.


To meet the Paris Agreement pledge and net zero targets, the chemical industry requires data-driven analysis methods for early-stage decision making, to better target resources and accelerate delivery on sustainability commitments. Current LCA methods, though acting as the gold-standard for data accuracy and transparency, are limited for this application by their necessary high labour intensity (taking weeks or months to complete) and their site or process specificity.

In this work, the sustainability metrics (equivalent CO2, water consumption, feedstock consumption, E-factor) of over 23,000 chemical pathways in the plastics industry were investigated using graph theory principles. The resulting chemical network allows for rapid (~1 min) insights for early-stage sustainability assessments, saving considerable time compared to more in-depth LCA methods and indicating promising low-carbon routes or areas where more investment and innovation into low carbon solutions is needed.

The analysis produced reasonable results when compared to literature, but further work on including indirect, or Scope 3, emissions and adapting to LCA ISO standards is required to improve the accuracy and quality. The interactive tool, while accessible to a wide range of users, would benefit from branching paths for multi-product processes, which could be achieved via a binodal graph structure. Further data such as costs may also be added in future development.

The chemical networks tool has great future potential, with the design allowing new processes to be quickly and easily added. The scope could therefore feasibly be expanded to include nearly all large-scale industrial chemical processes, allowing for a high-level overview of the entire industry. As the need for drastic reductions in emissions to reach net zero targets grows, such insight may prove invaluable.


  1. 1.
    “Climate Change 2021: The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change”, Intergovernmental Panel on Climate Change, Geneva, Switzerland, 2021 LINK
  2. 2.
    ‘The Paris Agreement’, United Nations, New York, USA, 2015, 27 pp LINK
  3. 3.
    ‘Companies Taking Action’, Science Based Targets: (Accessed on 15th October 2021)
  4. 4.
    “Net Zero by 2050: A Roadmap for the Global Energy Sector”, International Energy Agency, Paris, May, 2021, 224 pp LINK
  5. 5.
    “World Energy Outlook 2021”, International Energy Agency, Paris, France, October, 2021, 386 pp LINK
  6. 6.
    “Corporate Sustainability Goal Setting and Measurement”, GreenBiz Group Inc, Oakland, USA, and Black & Veatch, Overland Park, USA, May, 2021, 15 pp LINK
  7. 7.
    D. Saygin and D. Gielen, Energies, 2021, 14, (13), 3772 LINK
  8. 8.
    G. E. P. Box, ‘Robustness in the Strategy of Scientific Model Building’, in “Robustness in Statistics”, eds. R. L. Launer and G. N. Wilkinson, Academic Press Inc, New York, USA, 1979, pp. 201–306 LINK
  9. 9.
    ‘ecoinvent Database’, ecoinvent, Zurich, Switzerland: (Accessed on 5th September 2022)
  10. 10.
    ‘SimaPro: LCA Software for Information Change-Makers’, Pré Sustainability, Amersfoort, The Netherlands: (Accessed on 10th November 2021)
  11. 11.
    M. Feinberg, Chem. Eng. Sci., 1987, 42, (10), 2229 LINK
  12. 12.
    C. I. Sandefur, M. Mincheva and S. Schnell, Mol. BioSyst., 2013, 9, (9), 2189 LINK
  13. 13.
    G. N. Simm and M. Reiher, J. Chem. Theory Comput., 2017, 13, (12), 6108 LINK
  14. 14.
    M. Fialkowski, K. J. M. Bishop, V. A. Chubukov, C. J. Campbell and B. A. Grzybowski, Angew. Chem. Int. Ed., 2005, 44, (44), 7263 LINK
  15. 15.
    K. J. M. Bishop, R. Klajn and B. A. Grzybowski, Angew. Chem. Int. Ed., 2006, 45, (32), 5348 LINK
  16. 16.
    B. A. Grzybowski, K. J. M. Bishop, B. Kowalczyk and C. E. Wilmer, Nat. Chem., 2009, 1, (1), 31 LINK
  17. 17.
    C. M. Gothard, S. Soh, N. A. Gothard, B. Kowalczyk, Y. Wei, B. Baytekin and B. A. Grzybowski, Angew. Chem. Int. Ed., 2012, 51, (32), 7922 LINK
  18. 18.
    M. Kowalik, C. M. Gothard, A. M. Drews, N. A. Gothard, A. Weckiewicz, P. E. Fuller, B. A. Grzybowski and K. J. M. Bishop, Angew. Chem. Int. Ed., 2012, 51, (32), 7928 LINK
  19. 19.
    P. E. Fuller, C. M. Gothard, N. A. Gothard, A. Weckiewicz and B. A. Grzybowski, Angew. Chem. Int. Ed., 2012, 51, (32), 7933 LINK
  20. 20.
    T. Klucznik, B. Mikulak-Klucznik, M. P. McCormack, H. Lima, S. Szymkuć, M. Bhowmick, K. Molga, Y. Zhou, L. Rickershauser, E. P. Gajewska, A. Toutchkine, P. Dittwald, M. P. Startek, G. J. Kirkovits, R. Roszak, A. Adamski, B. Sieredzińska, M. Mrksich, S. L. J. Trice and B. A. Grzybowski, Chem, 2018, 4, (3), 522 LINK
  21. 21.
    B. A. Grzybowski, S. Szymkuć, E. P. Gajewska, K. Molga, P. Dittwald, A. Wołos and T. Klucznik, Chem, 2018, 4, (3), 390 LINK
  22. 22.
    ‘SYNTHIA TM Retrosynthesis Software’, Merck KGaA, Darmstadt, Germany: (Accessed on 15th October 2021)
  23. 23.
    G. K. Al-Sharrah, I. Alatiqi, A. Elkamel and E. Alper, Ind. Eng. Chem. Res., 2001, 40, (9), 2103 LINK
  24. 24.
    J. Duque, A. P. F. D. Barbosa-Póvoa and A. Q. Novais, Ind. Eng. Chem. Res., 2010, 49, (9), 4230 LINK
  25. 25.
    R. Calvo-Serrano and G. Guillén-Gosálbez, ACS Sustain. Chem. Eng., 2018, 6, (5), 7109 LINK
  26. 26.
    A. Gonzalez-Garay and G. Guillen-Gosalbez, Chem. Eng. Res. Des., 2018, 137, 246 LINK
  27. 27.
    D. Schack, L. Rihko-Struckmann and K. Sundmacher, Ind. Eng. Chem. Res., 2018, 57, (30), 9889 LINK
  28. 28.
    R. Ahmed, S. Shehab, D. M. Al-Mohannadi and P. Linke, Chem. Eng. Sci., 2020, 227, 115922 LINK
  29. 29.
    I. Ioannou, S. C. D’Angelo, Á. Galán-Martín, C. Pozo, J. Pérez-Ramírez and G. Guillén-Gosálbez, React. Chem. Eng., 2021, 6, (7), 1179 LINK
  30. 30.
    J. Zheng and S. Suh, Nat. Clim. Chang., 2019, 9, (5), 374 LINK
  31. 31.
    R. J. Chang and J. Lacson, “Accounting for Carbon Emission Cost in Chemical Production Economics PEP Review”, PEP Review 2017-12, IHS Chemical, London, UK, October, 2017, 51 pp
  32. 32.
    “Inventory of U.S. Greenhouse Gas Emissions and Sinks: 1990-2019”, EPA-430-R-21-005, US Environmental Protection Agency, Washington, DC, USA, 2021, 791 pp LINK
  33. 33.
    ‘How Much Carbon Dioxide is Produced per Kilowatthour of U.S. Electricity Generation?’, US Energy Information Administration, Washington, DC, USA, 4th November, 2021 LINK
  34. 34.
    ‘National Greenhouse Gas Inventories (IPCC Common Reporting Format sector classification)’, European Environment Agency, Copenhagen, Denmark: (Accessed on 5th September 2022)
  35. 35.
    ‘Production: Gross Electricity Production in Germany’, Federal Statistical Office, Wiesbaden, Germany, 18th July, 2021 LINK
  36. 36.
    X. Tao, P. Wang and B. Zhu, Sustainability, 2016, 8, (6), 506 LINK
  37. 37.
    “China Energy Statistical Yearbook: 2015”, China Statistics Press, Beijing, China, 2015 LINK
  38. 38.
    P. W. Griffin, G. P. Hammond and J. B. Norman, Appl. Energy, 2018, 227, 587 LINK
  39. 39.
    ‘Process Economics Program (PEP) Yearbook Database’, IHS Markit, London, UK, 2020
  40. 40.
    A. A. Hagberg, D. A. Schult, and P. J. Swart, ‘Exploring Network Structure, Dynamics, and Function using NetworkX’, Proceedings of the 7th Python in Science conference (SciPy 2008), Pasedena, USA, 19th–24th August, 2008, eds. G. Varoquaux, T. Vaught and J. Millman, SciPy, Austin, USA, 2008, pp. 11–15 LINK
  41. 41.
    P. Shannon, A. Markiel, O. Ozier, N. S. Baliga, J. T. Wang, D. Ramage, N. Amin, B. Schwikowski and T. Ideker, Genome Res., 2003, 13, (11), 2498 LINK
  42. 42.
    D. Otasek, J. H. Morris, J. Bouças, A. R. Pico and B. Demchak, Genome Biol., 2019, 20, 185 LINK
  43. 43.
    Franklin Associates, “Cradle-to-Gate Life Cycle Analysis of Polypropylene (PP) Resin”, American Chemistry Council, Washington, DC, USA, February, 2021, 44 pp LINK
  44. 44.
    T. Ren, M. Patel and K. Blok, Energy, 2006, 31, (4), 425 LINK
  45. 45.
    T. Ren, B. Daniëls, M. K. Patel and K. Blok, Resour. Conserv. Recycl., 2009, 53, (12), 653 LINK
  46. 46.
    I. Amghizar, L. A. Vandewalle, K. M. Van Geem and G. B. Marin, Engineering, 2017, 3, (2), 171 LINK
  47. 47.
    ‘Environmental Management – Life Cycle Assessment – Requirements and Guidelines’, ISO 14044:2006, International Organization for Standardization, Geneva, Switzerland, 2006 LINK


The user interface tool shown in this work was co-developed with Inawisdom Ltd, UK. The authors would also like to thank Paul Clark, Connor Longland (Johnson Matthey) and Ben Hancox (formerly Johnson Matthey) for their assistance in the development of the chemical networks tool; Dominic Winch (Johnson Matthey) and Cathy Tway (Johnson Matthey) for support in the article messaging; and Sue Ellis (Johnson Matthey) for valuable discussion and direction throughout the project.

Supplementary Information

Download the Supplementary Information for this article (PDF, 806 KB)

The Authors

Joseph Staddon is a Technical Strategy Analyst at Johnson Matthey, UK, where he works in the Hydrogen Technologies business. He has previously worked on researching electrochemical synthesis of metal complexes, designing electrocatalysts for fuel cells and electrolysers, evaluating opportunities for sustainable growth in the chemical industry and facilitating open innovation collaborations in cleantech. Currently, he is working on strategic evaluation and technical analysis of hydrogen technology opportunities.

Joost Smit is a Technology Development Manager at Johnson Matthey working in the Process Development Group and is based in The Netherlands. He joined Johnson Matthey in 2014 and has been working on the commercialisation of Johnson Matthey’s proprietary gold catalyst for acetylene-based vinyl chloride monomer (VCM) production, the Johnson Matthey acetylene to VCM process as well as several other early-stage process developments and evaluations.

Zinovia Skoufa is a Business Development Manager at Johnson Matthey. She is a Chemical Engineer with a PhD in the area of heterogeneous catalysis from Aristotle University of Thessaloniki, Greece. She joined Johnson Matthey in 2015 and worked on several step-out catalysis programmes before joining the Business Development and Innovation team.

David Watson is a chemical engineer with over 20 years of experience with Johnson Matthey in process technology development and commercialisation (technologies including ethyl acetate, with lead roles in COMETTM, dimethyl ether and monoethylene glycol), and now heads Johnson Matthey’s process development team.

Related articles

Guest Editorial: Industrial Biotechnology

Microbial Production of Hydrogen

Find an article