New AI system extracts numerical data from academic texts, freeing researchers from routine tasks

The Quinex framework automatically structures quantitative data and is designed to help manage the growing flood of data

21-Apr-2026

Symbolic image

AI-generated image

3D Raman microscopes with unequalled speed, sensitivity and resolution

AQ700 - A Discrete Analyser with high throughput and walk away time

High-Voltage, and High-Resolution CT Scanner for Non-Destructive Research and Industrial Inspection

Numbers are the language of science—yet in research articles, they are often buried within the text and difficult to analyze. Researchers at Jülich have developed an AI system that automatically identifies these numbers, categorizes them, and converts them into structured data. The Quinex framework thus eliminates the need for time-consuming manual work.

Whether in energy, climate, or materials research—scientific papers are full of numbers—or, more precisely, quantitative data: efficiencies, temperatures, costs, emissions. These are often crucial for improving models or identifying trends. At the same time, the number of scientific publications is growing rapidly. For many research questions, it is now virtually impossible to manually evaluate all relevant publications—the time and resources required would be enormous.

The Quinex (“Quantitative Information Extraction”) framework, developed by researchers at Jülich, is based on language models and automates this process: Artificial intelligence identifies numerical values, assigns them to appropriate units, and recognizes what was measured, when, where, and how. Thus, a sentence like “Efficiency levels of 63 to 71 percent are assumed for 2025” is transformed into a structured dataset containing all relevant contextual information—from the year and measurement method to the source.

AI start-up helps companies to use their data to make the best possible decision under any circumstances

Scavenger AI secures €1.1m pre-seed funding

Read news

Open and Efficient AI

Unlike many proprietary AI solutions, Quinex is based entirely on open, relatively small, and thus efficient language models. These have been specifically trained to recognize and classify quantitative information in scientific texts. Compared to similar systems, Quinex delivers more precise results, captures contextual information in a more nuanced way, and also takes implicit characteristics into account.

Despite its compact size, Quinex achieves a recognition accuracy (F1) of around 98 percent for numbers and associated units, and approximately 87 and 82 percent for the classification of quantified properties and entities. These high accuracy rates were achieved through specially created training datasets and methodological improvements.

“We wanted to develop a tool that is powerful, yet also transparent and resource-efficient,” explains Dr. Jann Weinand, head of the Integrated Scenarios Department at Jülich System Analysis. “Quinex makes artificial intelligence more accessible for data analysis in science.”

Successful Practical Test

To test Quinex’s practical suitability, the system was applied to thousands of scientific abstracts from various fields. It successfully extracted data on electricity production costs for various energy technologies, on maximum oxygen uptake in humans, on earthquake magnitudes and locations, and on the band gaps of photovoltaic materials.

The automatically derived values closely matched the respective reference data. This demonstrates that Quinex is well-suited for analyzing large volumes of academic literature across a wide range of research fields and deriving reliable trends from it.

New Perspectives for Research

“Language models open up new perspectives for science and help maintain an overview of entire research fields,” says lead author Jan Göpfert. “They enable automated literature searches, the creation of uniformly structured research databases, and trend analyses that reveal developments in science and technology at an early stage.”

“Our goal is to relieve researchers of routine work,” says Dr. Patrick Kuckertz, head of the Research Data Management Group. “Quinex is designed to help them arrive at insights more quickly and manage the growing flood of data in science.”

Limitations and future improvements

Quinex isn’t entirely error-free either—but transparency is part of its design. “The system recognizes numbers and units very reliably,” says Jan Göpfert. “Since they are taken directly from the text, they cannot be ‘hallucinated.’ However, misinterpretations sometimes occur, for example when important references are scattered throughout the text.”

Thus, Quinex remains a tool that supports people but does not replace them. “We recommend using Quinex where it informs and relieves researchers—but the responsibility for interpreting the results remains with them,” says Göpfert. Every recognized number can be traced back to its source and, where possible, is highlighted in the original text.

The team is working to further develop Quinex with additional domain-specific datasets and models, making it even more efficient and flexible enough to adapt to various research requirements.

Open Collaboration Welcome

Forschungszentrum Jülich is making Quinex available as an open-source project.

This is intended to give researchers worldwide the opportunity to test, expand, and adapt the system to their own fields—from energy research to chemistry and biomedicine.

Original publication

Jan Göpfert, Patrick Kuckertz, Gian Müller, Luna Lütz, Celine Körner, Hang Khuat, Detlef Stolten, Jann Michael Weinand; "Quinex: Quantitative information extraction from text using open and lightweight LLMs"; The Innovation

https://www.chemeurope.com/en/news/1188534/new-ai-system-extracts-numerical-data-from-academic-texts-freeing-researchers-from-routine-tasks.html

Original publication

Topics

artificial intelligence data analysis technical literature data analysis

Show all

Organizations

Forschungszentrum Jülich

So close that even
molecules turn red...

NIR spectrometer manufacturer

Last viewed contents

Researchers show how simple magnets can help solve a complex problem - Magnetic fields assist in recovering valuable metals from waste

Go to page

Geopolitical price pressure hits fertilizer and flame retardant market - Natural calcium carbonate and magnesium hydroxide as reliable alternatives

Go to page

More from the department science Subscribe to newsletter

New AI system extracts numerical data from academic texts, freeing researchers from routine tasks

The Quinex framework automatically structures quantitative data and is designed to help manage the growing flood of data

AI start-up helps companies to use their data to make the best possible decision under any circumstances

Open and Efficient AI

Successful Practical Test

New Perspectives for Research

Limitations and future improvements

Open Collaboration Welcome

Original publication

Machine learning boosts search for new materials

Other news from the department science

Efficient Production of Solar Hydrogen Through Direct Coupling of Concentrating Solar Cells and Electrolyzer

Plastic bottles could find new life in batteries as graphite

Researchers discover an unexpected synthetic route: A new route to climate-neutral methane

Breakthrough in tailor-made enzyme design

Like a miniature lunar rocket: Researchers develop modular nanorobot

Bonding at the push of a button

Hydrogen Research on an Industrial Scale

The 2026 Wolf Prize Goes to a Berlin Chemist

From cleaner “cracking” to black gold

New membrane technology could transform hydrocarbon processing by slashing energy use

Artificial intelligence evaluates chemical spectra in minutes

Carbon nanotubes make the electronic nose suitable for everyday use for the first time

Tailor-made functionalized gelatin – manufactured with reproducible results

DECHEMA Research Institute opens new site in Bad Homburg

Rare-earth-free zinc oxide achieves a first in stress-to-light conversion

Designable van der Waals crystal realizes artificial neuronal cell mimicking with light

Chemists achieve breakthrough: Editing molecules instead of rebuilding them

Extending cryo-electron microscopy beyond water

On the trail of the missing hydrogen atoms

Interpretable AI in materials discovery: Uncovering how models make predictions

Most read news

Atomic reshuffle paves way for record-breaking catalysts for hydrogen production

Festo is cutting approximately 1,300 jobs in Germany

Focused Energy secures US$240 million: the world’s first laser fusion power plant is set to be built in Germany

Water splitting catalyst creates hydrogen at low temperatures

German plastics recycling on the brink of collapse

Cooking plastics into oil

Magnetic field during catalyst synthesis triples ammonia yield

Making Chemistry Greener: The 2026 Gerhard Ertl Lecture Award goes to Professor Marc Koper

Bacterial factories: A key to climate-friendly chemistry

Faster and more energy-efficient: Catalysts boost hydrogen-based steel production

MIT researchers develop a low-cost technique to get lithium out of rocks

A Step Forward for Solar-Driven Ammonia Production

More news from our other portals

It may not just be what’s in ultra-processed foods, but how they’re made

Nordzucker is revising its beet pricing model and investing €160 million in its factories

Detecting heavy metals in soil and water: New method for on-site analysis

New drug could slow the development of Alzheimer’s

New research finds that almost all plant-based meat alternatives contain mycotoxins

Merck expands life science portfolio with $11.3 billion Bio-Techne deal

New antibiotics discovered to treat multi-resistant germs

Could cultured chocolate unlock the next food revolution?

Why doesn't coffee taste like caffeine?

Mini-Brains from Patient Cells Point to Vitamin B3 as Treatment for Rare Childhood Disease

Nestlé to acquire smart food pioneer yfood to accelerate the brand’s international expansion

Holography meets spectroscopy: Ultrafast microscopy method for optical processes

Egg consumption is associated with a lower risk of Alzheimer’s Disease

Less hunger, more environmental problems?

Researchers solve a 50-year-old mystery: how acid removes water from proteins

Cytospire Therapeutics announces oversubscribed £61 million Series A financing

Carbon nanotubes make the electronic nose suitable for everyday use for the first time

Artificial intelligence evaluates chemical spectra in minutes

Insect larvae as a screening tool

According to the report, one in five cups of coffee contains toxic pesticide residues

Haga Bioscience raises $2.3m in oversubscribed seed round to bridge spatial biomarker discovery and clinical translation

First European biotech with CAR-T and LNP technology under one roof

Green light for Arla Foods and DMK Group merger ​

Common structural analysis of interfacial water is inadequate, according to a new study

Fewer animal experiments thanks to virtual mouse

Researchers find fructose sends a weaker “I’m full” signal to the brain than glucose

Pyrolysis oil instead of crude oil: Faster fluorine analysis reduces the risk for refineries

Daily glass of 100% fruit juice could help support mental wellbeing

Turning food waste into carbon captors

PFAS detection in minutes rather than weeks: deep-tech start-up Grapheal secures €2.5 million in EU funding

So close that even molecules turn red...

Last viewed contents

Researchers show how simple magnets can help solve a complex problem - Magnetic fields assist in recovering valuable metals from waste

Geopolitical price pressure hits fertilizer and flame retardant market - Natural calcium carbonate and magnesium hydroxide as reliable alternatives

Green light for Arla Foods and DMK Group merger

So close that even
molecules turn red...