尊敬的 微信汇率:1円 ≈ 0.046239 元 支付宝汇率:1円 ≈ 0.04633元 [退出登录]
SlideShare a Scribd company logo
Extracting accurate materials
data from research papers with
conversational language models
and prompt engineering
Background
• Growing effort to replace manual extraction with automated
extraction.
• It still requires large effort, expertise and coding.
• ChatExtract can fully automate very accurate data extraction with
minimal initial effort and background.
• Close to 90% of precision.
• Databases for critical cooling rates of metallic glasses and yield
strength are developed.
• Prompt engineering has now become a standard practice in the field
image generation.
The data extraction – two main stages
• 1. Initial classification with simple prompt, clear out all the sentences that
do not contain data.
• 2. A series of prompts that control the data extraction are categorized.
• 2.1. Split data into single and multi valued (single entry are more likely to
be extracted properly)
• 2.2. Include the possibility that some data may be missing from the text.
• 2.3. Use uncertainty-inducing prompts to let model re-analyze the text.
• 2.4. Embed all the questions in a single conversation.
• 2.5. Enforce a strict Yes or No to reduce uncertainty.
Single or multiple values
• Texts with only a single value are much simpler.
• Texts with multiple values need a careful analysis of the relations
between words to determine the correspondance.
• Material properties: Material, Value, Unit
• Then the text is analyzed.
• For a single-valued text, can directly ask questions about the data and ask for separations.
• If nagative answer is given, the text is discarded and no data is extracted.
• For multi-valued sentence, asking the model to provide structured data in a form of a
table.
• Repetitively provide the text with each prompt.
• The repetition helps in maintaining all the details about the text that is being analyzed.
• Enforcing Yes or No will enable the automation of the data extraction process.
1. Blue boxes represent prompts given to the model.
2. Gray boxes are instructions to the user, ‘Yes’, ‘No’, ‘None’.
3. The bold text in ‘[ ]’ are to be replaced with appropriate
values of the named item: material, value or unit.
• Investigated the performance of ChatExtract approach on multiple
property examples: bulk modulus, metallic glass critical cooling rate,
high entropy alloy yield stress
• The reason of choosing bulk modulus: very often report other elastic
properties, such as Young’s modulus or shear modulus with similar
name and range of values.
• Other measurements have similar units as bulk modulus.
1. Single-value sentences have higher precisions and recalls
than multi-valued sentences for the same models.
2. Two core features of ChatGPT that can be in ChatExtract
2.1. Use of redundant prompts that introduce the possibility of
uncertainty about the previous extracted data.
2.2. The information about previous prompts and answers is
kept. This will allow the follow-up questions to relate to the
entire conversation.
3. Removing the follow-up questions will decrease the overall
precision to just 42.7% and 26.5% for ChatGPT-4 and ChatGPT-
3.5, from values of 90.8% and 70.1%.
4. Other models: LLaMA2-chat, ChemDataExtractor2(CDE2)
have been tested as well.
Application to tables and figures
• Data can be found in tables and figures.
• Analyzing figures is still an ongoing challenge.
• For table, the same method as in the general
ChatExtract workflow.
• For figure, only the figure caption is used in the
classification. If positive, figure can be downloaded
for later manual data extraction.
• The precision is very high for extracting tables (98%)
• Assessment of accuracy for figure classification is
more difficult
• For example, 436 figures, manually classified 45
containing bulk modulus data. (80% precision)
• Some figures requires expertise to extract data
Results of real-life data extraction
• Critical cooling rates of metallic glasses
• Databases presented in raw, cleaned, and standardized forms via manual post-processing and
machine learning.
• Despite challenges, ChatExtract demonstrated reasonable precision and recall in data extraction.
• Final standardized database contained 557 datapoints, with duplicates retained.
• A separate database focusing on metallic materials generated, containing 298 unique datapoints.
• ChatExtract showed efficiency compared to manual methods, with consistent performance.
Results of real-life data extraction
• Yield strength of high entropy alloys
• Extracted 10269 raw data points, resulting in 8900 cleaned datapoints.
• Data ranged from 12 MPa to 19.16 GPa, with a peak around 400 MPa.
• Extracted 2456 datapoints from tables and classified 1848 figures as relevant.
• ChatExtract's general and transferable approach proves effective for diverse data extraction tasks.
• Proposed expansions to ChatExtract workflow to handle additional constraints or data types.
• Assessing accuracy of expanded ChatExtract functionalities would require further refinement and
testing.
Conclusions
• ChatGPT extracts high-quality materials data from research texts with engineered prompts.
• Achieved over 90% precision and 87.7% recall on bulk modulus data, and 91.6% precision and
83.6% recall on critical cooling rates.
• Success attributed to purposeful redundancy, uncertainty, and information retention within
conversation.
• Developed two databases: critical cooling rates for metallic glasses and yield strengths for high
entropy alloys.
• Quality of data and simplicity suggest potential to replace labor-intensive methods.
• ChatExtract's independence suggests potential for improvement with newer LLMs.

More Related Content

Similar to Literature review for prompt engineering of ChatGPT.pptx

Overview of DuraMat software tool development
Overview of DuraMat software tool developmentOverview of DuraMat software tool development
Overview of DuraMat software tool development
Anubhav Jain
 
Data structures and algorithms Module-1.pdf
Data structures and algorithms Module-1.pdfData structures and algorithms Module-1.pdf
Data structures and algorithms Module-1.pdf
DukeCalvin
 
Machine learning systems for engineers
Machine learning systems for engineersMachine learning systems for engineers
Machine learning systems for engineers
Cameron Joannidis
 
Entity embeddings for categorical data
Entity embeddings for categorical dataEntity embeddings for categorical data
Entity embeddings for categorical data
Paul Skeie
 
Cs 331 Data Structures
Cs 331 Data StructuresCs 331 Data Structures
Agile Experiments in Machine Learning
Agile Experiments in Machine LearningAgile Experiments in Machine Learning
Agile Experiments in Machine Learning
mathias-brandewinder
 
Deep Learning Made Easy with Deep Features
Deep Learning Made Easy with Deep FeaturesDeep Learning Made Easy with Deep Features
Deep Learning Made Easy with Deep Features
Turi, Inc.
 
Pre-Processing and Data Preparation
Pre-Processing and Data PreparationPre-Processing and Data Preparation
Pre-Processing and Data Preparation
Umair Shafique
 
An LSTM-Based Neural Network Architecture for Model Transformations
An LSTM-Based Neural Network Architecture for Model TransformationsAn LSTM-Based Neural Network Architecture for Model Transformations
An LSTM-Based Neural Network Architecture for Model Transformations
Lola Burgueño
 
PERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES - Choice of Information Encapsulation (IE) TechniquePERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES_FP7
 
Test data documentation ss
Test data documentation ssTest data documentation ss
Test data documentation ss
AshwiniPoloju
 
Data visualization via Tableau solving an excel problem
Data visualization via Tableau solving an excel problemData visualization via Tableau solving an excel problem
Data visualization via Tableau solving an excel problem
VivAde1
 
Prepare your data for machine learning
Prepare your data for machine learningPrepare your data for machine learning
Prepare your data for machine learning
Ivo Andreev
 
NUS-ISS Learning Day 2018- Application of analytics in manufacturing sector
NUS-ISS Learning Day 2018- Application of analytics in manufacturing sectorNUS-ISS Learning Day 2018- Application of analytics in manufacturing sector
NUS-ISS Learning Day 2018- Application of analytics in manufacturing sector
NUS-ISS
 
Dimensionality Reduction in Machine Learning
Dimensionality Reduction in Machine LearningDimensionality Reduction in Machine Learning
Dimensionality Reduction in Machine Learning
RomiRoy4
 
artificial intelligence.pptx
artificial intelligence.pptxartificial intelligence.pptx
artificial intelligence.pptx
rithika858339
 
Keysum - Using Checksum Keys
Keysum - Using Checksum KeysKeysum - Using Checksum Keys
Keysum - Using Checksum Keys
David Walker
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Anubhav Jain
 
DB
DBDB
Agile experiments in Machine Learning with F#
Agile experiments in Machine Learning with F#Agile experiments in Machine Learning with F#
Agile experiments in Machine Learning with F#
J On The Beach
 

Similar to Literature review for prompt engineering of ChatGPT.pptx (20)

Overview of DuraMat software tool development
Overview of DuraMat software tool developmentOverview of DuraMat software tool development
Overview of DuraMat software tool development
 
Data structures and algorithms Module-1.pdf
Data structures and algorithms Module-1.pdfData structures and algorithms Module-1.pdf
Data structures and algorithms Module-1.pdf
 
Machine learning systems for engineers
Machine learning systems for engineersMachine learning systems for engineers
Machine learning systems for engineers
 
Entity embeddings for categorical data
Entity embeddings for categorical dataEntity embeddings for categorical data
Entity embeddings for categorical data
 
Cs 331 Data Structures
Cs 331 Data StructuresCs 331 Data Structures
Cs 331 Data Structures
 
Agile Experiments in Machine Learning
Agile Experiments in Machine LearningAgile Experiments in Machine Learning
Agile Experiments in Machine Learning
 
Deep Learning Made Easy with Deep Features
Deep Learning Made Easy with Deep FeaturesDeep Learning Made Easy with Deep Features
Deep Learning Made Easy with Deep Features
 
Pre-Processing and Data Preparation
Pre-Processing and Data PreparationPre-Processing and Data Preparation
Pre-Processing and Data Preparation
 
An LSTM-Based Neural Network Architecture for Model Transformations
An LSTM-Based Neural Network Architecture for Model TransformationsAn LSTM-Based Neural Network Architecture for Model Transformations
An LSTM-Based Neural Network Architecture for Model Transformations
 
PERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES - Choice of Information Encapsulation (IE) TechniquePERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES - Choice of Information Encapsulation (IE) Technique
 
Test data documentation ss
Test data documentation ssTest data documentation ss
Test data documentation ss
 
Data visualization via Tableau solving an excel problem
Data visualization via Tableau solving an excel problemData visualization via Tableau solving an excel problem
Data visualization via Tableau solving an excel problem
 
Prepare your data for machine learning
Prepare your data for machine learningPrepare your data for machine learning
Prepare your data for machine learning
 
NUS-ISS Learning Day 2018- Application of analytics in manufacturing sector
NUS-ISS Learning Day 2018- Application of analytics in manufacturing sectorNUS-ISS Learning Day 2018- Application of analytics in manufacturing sector
NUS-ISS Learning Day 2018- Application of analytics in manufacturing sector
 
Dimensionality Reduction in Machine Learning
Dimensionality Reduction in Machine LearningDimensionality Reduction in Machine Learning
Dimensionality Reduction in Machine Learning
 
artificial intelligence.pptx
artificial intelligence.pptxartificial intelligence.pptx
artificial intelligence.pptx
 
Keysum - Using Checksum Keys
Keysum - Using Checksum KeysKeysum - Using Checksum Keys
Keysum - Using Checksum Keys
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
DB
DBDB
DB
 
Agile experiments in Machine Learning with F#
Agile experiments in Machine Learning with F#Agile experiments in Machine Learning with F#
Agile experiments in Machine Learning with F#
 

Recently uploaded

❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
nainakaoornoida
 
Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...
Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...
Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...
IJCNCJournal
 
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
simrangupta87541
 
🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...
🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...
🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...
adhaniomprakash
 
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
sydezfe
 
Technological Innovation Management And Entrepreneurship-1.pdf
Technological Innovation Management And Entrepreneurship-1.pdfTechnological Innovation Management And Entrepreneurship-1.pdf
Technological Innovation Management And Entrepreneurship-1.pdf
tanujaharish2
 
🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...
🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...
🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...
sonamrawat5631
 
Online train ticket booking system project.pdf
Online train ticket booking system project.pdfOnline train ticket booking system project.pdf
Online train ticket booking system project.pdf
Kamal Acharya
 
Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdfSri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
Balvir Singh
 
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort ServiceCuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
yakranividhrini
 
DELTA V MES EMERSON EDUARDO RODRIGUES ENGINEER
DELTA V MES EMERSON EDUARDO RODRIGUES ENGINEERDELTA V MES EMERSON EDUARDO RODRIGUES ENGINEER
DELTA V MES EMERSON EDUARDO RODRIGUES ENGINEER
EMERSON EDUARDO RODRIGUES
 
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASICINTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
GOKULKANNANMMECLECTC
 
Data Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdfData Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdf
Kamal Acharya
 
My Airframe Metallic Design Capability Studies..pdf
My Airframe Metallic Design Capability Studies..pdfMy Airframe Metallic Design Capability Studies..pdf
My Airframe Metallic Design Capability Studies..pdf
Geoffrey Wardle. MSc. MSc. Snr.MAIAA
 
Covid Management System Project Report.pdf
Covid Management System Project Report.pdfCovid Management System Project Report.pdf
Covid Management System Project Report.pdf
Kamal Acharya
 
TENDERS and Contracts basic syllabus for engineering
TENDERS and Contracts basic syllabus for engineeringTENDERS and Contracts basic syllabus for engineering
TENDERS and Contracts basic syllabus for engineering
SnehalChavan75
 
Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...
Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...
Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...
Dr.Costas Sachpazis
 
Cricket management system ptoject report.pdf
Cricket management system ptoject report.pdfCricket management system ptoject report.pdf
Cricket management system ptoject report.pdf
Kamal Acharya
 
Butterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdfButterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdf
Lubi Valves
 
paper relate Chozhavendhan et al. 2020.pdf
paper relate Chozhavendhan et al. 2020.pdfpaper relate Chozhavendhan et al. 2020.pdf
paper relate Chozhavendhan et al. 2020.pdf
ShurooqTaib
 

Recently uploaded (20)

❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
 
Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...
Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...
Particle Swarm Optimization–Long Short-Term Memory based Channel Estimation w...
 
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
 
🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...
🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...
🔥LiploCk Call Girls Pune 💯Call Us 🔝 7014168258 🔝💃Independent Pune Escorts Ser...
 
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
 
Technological Innovation Management And Entrepreneurship-1.pdf
Technological Innovation Management And Entrepreneurship-1.pdfTechnological Innovation Management And Entrepreneurship-1.pdf
Technological Innovation Management And Entrepreneurship-1.pdf
 
🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...
🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...
🔥Young College Call Girls Chandigarh 💯Call Us 🔝 7737669865 🔝💃Independent Chan...
 
Online train ticket booking system project.pdf
Online train ticket booking system project.pdfOnline train ticket booking system project.pdf
Online train ticket booking system project.pdf
 
Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdfSri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
 
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort ServiceCuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
 
DELTA V MES EMERSON EDUARDO RODRIGUES ENGINEER
DELTA V MES EMERSON EDUARDO RODRIGUES ENGINEERDELTA V MES EMERSON EDUARDO RODRIGUES ENGINEER
DELTA V MES EMERSON EDUARDO RODRIGUES ENGINEER
 
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASICINTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
 
Data Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdfData Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdf
 
My Airframe Metallic Design Capability Studies..pdf
My Airframe Metallic Design Capability Studies..pdfMy Airframe Metallic Design Capability Studies..pdf
My Airframe Metallic Design Capability Studies..pdf
 
Covid Management System Project Report.pdf
Covid Management System Project Report.pdfCovid Management System Project Report.pdf
Covid Management System Project Report.pdf
 
TENDERS and Contracts basic syllabus for engineering
TENDERS and Contracts basic syllabus for engineeringTENDERS and Contracts basic syllabus for engineering
TENDERS and Contracts basic syllabus for engineering
 
Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...
Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...
Sachpazis_Consolidation Settlement Calculation Program-The Python Code and th...
 
Cricket management system ptoject report.pdf
Cricket management system ptoject report.pdfCricket management system ptoject report.pdf
Cricket management system ptoject report.pdf
 
Butterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdfButterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdf
 
paper relate Chozhavendhan et al. 2020.pdf
paper relate Chozhavendhan et al. 2020.pdfpaper relate Chozhavendhan et al. 2020.pdf
paper relate Chozhavendhan et al. 2020.pdf
 

Literature review for prompt engineering of ChatGPT.pptx

  • 1. Extracting accurate materials data from research papers with conversational language models and prompt engineering
  • 2. Background • Growing effort to replace manual extraction with automated extraction. • It still requires large effort, expertise and coding. • ChatExtract can fully automate very accurate data extraction with minimal initial effort and background. • Close to 90% of precision. • Databases for critical cooling rates of metallic glasses and yield strength are developed. • Prompt engineering has now become a standard practice in the field image generation.
  • 3. The data extraction – two main stages • 1. Initial classification with simple prompt, clear out all the sentences that do not contain data. • 2. A series of prompts that control the data extraction are categorized. • 2.1. Split data into single and multi valued (single entry are more likely to be extracted properly) • 2.2. Include the possibility that some data may be missing from the text. • 2.3. Use uncertainty-inducing prompts to let model re-analyze the text. • 2.4. Embed all the questions in a single conversation. • 2.5. Enforce a strict Yes or No to reduce uncertainty.
  • 4. Single or multiple values • Texts with only a single value are much simpler. • Texts with multiple values need a careful analysis of the relations between words to determine the correspondance. • Material properties: Material, Value, Unit
  • 5. • Then the text is analyzed. • For a single-valued text, can directly ask questions about the data and ask for separations. • If nagative answer is given, the text is discarded and no data is extracted. • For multi-valued sentence, asking the model to provide structured data in a form of a table. • Repetitively provide the text with each prompt. • The repetition helps in maintaining all the details about the text that is being analyzed. • Enforcing Yes or No will enable the automation of the data extraction process.
  • 6. 1. Blue boxes represent prompts given to the model. 2. Gray boxes are instructions to the user, ‘Yes’, ‘No’, ‘None’. 3. The bold text in ‘[ ]’ are to be replaced with appropriate values of the named item: material, value or unit.
  • 7. • Investigated the performance of ChatExtract approach on multiple property examples: bulk modulus, metallic glass critical cooling rate, high entropy alloy yield stress • The reason of choosing bulk modulus: very often report other elastic properties, such as Young’s modulus or shear modulus with similar name and range of values. • Other measurements have similar units as bulk modulus.
  • 8. 1. Single-value sentences have higher precisions and recalls than multi-valued sentences for the same models. 2. Two core features of ChatGPT that can be in ChatExtract 2.1. Use of redundant prompts that introduce the possibility of uncertainty about the previous extracted data. 2.2. The information about previous prompts and answers is kept. This will allow the follow-up questions to relate to the entire conversation. 3. Removing the follow-up questions will decrease the overall precision to just 42.7% and 26.5% for ChatGPT-4 and ChatGPT- 3.5, from values of 90.8% and 70.1%. 4. Other models: LLaMA2-chat, ChemDataExtractor2(CDE2) have been tested as well.
  • 9. Application to tables and figures • Data can be found in tables and figures. • Analyzing figures is still an ongoing challenge. • For table, the same method as in the general ChatExtract workflow. • For figure, only the figure caption is used in the classification. If positive, figure can be downloaded for later manual data extraction. • The precision is very high for extracting tables (98%) • Assessment of accuracy for figure classification is more difficult • For example, 436 figures, manually classified 45 containing bulk modulus data. (80% precision) • Some figures requires expertise to extract data
  • 10. Results of real-life data extraction • Critical cooling rates of metallic glasses • Databases presented in raw, cleaned, and standardized forms via manual post-processing and machine learning. • Despite challenges, ChatExtract demonstrated reasonable precision and recall in data extraction. • Final standardized database contained 557 datapoints, with duplicates retained. • A separate database focusing on metallic materials generated, containing 298 unique datapoints. • ChatExtract showed efficiency compared to manual methods, with consistent performance.
  • 11. Results of real-life data extraction • Yield strength of high entropy alloys • Extracted 10269 raw data points, resulting in 8900 cleaned datapoints. • Data ranged from 12 MPa to 19.16 GPa, with a peak around 400 MPa. • Extracted 2456 datapoints from tables and classified 1848 figures as relevant. • ChatExtract's general and transferable approach proves effective for diverse data extraction tasks. • Proposed expansions to ChatExtract workflow to handle additional constraints or data types. • Assessing accuracy of expanded ChatExtract functionalities would require further refinement and testing.
  • 12. Conclusions • ChatGPT extracts high-quality materials data from research texts with engineered prompts. • Achieved over 90% precision and 87.7% recall on bulk modulus data, and 91.6% precision and 83.6% recall on critical cooling rates. • Success attributed to purposeful redundancy, uncertainty, and information retention within conversation. • Developed two databases: critical cooling rates for metallic glasses and yield strengths for high entropy alloys. • Quality of data and simplicity suggest potential to replace labor-intensive methods. • ChatExtract's independence suggests potential for improvement with newer LLMs.
  翻译: