• Home
  • AI News
  • AI Startups
  • Deep Learning
  • Interviews
  • Machine-Learning
  • Robotics

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

What's Hot

Analysis at Stanford Introduces PointOdyssey: A Massive-Scale Artificial Dataset for Lengthy-Time period Level Monitoring

September 23, 2023

Google DeepMind Introduces a New AI Software that Classifies the Results of 71 Million ‘Missense’ Mutations 

September 23, 2023

Researchers from Seoul Nationwide College Introduces Locomotion-Motion-Manipulation (LAMA): A Breakthrough AI Methodology for Environment friendly and Adaptable Robotic Management

September 23, 2023
Facebook Twitter Instagram
The AI Today
Facebook Twitter Instagram Pinterest YouTube LinkedIn TikTok
SUBSCRIBE
  • Home
  • AI News
  • AI Startups
  • Deep Learning
  • Interviews
  • Machine-Learning
  • Robotics
The AI Today
Home»Machine-Learning»Meta AI Releases Nougat: A Visible Transformer Mannequin that Performs OCR for Processing Scientific Paperwork right into a Markup Language
Machine-Learning

Meta AI Releases Nougat: A Visible Transformer Mannequin that Performs OCR for Processing Scientific Paperwork right into a Markup Language

By September 5, 2023Updated:September 5, 2023No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Reddit WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


With the rising developments within the discipline of Synthetic Intelligence, its sub-fields, together with Pure Language Processing, Pure Language Technology, Pc Imaginative and prescient, and many others., have quickly gained a number of reputation on account of their intensive use circumstances. Optical Character Recognition (OCR) is a well-established and closely investigated space of pc imaginative and prescient. It has quite a few makes use of, reminiscent of doc digitization, handwriting recognition, and scene textual content identification. The popularity of mathematical expressions is one space of OCR that has obtained a number of curiosity in educational research.

The Moveable Doc Format (PDF) is among the most generally used codecs for scientific data, which is usually preserved in books or printed in scholarly journals. The second most used information format on the web, accounting for two.4% of the knowledge, PDFs are steadily used for doc supply. Regardless of their widespread use, extracting data from PDF recordsdata might be troublesome, notably when coping with extremely specialised supplies like scientific analysis articles. Specifically, when these papers are transformed to PDF format, the semantic data of mathematical expressions is steadily misplaced.

To deal with the challenges, a crew of researchers from Meta AI has launched an answer known as Nougat, which stands for “Neural Optical Understanding for Tutorial Paperwork.” So as to do Optical Character Recognition (OCR) on scientific texts, Nougat is a Visible Transformer mannequin. Its objective is to remodel these recordsdata right into a markup language in order that they could be extra simply accessed and machine-readable.

To indicate the efficacy of the methodology, the crew has additionally produced a contemporary dataset of educational papers. This methodology provides a viable reply for enhancing scientific data accessibility within the digital age. It fills the hole between written supplies which are easy for individuals to learn and textual content that computer systems can course of and analyze. Researchers, educators, and anybody occupied with scientific literature can entry and cope with scientific papers extra successfully utilizing Nougat. Nougat is principally a transformer-based mannequin designed to transform photographs of doc pages, notably these from PDFs, into formatted markup textual content.

The crew has summarized their key contributions as follows –

  1. Publication of a Pre-trained Mannequin: The crew has created a pre-trained mannequin that may rework PDFs right into a easy markup language. This pre-trained mannequin is made public on GitHub, the place the analysis group and anybody can entry it, together with the associated code.
  1. Pipeline for Dataset Creation: A technique for constructing datasets that pair PDF paperwork with their related supply code is described within the examine. This dataset improvement methodology is essential for testing and refining the Nougat mannequin and could also be helpful for future doc evaluation analysis and purposes.
  1. Dependency on the Web page’s Picture Solely: Considered one of Nougat’s standout options is its capability to function solely on the Web page’s Picture. This makes it a versatile device for extracting content material from a wide range of sources, even when the unique paperwork will not be out there in digital textual content codecs. It may well course of scanned papers and books.

Try the Paper and Github. All Credit score For This Analysis Goes To the Researchers on This Mission. Additionally, don’t neglect to hitch our 29k+ ML SubReddit, 40k+ Fb Neighborhood, Discord Channel, and E mail E-newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.

In case you like our work, you’ll love our publication..



Tanya Malhotra is a last 12 months undergrad from the College of Petroleum & Vitality Research, Dehradun, pursuing BTech in Pc Science Engineering with a specialization in Synthetic Intelligence and Machine Studying.
She is a Information Science fanatic with good analytical and important pondering, together with an ardent curiosity in buying new abilities, main teams, and managing work in an organized method.


🚀 CodiumAI permits busy builders to generate significant assessments (Sponsored)

Related Posts

Researchers from Seoul Nationwide College Introduces Locomotion-Motion-Manipulation (LAMA): A Breakthrough AI Methodology for Environment friendly and Adaptable Robotic Management

September 23, 2023

Unlocking Battery Optimization: How Machine Studying and Nanoscale X-Ray Microscopy May Revolutionize Lithium Batteries

September 23, 2023

This AI Analysis by Microsoft and Tsinghua College Introduces EvoPrompt: A Novel AI Framework for Automated Discrete Immediate Optimization Connecting LLMs and Evolutionary Algorithms

September 23, 2023

Leave A Reply Cancel Reply

Misa
Trending
Deep Learning

Analysis at Stanford Introduces PointOdyssey: A Massive-Scale Artificial Dataset for Lengthy-Time period Level Monitoring

By September 23, 20230

Massive-scale annotated datasets have served as a freeway for creating exact fashions in numerous pc…

Google DeepMind Introduces a New AI Software that Classifies the Results of 71 Million ‘Missense’ Mutations 

September 23, 2023

Researchers from Seoul Nationwide College Introduces Locomotion-Motion-Manipulation (LAMA): A Breakthrough AI Methodology for Environment friendly and Adaptable Robotic Management

September 23, 2023

Unlocking Battery Optimization: How Machine Studying and Nanoscale X-Ray Microscopy May Revolutionize Lithium Batteries

September 23, 2023
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Our Picks

Analysis at Stanford Introduces PointOdyssey: A Massive-Scale Artificial Dataset for Lengthy-Time period Level Monitoring

September 23, 2023

Google DeepMind Introduces a New AI Software that Classifies the Results of 71 Million ‘Missense’ Mutations 

September 23, 2023

Researchers from Seoul Nationwide College Introduces Locomotion-Motion-Manipulation (LAMA): A Breakthrough AI Methodology for Environment friendly and Adaptable Robotic Management

September 23, 2023

Unlocking Battery Optimization: How Machine Studying and Nanoscale X-Ray Microscopy May Revolutionize Lithium Batteries

September 23, 2023

Subscribe to Updates

Get the latest creative news from SmartMag about art & design.

The Ai Today™ Magazine is the first in the middle east that gives the latest developments and innovations in the field of AI. We provide in-depth articles and analysis on the latest research and technologies in AI, as well as interviews with experts and thought leaders in the field. In addition, The Ai Today™ Magazine provides a platform for researchers and practitioners to share their work and ideas with a wider audience, help readers stay informed and engaged with the latest developments in the field, and provide valuable insights and perspectives on the future of AI.

Our Picks

Analysis at Stanford Introduces PointOdyssey: A Massive-Scale Artificial Dataset for Lengthy-Time period Level Monitoring

September 23, 2023

Google DeepMind Introduces a New AI Software that Classifies the Results of 71 Million ‘Missense’ Mutations 

September 23, 2023

Researchers from Seoul Nationwide College Introduces Locomotion-Motion-Manipulation (LAMA): A Breakthrough AI Methodology for Environment friendly and Adaptable Robotic Management

September 23, 2023
Trending

Unlocking Battery Optimization: How Machine Studying and Nanoscale X-Ray Microscopy May Revolutionize Lithium Batteries

September 23, 2023

This AI Analysis by Microsoft and Tsinghua College Introduces EvoPrompt: A Novel AI Framework for Automated Discrete Immediate Optimization Connecting LLMs and Evolutionary Algorithms

September 23, 2023

Researchers from the College of Oregon and Adobe Introduce CulturaX: A Multilingual Dataset with 6.3T Tokens in 167 Languages Tailor-made for Giant Language Mannequin (LLM) Growth

September 23, 2023
Facebook Twitter Instagram YouTube LinkedIn TikTok
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms
  • Advertise
  • Shop
Copyright © MetaMedia™ Capital Inc, All right reserved

Type above and press Enter to search. Press Esc to cancel.