Our Airflow Pipeline & Automated Data Extraction for a Smart Assistant

Our Airflow Pipeline & Automated Data Extraction for a Smart Assistant

Our Airflow Pipeline & Automated Data Extraction for a Smart Assistant

About the Client

Our customer is an energy technology startup based in Texas, USA. They created a device-independent smart assistant that runs on any browser and has been custom built just for upstream Oil and Gas. Any company can connect to limitless internal and public data sources, extract high-value insights, and allow for a natural conversation with the data. The smart assistant helps companies make brisk decisions, by organizing the collective knowledge within the company and making it accessible.

About the Client

Our customer is an energy technology startup based in Texas, USA. They created a device-independent smart assistant that runs on any browser and has been custom built just for upstream Oil and Gas. Any company can connect to limitless internal and public data sources, extract high-value insights, and allow for a natural conversation with the data. The smart assistant helps companies make brisk decisions, by organizing the collective knowledge within the company and making it accessible.

Siddhartha Gupta

Nesh CEO

Client goals

During the oil price downturn in November 2014, the company’s co-founder—an industry expert from Schlumberger and Shell—became deeply curious about data, analytics, and machine learning. His mission,

Problem

Manual and Slow Data Extraction The smart assistant was using Natural Language Processing to integrate with multiple systems used by a company to unstructured data from documents, emails, and much more. The then-current version of the search engine had manual data extraction - which was extremely time-consuming. They were using a third party API to extract the financial data.

" Since September 2020, JTC has powered the energy tech startup’s growth as a trusted technology partner through staff augmentation "

Building Airflow Pipelines and Automated Data Extraction

We developed an Airflow Pipeline to extract text from unstructured documents and upload it to an elastic search database and created an external API for users outside of the team network. The Airflow pipeline automated workflows and managed boring and redundant manual tasks.

We built airflow pipelines for

  • Extracting data from uploaded and emailed document

  • Extracting companies investor presentations and transcript

  • Extracting information from financial documents

  • Extracting news data from peer websites.

We added a new feature where anyone from the organization can compare financial data on a quarterly basis or look into the yearly growth.

We did this by designing elastic search mapping and queries for refined results. Now anyone can ask questions like: What was NRG’s thirdquarter revenue growth percentage and they will get the reply at the same moment in a conversational way.

The more questions you ask the assistant - the smarter it gets. Another feature added by our team is to develop a process where the assistant nudges you with questions that you didn t think of asking. This can provide relevant insights and, hopefully, uncover new knowledge from the data.

The team also reinvented the backend data structure with dependency injection and storage abstraction. - increase in speed, reduced latency, restructured the backend using service based architecture.

We also redesigned Data Explorer for viewing different connected data sources in an improved manner. We improved table extraction logic for extracting tables from unstructured PDFs and documents on top of existing libraries ( Airflow )

Technology Used

React Native

Node JS

Angular JS

AWS

Results – A Smarter, Faster, and More Efficient System

✅ Drastically reduced response times

✅ Improved UI/UX for a seamless experience

✅ More powerful financial insights & filtering options

✅ Scalable infrastructure ready for future growth

Other Casestudies

Your next breakthrough starts with the right technical foundation.

Better.

Your next breakthrough starts with the right technical foundation.

Better.