Find Jobs
Hire Freelancers

Developer Needed for OCR Data Extraction using Machine Learning

$15-25 USD / hour

Închis
Data postării: circa 3 ani în urmă

$15-25 USD / hour

Overview: The goal of our project is to create a software-based process for extracting text information from a variety of types of invoice documents (PDFs, scanned images (TIF, JPG, etc.)) that is based on an understanding of the document and the text on it instead of using fixed positions, etc. The end goal is to have this capability to be able to process any invoice that is presented to it. I see this effort being conducted in phases based on the success of each phase. The phases are as follows: 1. Upfront discovery call(s) with selected developer 2. Invoice Face Page Only proof of concept (see below) for US / English language invoices only 3. Invoice Face Page Only for other languages 4. Invoice Detail for US / English language invoices only 5. Invoice Detail for other languages Our business involves processing invoices so we require highly accurate invoice data. We have built a significant amount of invoice parsing technology covering a defined industry space but all dealing with actual data files, not OCR. This capability is needed to expand out to all industries as well as to automate internal processes involving managing invoice files. We believe we will implement this technology as part of a multi-step process within in our application to function as follows: 1. Invoices files will be transmitted to us in the following ways and will be placed into a file system a. SFTP b. AWS S3 c. Email attachments 2. This service will open each invoice file and perform its extraction process. The text information and other metadata (i.e. confidence levels, etc.) will be stored in a database table 3. Our application or this service will perform validation on the data extracted to determine its suitability to support our needs. Examples of validation will include things like the following: a. Match of extracted account number to our table b. Sum of various numeric fields (i.e. balance forward + new charges = total amount due) 4. For fields that cannot be validated through external means we will set some arbitrary threshold on the confidence level provided in the metadata and test actual data to determine how well this works 5. Any item failing validation will be set up in our application for a human user to review and fix. We will build this into our application UI Phase 1/2 Details: As part of the Phase 1/2 proof of concept we would like to start off with 1 or 2 discovery calls with the selected developer to better understand how the optimal technology (machine learning) for this task functions, the developer's experience / expertise in this area, etc. These will be paid discovery calls. Next, we want the selected developer to create software code to perform what is described below. - This proof of concept will be limited to a few invoices (5) to reduce the time to execute the project - Take scanned image files and do the following programmatically: ○ Extract defined fields based on learning what they are § Extract the following information (note not all fields may be provided): □ Vendor □ Account Number □ Invoice Number □ Invoice Date □ Bill Period From □ Bill Period Thru □ Due Date □ Customer Address □ Vendor Remit Address □ Balance Forward Amount □ Total New Charges □ Total Amount Due ○ For the sample images, I have defined where each of these field data are located § Again, we are looking for technology that understands the data and doesn't use position-based means of locating the data, but also realize that you may need information to train the model ○ Output data in a text file with following headers delimited by comma (i.e. .csv): § Vendor § Account Number § Invoice Number § Invoice Date § Bill Period From § Bill Period Thru § Due Date § Customer Address § Vendor Remit Address § Balance Forward Amount § Total New Charges § Total Amount Due
ID-ul proiectului: 29549081

Despre proiect

19 propuneri
Proiect la distanță
Activ: 3 ani în urmă

Vrei să câștigi bani?

Avantajele de a licita pe platforma Freelancer

Stabilește bugetul și intervalul temporal
Îți primești plata pentru serviciile prestate
Evidențiază-ți propunerea
Te înregistrezi și licitezi gratuit pentru proiecte
19 freelanceri plasează o ofertă medie de $22 USD/oră pentru proiect
Avatarul utilizatorului
Hi there. I am a senior data scientist and have sufficient experience in this field, so very interested in your task. As you can know from my profile and work history, I have done similar projects before, so can help you perfectly. Please contact me and discuss more details. Best regards, Armand.
$30 USD în 40 zile
5,0 (11 recenzii)
5,2
5,2
Avatarul utilizatorului
Hello,i checked your project description carefully so it is very interesting for me i can handle your project wonderfully because i have full experience with image processing for 10+ years. i am very familar with OCR,ALPR,detection object etc thanks
$30 USD în 40 zile
5,0 (1 recenzie)
4,6
4,6
Avatarul utilizatorului
Hello, I am good at Computer vision like OCR. Please visit my profile. I hope to discuss more via chat. Thank you. Nemanja.
$20 USD în 40 zile
5,0 (4 recenzii)
4,5
4,5
Avatarul utilizatorului
hello sir, i am highly interested in your project. i have gone through your requirements and i believe i can be a valuable asset for your project. i am an expert in machine learning, deep learning, natural language processing and computer vision in python. previously i have worked on various OCR models and using OCR to extract relevant information from documents such as invoices. i have more than 3 years of experience in this field. so, this is very familiar to me. i can assure you best quality work. hope to get in touch and work together.
$15 USD în 40 zile
4,9 (6 recenzii)
3,7
3,7
Avatarul utilizatorului
Thank you for your posting! Just I have read your job posting, and I’m very interested in your project. I am an computer vision expert with full experience using machine learning such as tensorflow, caffe, darknet, keras, pytorch, tesserat, etc and opencv, intel openvino technique. I am very familiar with OCR. I have developed a lot of OCR projects such as Image(PDF) to text(docx), MRZ & Barcode & OR & PDF417 recognition, ANPR, Captcha Recognition and etc... with with Python, Java, Android, C#, C++. And also, I develop all platforms including Desktop and Mobile/Web. If you response my bid , I will send you my image processing Demos and I can finish your task and deliver you perfectly, Also, you can know my ability enough. I sincerely hope this project would be first step in long term relationship with you. Please give me your detail. Looking forward to hear from you soon. Best Regard!
$18 USD în 40 zile
5,0 (4 recenzii)
3,3
3,3
Avatarul utilizatorului
How are you, Dear! I have read your job posting with great care and interest. I have great experience in Machine Learning (ML), Data Extraction, OCR, but also in relevant up-to-date technologies and I'm sure that I can complete this project perfectly. I'll share more details of my last projects while interviewing. I want to have long term partnership with you as best employer. Your project will always move forward on time and completed successfully in a high quality. I'm looking forward to hearing from you soon. Regards!
$22 USD în 1 zi
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
----- Pro OCR/ Image processing/Algorithm/ Machine Learning Expert! -------- Hi, Dear Your project is very attracting my mind because I have rich experiences and high skills on this project. I have been completed many similar projects before and am working in these branches for 8+ years. If given a chance, I am highly confident in my ability to deliver the highest quality. I look forward to hearing from you. Kind Regards
$25 USD în 40 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
Hi I am Santhosh I am Experienced developer in the field of AI, Machine Learning , Deep Learning , NLP and Image Processing. I have done various data driven projects as follows: -- Optical Character Recognition -- Image Classification -- Object Detection -- Application Tracking System -- AI Bases Smart Accounting System. Familiar with Matplotlib, Sklearn, Keras, seaborn, scipy , Pandas, OpenCV etc... As I have gone through your requirement as I have implemented OCR using Deep Learning and Python, so I can able to finish the project in given time.. Ping me we will discuss more...
$22 USD în 20 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
I solve problems in everything from computer vision to deep reinforcement learning(RL) applied to algorithmic trading. Finishing a paper on my revolutionary sports prediction architecture employing novel deep learning models in sync with classical ML techniques. Strong background in linear alg, probability, set theory, and more. Utilizing lie groups in the context of theoretical physics in my free time(gauge theory) while dissecting some of the titular problems of our time. My knowledge of ML algorithms spans from classic methods(random forests, SVMs), to cutting-edge approaches(capsule networks, ANNs learning to knowledge graphs, pointer networks). Ergo, I can reason about the optimality of a particular architecture in a given problem domain. While I consider myself primarily focused on reinforcement learning and top-down AGI, I have created many effective recommendation systems, developed novel CV and NLP models, including tabular problems, correctly predicted stress on unlabeled structured data in an unsupervised context and even fit Harr Wavelets and Radon Transforms to produce a lightweight image similarity framework. As such, I’ve situated myself as a top-flight engineer and researcher at the cutting edge of my field. I bring a new vision uninhibited by prior bureaucratic thinking. Hope to speak about the project at your earliest convenience. Thanks, Austin
$30 USD în 40 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
I am a PhD student of software engineering with experience in Software development, Machine learning, OCR, Image processing. I am very interested in your project as I want to work in the OCR field. Kindly message me to discuss the project details.
$15 USD în 15 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
Hi, We at Tecogno Solutions are a team of Passionate Data Science and Full Stack professionals having more than five years of combined experience in multiple areas including Backend, Frontend, Machine learning (ML) and Artificial Intelligence (AI). We have developed multiple similar projects for OCR, DATA EXTRACTION etc and we have a strong command over Python, Flask, Django, Beautiful Soup, Tensorflow, Spacy, Twilio, Node.js, RESTful API, NLP, CNN, Bert, Albert, AWS, GCP, Google API's etc. Thus we are capable of fulfilling your requirements. For further information please refer to our Profile or visit or website. Thanks!
$20 USD în 40 zile
0,0 (0 recenzii)
0,0
0,0

Despre client

Steagul UNITED STATES
Atlanta, United States
0,0
0
Membru din mar. 12, 2021

Verificarea clientului

Mulțumim! Ți-am trimis prin e-mail linkul pe care trebuie să-l accesezi pentru a revendica creditul gratuit.
A apărut o eroare la trimiterea e-mailului. Încearcă din nou.
Utilizatori înregistrați Totalul proiectelor postate
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Se încarcă previzualizarea
S-a oferit permisiunea de depistare a locației.
Ți-a expirat sesiunea pentru conectare sau te-ai deconectat. Conectează-te din nou.