Scrape data from 18 PDFs into Google Sheets

Închis Postat la acum 2 ani S-au achitat serviciile după ce au fost prestate
Închis S-au achitat serviciile după ce au fost prestate

Hello freelancers,

I need some help with data entry from PDFs to Google Sheets so I can access the data easier.

If you do this small trial project well, we have a good chance to work together on larger project that is exactly the same format but 10+ times more work/budget.

Please do the following if you want to bid:

1) read the whole project description,

2) watch this short project description video (2 min): [login to view URL]

3) then here's the full detailed video if you need (38 min): [login to view URL]

Any questions or clarifications, please feel free to ask.

Thank you

-Ivan

Project background:

I tutor high school physics students in the Cambridge International Examinations system. To help me better teach my students, I need to know what questions have been asked on the past years' exams, and the solutions for all those questions. Thankfully, the exam papers and their solutions ("mark schemes") for many previous years are all publicly available in PDF format on various websites.

This project is phase 1 - we ONLY care about Physics, 9702, May/June 2020, papers # (1, 2, 4). The PDFs are available from sites like these (same PDFs, just uploaded to different sites. If you google "CIE AS Physics Past papers" you'll find find others too):

[login to view URL]

[login to view URL](9702)/2020/

[login to view URL]

These don't have the 2020 ones we're after for this project, but they have 2019 and previous years:

[login to view URL]

[login to view URL](9702)/

File naming convention: the PDFs are named like this:

eg "[login to view URL]"

9702 means Physics (will always be 9702 for this project)

s20 means "summer 2020" (will always be s20 for this project)

"ms" means "mark scheme" and "qp" means "question paper" (we only care about the "ms" and "qp" files)

the "12" at the end means paper #1, version 2 - we care about papers #1, #2, #4 only (not 3 or 5) and ALL versions of these papers (usually 3 versions so we want all the files ending in 11, 12, 13, 21, 22, 23, 41, 42, 43)

What I need - output to Google Sheets:

Each paper/version # (eg 11, 23, 41 etc), there is a "qp" (question paper) and "ms" (mark scheme) file. Each qp has multiple questions. Each question has multiple parts like 1a) 1b) 1c) and subparts like 1 a i) 1 a ii) 1 b i) etc. Each question/part/sub-part is worth a certain # of points (integer from 1 to 6+ points). For the lowest level of each question/part/subpart I need a separate line entry in the Google Sheets with the following data in separate columns:

1) subject

2) level

3) exam paper version

4) date/season

5) question number

6) question part/subpart

7) # of marks/points

8) question wording - full, exact wording of the question/part

9) solution wording & "mark type" - full wording of the solution/answers

bonus:

10) images/screenshots of any figures or tables related to question/part

11) image/screenshot of question/part itself

12) image/screenshot of solution itself

Example output Google Sheets that I used in the video: [login to view URL]

If anything is unclear please ask.

If you can think of a better format or any improvements, feel free to let me know.

Happy bidding

Ivan

Introducere date Procesare date Data Scraping Image Processing Google Sheets

ID Proiect: #30319985

Detalii despre proiect

54 propuneri Proiect la distanță Activ acum 2 ani

54 freelanceri plasează o ofertă medie de 132$ pentru proiect

AbbasSaeed143

Hello. I have seen the videos and attached files. I am sure to do this job perfectly. Please review my profile and message me. Thanks Abbas

$250 USD în 3 zile
(166 recenzii)
8.8
ha4401310

Hello! Thank you for getting in touch!I have skills that you needed I am finished project like that, ✅My Portfolio Items : https://www.freelancer.com/u/ha4401310 I would love to be part of your project, And I can compl Mai multe

$50 USD în 1 zi
(193 recenzii)
6.4
ARTICLEYOUWANT

Hey there, I have watched your 2 videos. In first video, you have showed the paper along-with the mark scheme and the answers. You have also showed the Google spreadsheet with columns like Data/Season, Question Number, Mai multe

$150 USD în 7 zile
(94 recenzii)
6.2
sajibdigital

Hi, I pray and hope that you will give me chance to work with you after checking my profile. Lets chat for more details.

$150 USD în 7 zile
(72 recenzii)
5.8
bktk

Hello, Sir I am offering my service to do scrape 18 pdf into google sheet of question paper and mark scheme of physics Sir I am offering free sample Sir lets discuss further

$30 USD în 1 zi
(161 recenzii)
5.9
Badhan685

Hi, I want to start the task Right Now. I'm professional in data entry, data entry into Google sheets, Excel, Tying, copy paste and such kind of projects. Thank you very much for taking a look at my profile and I hope Mai multe

$30 USD în 1 zi
(58 recenzii)
5.2
brtodi

Hi, we are a team. We have completed several data entry projects. Highly accurate and responsible. Skills : Data Entry, Web Research, Word, Excel, PDF editing, Finding data online and more. We are interested in this Mai multe

$200 USD în 5 zile
(33 recenzii)
5.7
AnuradhaVM

Hi, Greetings! ✅checked your project details. I can Scrape data from 18 PDFs into Google Sheets I Have 4 years of experience with Data entry, Data analysis, and MIS Management. I have strong analysis, and Excel/google Mai multe

$200 USD în 3 zile
(40 recenzii)
4.8
abhisa21

Hi Ivan, I am Maria, I went through the explanation videos and the spreadsheets and question papers/answer sheets. I understand the job and would be happy to work on this. I assure you of a quality job with this one. Mai multe

$150 USD în 5 zile
(24 recenzii)
4.8
pilotarif

Hello there, I have gone through your project  "Scrape data from 18 PDFs into Google Sheets", and I can firmly assure that I have understood the requirements and can perfectly do the project work with 100% accuracy and Mai multe

$225 USD în 7 zile
(38 recenzii)
5.0
designer504

Hello, I am ready to start right now. I can professionally work on your project according to your requirements. I am an expert in DATA ENTRY, CSV Editing, PDF, Digitization, WORD, Scan to Text with 10 years experienc Mai multe

$250 USD în 1 zi
(7 recenzii)
3.6
rojjalex

Hi there, I can start right now for converting pdf to google sheet. I would like to perform this job for you. I have proper experience on Data entry, Web search, Virtual Assistant and also Ms Excel, and I’ve performe Mai multe

$50 USD în 7 zile
(18 recenzii)
3.7
pandaPython123

Hi, I have 4 years of experience in computer vision and image processing with python. I also have worked on OCR projects. Although this is not exactly OCR, but I can help you with putting and organizing the PDFs. Tha Mai multe

$250 USD în 7 zile
(5 recenzii)
2.8
sanjaykumarmond1

I can do this job. please let me know your fixed budget and the deadline. then I can start this task and can complete it asap. Please send a message for a short discussion. Thanks, Sanjay

$30 USD în 1 zi
(4 recenzii)
2.7
Shivali1988

hi I am a certified data entry operator. I have gone through your proposal and I feel myself suitable for this role. I have done these types of projects many times before also. I am very much hardworking and devoted Mai multe

$30 USD în 1 zi
(1 părere)
2.4
MMO777

Hi, 1) I've checked the video instructions and files. 2) I've done many similar projects. 3) I'm available and ready to transfer all 18 pdfs into GoogleSpreadsheet/Excel within your requirements very quickly and with Mai multe

$65 USD în 2 zile
(4 recenzii)
2.1
DinaSayedAhmed

Hi! I've watched the videos and the output sheet that you've provided. I'd like to work on your project and I can assure you a high percentage of accuracy. I have a STEM background as I'm a Computer Science graduated Mai multe

$100 USD în 4 zile
(2 recenzii)
2.0
Boniface123

Dear, I have read your description, I have experience in data entry and feeling confident to handle this project. I am hardworking, close to details and motivated. I can ensure you to deliver you a best quality of resu Mai multe

$30 USD în 7 zile
(3 recenzii)
1.7
normanburtonfree

Hello, I've just checked your job description carefully. I'm senior developer with 7+ years of Python. By using Python, I used to make many Web scraping tools with beautifulsoup and selenium. Also i have many experie Mai multe

$500 USD în 7 zile
(1 părere)
1.3
mohsinsana269

Individual Freelancer who is honest, work hard, do not outsource projects, no middle man,communicate well,communicate often and, who always try the best to meet (or beat) deadlines. I Read Your Project Carefully. And I Mai multe

$140 USD în 7 zile
(0 recenzii)
0.0