Scrape data from 18 PDFs into Google Sheets
$30-250 USD
S-au achitat serviciile după ce au fost prestate
Hello freelancers,
I need some help with data entry from PDFs to Google Sheets so I can access the data easier.
If you do this small trial project well, we have a good chance to work together on larger project that is exactly the same format but 10+ times more work/budget.
Please do the following if you want to bid:
1) read the whole project description,
2) watch this short project description video (2 min): [login to view URL]
3) then here's the full detailed video if you need (38 min): [login to view URL]
Any questions or clarifications, please feel free to ask.
Thank you
-Ivan
Project background:
I tutor high school physics students in the Cambridge International Examinations system. To help me better teach my students, I need to know what questions have been asked on the past years' exams, and the solutions for all those questions. Thankfully, the exam papers and their solutions ("mark schemes") for many previous years are all publicly available in PDF format on various websites.
This project is phase 1 - we ONLY care about Physics, 9702, May/June 2020, papers # (1, 2, 4). The PDFs are available from sites like these (same PDFs, just uploaded to different sites. If you google "CIE AS Physics Past papers" you'll find find others too):
[login to view URL]
[login to view URL](9702)/2020/
[login to view URL]
These don't have the 2020 ones we're after for this project, but they have 2019 and previous years:
[login to view URL]
[login to view URL](9702)/
File naming convention: the PDFs are named like this:
eg "[login to view URL]"
9702 means Physics (will always be 9702 for this project)
s20 means "summer 2020" (will always be s20 for this project)
"ms" means "mark scheme" and "qp" means "question paper" (we only care about the "ms" and "qp" files)
the "12" at the end means paper #1, version 2 - we care about papers #1, #2, #4 only (not 3 or 5) and ALL versions of these papers (usually 3 versions so we want all the files ending in 11, 12, 13, 21, 22, 23, 41, 42, 43)
What I need - output to Google Sheets:
Each paper/version # (eg 11, 23, 41 etc), there is a "qp" (question paper) and "ms" (mark scheme) file. Each qp has multiple questions. Each question has multiple parts like 1a) 1b) 1c) and subparts like 1 a i) 1 a ii) 1 b i) etc. Each question/part/sub-part is worth a certain # of points (integer from 1 to 6+ points). For the lowest level of each question/part/subpart I need a separate line entry in the Google Sheets with the following data in separate columns:
1) subject
2) level
3) exam paper version
4) date/season
5) question number
6) question part/subpart
7) # of marks/points
8) question wording - full, exact wording of the question/part
9) solution wording & "mark type" - full wording of the solution/answers
bonus:
10) images/screenshots of any figures or tables related to question/part
11) image/screenshot of question/part itself
12) image/screenshot of solution itself
Example output Google Sheets that I used in the video: [login to view URL]
If anything is unclear please ask.
If you can think of a better format or any improvements, feel free to let me know.
Happy bidding
Ivan
ID Proiect: #30319985
Detalii despre proiect
54 freelanceri plasează o ofertă medie de 132$ pentru proiect
Hello. I have seen the videos and attached files. I am sure to do this job perfectly. Please review my profile and message me. Thanks Abbas
Hey there, I have watched your 2 videos. In first video, you have showed the paper along-with the mark scheme and the answers. You have also showed the Google spreadsheet with columns like Data/Season, Question Number, Mai multe
Hi, I pray and hope that you will give me chance to work with you after checking my profile. Lets chat for more details.
Hello, Sir I am offering my service to do scrape 18 pdf into google sheet of question paper and mark scheme of physics Sir I am offering free sample Sir lets discuss further
Hi, Greetings! ✅checked your project details. I can Scrape data from 18 PDFs into Google Sheets I Have 4 years of experience with Data entry, Data analysis, and MIS Management. I have strong analysis, and Excel/google Mai multe
Hello, I am ready to start right now. I can professionally work on your project according to your requirements. I am an expert in DATA ENTRY, CSV Editing, PDF, Digitization, WORD, Scan to Text with 10 years experienc Mai multe
Hi, I have 4 years of experience in computer vision and image processing with python. I also have worked on OCR projects. Although this is not exactly OCR, but I can help you with putting and organizing the PDFs. Tha Mai multe
I can do this job. please let me know your fixed budget and the deadline. then I can start this task and can complete it asap. Please send a message for a short discussion. Thanks, Sanjay
hi I am a certified data entry operator. I have gone through your proposal and I feel myself suitable for this role. I have done these types of projects many times before also. I am very much hardworking and devoted Mai multe
Hi! I've watched the videos and the output sheet that you've provided. I'd like to work on your project and I can assure you a high percentage of accuracy. I have a STEM background as I'm a Computer Science graduated Mai multe
Dear, I have read your description, I have experience in data entry and feeling confident to handle this project. I am hardworking, close to details and motivated. I can ensure you to deliver you a best quality of resu Mai multe
Hello, I've just checked your job description carefully. I'm senior developer with 7+ years of Python. By using Python, I used to make many Web scraping tools with beautifulsoup and selenium. Also i have many experie Mai multe
Individual Freelancer who is honest, work hard, do not outsource projects, no middle man,communicate well,communicate often and, who always try the best to meet (or beat) deadlines. I Read Your Project Carefully. And I Mai multe