!!! Please read attached [login to view URL] file for details. Text here is an abstract due to character limit !!!
Hello Python and PDF expoerts!
I have a pdf file of product summary and several pdf files for individual products. I'd like you to develop a Python tool to make a PDF catalog by concatenating all PDF files, making links to product page from summary page, and adding navigation menu images onto all pages to link back to summary page.
One of the technically challenging part of the project is No.3 below; search text in existing pdf and find its binding box so that link can be created. I already have a solution for it (based on PDFMiner) and I will provide the code so you can jump start.
https://www.freelancer.com/projects/eBooks-English-US/Add-link-text-existing-pdf.html
Required functionality
------------------------
1. Read a csv file ([login to view URL] attached as example) which lists product part number and corresponding product pdf file name.
2. Concatenate the summary pdf file ([login to view URL] or [login to view URL]) and product pdf files in the order listed in [login to view URL], while keeping record of page count.
3. Search product part numbers in concatenated pdf and set link to the 1st page of corresponding product pdf
4. Read another csv file ([login to view URL]) which lists group name and corresponding group's summary page number.
5. Make navigation menu as an array of link buttons from [login to view URL] and overlaying group name text (refer [login to view URL] for layout).
6. Apply the menu to all pages (except page 1)
UI:
Simple command line interface is ok. For example
> python [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] -p cutsaw
(p option is password for [login to view URL] if encrypted)
Special note:
* Paper size and orientation is not constant. Overlaying link and navigation menu must be adjusted to each page size and orientation.
* [login to view URL] may or may not be password protected. When protected, take extra argument for password to decrypt and encrypt back the result pdf.
* The tool should preserve [login to view URL]'s security restriction meta data.
(if you use PyPDF2 or pyPdf, this patch may help [login to view URL])
* Group names are "Top" and arbitrary number of one or two digit numbers. I'd expect max 15.
Other requirements
------------------
* Use Python logging facility to output debug messages
* Add reasonable amount of comments in code so that I can maintain it later
* Try to follow PEP8 Style Guide for Python Code
* Frequent communication; One line status update once in two days; describing which part you are working on
* All the outcome of the project belong to the project owner
What I provide as the project owner
* Requirement (this document)
* Example files (attached)
* Reward as agreed at the start of project
* Feedback as the project progress
What you provide as a developer
* Technical advice on requirements
* Source code
Project time frame and milestone
-----------------------------------------------
I’d expect the project to be completed in about two weeks.
1st milestone: Requirements agreed and identified libraries to use (1/4 of reward)
2nd milestone: 1st version (proof of concept) submitted (1/4 of reward)
3rd milestone: 2nd version (functional) submitted (1/4 of reward)
4th milestone: Final version submitted and project completed (1/4 of reward)
The requirements are written in a way so bright and clear, as is rarely an opportunity to read similar.
The task is to create a CLI script that will be created from the components (pdf files, csv files, files JGP) a PDF document with internal links and navigation menu.
An important issue is the confidentiality of data. The input data may be secure password, and the meta data must be protected. Sheets with pdf file can have a different size and orientation.
The code should be in good style (PEP 8) well commented and easy to subsequent maintenance.
method:
[login to view URL] file will serve as the backbone of the output file to which will be added other elements on the basis of [login to view URL] and product_list.csv.
schedule:
each milestone - 2 days
Proposes a repository for [login to view URL] and there to coordinate the project
Due to the very accurate documentation, the only question when we start work?
Best Regards
Dawid Anioł
$388 USD în 6 zile
3,6 (1 recenzie)
3,0
3,0
12 freelanceri plasează o ofertă medie de $438 USD pentru proiect
I have the required expertise in python, excel w/ macro and pdf editor, I have developed many projects with python, check my portfolio, feedbacks and reputation, my works have warranty and quality.
I have experience of 5 year of creating web and CLI application in pyython. I can do the job for you in just 10 days. Give me the job and it will be done in time and with quality.
You need something like pdfium or poppler lib. I have worked with both :). It is not problem to use this libs from Python, but we can make it in C++ too.
Hi,
I have about 4 years of experience in python. Will deliver on time and of good quality. Kindly give go ahead to start
Thanks in advance
karthi palanisamy
Dear, hire manager, i am read your job description with carefully, i think all of your requirement can complete with carefully, because i have 5 years practical experience about this job
Hey, thanks for the invitation. I am indeed interested in the project. My bid reflects an estimated time to completion of 5 days, and I included a significative discount to my hourly price. Cheers.
We will create a python code for concatenation of pdf file. We will first create a pseudo code in three days which you can test with sample and then give you final code.
Thanks
Krishna