Login Form






Lost Password?
No account yet? Register
Freelancer Jobs arrow Freelancer Jobs - Archive 2007
JobId: #1400449

Automated PDF download and parsing(repost)

Categories: Database,Requirements,C#,SQL Server,ASP .NET,Software Related (Includes Websites)
Posted By: got_mike  (0 ratings)
Source: Rent-a-coder
Job viewed: 452 times
Bids Placed: 7
Start Date: 2010-04-29 23:08:54
End Date: 2010-05-14 23:08:54
Time Remaining:
Deadline: Please estimate in your bid
Current Phase: Bidding open
Max Accepted Bid: Open to fair suggestions
Project Type:
Bid Type: Open Auction
Brief Summary:

I need a program that will download an all-text PDF file and parse the text in that file, then upload the fields into a database.

This program must run on a shared hosted server and must be programmed in ASP.net and C#.  The C# files must be separate files.  The database is SQL Server 2008.  I also must receive the source code.

I use DiscountASP.net as my hosting provider and they allow me to schedule page requests, so while I need this program to run daily, it is not important to build the scheduling into the program.

Requirements Interview Answers:
To help you bid more accurately, the buyer was interviewed about the requirements for this bid request. Below are their answers. Untitled Page
Bid request Type: What kind of work do you need done?
Software related (Includes desktop applications and internet websites)
Bid request Parts: What do you want the worker to do on this bid request?
Requirements: The worker will analyze the problem and propose a software-based solution to the problem.
Programming: The worker will take the requirements and translate them into the language of the computer (and test it).
User installation: The installer will move the software from the place it was created (which is called the development or QC environment) to where you will use it (which is called the production environment). The installer then tests the software to make sure that the installation was done properly and completely.
Req. Doc. Type: What kind of documentation do you want for this bid request?
Informal documentation - As the buyer talks back and forth about the bid request with the worker, those conversations become the requirements.Remember to communicate ALL of the details of your bid request on the Rent A Coder site. If you don't, and there is a dispute, then important details of the contract will not be documented and cannot be taken into account in arbitration. If you feel you MUST go offsite (for example, using the phone or IM) then afterwards post everything onsite and get the other party to post that they agree to those contractual terms.
Program Type: What kind of software should the worker create (and/or install)?
  • An internet web-site: This software runs on a web server and users will access it using their internet browser.
Internet web-site info
Design and functionality: What does the programming of this bid request involve?
  • Program Functionality: Making the website "work".
Modeling another site: Do you wish to model another site? No
Size of website: How many pages need to be created/edited in this website?
Exactly 1.
Programming Language: What programming language(s) do you want your website written in?
I do know the language(s).
Languages(s):
  • ASP .NET
  • C#
Misc. details: This must be written in ASP.net with C# codebehind and the .cs files must be separate.
Database: Will this bid request include a database?
Yes, it does include a database.
Details:
  • SQL Server -- version(s): 2008
Browser Type(s)/Version(s): Which browser/version combinations must this website support?
  • IE 7.0
Server Hosting Environment: What is your server hosting environment?
I have a server at a 3rd party hosting company.
The name of the hosting company is: DiscountASP.net
The server's shared/dedicated status is: Shared with other people.

Component Compatibility Protection:
Sometimes a worker may create features on the website that rely on componentswhich work fine on their own server, but won't work with your 3rd party hosting company's server. There are many reasons for this including version incompatibility, restricted permissions, etc... Rent A Coder normally requires that the worker verify in advance that your 3rd party hosting provider will support the components.  If they do not, then they are responsible for any incompatibilities, rather than you. Do you want this protection on your bid request?
Yes, I require the worker to query my hosting company in advance, to ensure that they support all components which the worker uses in the final solution (including supporting the anticipated version #s, allowing components to run under their potentially restricted security model, etc.)
Server Hosting Environment: Will the worker develop "live" on your server?
No. The worker is responsible for creating their own development and/or qc environment.
Physical installation: Who will perform the physical installation?
The worker will perform the physical installation, and will be given all necessary permissions to do so.
Legal: 1) I require complete and fully-functional working program(s) in executable form as well as complete source code of all work done (so that I may modify it in the future).
2) Deliverables must be in ready-to-run condition as follows (depending on the nature of the deliverables):
2a) Any website server-side deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
2b) All other software (including but not limited to any desktop software or software the buyer intends to distribute) must include a software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request (unless specified elsewhere by the Buyer).
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased.
3b) No part of the deliverable may contain any copyright restricted 3rd party components (including GPL, GNU, Copyleft, etc.) unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the seller's Seller Legal Agreement.
Other Requirements:
The file that needs downloaded is located here: <a shape="rect" href="ftp://www.hillsclerk.com/civil/foreclosure_list/MortgageForeclosureSales.pdf" shape="rect">ftp://www.hillsclerk.com/civil/foreclosure_list/MortgageForeclosureSales.pdf

the program should do the following when the web page is accessed:
1) check the above file on the ftp server and see if the date/time has changed since last time the file was downloaded
2) if it has changed, then download the file (otherwise, do not download just stop)
3) after downloading the file, parse the text in the PDF file page by page
4) at the top of each page (line 6) is listed a scheduled date.  record that date for the page.
5) parse the rest of the page into the following fields: case number, plaintiff, defendant, comments
6) write to a database table for each record the fields recorded in step 5 plus the date & time collected in step 4
7) run a report that shows all the data collected in an HTML table
8) create a concatenated link from a querystring i will provide to turn the following fields into hyperlinks: case number, plaintiff, defendant.  the links should open in new windows.
9) email the same file to email addresses i provide.
10) provide me with an "admin" page that allows me to change program options like smtp servers, email logins, passwords, email addresses to send to, etc.

any other questions with regard to functionality, please ask.
  • All deliverables must be uploaded to Rent A Coder before the deadline(s) for this bid request...with no exceptions. If this contract makes it impossible for a competent person to do this, then do not start this bid request...but instead alert Rent A Coder of an un-arbitratable, illegal bid request.
  • Remember that contacting the other party outside of the site (by email, phone, etc.) on all business bid requests <$500 (before the buyer's money is escrowed) is a violation of both the software buyer and worker agreements. Rent A Coder monitors all site activity for such violations and can instantly expel transgressors on the spot, so we thank you in advance for your cooperation. If you notice a violation please help out the site and report it. Thanks for your help.
Categories:
(Note: Like everything else on this page, these categories are part of the original contract for this bid request.)
Web development, Database, Languages, Requirements, C#, Other (Technology), SQL Server, ASP .NET, Software related (includes websites)