
In Bearbeitung
Veröffentlicht
Bezahlt bei Lieferung
Dear Freelancer, As a new startup, we need a professional plan for a web scraping system that can handle high-volume product data extraction from e-commerce sites. We've tried PC-based methods but faced limitations. We're looking for a cloud-based solution that's automated, stable, and low-cost. Please provide a detailed plan only for now. The plan which fix our current scenario and implementable for us to fulfill our requirements will be award this project. Our team will look into this plan and discuss within our team for few days. If your plan fits all over as per our requirements, we'll hire you for a separate new project to build it. Current Scenario: 1) We have already used method - scraping using Python installed on PC, before this also tried scraping through Eclipse software on PC. 2) Python installed on PC worked but it is slow. It extracted 5000 URLs with 18 XPath locations each URL in around 4 hours. 3) This PC Python system input sample spreadsheet, output sample spreadsheet and Python scraping code files also attached. 4) Our earlier scraping is unacceptable to achieve our result using PC based system due to very high configuration needed and dependency on person to turn on PC, checking PC scraping time to time for internet connection, electricity downtime problem etc. So, we want to use hurdle free process / system to achieve our objective, for example:- online / cloud based system (for example:- GitHub action or anything else), also fix all hurdles while scraping to keep scraping workflow smooth and automation to get our final usable data with almost forever stability. Our Requirements: 1) Our requirement is to have system of scraping 1,00,000+ product URLs per run, extracting 30 fields (located through identifiers, for example XPath) per URL (e.g., rating, price, title, images URL, categories, stock, delivery info, product tags, even minor or major details of products etc) in ~1 hour or less in starting. 2) We want to use hurdle free process / system to achieve our objective, for example:- online / cloud based system (for example:- GitHub action or anything else), also fix all hurdles while scraping to keep scraping workflow smooth and automation to get our final usable data with almost forever stability. 3) Current we are scraping data from Amazon India and Flipkart but further we will also scrap data from our websites as well. 4) Also, as we are very new startup with further plans we do not want to keep high or medium fixed expense per month on our head. We need to run this program daily, twice a day with very very low monthly expense. 5) Automation: Scheduled (daily twice) or manual trigger, no local PC needed, also needed for binding some points of workflow. 6) Scalable: Easy to add new sites (config for XPaths per site). 7) Easy to use: Beginner-friendly daily operation with minimal maintenance. Please Bid with Answering Below Questions: 1) Monthly fixed cost we need to spend on these resources? 2) Time will taken daily by your system for scraping 1,00,000 URLs (with 30 specified elements to scrap in each single URL into different cells of same row for same product, see sample output spreadsheet for more clarity). 3) Overview of techniques, ways which will be used by you to make this workflow system. Anything you want to convey like limitations, important points or anything about your system, if any. 4) Your plan PDF. Next Step: Reply with your plan. Top plans will be selected for the development project.
Projekt-ID: 40103568
9 Vorschläge
Remote Projekt
Aktiv vor 1 Monat
Legen Sie Ihr Budget und Ihren Zeitrahmen fest
Für Ihre Arbeit bezahlt werden
Skizzieren Sie Ihren Vorschlag
Sie können sich kostenlos anmelden und auf Aufträge bieten
9 Freelancer bieten im Durchschnitt ₹1.017 INR für diesen Auftrag

Hi there, I’m Abdul Rehman. I’ve carefully read your project details and I’m confident I can deliver exactly what you’ve described with high quality and on time. Let’s discuss your requirements and get started right away. Best regards, Abdul Rehman
₹1.150 INR in 7 Tagen
2,8
2,8

Hi, My name is Muhammad Usama, and I believe my extensive experience in automation and web scraping makes me uniquely suited to tackle your high-volume e-commerce data extraction project. In the past, I've successfully automated similar workflow processes using advanced technologies such as n8n, Make, and Zapier - tools known for their efficacy in transforming manual tasks into streamlined, hassle-free operations. Given your stringent performance requirements, I plan to utilize cloud-based solutions like Github actions to ensure that your extensive needs of extracting 1,00,000+ product URLs per run with 30 specified elements each URL can be completed within an hour or less. Lets have a chat warm regards USama Ansari
₹1.050 INR in 7 Tagen
0,0
0,0

I have designed a solution specifically to address your requirement for 100,000+ URLs per hour while maintaining the "low monthly expense" priority essential for a new startup. I am unable to attach PDF here, Please initiate chat so the attachment option will open for me. I will transition your local Python scripts into a Serverless Distributed System using Python (Scrapy/Playwright) and GitHub Actions. This eliminates the need for PCs, electricity, or manual monitoring. Addressing Your Requirements: Speed: I use parallel processing. By splitting 100,000 URLs across 50+ cloud "workers," we reduce your 4-hour runtime to under 60 minutes. Stability: GitHub Actions provides a 99.9% uptime environment. I include stealth headers and rotating User-Agents to bypass Amazon/Flipkart bot detection. Cost-Efficiency: As a startup, you pay $0 in fixed server fees. You only pay for Pay-As-You-Go Proxies (est. $20/month). Key Answers: Monthly Cost: ~$15–$30 (Variable proxy costs only; zero fixed server fees). Daily Time: ~45–55 minutes per 100k run. Techniques: Python async requests, GitHub Actions for scheduling, and a centralized Config file for easy XPath updates. Next Step: Would you like me to share the specific Python Cloud Architecture diagram we’ve designed for your workflow?
₹1.500 INR in 7 Tagen
0,0
0,0

Hello, I am interested in your data entry project. I have experience handling data entry, web research, and basic data processing using Excel. I focus on accuracy, clear communication, and timely delivery. I am confident I can complete this task according to your requirements and I am open to discussion. Thank you.
₹1.050,05 INR in 70 Tagen
0,0
0,0

Hello, My name is Srujan, and I’m a student freelancer with strong skills in Data scraping AI-assisted content creation, assignments, resumes, PPTs, and written work and i can work on your content as well . I’ve carefully read your project and understand that you need Plan for Building a Stable, Ultra Fast E-Commerce Data Scraping System. I can complete this efficiently using AI tools combined with manual review to ensure accuracy, clarity, and originality. Although my profile is new, I’m highly committed to delivering quality work, meeting deadlines, and building long-term client relationships. I focus more on client satisfaction than ratings at this stage, which means you’ll get my full attention on your project. What you can expect: ✔ Clean and well-structured output ✔ On-time delivery (even urgent tasks) ✔ Revisions if required Estimated delivery: [today] Budget: 700 I’d be happy to get started immediately. Looking forward to your response. Best regards, Srujan
₹700 INR in 1 Tag
0,0
0,0

Hello, I have carefully reviewed your current scraping scenario and requirements for building a stable, cloud-based, and low-cost e-commerce data scraping system. I can provide a clear and practical plan to move your existing PC-based Python scraping workflow to an automated cloud environment. The plan will focus on scalable scraping architecture, cloud execution (such as GitHub Actions or low-cost cloud servers), scheduling automation, and handling high-volume URLs efficiently. I will outline estimated monthly costs, expected scraping time for 100,000 URLs with multiple fields, and techniques to improve speed, stability, and maintainability while keeping expenses minimal. I will also include limitations, important considerations for Amazon and Flipkart scraping, and a step-by-step workflow explanation in a structured PDF plan that your team can review and discuss internally.
₹1.050 INR in 7 Tagen
0,0
0,0

Hi, I’m a Python Backend Developer focused on designing reliable, stable, and maintainable backend systems. I can deliver a clear, practical technical plan for building or stabilizing a Python-based backend application, focused on long-term reliability rather than quick fixes. What the plan will cover: - High-level backend architecture (layers, responsibilities, data flow) - Recommended Python stack (FastAPI / Flask) and rationale - API design principles and versioning strategy - Error handling, validation, and fault tolerance - Logging, monitoring, and observability practices - Database design, migrations, and data integrity - Security fundamentals (auth, secrets, input validation) - Testing strategy (unit, integration, critical paths) - Deployment considerations and environment separation (dev/staging/prod) - Scalability and maintainability recommendations The plan will be structured, actionable, and easy to follow, so it can be used directly by a development team or as a roadmap for implementation. I focus on real-world backend practices that reduce downtime, improve stability, and simplify future development. Happy to tailor the plan to your specific system and goals. Best regards.
₹650 INR in 7 Tagen
0,0
0,0

Hi, I propose a fully cloud-based, automated web scraping system to replace your PC-dependent setup and ensure speed, stability, and low operational effort. The system will use Selenium and Playwright running on cloud infrastructure to scrape 1,00,000+ product URLs per run with 30 fields per URL within 45–60 minutes. Scraping will be executed through parallel browser instances to achieve high throughput. The workflow will run twice daily via scheduled or manual triggers using GitHub Actions or a cloud scheduler, with no local PC required. Built-in retry handling, logging, and auto-restart will maintain smooth execution. Site wise XPath selectors will be managed through configuration files for Amazon, Flipkart, and future websites. Input and output will remain spreadsheet-based for ease of use. Estimated monthly fixed cost will be approximately ₹1,000–1,400 INR, and a detailed plan PDF will be provided. regards: Muzamil
₹750,02 INR in 11 Tagen
0,0
0,0

New Delhi, India
Zahlungsmethode verifiziert
Mitglied seit März 1, 2019
₹400-750 INR / Stunde
$10-30 USD
₹1500-12500 INR
₹600-1500 INR
₹600-1500 INR
₹750-1250 INR / Stunde
$15-25 USD / Stunde
$15-25 USD / Stunde
$10-30 USD
£18-36 GBP / Stunde
$30-250 USD
$30-250 USD
₹12500-37500 INR
$750-1500 USD
₹750-1250 INR / Stunde
₹750-1250 INR / Stunde
€30-250 EUR
$10-30 USD
₹1500-12500 INR
$30-250 USD
₹750-1250 INR / Stunde
₹600-1500 INR
₹600-1500 INR
$30-250 USD
₹12500-37500 INR