
In Bearbeitung
Veröffentlicht
Bezahlt bei Lieferung
I need a dependable scraper that can crawl an online blog of roughly ten thousand posts and pull down every entry, complete with any comments attached to each post. The final dataset must be delivered in a clean, well-structured XML file because that is my preferred working format, but feel free to include additional JSON or raw HTML copies if they come out of your workflow naturally—extra formats are a bonus, not a requirement. Scope of work • Crawl every live post on the site, following all pagination and in-site links that surface original articles. • Capture full article content and pair the corresponding comments so they stay linked to the right post. • Preserve each post’s core details (title, body, URL slug, and whatever standard metadata your tool picks up) inside the XML structure. • Package the finished XML so it can be imported in a single run without manual tweaking. Deliverables 1. The complete XML dataset of ~10 k posts, each nested with its comments. 2. The scraping script or notebook, documented well enough that I can rerun it later if the blog updates. 3. A short run-through of the steps taken and any dependencies required. I’m happy with mainstream libraries such as Python-requests, BeautifulSoup, Scrapy or similar—use what lets you move fastest while keeping the output tidy. Let me know your estimated turnaround and any questions you have about the target domain, and we can get moving right away.
Projekt-ID: 40054622
114 Vorschläge
Remote Projekt
Aktiv vor 2 Monaten
Legen Sie Ihr Budget und Ihren Zeitrahmen fest
Für Ihre Arbeit bezahlt werden
Skizzieren Sie Ihren Vorschlag
Sie können sich kostenlos anmelden und auf Aufträge bieten

As an AI Engineer and Full-Stack Developer, I've scraped and processed large volumes of data like your blogging dataset in my previous projects. I've worked extensively with Python, particularly libraries like BeautifulSoup, Scrapy that would serve your project perfectly. My expertise in Machine Learning and Natural Language Processing will ensure that the scraped XML file is clean, well-structured, and meets your precise requirements. What sets me apart is my ability to document my work thoroughly, empowering you to re-run the script when needed. In previous projects lies the foundation of my expertise for this one: from developing Chatbots with knowledge retrieval functionality using OpenAI & vector search, to building Classification systems integrated with FastAPI backends deployed on GCP. These projects involved extensive webscraping, structuring data and utilizing mainstream libraries like Python-requests, Beautiful Soup in a manner similar to the one required here. I can leverage my knowhow in NLP and Full-stack Development along with mainstream libraries you've specified (Python-requests, BeautifulSoup, Scrapy) for a timely delivery of your project while maintaining utter accuracy. By choosing me for this task, you're not only gaining quick solutions powered by my substantial experience but also a collaborative partner always on your side throughout the project deployment. Could we discuss further steps?
€140 EUR in 3 Tagen
0,0
0,0
114 Freelancer bieten im Durchschnitt €132 EUR für diesen Auftrag

I have extensive experience in PHP, XML, Python, Data Entry, and Web Scraping, making me a great match for the "Scrape 10K-Post Blog Dataset" project. I am confident in my ability to deliver the complete XML dataset with comments efficiently. The budget can be adjusted after discussing the full scope, and I am committed to working within your budget. Let's start this project and discuss the details. Please review my 15-year-old profile to see my work history. Looking forward to hearing from you.
€175 EUR in 7 Tagen
8,7
8,7

Hello there, I am experienced in web scraping and building scripts or a Windows desktop application using Python. I am also experienced in large data scraping from a given website, bypassing IP, Captcha, and anti-bot or cloud flair protection. Please message me to discuss this project in detail. Best Regards Enamul
€100 EUR in 3 Tagen
8,3
8,3

Hi I have expertise in Web Automation and can develop you a reliable Python script to crawl nearly 10k blog posts and extract all the post-related data as well as comments into an organized XML format I will provide you the complete script with instructions to setup and run the program as well as sample XML dataset of nearly 10k blog posts. I'm available to discuss further details in chat and can start right away. Abdul H.
€100 EUR in 2 Tagen
7,8
7,8

Hi, I can handle the full crawl of your ~10k-post blog and deliver a clean, import-ready XML file with all posts and their attached comments correctly mapped. I’ll build a dependable scraper using Python (Scrapy/Requests + BeautifulSoup), document it clearly, and provide both the final XML and the runnable script/notebook. I’m careful with pagination, metadata, and structural consistency. Turnaround is fast, and I can start immediately once you share the target domain. Regards sujon
€150 EUR in 7 Tagen
7,5
7,5

I can build a robust crawler to fetch all ~10k blog posts and their comments, linking each thread correctly and exporting everything in a clean, import-ready XML file (JSON/HTML optional). You’ll get documented Python code and a clear rerun guide. Fast, reliable turnaround and full pagination coverage.
€110 EUR in 3 Tagen
7,5
7,5

Hello I am Python/Web Scraping specialist with several years of experience and I have completed a lot of similar projects here. I am able to start working, could you share URL of site to scrap data from? Also, about XML structure - does it exist, or I have to create? Thanks.
€52,40 EUR in 1 Tag
7,5
7,5

⭐>>-- Scraping Boss is here--<<⭐ I totally understand what you want. I have S_T_R_O_N_G experience in web/data scraping and crawling. I can scrape any website and overcome all anti-bot policies like Recaptcha, IP detection, etc... Just click the "Chat" button for further discussion. Thank you.
€100 EUR in 1 Tag
7,0
7,0

Hello, I’ll scrape all ~10k posts and comments, follow full pagination, and deliver a clean import-ready XML plus the documented script for future runs. Fast, accurate, reliable. Best regards, Siddiqur Rahman.
€170 EUR in 1 Tag
6,7
6,7

Hello, I am a PHP Developer with 15+ years of experience, specializing in building dynamic, secure, and high-performance websites and applications. I have worked on simple to complex websites, e-commerce stores, membership portals, and custom PHP-based solutions, always ensuring top-quality results for my clients. My expertise includes custom PHP development, Laravel/CodeIgniter frameworks, API integration, database management (MySQL), and performance optimization. Recently, I also worked on OpenAI API integration for auto-generated content, images, and social sharing, showing my ability to adopt the latest technologies. If you are looking for a dedicated PHP expert who guarantees quality, innovation, and timely delivery, I’d be happy to bring your project to life.
€100 EUR in 7 Tagen
6,5
6,5

Hello Ismael O. Hope you are doing well! This is Efan , I checked your project detail carefully. I am pretty much experienced with BeautifulSoup, Data Entry, Data Extraction, XML, Scrapy, Python, PHP and Web Scraping for over 8 years, I can update you shortly. Cheers Efan
€250 EUR in 10 Tagen
6,7
6,7

Hello, I will create a PHP script to automate your task. Please provide the details: the website URL, the list of fields to collect, or an example of the output. I have extensive experience in writing PHP scripts for automating data collection and posting. Please see my reviews for reference.
€250 EUR in 2 Tagen
6,7
6,7

⭐⭐⭐⭐⭐ Hi According to the job details that you need Scrape 10K-Post Blog Dataset I have some questions regarding your project details. let me know when you're available to chat so we can discuss PORTFOLIO LINK:- https://www.freelancer.com/u/suritaverma I'd be more than happy to answer any questions or discuss further project requirements if needed. Please feel free to reach out if you're interested in working together! Thanks & Regards Surita (Freelancer) P.S. - If you'd like me to send over some of my work please don't hesitate to let me know!
€50 EUR in 1 Tag
6,6
6,6

As a tenured Full Stack Software Engineer with over a decade of experience, I bring an array of highly relevant skills to the table. Your blog scraping project aligns perfectly with my expertise in Python, from using mainstream libraries like Scrapy to leveraging popular tools such as BeautifulSoup. My focus on maintaining high-quality, creative, and clean code aligns seamlessly with the scope of your project, ensuring you end up with a well-structured XML file accurately reflecting the original content. Over the years, I've honed my skills in data extraction, web crawling, and creating intricate datasets that stay linked even across nested entries like comments. Not only can I deliver you the complete XML dataset of the 10K post blog but also provide you with an easy-to-follow, well-documented script that can be re-run whenever required. Additionally, my extensive experience with various CMS including WordPress and Joomla gives me a unique edge in data structuring and management. Lastly, my commitment to strict deadline adherence combined with my passion for making complex processes user-friendly makes me an ideal fit for your project. I'm excited to move forward and demonstrate how I can leverage my skills to not just meet but exceed your expectations within your defined timeframe. Let's connect soon and get started on building your clean and valuable dataset!
€100 EUR in 7 Tagen
6,9
6,9

With my comprehensive skill set in Python and PHP and a solid understanding of web scraping, I am confident in providing you with exactly what you need for this project. I will utilize mainstream libraries such as BeautifulSoup and Scrapy to crawl and extract every post and comment from the blog ensuring that they stay linked together as intended. Being adept in both frontend and backend technologies, I am proficient at handling raw data, structuring it well, and preparing clean, well-structured XML files - just as you require. In addition to the dataset, I will also provide you with a well-documented scraping script/notebook to empower you to rerun it whenever necessary or the blog updates. My approach guarantees efficiency without compromising data quality. Moreover, my GitHub expertise ensures not just a clean code but also an impeccable version control history including all dependencies required. What truly sets me apart is my zeal for client satisfaction. I understand that your time is important, so you can rely on me to deliver within the agreed time frame without sacrificing output quality.
€140 EUR in 8 Tagen
6,2
6,2

As a seasoned web developer and highly proficient in Python, I can guarantee you a swift and effective data scraping experience. With my expertise in popular scraping tools such as BeautifulSoup and Scrapy, the project will be completed with the least technical hitches, ensuring your desired XML format is seamlessly produced. Having worked on various projects that required retrieving large-scale data, such as this job involving 10K blog posts, I have built the necessary skills to complete the task at hand efficiently. More specifically, my proficiency in web scraping paired with versatile programming languages including Python, PHP and XML make me a well-rounded candidate for this project. Trust me to deliver not only on the project's core requirements but also go the extra mile by providing you a well-documented script or notebook for future use, as well as comprehensive run-through of the steps taken. You can expect dedication and precision in all facets of this project, from retaining article content alongside their pertinent comments to maintaining post's core details stored within the XML structure. Let's get started promptly; I'm ready to dive into this challenge head-on and deliver exceptional quality results within a reasonable turnaround time!
€140 EUR in 7 Tagen
6,0
6,0

Hi, I can help you scrape the all the blog posts and associated comments in XML format efficiently and accurately, I can handle bot detection techniques well. Just share the blog link. Thanks!
€200 EUR in 3 Tagen
5,8
5,8

Dear Hiring Manager, I am a seasoned web scraping expert with proficiency in PHP, Python, and data entry. With a track record of successfully delivering similar projects, I am confident in my ability to scrape the 10K-post blog dataset you require. I have extensive experience in utilizing tools like Scrapy, BeautifulSoup, and Python-requests to efficiently extract data and deliver it in XML format, ensuring a well-structured dataset for your analysis. I am committed to providing you with a comprehensive XML dataset of the blog posts along with their associated comments, a detailed scraping script for future use, and a clear overview of the process undertaken. Let's discuss your project requirements further and initiate the collaboration to bring your vision to fruition. Looking forward to the opportunity to work together. Best regards, Ali Zahid
€30 EUR in 7 Tagen
5,6
5,6

Hi there, I'm confident that I can handle your project efficiently. With expertise in PHP, BeautifulSoup, and web scraping, I will crawl the blog, capturing posts and comments in a clean XML format as requested. My experience ensures a well-structured dataset ready for import. Looking forward to discussing the specifics further.
€100 EUR in 2 Tagen
5,5
5,5

As a seasoned professional with an extensive background in data scraping and conversion, I am confident in my ability to deliver the high-quality dataset you require. Over the past 3 years, I have honed my skills in web scraping using tools such as Python-requests, BeautifulSoup, and Scrapy; these will no doubt assist me in efficiently crawling your blog and identifying every single post - regardless of pagination or the complexity of within-site links. Creating clean, organized structures is one of my core competencies so you can trust that your final XML file will be perfectly tailored to your needs. In addition to the dataset, I promise to provide you with a well-documented script that not only explains how it operates but enables you - if need be in the future - to replicate the process and update it as per your requirements. My promise of quality work within timeline applies not just to delivering the complete XML dataset on time, but also extends to producing any additional formats that may prove useful to you without any manual tweaking. Lastly, I'm happy to offer you sample projects similar to yours where I have employed my skills effectively. I believe that actions speak louder than grandiose claims so let me exemplify my abilities through my work. Let's get started on this project and turn your raw unstructured data into valuable insights!
€50 EUR in 1 Tag
6,3
6,3

I can do it
€150 EUR in 7 Tagen
5,5
5,5

Madrid, Morocco
Zahlungsmethode verifiziert
Mitglied seit Juli 5, 2025
€30-250 EUR
€30-250 EUR
€30-250 EUR
€30-250 EUR
€30-250 EUR
₹750-1250 INR / Stunde
₹600-1500 INR
$35-36 USD
$30-250 USD
$250-750 USD
₹750-1250 INR / Stunde
₹400-750 INR / Stunde
$30-250 USD
₹1500-12500 INR
$5000-10000 USD
₹100-400 INR / Stunde
₹600-1500 INR
$250-750 USD
₹750-1250 INR / Stunde
$100-250 NZD
₹1500-12500 INR
₹750-1250 INR / Stunde
€100-150 EUR
$30-250 USD
$30-250 USD