
Offen
Veröffentlicht
•
Endet in 2 Tagen
Bezahlt bei Lieferung
Build a cross-platform desktop bot that uses computer vision (template matching) to automate web form filling and data scraping — controlled through a web-based dashboard, with no Selenium or browser drivers. We need a vision-based UI automation bot capable of navigating real browsers (Chrome, Firefox, Brave, Opera) installed on the user's machine. The bot will fill web forms and scrape report data using image-based template matching — no browser drivers or Selenium. A local web UI will serve as the control dashboard for operators to input data, monitor progress, and handle errors manually when needed. Core requirements 1. Cross-platform support Must run natively on Windows, Linux, and macOS without platform-specific hacks. 2. Native browser control Open and control the locally installed browser (Chrome, Firefox, Brave, or Opera) at the UI level. Selenium and WebDriver-based solutions will not be accepted. 3. Template-based visual navigation Navigate target websites and perform predefined operations using template/image matching. Reference images for each action step will be provided by us. 4. Web-based operator dashboard A local web UI for operators to enter form input data. The dashboard must display a live count of inputs submitted and automations completed successfully. 5. Step-level error handling with manual override On error, the bot pauses at the failed step and sends a real-time notification in the web UI. The operator completes that step manually, then the bot continues automatically from the next step. 6. Structured CSV output Export results as a structured CSV where input data occupies the first columns and scraped output data follows in subsequent columns. The core requirments are: 1. The system should be able run in any os enviromnment windoes, linux, mac 2. it should open local browser installed on the machine whethere it is crome, firefox, brave, opera. 3. The it should navigate tho targe websites website do the predefined operation by template matching (reference image will be provided) 4. There must be we based UI to insert data that will be used to fill up the forms 5. The UI must track how many inputs are given are and how many automation is done sucessfully 6. If there any error occurred during the automation it should pause the at that setp and the notifies in the web Ui with notification and so the operaton can do that mannualy and after that step the bot should continue with the next steps 7. The output data should have starutured formated csv that will have have imput data at dirst coulmns and the output data at the later columns We are looking for ✓Hands-on experience with computer vision libraries (e.g. OpenCV, PyAutoGUI,Playwright) ✓Ability to build a local web dashboard to drive and monitor bot activity ✓Solid understanding of OS-level UI automation across Windows, Linux, and Mac ✓Experience with real-time notifications and step-level error recovery logic ✗Do not apply if your solution relies on Selenium, WebDriver, or any headless browser API When applying, please describe your proposed technical approach — especially how you plan to handle template matching across different screen resolutions and OS environments. Include any relevant demos or past work.
Projekt-ID: 40352213
40 Vorschläge
Offen für Angebote
Remote Projekt
Aktiv vor 2 Tagen
Legen Sie Ihr Budget und Ihren Zeitrahmen fest
Für Ihre Arbeit bezahlt werden
Skizzieren Sie Ihren Vorschlag
Sie können sich kostenlos anmelden und auf Aufträge bieten
40 Freelancer bieten im Durchschnitt $185 USD für diesen Auftrag

⭐⭐⭐⭐⭐ Create a Cross-Platform Bot for Web Form Automation & Data Scraping ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project requirements and see you're looking for a desktop bot that uses computer vision for web form automation. Look no further; Zohaib is here to help you! My team has successfully completed 50+ similar projects for automation solutions. I will create a bot that navigates browsers using image-based matching, ensuring efficiency and accuracy. ➡️ Why Me? I can easily build your cross-platform bot as I have 5 years of experience in UI automation, computer vision, and web technologies. My expertise includes using libraries like OpenCV and developing user-friendly dashboards. Additionally, I have a strong grip on real-time notifications and error handling, ensuring seamless operation across all platforms. ➡️ Let's have a quick chat to discuss your project in detail. I can showcase samples of my previous work and explain my approach to handling template matching across different environments. Looking forward to chatting with you! ➡️ Skills & Experience: ✅ Computer Vision ✅ UI Automation ✅ Cross-Platform Development ✅ OpenCV ✅ PyAutoGUI ✅ Playwright ✅ JavaScript ✅ HTML/CSS ✅ Real-Time Notifications ✅ Error Handling ✅ Dashboard Development ✅ CSV Data Export Waiting for your response! Best Regards, Zohaib
$68 USD in 2 Tagen
8,0
8,0

Hi, I have carefully reviewed your project requirements, and I am writing this bid by myself with full confidence that I can deliver a robust, cross-platform vision-based UI automation bot as you described. Your need for a native browser controller using image template matching without Selenium or drivers perfectly matches my expertise. With strong experience in Python, OpenCV, PyAutoGUI, and building local web dashboards, I will create a seamless bot that operates natively on Windows, Linux, and macOS. The bot will handle browser navigation through visual cues, provide real-time notifications on your web dashboard, and gracefully pause for manual operator input on errors while exporting structured CSV reports. I propose starting with a proof of concept for template matching across different resolutions and OS environments, followed by dashboard integration and step-level error handling. This will keep you updated and involved over a 30-day timeline. Could you provide examples of the types of reference images or UI elements that the bot will need to match to help tailor the template matching approach? Thanks,
$250 USD in 30 Tagen
6,0
6,0

As a highly skilled freelancer with a significant depth of knowledge in Automation and Computer Vision, I am perfectly aligned with your project requirements for building a vision-based UI bot that scrapes data and performs actions using template matching. Adhering strictly to your specifications, I ensure that my solution completely bypasses any reliance on Selenium or WebDriver, leveraging technologies like OpenCV and PyAutoGUI instead. I have hands-on experience crafting cross-platform solutions that align perfectly with the core features you outlined. My approach emphasizes building a local web dashboard to drive and monitor bot activities effectively across Windows, Linux, and macOS environments. This ensures operators can seamlessly input data, track progress, and handle errors manually via a user-friendly interface. Importantly, my solution ensures accurate template matching across different screen resolutions and OS environments - a key concern you mentioned. Pausing the bot on encountering errors and sending real-time notifications for manual intervention is a critical feature I offer. As a result, our structured CSV outputs will provide consistent accuracy with input data at the front of your datasets and scraped output data following chronologically, making it easy to track details of each operation. Let's discuss further; I’m excited about the potential we have to make this project a great success together!
$220 USD in 4 Tagen
5,5
5,5

With over 6 years of experience in full-stack development, I have built and shipped numerous web applications that have required highly precise automation. This is where my expertise in JavaScript, Python, SQL and an understanding of computer vision libraries such as OpenCV, PyAutoGUI,Playwright come together to make projects like yours not only possible but efficient. I understand the core requirements of your project: cross-platform compatibility, native browser control, image/template-based navigation and a web-based operator dashboard for input data. My experience aligns perfectly with these essentials and more. My approach is to build robust systems using clean architecture that are easy to understand, maintainable, and performant. I rely on real-time notifications and step-level error recovery logic - precisely what you require. My problem-solving capabilities extend beyond just the technical aspects; I thrive on making your life easier by automating tedious tasks to reduce manual labor and errors. Hence, implementing structured CSV outputs for efficient data management is an area I excel in. Together we can develop a vision-based UI automation bot that meticulously navigates the browser just as needed with precision and accuracy on any OS environment. Let's solve this together!
$155 USD in 7 Tagen
4,1
4,1

Hello,I can build your cross-platform vision-based automation bot using OpenCV and PyAutoGUI, without Selenium or WebDrivers. It will control local browsers (Chrome, Firefox, Brave, Opera), navigate sites via template matching (with multi-scale/resolution handling), and output structured CSVs.A local web dashboard (React + FastAPI) will allow operators to input data, track live progress, and handle step-level errors with manual override. The bot will resume automatically after interventions. I’ll ensure full cross-platform support, real-time notifications, and modular, well-documented code ready for deployment.I’m ready to start immediately and deliver a reliable, production-ready system. BEST REGARDS SHABAHAT HABIB*
$155 USD in 7 Tagen
4,7
4,7

Hello There!!! ★★★★ ( Cross-platform vision-based bot with dashboard & smart error recovery ) ★★★★ I understand you need a vision-driven automation bot (no Selenium) that controls real browsers via UI, uses template matching to fill forms, scrape data, and runs across Windows, Linux, mac with a web dashboard and manual override system. ⚜ Cross-platform desktop bot (Win/Linux/mac) ⚜ OpenCV + PyAutoGUI based UI automation ⚜ Template matching with adaptive scaling ⚜ Local web dashboard for input & monitoring ⚜ Step-level error handling + manual resume ⚜ Real-time progress tracking & notifications ⚜ Structured CSV output generation I have 6+ years experince in automation, CV and RPA tools, built similar bots using image recognition and OCR with stable workflows. My approach: Python core (OpenCV + MSS), resolution-independent matching using scaling + anchor regions, FastAPI dashboard, and event-based state control for pause/resume. Ensures stability across enviroments. Would love to discuss your templates and flow to start quickly. Warm Regards, Farhin B.
$300 USD in 7 Tagen
3,8
3,8

Hi, This is Jagrati. I reviewed your project description and understand that you’re looking to build a cross-platform, vision-based desktop automation system that controls real browsers via UI-level interactions (without Selenium or WebDriver), driven by a local web dashboard for operators. My approach would be to design a hybrid architecture consisting of a lightweight local backend service, a real-time web dashboard, and an OS-level automation layer powered by computer vision and input simulation. Key deliverables: • Cross-platform desktop automation system (Windows, Linux, macOS) • Vision-based automation using template matching (OpenCV-based approach) • Local web dashboard for input, monitoring, and control • Real-time progress tracking and notifications • Step-level error handling with manual override capability • Structured CSV export with mapped input/output columns • Modular and extensible architecture for adding new workflows I would be happy to walk through a proposed architecture diagram and implementation plan, and share relevant examples of similar automation systems I’ve worked on. Looking forward to your response. Best regards, JP
$250 USD in 7 Tagen
3,6
3,6

Hello! I'm a computer vision and desktop automation specialist with strong experience building vision-based UI bots using OpenCV + PyAutoGUI for real browser control without Selenium or WebDriver. Understanding your needs: A cross-platform desktop bot (Windows, Linux, macOS) that uses template matching to navigate real installed browsers (Chrome, Firefox, Brave, Opera), fill web forms, scrape report data, and is controlled via a local web dashboard with step-level error handling and manual override. My plan: Build the core bot using OpenCV for template matching and PyAutoGUI for native OS-level browser control. Create a lightweight local web dashboard (Flask/FastAPI + HTML/JS) where operators input data, monitor progress in real-time, and receive notifications on errors. Implement step-by-step automation with reference images (provided by you) and robust error recovery: pause on failure, allow manual intervention, then auto-resume. Ensure compatibility across Windows, Linux and macOS with resolution-aware template matching. Generate structured CSV output (input columns + scraped data). Add clean logging and configuration file for easy maintenance. I focus on reliable, maintainable vision-based automation. Please share the reference template images and target websites so I can start building the bot accurately. Looking forward to delivering a solid solution. Ready when you are.
$155 USD in 5 Tagen
3,3
3,3

Hello, I understand this is a vision-based UI automation system where reliability across OS, resolution variance, and real-time control are critical, and my approach would be: input data via local web dashboard (Flask/FastAPI + JS UI) → task queue → OS-level browser launch (Chrome/Firefox/Brave/Opera) → screen capture pipeline → OpenCV-based template matching with multi-scale and resolution normalization (dynamic resizing + feature-based fallback) → PyAutoGUI/OS hooks for interaction → step execution with state tracking → real-time dashboard updates (progress, success count) → error detection triggers pause + UI notification → manual override → resume pipeline → structured CSV export combining input and scraped output; the system will be cross-platform, modular, and robust to UI variations without relying on Selenium, and I can also share similar CV/automation work for reference, so if you want a stable, production-ready bot, let’s connect.
$155 USD in 7 Tagen
2,9
2,9

Develop a vision-based UI bot for cross-platform form processing, using template matching for browser automation without Selenium or drivers. Cross-platform compatibility: Utilize Python and OpenCV to ensure the bot runs natively on Windows, Linux, and macOS, handling OS-specific UI automation with PyAutoGUI. Native browser control: Implement direct interaction with Chrome, Firefox, Brave, and Opera using image-based navigation, ensuring compatibility with different screen resolutions. Template-based navigation: Use provided reference images for precise template matching, allowing the bot to perform predefined operations accurately. Web-based dashboard: Build a local web UI with Flask for data input and monitoring, displaying real-time progress and error notifications. Error handling: Implement step-level error detection with manual override, pausing the bot and notifying the operator via the web UI for manual intervention. CSV output: Export results in structured CSV format, with input data in initial columns and scraped data following. Timeline: 3 days. Could you provide more details on the expected range of screen resolutions the bot should handle?
$250 USD in 3 Tagen
1,8
1,8

I've built similar vision-based automation bots before using OpenCV, PyAutoGUI, and OCR to interact with web interfaces purely through on-screen recognition. I can create a reliable loop-based system with smart error handling, data extraction, and clean logging all without relying on Selenium.
$300 USD in 1 Tag
1,8
1,8

Hey — saw your post about building a vision-based UI bot for form processing. Biggest pain point with these is getting template matching stable when the UI shifts slightly (zoom levels, fonts, element positions) so the bot doesn’t constantly “miss” targets. Quick question before I suggest an approach: Do you already have a preferred stack in mind (e.g. Python + OpenCV + Playwright), or are you open on tech as long as it runs reliably cross-platform? I’ve worked on vision-driven RPA bots before using template matching and DOM fallback, so I’m familiar with the common issues around robustness and cross-platform behavior. If you share a short spec, screen recording, or target URLs, I can review and tell you what’s realistic for the first version.
$155 USD in 7 Tagen
2,0
2,0

Hi, It looks like you're looking for a cross-platform bot that can automate web form filling and data scraping using computer vision, without relying on traditional browser drivers. I get how important it is to have a solution that runs smoothly across Windows, Linux, and macOS. My approach would involve leveraging libraries like OpenCV for template matching to effectively navigate various browser interfaces based on the provided reference images. I have hands-on experience building similar automation solutions, including local dashboards that provide real-time monitoring and error handling features. By implementing a clear step-level error recovery mechanism, operators can easily intervene when necessary, ensuring the process remains efficient. In past projects, I’ve created robust systems that streamline data processing and provided structured outputs, such as CSV files, tailored to user needs. My focus is on delivering clean, scalable solutions while maintaining open communication throughout the project. Best regards, Novalitz Tech
$155 USD in 7 Tagen
1,9
1,9

Being highly skilled in Computer Vision, Python, and Web Development, I have a solid background in providing end-to-end technology solutions - exactly what your vision-based UI bot project needs. More specifically, I have hands-on experience with powerful libraries such as OpenCV, PyAutoGUI, and Playwright that are essential for template matching as per your project requirements. My expertise in these libraries will ensure effective navigation of target websites and predefined operations without Selenium or WebDriver. Moreover, I have a proven track record in building local web dashboards that integrates seamlessly with automation systems to drive and monitor functionalities - the exact requirement for your operator dashboard. Additionally, my proficiency extends into handling errors and implementing manual override systems ensuring successful completion of automations. As an added value, I possess strong real-time notification skills crucial for the automated process to be efficient. On the subject of OS compatibility, my experience spans across Windows, Linux, and macOS making me adept at handling platform specifics without resorting to any unethical hacks. In essence, with my all-encompassing understanding of your project requirements and proven technical aptitude — your project is in capable hands! Let us embark on this journey to transform your workflow with cutting-edge computer vision and automation!
$100 USD in 9 Tagen
1,9
1,9

Hi, This is a great use case for vision-based automation—I can build a robust, cross-platform bot without Selenium or drivers. Proposed Approach: - Core Engine (Python): OpenCV (template matching) + PyAutoGUI for mouse/keyboard control - Browser Control: Launch and control native browsers at OS level (Chrome, Firefox, Brave, Opera) - Resolution Handling: Use multi-scale template matching + dynamic region detection + confidence thresholds to stay stable across different screen sizes/DPI. Optional calibration step per machine for accuracy. - Workflow Engine: Step-based execution system (JSON/YAML-defined flows) so actions are modular and reusable Web Dashboard: - Flask/FastAPI backend + lightweight frontend - Input form submission + live counters (processed/success/fail) - Real-time status via WebSockets - Error alerts with step pause + “Resume from step” control after manual fix Error Handling: - Screenshot capture on failure - Step-level retry + manual override trigger - Logs for each action Output: - Clean structured CSV (input columns → output data) - Deduplication + run history tracking Experience: I’ve built similar CV-based automation tools handling dynamic UIs and cross-OS behavior. I can deliver this system for $300, including setup guide and demo walkthrough. Happy to share architecture diagram or a quick prototype if needed. Best regards
$100 USD in 7 Tagen
1,0
1,0

Hello, This is a well-structured and challenging system, and I’m confident I can build it end-to-end with reliability and clear architecture. I understand the core requirement: a vision-based automation bot (no DOM interaction) tightly integrated with a full-stack system. My approach will focus on modular, event-driven design to ensure stability and traceability. Architecture plan: • React frontend for operator input + real-time status tracking • FastAPI backend as central controller (API + orchestration) • Redis for async messaging (triggers, retries, alerts, CAPTCHA events) • PostgreSQL for structured input/output tracking • Python automation service using OpenCV + OCR + PyAutoGUI For the bot logic, I will strictly implement your 5-step verification cycle (capture → verify → act → confirm → validate) with retry mechanisms and state rollback to ensure robustness. Key features: Real-time status updates via Redis channels CAPTCHA detection → pause + alert + manual resume Clean logging and full traceability of each run Excel export from PostgreSQL via backend Windows-compatible deployment I prioritize clean code, documentation, and version control throughout development. Timeline: 2–3 weeks (complex system, better done properly) I’m ready to start and can discuss architecture in detail. Best regards, Rajaoul Haque M.
$38 USD in 3 Tagen
0,6
0,6

Hi, I have relevant experience. I have read the details you shared and I understand the outcome you want. I am a reliable freelancer with 20+ years of experience in JavaScript, Python and I helped many clients in the past. Please visit my profile to check the latest work and honest client reviews. If you like my approach, please connect in chat. Regards, Shyam M
$160 USD in 7 Tagen
0,0
0,0

Hi, I’ve reviewed your project and understand that you need a **vision-based bot to automate form filling**, likely using computer vision and AI to interact with dynamic interfaces. I can help you build a **smart automation system** that uses computer vision + AI to detect form elements and complete them accurately. Unlike traditional automation (like Selenium), vision-based systems analyze the screen visually and adapt to changes, making them more reliable for real-world use ([Skyvern][1]) What I will deliver: * Vision-based bot to detect input fields, buttons, and UI elements * Automated form filling using intelligent data mapping * OCR + object detection for dynamic interfaces * Human-like interaction (click, type, scroll automation) * Scalable system that works across different websites My approach: I will combine **computer vision (UI detection)** with **AI logic (decision making)** to create a bot that understands forms visually and fills them intelligently. Computer vision enables systems to interpret images/screens and take actions based on that understanding ([Wikipedia][2]) Tech stack: * Python (OpenCV, PyAutoGUI / Selenium hybrid if needed) * AI models for detection (CNN / OCR tools) * Optional: GPT-based logic for smart input handling I can start immediately and build a reliable, adaptive automation solution tailored to your requirements. Looking forward to working with you. Best regards, Aryadittya
$10 USD in 1 Tag
0,0
0,0

What you’re really trying to avoid here is building a bot that looks good in a demo, then falls apart the moment it meets a different screen size, browser skin, or OS behavior. A job like this usually gets posted when the usual automation routes have already started to feel like a dead end. Selenium is too brittle for what you want, browser drivers are off the table, and what you actually need is something that can operate in the real world like a careful human would, while still giving your team visibility and control when something goes sideways. The way I’d approach it is as two tightly connected layers: a vision-based desktop runner that handles browser interaction through template matching and step logic, and a local web dashboard that acts like mission control for input, live status, pause states, manual recovery, and CSV output. The important part is making the vision layer adaptive, not naive, so template matching is backed by resolution-aware scaling, confidence thresholds, anchor-based positioning, and step validation before each action is taken. What you want underneath all of this is not just automation, but a system that stays usable when reality gets messy. Something operators can trust, step in on, and hand back to the bot without breaking the flow. I already have a clean way to structure the recovery logic and make the matching more stable across Windows, Linux, and macOS… want me to show you how I’d map it out?
$155 USD in 7 Tagen
0,0
0,0

Hi, I can build this cross-platform vision-based automation bot using Python + OpenCV with OS-level UI control and a local web dashboard. Approach: • Template matching with OpenCV + resolution scaling • UI automation via PyAutoGUI (no Selenium/WebDriver) • Local web dashboard (Flask/FastAPI) for inputs & live status • Step-level pause + manual resume logic • Multi-browser support (Chrome, Firefox, Brave, Opera) • Structured CSV export with input + scraped data Runs on Windows, Linux, and macOS. I can start immediately. Do you want this packaged as installable desktop app or run via local server?
$170 USD in 7 Tagen
0,0
0,0

Dhaka, Bangladesh
Zahlungsmethode verifiziert
Mitglied seit Apr. 5, 2026
$10-3500 USD
$250-750 USD
$30-250 USD
$30-250 USD
₹400-750 INR / Stunde
₹12500-37500 INR
$10-30 USD
₹100-101 INR / Stunde
$3000-5000 USD
₹600-1500 INR
₹150000-250000 INR
£20-250 GBP
₹1500-12500 INR
₹600-1000 INR
$10-30 USD
€30-250 EUR
$10-300 USD
$8-15 USD / Stunde
$30-250 USD
£250-750 GBP
$8-15 USD / Stunde
₹100-400 INR / Stunde