What Is the Scraping Camel Extension?#

Scraping Camel is an extension that automatically and continuously crawls websites, downloads information from them, and stores this data in a machine-readable data feed (CSV) or provides it via API to other extensions (e.g. Mergado Marketing Buddy). It works as a server-side scraper that runs continuously, meaning data is constantly updated without needing to run any software on a local computer.

The extension is ideal for collecting data that is not available in standard product feeds, such as information from category pages, blogs, static pages, or websites that have no feeds at all.

Who it is for:

  • PPC specialists — For building automated campaigns (e.g. DSA or Performance Max) based on data from an entire website.
  • SEO specialists — For continuous page monitoring, SEO audits, and keyword analysis in real time.
  • Online store owners — Who need to create product feeds where their platform does not provide them, or who want to supplement existing feeds with missing data (e.g. parameters, stock levels, images).
  • Sites without a shopping cart — Such as service catalogues, magazines, corporate websites, or wholesale catalogues, for which Scraping Camel enables entry into the world of feed marketing.
  • Content specialists and management — For reviewing the results of their work and gaining an overview of website content and structure.

Pricing and billing#

The extension is priced at:

  • EUR 37 per month with monthly billing,
  • EUR 29.60 per month with annual billing.

The price is set for one online store (in Mergado) and is fixed, regardless of the number of projects (websites) or keyword analyses created.

Due to the technical dependency, the extension is billed in line with the billing frequency selected in Mergado Editor. Different billing frequencies cannot be chosen for individual services.

The Scraping Camel extension has a 30-day free trial.

Can Scraping Camel be used without Mergado?#

Scraping Camel can be used without a paid project in Mergado. However, you need to create a Mergado account for user authentication and billing, and an online store for user access management. Both can be created in Mergado for free, without a paid project.

  • Scraper (Crawler) — An automated robot that runs continuously on a server, processes the HTML code of pages, and extracts data from it into a machine-readable format.
  • Keyword — A word or phrase automatically generated by Scraping Camel directly from website text. Keywords characterise the page content and its main topic.
  • Element — A specific data field carrying information that Scraping Camel downloads from a website. Elements are divided into:
    • System elements — Predefined common SEO attributes (Title, H1, Meta Description, etc.).
    • Custom elements — User-defined fields (e.g. price, stock level).
    • AI elements — Fields generated by artificial intelligence without manual configuration (keywords, language detection, word count).
  • Bulk elements — A special type of element for which Scraping Camel does not stop at the first occurrence but stores all occurrences on a given page.
  • Validator — A condition or rule that automatically and continuously monitors the state of a website and alerts to technical or SEO errors.
  • Reverse export — A special type of output CSV file for keywords, where the primary key is the URL, to which all found keywords are assigned (suitable for PPC campaigns).
  • Score — A numeric value expressing the degree of significance (relevance) of a keyword for a specific web page.
  • Phrase — A multi-word expression that the user defines so that it is processed as a single unit (one keyword) during analysis.
  • Stop words — A list of words to be ignored when generating keywords (e.g. conjunctions, prepositions, or your own brand), to keep the results clean.
  • Labels — A tool for organising and clustering keywords into thematic groups, for example by product category.

What you need to start using the Scraping Camel extension#

  • A Mergado Editor account with an online store created in it.
  • Proving your relationship to the domain — Scraping Camel is not intended for scraping third-party websites or competitor sites. You therefore need to verify domain ownership to prevent unauthorised scraping of other websites.
  • Adding and verifying sitemap.xml — A sitemap file is essential for the extension to work, as Scraping Camel uses it as the source of URLs to download. If the extension does not find the sitemap automatically, its address must be entered manually.

Main features#

1. Collecting any data from a website’s HTML#

Scraping Camel can retrieve any information from a website that standard feeds do not contain. It continuously crawls the site and downloads information directly from the source code of pages, which it then stores in a machine-readable CSV feed. Collected data is stored in elements:

  • System elements — Predefined attributes that can be activated with a single click. These include dozens of SEO parameters such as TITLE, H1, META_DESCRIPTION, HTTP_STATUS, microformat information (Open Graph, Twitter Cards), or measurement code IDs (GTM, GA).
  • Custom elements — Allow you to retrieve specific data that system elements do not cover (e.g. price, stock level, parameters, article author, or breadcrumb navigation). You define them yourself either using regular expressions or by specifying text “before and after” the desired value.
  • Scraping Camel AI — Elements generated by artificial intelligence that automatically analyse content without any manual configuration. Unlike standard elements, you do not define their content yourself — the extension retrieves it automatically using AI. Scraping Camel thus independently detects the language (SC_DETECTED_LANGUAGE), counts the words on a page (SC_NUMBER_OF_WORDS), generates a main page title (SC_MAIN_TITLE), and also generates keywords for the page (SC_WORDS_COUNT, SC_WORDS_TUPLES_COUNT, SC_WORDS_AGG_MIN_FREQ_3). Find out more about AI elements.

⚠️ Scraping Camel does not render JavaScript — it works on HTML only. If any content is hidden behind scripts, the extension cannot extract it.

2. Bulk elements#

Unlike standard elements, which store only the first occurrence in the code, bulk elements store all occurrences on a given page. This allows you to retrieve:

  • Backlinks — You get a list of all links on the page, including their link text and the HTTP status code of the target URL. This makes it easy to identify broken links, for example.
  • Images — You get the URLs of all images on the page, their alternative texts (alt), file size, and content type. This feature is key for identifying broken links (404 error) or oversized images that slow down a website.

3. Continuous SEO validation and monitoring#

Scraping Camel works as an SEO audit running continuously on a server. It includes 35 system validators that constantly monitor the technical state of your website (e.g. missing H1, overly long titles, URLs blocked in robots.txt) and alert you to errors. You can also create custom validators for specific checks, such as verifying the presence of measurement codes (GTM, GA) or checking the uniqueness of content.

4. Automated keyword analysis#

Scraping Camel allows you to create comprehensive keyword analyses from website content, which update themselves with every change to pages.

5. Advanced crawl management#

In Scraping Camel, you have full control over how and what is downloaded from your website.

  • URL filtering — You can exclude certain parts of a website (e.g. the blog) and scrape only selected sections (e.g. categories), saving server performance and making the output feed cleaner.
  • Request throttling — You set the frequency and speed of downloads to prevent overloading the website or the scraper being blocked by Anti-DDoS protection.
  • Update scheduling — You choose how often Scraping Camel should check for changes on already-downloaded pages (once, daily, every three days, weekly, monthly).

6. Universal outputs and API#

All collected data is available in a universal CSV file, which can be uploaded to Mergado Editor as a standalone feed or connected to an existing project using the Data file import rule. Data is also accessible via API, enabling its use in AI tools such as Mergado Marketing Buddy.

Real-world use cases#

  • Automating PPC campaigns (DSA and PMax) — Scraping Camel creates ideal inputs for Dynamic Search Ads (DSA) and Performance Max campaigns. It provides lists of URLs with their content (known as page feeds) and “reverse exports” with the most relevant keywords for each page.
  • Creating feeds for sites without a shopping cart — Enables catalogues, magazines, or property portals to participate in feed marketing, even though they normally have no product feeds. Scraping Camel creates a data file directly from their HTML code.
  • Supplementing missing data in existing feeds — If your e-commerce platform does not export important parameters to the feed (e.g. colour, material, detailed stock levels, or product codes), Scraping Camel pulls them directly from the website, and you can then add them to the feed in Mergado Editor.
  • Preparing category feeds — Helps PPC specialists create specific campaigns targeting website categories, for which online stores do not normally generate feeds.
  • Continuous on-page SEO audit — Instead of one-off checks, Scraping Camel automatically and continuously monitors the state of titles, H1 headings, and meta descriptions, for example. It alerts you to their absence, duplicates, or inappropriate length.
  • Data-driven SEO and business reports — Allows specialists to combine technical SEO parameters (e.g. keyword scores) with business information (price, margin, sales) for strategic decision-making.
  • Content quality control — Using AI, it identifies pages with too few words, automatically detects page language (e.g. untranslated products from a foreign supplier), and generates keywords describing the topic of a page.
  • Link and image audit — Through bulk elements, it identifies broken links and images (404 error), missing alternative texts (alt), or oversized image files that slow down a website.
  • Monitoring technical elements and measurement codes — Verifies the presence of Google Analytics or Google Tag Manager IDs on all sub-pages, to prevent gaps in data measurement.
  • Tracking changes and new pages — The DISCOVERED element enables you to filter pages that have appeared on the website in the last week or month, which is useful for monitoring the growth of large projects.

Why it is necessary to verify domain ownership#

Domain verification is an essential security step that you must complete before Scraping Camel starts downloading data from your website. The main reason is that this extension is intended exclusively for processing your own websites, or those of your clients or partners.

Why is this measure in place?#

  • Preventing competitor scraping — The extension is not intended to enable extracting third-party databases or monitoring competitor websites.
  • Legal and ethical considerations — Extracting data from other people’s websites without the owner’s consent may not be legal, and the website owner may not agree to such data mining. However, if they do agree to scraping (e.g. your wholesale supplier), they can verify the domain and thereby grant you access.
  • Privacy and data protection — Scraping Camel only processes data from domains for which the user has proved a relationship, ensuring that an authorised person from your team or agency is always working with the extension settings.
  • Access management — Thanks to the link to an online store in Mergado, user permissions can be easily managed, and if a working relationship ends, the settings and scraped data can be handed over to the client.

How to verify a domain#

In the new website creation wizard, you can choose from four methods, whichever suits you best:

  1. Google Search Console (GSC) — The simplest and fastest method, especially for marketing specialists and agencies. If you already have access to the domain in GSC, Scraping Camel will connect to Google via API and verify that you have permission to work with the domain.
  2. DNS TXT record — You insert a specific text string, generated by Scraping Camel, into your DNS settings with your domain provider. This method is permanent and independent of any changes to the website code, but you need to be careful about the correct subdomain configuration (e.g. www vs. without www). More details are available in the article How to Verify Domain Ownership in Scraping Camel via DNS.
  3. Meta tag — You insert a short code into the header of the website’s source code (between the <head> and </head> tags). This method is ideal for users of the Shoptet and WordPress platforms.
  4. HTML file — You upload a generated HTML file to your web server (to the root directory). Scraping Camel then verifies the existence of this file at your URL.

Working in Scraping Camel#

Activate the extension in Mergado Store: I want to activateselect the online store for which you want to enable the extension → Enable.

In the Scraping Camel extension interface, you will see two main sections in the menu:

  • Websites
  • Keywords

Websites#

This section is the primary place for managing your projects (websites). You will see a list of all created websites, where each item in the list corresponds to data from one specific domain. The number of domains you can process in a single instance (one activation) of the extension is not limited. For agencies, however, we recommend using the extension separately — always one Scraping Camel instance per client.

At the top of this section, you will find the New website button, which launches the wizard for adding a new website. All settings you make in the wizard can subsequently be edited in the individual tabs within the specific created project.

For efficient management — especially if you manage dozens of domains — you can use filtering, for example by creation date (useful for monitoring changes in the last week) or by page processing status. Filters can also be used on individual tabs within a specific project.

Clicking on a specific project (website) opens a clear interface with several tabs — Overview, File Exports, Pages, Elements, Bulk Elements, Validation, Settings.

1. Overview#

This page serves as the project’s home screen with the most important information. Here you will see output feed URLs (CSV exports) and the option to download them.

You can also monitor the download status here. Since Scraping Camel crawls websites carefully to avoid overloading them or being blocked by Anti-DDoS protection, downloading thousands of pages takes some time depending on your throttling settings.

2. File Exports#

Here you manage the output data that the extension retrieved from the website. You can create any number of exports (by clicking the Create new export file button) with different names for different purposes (e.g. a feed for DSA campaigns or an SEO analysis). In the export settings, you choose yourself which elements the resulting CSV should contain and define their exact order.

3. Pages#

A list of all specific URLs that Scraping Camel has found on your website. Clicking on a specific URL shows you an overview of all elements with their specific values that were successfully downloaded from that page.

4. Elements#

On this page you will find the list of elements you defined in the wizard when creating a new website. These are the elements that Scraping Camel is set to look for on the website. Clicking on each individual element displays the values of that element for individual pages.

Clicking Edit elements allows you to modify them. A list of elements will appear on the left and on the right you will see a preview of the HTML code of the page. You determine whether elements will be downloaded or not by checking them in the list on the left.

Elements are divided into categories here:

  • Found elements — System elements that Scraping Camel was able to find in the HTML code.
  • Not found elements — System elements that Scraping Camel was unable to find in the HTML code.
  • AI elements — Elements generated by artificial intelligence. More information in the article What Is Scraping Camel AI?
  • Custom elements — In this section you create custom elements, whose values you define either using regular expressions or as “text before/after”.

5. Bulk Elements#

Special elements designed for downloading data that appears multiple times on a single page. Unlike standard elements, Scraping Camel does not stop at the first occurrence but stores all occurrences on the given page. These elements are:

  • IMAGES (Images) — Contains information about the image URL, its alternative text, file size, and type (e.g. image, jpg). 💡 In the overview on the Bulk Elements page, image sizes are displayed in kilobytes (KB), but in the validation rule settings, bytes (B) are used.
  • LINKS (Backlinks) — Contains information about the target URL, the link text, and the HTTP status code of the target address.

Extracting this data is resource-intensive, so their download must be triggered manually using the Run extraction button.

System and custom validators can be run on bulk elements. This makes it easy to get lists of broken images (404 error), images without alt text, or oversized files that slow down page loading.

6. Validation#

This page serves as a continuous and comprehensive SEO audit of your website, running directly on the server. Unlike classic desktop tools that you need to run manually, Scraping Camel checks the state of the website automatically in line with your data update schedule. Since it operates online, multiple team members can access the current validation results simultaneously.

Here you activate up to 35 system validators covering key SEO areas (e.g. missing H1, overly long titles, missing meta description, broken pages, etc.). These validators can be enabled/disabled and adjusted according to your needs (the Edit default validators button).

You can also create custom validators for specific needs (the New custom validator button). These make it easy to monitor the presence of measurement codes (GTM, GA), verify content uniqueness, or check specific parameters such as article author or category.

Validation results are displayed in a clear table, sorted by severity and marked with colour icons. Clicking on a specific validator shows a list of all URLs affected by that error/warning/notice.

  • 🔴 Red icon (Critical error): Critical issues that require immediate attention (e.g. broken URL 404 or missing TITLE).
  • 🟠 Orange icon (Warning): Errors that should be fixed but the website remains functional (e.g. overly long Meta Description).
  • 🔘 Grey icon (Notice/Information): Less severe states or recommendations for improvement.
  • 🟢 Green icon (OK): Displayed for validators where none of the checked items meets the error condition.

For data validation to be successful, the elements that the validator is to check must be activated (checked) on the Elements tab (e.g. if you want to validate H1, the H1 element must be active for downloading). The speed of validation result updates depends on the page download frequency set on the Settings tab.

7. Settings#

On the individual tabs here, you can adjust the technical scraping parameters.

a. Page settings

Change the project name or domain.

b. Request throttling

You can change the frequency and speed at which Scraping Camel accesses your website. These parameters are key to ensuring the website is processed in a reasonable time without being overloaded or the scraper being blocked by Anti-DDoS protection.

  • Page download frequency — How often Scraping Camel should check for changes on pages it has already successfully downloaded. While new pages (appearing in the sitemap for the first time) are downloaded as soon as possible (typically daily), for known pages you can choose from intervals — once only (and never update again), daily, every three days, weekly, or monthly.
  • Number of page downloads per interval — Defines the batch size, i.e. how many pages should be processed at once within a given time period. If you enter, for example, the value 5, the extension will send a request to download 5 URLs in each defined time period.
  • Page download interval — Determines the time delay between individual download batches. If you set the interval to, for example, 500 ms, the extension will attempt to download the set number of pages (defined in the Number of page downloads per interval field) every half second.

c. AI settings

Management of parameters for keyword generation and stop word lists. Here you can define, for example, the minimum keyword length, the score threshold (the minimum percentage relevance a word must have to be included in the keyword selection), rules for processing numbers, define stop words (terms the AI should completely ignore when generating keywords), or define phrases (multi-word expressions the algorithm should process together as a single keyword). More information in the article What Is Scraping Camel AI?.

d. Page processing rules

Using rules, you determine which pages should be downloaded (everything else will be ignored) or conversely excluded from downloading (everything except them will be downloaded). You can, for example, download only categories and exclude blog pages. This significantly saves your server performance, shortens the data update time, and cleans unnecessary data out of the output CSV feed.

You can define conditions for URL selection in two ways:

  • By URL string — Enter part of an address (e.g. /blog/ or /product/).
  • By regular expression — For more advanced and precise filtering (e.g. for URLs ending with a specific number).

Pages that do not match your rules will still appear in the list on the Pages tab (to provide a complete overview of sitemap content), but will be marked with a red symbol. These pages will not be scraped or exported to output files.

Keywords#

This section is used for automatically generating and managing data feeds with keywords found directly on your websites. Unlike the Websites section, where the primary key is the URL, in this module the starting point is the specific keyword, to which relevant pages and metrics are assigned.

On this page you will see a list of all created keyword analyses. Create a new analysis using the New keyword analysis button, where you select the domain (one or more) from which data should be drawn.

Clicking on a specific analysis opens a clear interface with several tabs — Overview, File Exports, Labels, Variants, Diagnostics, Validation, and Settings.

1. Overview#

Here you will find a list of all found keywords. For each keyword in the table you will see:

  • its text (element KEYWORD),
  • the URL of the landing page that has the highest measured relevance, i.e. the highest score, for the given word (element URL),
  • a numeric value expressing the highest relevance score for the landing page for that word (element TOP_SCORE),
  • the number of all pages on the website where this word reached the top keywords, i.e. exceeded the set significance threshold (element PAGES_COUNT),
  • search volume and CPC (cost per click) data, if available for the given word,
  • status, i.e. whether the keyword is active or not,
  • whether labels are assigned to it,
  • the date and time of first occurrence.

From the list, you can click directly through to Google SERPs to verify the actual search results for a given term. Individual words can be activated or deactivated in the list, which will then be reflected in the export. Clicking on a keyword displays a list of all pages relevant to that keyword.

2. File Exports#

In this section you can manage and export CSV files for keyword analysis. For the export you select the elements to be included in the file and choose their order. Two types of export are available:

  • Classic export (the Create classic export file button) — A table where each keyword has one row with the selected supplementary information.
  • Reverse export (the Create reverse export file button) — A special format that “flips” the data. The primary key is the URL, to which all relevant keywords are assigned in the next column. This format is ideal for creating DSA or Performance Max campaigns in Google Ads.

3. Labels#

Used for organising keywords into thematic groups. For online stores, they are most commonly used to divide words by product category (e.g. the label “refrigerators”, “washing machines”, etc.). Labels can be assigned to words in bulk or individually in the keyword detail.

Clicking on a label displays a list of keywords assigned to it.

4. Variants#

This tab is used for manually unifying different forms of the same word. While Scraping Camel automatically tries to recognise that words such as “washing machine”, “washing machines”, or “washing machine’s” belong together, it occasionally misses a more complex form or inflection.

In this section you can manually guide the system and merge these expressions under one main word. For example, you can specify that the word “agencies” should be counted as a variant of the word “agency”. The result is a much cleaner and clearer analysis where data is not fragmented across many similar rows.

5. Diagnostics#

Diagnostics helps you identify whether the texts on your pages are actually talking about what you sell, or whether they are cluttered with irrelevant words that distort the analysis results.

Scraping Camel takes all found keywords (including multi-word phrases), breaks them down into individual words, and counts how many times each one appears across the entire website. This immediately shows you which words dominate your website.

If you find that the most common word on your online store is, for example, “VAT”, “cookies”, or “in stock”, that is an important signal. It means these technical or generic words are “drowning out” the important keywords that truly describe your products in the analysis. Once you identify such terms in diagnostics, you can add them to the stop words list on the Validation tab. This cleans up the analysis, leaving only the words that have real marketing value.

6. Validation#

The Validation tab is used for automatic cleaning, organising, and improving the quality of your entire keyword dataset. Here you can set validation rules (validators) to ensure that the resulting keyword export is maximally relevant to your marketing and does not contain unnecessary data. Validators clean the dataset continuously and automatically in the background, so it stays high-quality without constant manual intervention.

Using validation rules you can, for example, set:

  • Blocking words (stop words) — If you identify words in the analysis that are not useful to you (e.g. generic terms like “VAT”, “in stock”, “cart”, or your own brand that drowns out unique keywords), you can set rules here for their permanent exclusion from the analysis.
  • Merging into phrases — You can define rules that merge two or more words into a single unit (phrase), e.g. “Bidding Fox” or “Google Analytics”. Scraping Camel then treats these phrases as a single keyword, which increases the accuracy of scoring and the relevance of results.
  • Automatic labelling — Using validators you can also automatically assign labels to keywords based on defined conditions. This is essential for so-called clustering — grouping words into logical units, for example by product group (refrigerators, washing machines, mobile phones).

7. Settings#

In Settings you adjust the basic parameters of the analysis, such as its name or list of ignored characters. This feature is key for removing unwanted elements from the texts from which keywords are generated. It allows you to define a list of specific symbols and characters that Scraping Camel should completely ignore during analysis. This ensures greater data cleanliness and prevents the creation of irrelevant keywords.

All parameters listed here are set in the wizard when creating a new analysis. At that point you also choose one or more websites from which data should be drawn. This setting is permanent and cannot be changed later on the Settings tab. If you need to analyse a different domain, you must create a completely new analysis.

Privacy and data handling#

Scraping Camel processes exclusively data from publicly accessible websites for which the user has verifiably confirmed ownership or a relationship to the domain. All data obtained is considered private, is not provided to any third parties, and the user has full control over its scope directly in the extension admin. If a project is deleted, all related data is permanently removed within 14 days, including from backup systems that serve solely for restoring the extension in the event of technical failures.

FAQ#

What is Scraping Camel and what does it do?#

Scraping Camel is an extension that automatically and continuously crawls websites, downloads data from them, and stores it in a CSV feed or provides it via API. It runs continuously on a server, so data is constantly updated without needing to manually run software on your computer. It is useful wherever you need to collect data that is not available in standard product feeds.

Who is Scraping Camel for?#

It is primarily used by PPC and SEO specialists, online store owners, managers of sites without a shopping cart (catalogues, magazines, corporate websites), and content managers. The tool is versatile — it helps with automated campaign creation, SEO audits, keyword analysis, and content quality checks.

How much does Scraping Camel cost?#

The price is 986 CZK per month with monthly billing, or 788.80 CZK per month with annual billing. The price is fixed for one online store regardless of the number of projects (websites) or keyword analyses created. A 30-day free trial is available.

Do I need a paid project in Mergado for Scraping Camel to work?#

No. Scraping Camel can be used without a paid project in Mergado. You just need to create a free Mergado account and an online store in it for login, billing, and access management.

What are the requirements for getting started with Scraping Camel?#

You need three things — a Mergado account with an online store, verified domain ownership for the website you want to scrape, and a working sitemap.xml file from which the extension draws the list of URLs.

Why do I need to verify domain ownership?#

Scraping Camel is intended exclusively for processing your own websites or those of your clients. Domain verification prevents misuse of the tool for scraping third-party or competitor websites and ensures that only an authorised person works with the data.

How can I verify a domain?#

Four methods are available: via Google Search Console (the fastest option for agencies), via a DNS TXT record (a permanent method independent of website code), by inserting a meta tag into the website header (suitable for Shoptet and WordPress), or by uploading an HTML file to the server.

Does Scraping Camel work on JavaScript-based websites?#

No. The extension works only with the HTML code of pages and does not render JavaScript. If any page content depends on JavaScript, Scraping Camel cannot extract it.

What data can be retrieved using Scraping Camel?#

Virtually anything contained in the HTML code of a page. Data is then stored either in predefined system elements (Title, H1, Meta Description, HTTP status, GTM/GA codes, etc.), custom elements defined by a regular expression or “text before/after”, or automatically generated AI elements (keywords, page language, word count, page title).

What are bulk elements and how do they differ from standard ones?#

A standard element stores the first occurrence of a given value on a page. Bulk elements store all occurrences. Specifically, they store a complete list of all images (including alt texts and file sizes) or all links (including HTTP status codes) on a given page. Downloading information into these elements is triggered manually because it is more resource-intensive.

How does SEO validation work in Scraping Camel?#

Scraping Camel includes 35 system validators that continuously monitor the technical state of the website and alert to errors such as missing H1, overly long titles, or broken pages. Results are colour-coded by severity and accessible online. Alongside system validators, you can also create custom ones.

How does Scraping Camel help with PPC campaigns?#

It creates page feeds with URLs and their content, which are ideal for DSA and Performance Max campaigns. Reverse keyword exports are also available — CSV files where the primary key is the URL and all relevant keywords are assigned to it.

Can I scrape just part of a website, such as only categories?#

Yes. In the page processing rules settings, you specify which parts of the website should be scraped and which should be ignored — for example, you exclude the blog and process only categories. Conditions can be defined by URL string or regular expression.

How quickly does Scraping Camel update data?#

New pages appearing in the sitemap for the first time are downloaded typically every day. For already known pages, you choose the frequency yourself — once, daily, every three days, weekly, or monthly. Speed can be further influenced by setting the batch download count and the interval between batches.

In what format is the output data available?#

Data is exported as a CSV file, which can be uploaded to Mergado Editor as a standalone feed or connected to an existing project using the Data file import rule. Data is also accessible via API, for example for the Mergado Marketing Buddy extension.

Can I scrape multiple websites in a single instance of the extension?#

Yes, the number of domains within a single instance is not limited. For agencies managing websites for multiple clients, however, it is recommended to run a separate instance of the extension for each client — this makes access management and data handover easier when a working relationship ends.

What happens to my data if I delete a project?#

After deleting a project, all related data is permanently removed within 14 days, including from backup systems. Scraping Camel processes exclusively data from publicly accessible pages with verified ownership and does not provide it to any third parties.

How does keyword analysis work in Scraping Camel?#

In the Keywords section, you create an analysis for one or more domains. Scraping Camel automatically generates keywords from page content, assigns them relevance scores, and continuously updates them. Data can be organised using labels, different word forms can be unified via variants, cleaned using stop words, and exported as a classic or reverse CSV file.

How do I know my keyword analysis contains irrelevant words?#

The Diagnostics tab is for this purpose. Scraping Camel displays there which words appear most frequently on your website. If generic terms such as “VAT” or “cookies” dominate among them, it is a signal that these words are drowning out the relevant keywords and should be added to the stop words list.

Was this article helpful?