- MySitemapGenerator
- Help Desk
- Creating Product Feeds
Creating Product Feeds
To ensure that MySitemapGenerator can import information from a product page, the following requirements must be met:
- The product page must be publicly accessible on the internet.
- The page should contain recognizable HTML source code.
- The page must include comprehensive details about the product being offered (the minimum required data includes the product name, price, and image link).
MySitemapGenerator prioritizes structured markup when importing product data - specifically, Schema.org Product markup and Open Graph Product metadata.
Fallback and additional data extraction mechanisms:
If structured data is absent, or if additional attributes are required that are not marked up, our crawler attempts to automatically detect product details using web page analysis.
We strongly recommend using product markup for accurate importing results. If your website does not use product markup and is not compatible with our on-page extraction algorithm, you can import products from a prepared CSV file instead.
If you would like to learn more about Schema.org markup, you can explore the following resources:
- https://schema.org/Product
- https://developers.google.com/structured-data/rich-snippets/products
- https://yandex.ru/support/webmaster/supported-schemas/goods-prices.xml
For more information on Open Graph:
You can verify the correctness of your Schema.org structured data using the Schema Markup Validator.
To improve crawling speed, try limiting the crawler’s access to informational pages such as articles, reviews, and similar content. This prevents the crawler from spending time processing unnecessary pages and can significantly speed up the overall process, especially for websites with a large number of pages.
You can also use permanent filters or configure specific restrictions in your robots.txt file to exclude these informational pages.
The number of crawled pages refers to the total number of pages on a website that our crawler has analyzed. This includes all processed pages, such as the homepage, pagination pages, and potentially additional sections depending on the website structure, such as blog posts, news articles, reviews, and other informational pages (e.g., contact details, shipping information, etc.).
Can’t find the answers you’re looking for? We’re here to help.
Contact support