Creating RSS Feeds
- What are the requirements for the specified tracking page?
- What social networks are supported by Mysitemapgenerator?
- How does automatic feed creation work?
- What types of structured data are retrieved by MySitemapGenerator
- How is the formation of full-text RSS in the format for export to Google Publisher Center?
- How occurs RSS forming in the format of data export to Yandex News?
- How do you let visitors know you have an RSS feed?
What are the requirements for the specified tracking page?
The specified tracking page must be publicly accessible, contain recognizable HTML source code, and contain links to internal publication pages (if the URL Search Pattern is specified – corresponding links).
How does automatic feed creation work?
The feed is generated based on the content received from the website. The data source can be Schema.org markup: Article, NewsArticle, BlogPosting, LiveBlogPosting and Open Graph Article metadata, as well as standard HTML document tags.
When no structured data is detected on the website, Mysitemapgenerator tries to automatically determine the content of publications using an AI-based extraction algorithm. But to achieve a better results, we recommend that you use Schema.org markup on your website.
What types of structured data are retrieved by MySitemapGenerator
Supported Schema.org markup types: Article, NewsArticle, BlogPosting, LiveBlogPosting:
- headline - Article title.
- description - Article short description.
- articleBody - Article text.
- image - The URL of an image resource .
- category - Article category.
- datePublished - Date.
Schema.org markup sample for news article:
<div vocab="http://schema.org/" typeof="NewsArticle"> ... <h1 property="headline">Article title</h1> <span property="description">Article short description</span> <img itemprop="image" src="http://youwebsiteurl/pictures/thumbnail.jpg" /> <span itemprop="datePublished" content="1998-09-04T19:25:33+00:00">Today, 19:25</span> ... <div property="articleBody"> <p>Article full contents</p> ... </div> ... </div>
Schema.org markup sample for any article:
<div itemscope itemtype="http://schema.org/Article"> <h1 itemprop="headline">Article title</h1> <div itemprop="description">Article short description</div> <div itemprop="articleBody"> <p>Article full contents</p> ... </div> </div>
Open Graph metadata (Supported type: Article):
- og:title — Article title.
- og:description — Article short description.
- og:image — The URL of an image resource.
- article:published_time - Date.
Open Graph markup example:
<meta property="og:type" content="article" /> <meta property="og:title" content="Article title" /> <meta property="og:description" content="Article short description" /> <meta property="og:image" content="http://youwebsiteurl/pictures/thumbnail.jpg" /> <meta property="article:published_time" content="1998-09-04T19:25:33+00:00" />
An example of pure HTML markup:
... <head> ... <title>Article title</title> <meta name="description" content="Article short description" /> ... </head> <body> ... <article> <p>Article full contents</p> ... </article> ... </body> ...
How is the formation of full-text RSS in the format for export to Google Publisher Center?
In addition to standard RSS 2.0 elements, the special element "content:encoded" is required to export data to Google Publisher Center, which contains the full contents of the publication:
<content:encoded>Full contents of the publication</content:encoded>
To include the full content of publications, the feed must use the additional namespace:
xmlns:content="http://purl.org/rss/1.0/modules/content/"
Our crawler extracts the full text of publications on web pages using an AI algorithm, as well as on pages marked up with Schema.org markup (supported types: Article, NewsArticle, BlogPosting, LiveBlogPosting).
How occurs RSS forming in the format of data export to Yandex News?
Along with standard RSS 2.0 elements, in order to export data into Yandex a special elemnt "yandex-fulltext" is needed, which should contain full contents of the publication:
<yandex-fulltext>Full contents of the publication</yandex-fulltext>
To include the full content of publications, the feed must use the additional namespace:
xmlns:yandex="http://news.yandex.ru"
Full text should not contain the name of the source and date/time of the publication, and contact information or any other information, which is repeated in every publication.
How do you let visitors know you have an RSS feed?
The easiest way is to create a link to your RSS feed and place it on your website. By clicking on it, visitors will be able to subscribe to updates through their RSS reader.
<a href="URL**">Subscribe via RSS</a>
Another way is to add a following meta tag to the <head> section of the HTML-page:
<head> … <link rel="alternate" type="application/rss+xml" title="Feed Title*" href="URL**"> … </head>
* "Feed Title" - RSS feed title,
** "URL" - Destination feed URL.
When your site contains this meta tag, some browsers add a special icon, which is a link to your RSS feed, located in the right corner of the address bar.
What social networks are supported by Mysitemapgenerator?
Mysitemapgenerator supports Youtube, VK, Reddit, and Pinterest. Please note that only public content is available for crawling and importing.