RSS Feeds FAQs

What are the requirements for the specified tracking page?
The specified tracking page must be publicly accessible, contain recognizable HTML source code, and contain links to internal publication pages (if the URL Search Pattern is specified – corresponding links).
What social networks are supported by Mysitemapgenerator?
Mysitemapgenerator supports Youtube, VK, Reddit, and Pinterest. Please note that only public content is available for crawling and importing.
How does automatic feed creation work?
The feed is generated based on the content received from the website. The data source can be Schema.org markup: Article, NewsArticle, BlogPosting, LiveBlogPosting and Open Graph Article metadata, as well as standard HTML document tags.
When no structured data is detected on the website, Mysitemapgenerator tries to automatically determine the content of publications using an AI-based extraction algorithm. But to achieve a better results, we recommend that you use Schema.org markup on your website.

How MySitemapGenerator finds and understands articles published on the website

How we find and understand articles published on your website

What types of structured data are retrieved by MySitemapGenerator
Supported Schema.org markup types: Article, NewsArticle, BlogPosting, LiveBlogPosting:
  • headline - Article title.
  • description - Article short description.
  • articleBody - Article text.
  • image - The URL of an image resource .
  • category - Article category.
  • datePublished - Date.

Schema.org markup sample for news article:
    <div vocab="http://schema.org/" typeof="NewsArticle">
        ...
        <h1 property="headline">Article title</h1>
        <span property="description">Article short description</span>
	<img itemprop="image" src="http://youwebsiteurl/pictures/thumbnail.jpg" />
	<span itemprop="datePublished" content="1998-09-04T19:25:33+00:00">Today, 19:25</span>
	...
        <div property="articleBody">
	    <p>Article full contents</p>
	    ...
	</div>
        ...
    </div>

Schema.org markup sample for any article:
<div itemscope itemtype="http://schema.org/Article">
  <h1 itemprop="headline">Article title</h1>
  <div itemprop="description">Article short description</div>
  <div itemprop="articleBody">
	    <p>Article full contents</p>
	    ...
  </div>
</div>

Open Graph metadata (Supported type: Article):
  • og:title — Article title.
  • og:description — Article short description.
  • og:image — The URL of an image resource.
  • article:published_time - Date.

Open Graph markup example:
    <meta property="og:type" content="article" /> 
    <meta property="og:title" content="Article title" /> 
    <meta property="og:description" content="Article short description" />
    <meta property="og:image" content="http://youwebsiteurl/pictures/thumbnail.jpg" />
    <meta property="article:published_time" content="1998-09-04T19:25:33+00:00" />

An example of pure HTML markup:
    ...
    <head>
	...
	<title>Article title</title> 
	<meta name="description" content="Article short description" />
	...
    </head>
    <body>
	...
	<article>
	    <p>Article full contents</p>
	    ...
	</article>
	...
    </body>
    ...
How is the formation of full-text RSS in the format for export to Google Publisher Center?
In addition to standard RSS 2.0 elements, the special element "content:encoded" is required to export data to Google Publisher Center, which contains the full contents of the publication:
    <content:encoded>Full contents of the publication</content:encoded>

To include the full content of publications, the feed must use the additional namespace:
    xmlns:content="http://purl.org/rss/1.0/modules/content/"
HTML-formatted text should be wrapped by CDATA section.
Our crawler extracts the full text of publications on web pages using an AI algorithm, as well as on pages marked up with Schema.org markup (supported types: Article, NewsArticle, BlogPosting, LiveBlogPosting).
How occurs RSS forming in the format of data export to Yandex News?
Along with standard RSS 2.0 elements, in order to export data into Yandex a special elemnt "yandex-fulltext" is needed, which should contain full contents of the publication:
    <yandex-fulltext>Full contents of the publication</yandex-fulltext>

To include the full content of publications, the feed must use the additional namespace:
    xmlns:yandex="http://news.yandex.ru"
HTML-formatted text should be wrapped by CDATA section.
Full text should not contain the name of the source and date/time of the publication, and contact information or any other information, which is repeated in every publication.
How do you let visitors know you have an RSS feed?
The easiest way is to create a link to your RSS feed and place it on your website. By clicking on it, visitors will be able to subscribe to updates through their RSS reader.
  <a href="URL**">Subscribe via RSS</a>

Another way is to add a following meta tag to the <head> section of the HTML-page:
<head>
  …
  <link rel="alternate" type="application/rss+xml" title="Feed Title*" href="URL**">
  …
</head>

* "Feed Title" - RSS feed title,
** "URL" - Destination feed URL.
When your site contains this meta tag, some browsers add a special icon, which is a link to your RSS feed, located in the right corner of the address bar.