Image courtesy of Patrick Hochstenbach.
A Signmap showing an entry for a single scholarly object managed by a repository. It shows the URL of the
object's landing page in the <loc> element and several Signposting links
associated with the landing page in consecutive <rs:ln> elements. As per Signposting conventions,
the describedby link points at a metadata resource, the item link at a content resource,
and the cite-as link at the object's persistent identifier. Note that the first two links also provide information on the media
type of the linked resources, i.e. JSON and PDF, respectively. The first link additionally expresses the profile of the media type. |
<?xml version="1.0" encoding="UTF-8"?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:rs="http://www.openarchives.org/rs/terms/"> <url> <loc>https://example.com/res1</loc> <rs:ln rel="describedby" href="https://example.com/metadata/res1.json" type="application/ld+json" profile="https://w3id.org/ro/crate"/> <rs:ln rel="item" href="https://example.com/content/res1.pdf" type="application/pdf"/> <rs:ln rel="cite-as" href="https://doi.org/123.457643"/> </url> </urlset> |
robots.txt
, which is used by the Sitemaps Protocol to support discovery of
Sitemaps.<url>
element per scholarly object managed by the repository. The URL of the object's landing page
must be provided in the <loc>
element. Other elements may be provided as intended by the Sitemaps Protocol.robots.txt
to make
a Sitemap (or a Sitemap Index, if applicable) discoverable. The Signmap approach uses the Robots Exclusion
Protocol in the same way, with the following implementation guidelines for repositories:
robots.txt
file must be provided at the repository-entry-URL of the repository.
What the repository-entry-URL is depends on how/where the repository was installed.
For example, it could be <https://myuniversity.edu/repository/home>, or
<https://repository.myuniversity.edu/home>, or <https://repo.org/>.
Generally speaking it is the de-facto entry page to a repository.robots.txt
file by means of a Sitemap:
line,
e.g. by adding the line Sitemap: https://myuniversity.edu/sitemap.xml
. Other lines may be added to the robots.txt
file,
as described in the Robots Exclusion Protocol.<loc>
element.
The Signmaps approach uses this extensibility mechanism to provide typed links pertaining to the landing page URLs provided in a Sitemap's <loc>
elements. This is achieved by:
<urlset>
element of a Sitemap as xmlns:rs="http://www.openarchives.org/rs/terms/"
<rs:ln>
child element of the <loc>
element that contains the landing page's URL.<rs:ln>
element to provide link information:
rel
: conveys the link relation type;href
: conveys the URL (absolute, not relative URL) of the resource that is the target of the link;type
: conveys the media type of the resource that is the target of the link.profile
: conveys a profile of the media type by means of a Profile URI.describedby
to link to metadata resources and item
to link to content resources.