100% free · No login · No AI · Crawls your public pages

Free llms.txt Generator

Create an AI-optimized llms.txt file from your website. Paste a sitemap URL, enter your domain, or list specific pages — no AI, no credits, no login.

✨ Free generator

Build your llms.txt file

Choose one input method below. We crawl your public pages and assemble a valid llms.txt from titles and meta descriptions.

Sitemap URL

Best for large sites: point to your sitemap.xml and we'll crawl all listed pages.

Website URL

We'll scan your homepage and follow internal links automatically.

Paste specific URLs

One URL per line, useful when you only want specific pages included.

Preview

Generated Content Preview

Your llms.txt preview will appear here after you click Generate.
Guide

What is an llms.txt file?

An llms.txt file is a plain-text Markdown index at your site root (yoursite.com/llms.txt) that lists your most important pages with short descriptions for AI systems and developers.

The format was proposed by Jeremy Howard in 2024 and is documented at llmstxt.org. It is a community convention — not an IETF or W3C standard — but documentation sites, SaaS products, and blogs increasingly publish one.

Think of it as a curated reading list for language models: an H1 site title, a blockquote summary, and H2 sections of markdown links with one-line notes. It complements sitemap.xml and robots.txt rather than replacing them.

Format

llms.txt file format reference

A valid llms.txt follows a simple, deterministic Markdown layout defined at llmstxt.org:

  1. H1 — site or project name (required; exactly one per file)
  2. Blockquote — one- or two-sentence summary of the site (recommended)
  3. Free-form body — optional paragraphs or lists before the first H2 (no extra headings)
  4. H2 sections — groups of related links (e.g. Documentation, API, Blog)
  5. Link items — - [Page title](https://absolute-url): short description per line
  6. Optional section — an H2 titled Optional for secondary links LLMs may skip in short-context mode

Absolute URLs are strongly recommended. Serve the file as text/plain or text/markdown at a well-known path. Most sites only need the index file — not a separate llms-full.txt unless you want full page text inlined.

Compare

llms.txt vs robots.txt vs sitemap.xml

These three root-level files serve different audiences. Publishing all three is common; they do not conflict.

File
Purpose
Primary audience
llms.txt
Curated Markdown index of key pages with descriptions for AI context
LLM crawlers, AI coding assistants, developers
robots.txt
Crawl rules — allow or disallow paths for web crawlers
Search-engine bots and compliant crawlers
sitemap.xml
Machine-readable list of URLs with optional lastmod and priority
Search engines (Google, Bing, etc.)
Expectations

What llms.txt does — and what it doesn't

Set realistic expectations before you ship. llms.txt is low-cost to publish but adoption by major AI products is still evolving.

What it can do

  • Give AI coding assistants (Cursor, Copilot, etc.) a quick map of your docs and key pages
  • Help developers and technical writers discover your information architecture
  • Complement structured data and sitemaps with human-readable, LLM-friendly context
  • Make your best content easy to find without dumping every URL into one file

What it does not do

  • Boost Google search rankings — Google has stated it is not a ranking factor
  • Guarantee that ChatGPT, Claude, Gemini, or Perplexity will read or weight your file
  • Block or allow AI crawlers (use robots.txt and provider opt-out tools for that)
  • Replace a sitemap, proper meta tags, or high-quality on-page content
How-to

How to use this generator

Enter your sitemap URL for blogs and large sites, your homepage URL for smaller sites, or paste specific URLs. Review the preview, download llms.txt, and upload it to your domain root.

  1. Pick an input method — sitemap is fastest for sites with many posts
  2. Click Generate llms.txt and wait while we fetch page titles and descriptions
  3. Review the Generated Content Preview and edit locally if needed
  4. Download the file and upload it so it is live at https://yourdomain.com/llms.txt
  5. Re-run the generator when you add major new sections or restructure your site
FAQ

Frequently asked questions

Straight answers for common llms.txt questions — useful for SEO long-tail queries and AI-citable Q&A.

What is an llms.txt file?

An llms.txt file is a Markdown text file at your domain root (/llms.txt) that lists your site's most important pages with short descriptions for AI systems. It gives language models a curated map of your content without requiring them to crawl your entire site. The convention is documented at llmstxt.org.

Is this llms.txt generator free?

Yes — this generator is 100% free with no signup, no credits, and no login required. We crawl your public pages and build the file from HTML titles and meta descriptions without using AI. You can copy or download the result instantly.

Where do I put the llms.txt file?

Upload it to the root of your domain so it is reachable at https://yourdomain.com/llms.txt. Some documentation sites use a subpath like /docs/llms.txt, but the root is the most common location crawlers check first.

What's the difference between llms.txt and llms-full.txt?

llms.txt is a curated link index with short descriptions — what most sites need. llms-full.txt inlines the full text of those pages for deeper context and can be very large. Start with llms.txt; add llms-full.txt only if you have a specific use case that needs full page content.

Does llms.txt help SEO or Google rankings?

No — llms.txt is not a Google ranking factor, and Google has said it does not use llms.txt for search indexing. Its real value is in AI tooling, developer discovery, and giving coding assistants a structured overview of your site. Keep investing in sitemaps, meta tags, and content quality for SEO.

Do ChatGPT, Claude, Gemini, and Perplexity actually read llms.txt?

No major AI provider has officially committed to using llms.txt in production inference. Adoption is uncertain, though some crawlers occasionally fetch /llms.txt. The strongest confirmed use today is AI coding assistants and developer tooling that consume the file when pointed at your repo or docs.

Is llms.txt the same as robots.txt? Does it block AI crawlers?

No — llms.txt is an inclusion and curation file, not a blocking mechanism. It does not tell crawlers what they may or may not access. To restrict AI crawlers like GPTBot or ClaudeBot, use robots.txt rules and each provider's opt-out documentation.

How do I check if AI crawlers are fetching my llms.txt?

Filter your server access logs for requests to /llms.txt and inspect the User-Agent header. Look for identifiers such as GPTBot, ClaudeBot, PerplexityBot, and Google-Extended. A single fetch does not mean ongoing use — track patterns over weeks.

How big should the file be?

Curate your best pages rather than listing every URL on your site. Aim well under typical model context limits — roughly 150K words or ~700KB as a practical ceiling — and split into llms-full.txt or section-specific files if you need more. Quality and focus beat exhaustive dumps.

Should I create a Markdown copy of every page?

Generally no — publishing indexable .md mirrors of every HTML page can create duplicate-content issues if search engines crawl them. llms.txt links to your canonical URLs with descriptions; that is usually enough. Use llms-full.txt only when you deliberately want inlined full text for LLM context.

How often should I update llms.txt?

Update when you add, remove, or significantly change key pages — new product areas, docs sections, or landing pages. For active sites, regenerating from your sitemap monthly or on each deploy (via CI) keeps the file current without manual edits.

Does this tool use AI or Gemini?

No. The generator crawls your public HTML and extracts page titles, meta descriptions, and URL structure programmatically. Nothing is sent to Gemini or any other AI model to write your file.

Sitemap URL vs website URL — which should I use?

Use sitemap URL for blogs and large sites — it discovers every listed page quickly. Use website URL for smaller sites where we start at the homepage and follow internal links. Paste specific URLs when you only want a hand-picked subset.

Why can't my site be crawled?

Some sites block automated requests through bot protection, Cloudflare, or WAF rules. Try pasting specific URLs manually, or use your sitemap URL if it is publicly accessible without authentication.