Research Quick

ArXiv

Search and retrieve arXiv academic papers by topic, category, or paper ID — with AlphaXiv-enriched AI-generated overviews.

03
Workflows
00
References
10
Triggers
low
Effort

The Problem

Ask a generic AI to find you the latest research on a topic and you get a vague summary that might be months out of date, with no paper IDs, no actual abstracts, and a real chance the citations are hallucinated. Academic research moves fast. If you're trying to track what's happening in LLMs, AI safety, or security, you need the actual papers — real IDs, real dates, real authors — not a prose summary that sounds confident but can't be verified.

How This Skill Approaches It

Hit the arXiv Atom API directly, no API key needed, with category-specific queries across cs.AI, cs.LG, cs.CL, cs.CR, cs.MA, cs.SE, and cs.IR. Three workflows cover the main use cases: Latest pulls the most recently updated papers in a category sorted by lastUpdatedDate; Search runs fielded queries against title, abstract, and author fields with boolean operators; Paper fetches a specific paper by ID and enriches it with an AlphaXiv AI-generated overview when one exists. The AlphaXiv enrichment layer gives you a readable markdown summary of the paper without needing to parse the PDF — useful for quick triage before you commit to reading the full thing.

  • Uses arXiv Atom API across cs.AI/cs.LG/cs.CL/cs.CR/cs.MA/cs.SE/cs.IR
Not for general research (Research), URL parsing (_PARSER), or annual reports (_ANNUALREPORTS)

In Action

What you say to your DA, and what the ArXiv skill actually does.

  • You say "what are the newest papers on multi-agent systems this week"
    Runs the Latest workflow against cs.MA sorted by lastUpdatedDate, returns real paper IDs, titles, authors, and publish dates — no hallucinated citations.
  • You say "find arxiv papers about LLM prompt injection vulnerabilities"
    Runs the Search workflow with a fielded query across cs.CR and cs.AI abstracts, then enriches top results with AlphaXiv overviews where available.
  • You say "explain this paper to me — arxiv.org/abs/2401.12345"
    Runs the Paper workflow: fetches the Atom entry for the ID, pulls the AlphaXiv overview markdown if it exists, and returns a readable summary with the original abstract as a fallback.

Inside the Skill

The thinking, frameworks, and architecture that distinguish this skill from a generic version of the same task.

What It Does

Searches and retrieves arXiv academic papers by topic, category, or paper ID, and pulls AlphaXiv's AI-generated overviews when a paper has one. Covers the cs.AI / cs.LG / cs.CL / cs.CR / cs.MA / cs.SE / cs.IR categories. Three workflows: Latest, Search, Paper. No API keys needed.

The Problem

arXiv ships thousands of papers a day and its native search is clunky — Atom XML, three-second rate limits, fields you have to know by name, and a lastUpdatedDate that quietly resurfaces old papers as if they were new. Reading a raw paper to decide whether it's worth your time is slow. This skill wraps the query mechanics, handles the XML, and layers AlphaXiv overviews on top so you can triage a paper in seconds instead of reading the whole PDF first.

How It Works

Uses arXiv's Atom API for search and discovery, and AlphaXiv's markdown endpoint for enriched paper overviews. Search fields, boolean operators, sort order, and pagination are all handled for you; overviews are fetched per paper ID when available (a 404 just means no overview exists yet).

Quick Reference

arXiv API (no auth):

  • Base: https://export.arxiv.org/api/query
  • Search fields: ti: (title), au: (author), abs: (abstract), cat: (category), all: (everything)
  • Booleans: AND, OR, ANDNOT
  • Sort: sortBy=lastUpdatedDate&sortOrder=descending for latest
  • Pagination: start=0&max_results=10 (max 2000 per call)
  • Rate limit: 3s between calls

AlphaXiv enrichment (no auth):

  • Overview: curl -s "https://alphaxiv.org/overview/{PAPER_ID}.md"
  • Full text: curl -s "https://alphaxiv.org/abs/{PAPER_ID}.md" (fallback)
  • Not all papers have overviews — 404 means analysis not yet generated

Key categories for our work:

  • cs.AI — Artificial Intelligence
  • cs.LG — Machine Learning
  • cs.CL — Computation and Language (NLP/LLMs)
  • cs.CR — Cryptography and Security
  • cs.SE — Software Engineering
  • cs.MA — Multi-Agent Systems
  • cs.IR — Information Retrieval

Gotchas

  • arXiv API requires HTTPS and -L (follows redirects). HTTP 301s to HTTPS silently.
  • arXiv API returns Atom XML, not JSON. Parse with text processing, not jq.
  • lastUpdatedDate includes edits to old papers. For truly new submissions, check <published> dates.
  • AlphaXiv overviews are AI-generated summaries. Great for quick understanding, but verify claims against the actual paper for anything you'd cite.
  • arXiv API rate limit is 3 seconds between calls. Batch your queries.
  • max_results caps at 2000. For broader sweeps, paginate with start.
  • Category search (cat:cs.AI) returns papers with that as primary OR cross-listed category.

Workflows · 3

  1. 01
    `Workflows/Latest.md` Workflows/`Workflows/Latest.md`.md

    latest papers in X, new papers on X, whats new in AI research

  2. 02
    `Workflows/Search.md` Workflows/`Workflows/Search.md`.md

    search arxiv for X, find papers about X, arxiv papers on X

  3. 03
    `Workflows/Paper.md` Workflows/`Workflows/Paper.md`.md

    arxiv URL, paper ID like 2401.12345, explain this paper

How to Invoke

Say any of these to your DA and PAI activates the ArXiv skill automatically:

  • "arxiv"
  • "papers"
  • "latest papers"
  • "research papers"
  • "recent ML papers"
  • "paper lookup"
  • "summarize paper"
  • "latest LLM papers"
  • "AI safety papers"
  • "cs.AI latest"

Or invoke explicitly:

Skill("ArXiv")

References & Credits

The thinkers, books, frameworks, and research this skill is built on. The ideas belong to them — the integration belongs to PAI.

Want PAI to do this for you?

Install PAI on your machine — your DA gets the ArXiv skill plus 44 others, all hooked into one Life OS.