.docx output with proper heading
hierarchy, line spacing, tables, and (optionally) images. The agent auto-
accepts the inline preview diff on the backend, so edits apply immediately
— no user-in-the-loop required.
What the Word agent can do
Ordered roughly from “used in almost every document” to specialized:- Paragraph and run styling — heading levels, body styles, custom paragraph styles; bold, italic, underline, strikethrough, double strikethrough, superscript, subscript, all-caps, small-caps, hidden, highlight color, shading color, custom font + size + color; per-run hyperlinks; line spacing, space-before / space-after, indents, alignment (left / center / right / justify).
- Lists — bulleted and numbered, with custom markers and nesting
levels; per-paragraph indent + marker control for designs like
• …/– …/1. …. - Tables — headered tables with shaded header row, per-cell borders and shading, cell merge / split, column-width control, alignment per cell, text formatting inside cells.
- Document structure — headers, footers (with left/center/right sections), section breaks (continuous, next-page, even/odd), column breaks, multi-column page layouts, drop caps, watermarks, page numbering fields.
- Media — images, charts (same chart types as Excel), shapes, text boxes (linked or standalone).
- Citations as hyperlinks — inline links to sources; pass
context_filesto give the agent specific references to cite. - Footnotes and endnotes — full per-note content, with automatic renumbering.
- Review / collaboration — comments (with threaded replies and resolve), tracked-changes (accept / reject, individually or all at once).
- References — cross-references to headings, bookmarks, footnotes, tables, figures; merge fields for mail-merge flows; form controls (rich text, plain text, drop-down, checkbox); sub-documents and master-doc structure; document compare.
- Equations / math — full Equation schema for fractions, radicals,
limits, integrals, sums, matrices, Greek letters. Renders natively
in the embedded viewer and in any downloaded
.docx. - Styles catalog — create new paragraph / character styles, modify or delete existing ones, set a document-wide default style.
- Page fields — date, time, page number, total pages, file name, author; auto-updated on open.
Quickstart
Common patterns
Reports
Letters and memos
Editing an existing document
- Inspects the document to find the existing payment-terms section
- Generates a
PreviewActionPlanwith the replacement - Auto-accepts the diff (no user clicking needed)
- Returns the updated
.docx
Working with images
Upload images first, then reference them byimage_id:
Writing quality
The underlying agent is tuned for professional prose. It avoids:- Banned words: delve, leverage, utilize, robust, seamless, cutting-edge, etc.
- Banned transitions: Furthermore, Moreover, Additionally, In summary…
- Banned phrases: “in today’s fast-paced world”, “at the end of the day”
- Em dashes as a stylistic tic (max one per page)
Things to watch for
- Table of contents is not generated today. If your prompt asks for
a TOC, the agent skips it and notes the omission in
summary. - Long documents (>5 pages) are built page-by-page. Specify “first page preview only” if you want a quick iteration.
