← Back to Skills Marketplace
wsyummy

论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等

by huangchuzhi · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
29
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install tex2docx
Description
Convert LaTeX (.tex) academic papers to Word (.docx) with editable OMML equations, native Word tables, embedded figures, IEEE two-column format, and bibliogr...
README (SKILL.md)

tex2docx — LaTeX to Word Converter

Requirements

  • pandoc (system install): winget install pandoc or pandoc.org
  • Python packages: pip install python-docx lxml pypandoc_binary

Usage

python scripts/tex2docx.py input.tex [output.docx]

If output.docx is omitted, output is input.docx in the same directory.

How It Works (Three Phases)

.tex ──→ [pandoc] ──→ OMML equations (13+ Word-editable formulas)
  │
  └──→ [Custom parser] ──→ Native Word tables ├──→ Final .docx
                           Embedded figures     │   (merged)
                           Formatted refs       │
                           IEEE layout & font  ┘

Phase 1 — Pandoc

Runs pandoc via pypandoc. Input file must be in its own directory (with figures/ subfolder if images exist). The script chdirs to the tex directory before running pandoc so image paths resolve correctly.

Phase 2 — Custom LaTeX Parser

RegEx-based extraction of:

  • Tables: \begin{table} → Word Table objects (full borders, centered, 8pt TNR)
  • Figures: \includegraphics{} + \caption{} → PNG/PDF embeds with italic captions
  • References: hebibliography → formatted entries with hanging indent
  • Sections: \section{}, \subsection{} → bold headings
  • Metadata: itle, author, \abstract, \IEEEkeywords

Phase 3 — Merge

OMML equation paragraphs from pandoc are inserted into the cleanly-built document. Body paragraphs get 0.25in first-line indent. All LaTeX commands ( extbf, oprule, \ref, \cite, \begin{itemize}, etc.) are stripped from text content.

Output Format

Feature Detail
Font Times New Roman (10pt body, 9pt table/figure, 8pt refs)
Layout A4, two-column IEEE conference style
Equations OMML (double-click to edit in Word)
Tables Native Word tables, all borders
Figures PNG/PDF embedded with "Fig." captions
References Hanging indent, [bN] format
First indent 0.25in on body paragraphs

Verification

python scripts/verify.py output.docx

Reports paragraph/table/image/equation counts and checks for LaTeX residue.

Chinese (ctex) Support

Fully supports Chinese LaTeX documents using the ctex package:

  • Chinese section titles (引言, 方法, 实验, 结论等) are recognized
  • \section*{} (star variant) is supported
  • Chinese table headers preserved
  • Chinese text in titles rendered via w:eastAsia font fallback
  • itle{...} and \author{...} residue paragraphs are filtered

Limitations

  • Inline math ($...$) becomes plain text (italic), not OMML — only \begin{equation}, \begin{align}, and \[...\] become editable equations
  • No .bib support: references must be in hebibliography{} environment
  • PNG images preferred: script tries PNG then PDF fallback
  • Pandoc path: the system pandoc binary must be discoverable by pypandoc

Script: scripts/tex2docx.py

Self-contained (660+ lines). Key internal functions:

Function Role
extract_tex() Parse all structural elements from .tex
extract_omml() Pull OMML XML from pandoc output
build_docx() Construct final document with all components
clean() Strip LaTeX commands to plain text
add_table() Build Word table with borders
add_figure() Embed image + caption
Usage Guidance
Before installing or using this skill, edit or remove the hard-coded company/email block in scripts/tex2docx.py, install dependencies only from trusted sources, run it in a working copy of your paper directory, and carefully review the generated DOCX before sharing or submitting it.
Capability Analysis
Type: OpenClaw Skill Name: tex2docx Version: 1.0.0 The skill bundle provides a functional utility for converting LaTeX academic papers to Word (.docx) format using the 'pypandoc' library and 'python-docx'. The scripts (tex2docx.py and verify.py) perform standard file operations, regex-based parsing of LaTeX elements, and document generation without any indicators of data exfiltration, malicious command execution, or prompt injection. The hardcoded template information in the header generation appears to be a specific user's default configuration rather than a malicious indicator.
Capability Assessment
Purpose & Capability
The LaTeX-to-DOCX workflow is coherent, but the conversion script adds hard-coded affiliation/contact text that is not disclosed and is not derived from the user's LaTeX file.
Instruction Scope
SKILL.md scopes use to user-requested conversion of a provided .tex file and does not contain prompt-injection-style role changes or autonomous high-impact instructions.
Install Mechanism
The registry has no install spec or declared required binaries, while SKILL.md asks users to install pandoc and unpinned Python packages manually. This is purpose-aligned but should be done from trusted sources.
Credentials
The script reads the input .tex file and figures folder, invokes pandoc, and writes temporary/output .docx files. This is expected for the converter, though users should run it on copies and inspect outputs.
Persistence & Privilege
No credentials, account access, browser/session stores, background persistence, privileged paths, or network exfiltration endpoints are shown in the provided artifacts.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install tex2docx
  3. After installation, invoke the skill by name or use /tex2docx
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of tex2docx — LaTeX to Word converter. - Converts academic LaTeX (.tex) files to Word (.docx) with IEEE-style formatting - Preserves editable OMML equations, native Word tables, embedded figures, and bibliography - Supports two-column layout, Times New Roman font, and paragraph/table formatting - Includes Chinese (ctex) document support for section titles, headers, and text - Requires Python, pandoc, and a few Python packages (python-docx, lxml, pypandoc_binary) - Provides a verification script to check output quality and LaTeX residue
Metadata
Slug tex2docx
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等?

Convert LaTeX (.tex) academic papers to Word (.docx) with editable OMML equations, native Word tables, embedded figures, IEEE two-column format, and bibliogr... It is an AI Agent Skill for Claude Code / OpenClaw, with 29 downloads so far.

How do I install 论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等?

Run "/install tex2docx" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等 free?

Yes, 论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等 support?

论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 论文常用的tex格式转换为word格式,核心是能转换数学公式(非图片),表格,图表等等?

It is built and maintained by huangchuzhi (@wsyummy); the current version is v1.0.0.

💬 Comments