LaTeX/Export To Other Formats

转自
Strictly speaking, LaTeX source can be used to directly generate two formats:

  • DVI using latex, the first one to be supported;
  • PDF using pdflatex, more recent.

Using other software freely available on Internet, you can easily convert DVI and PDF to other document formats. In particular, you can obtain the PostScript version using software which is included in your LaTeX distribution. Some LaTeX IDE will give you the possibility to generate the PostScript version directly (even if it uses internally a DVI mid-step, e.g. LaTeX → DVI → PS). It is also possible to create PDF from DVI and vice versa. It doesn’t seem logical to create a file with two steps when you can create it straight away, but some users might need it because, as you remember from the first chapters, the format you can generate depends upon the formats of the images you want to include (EPS for DVI, PNG and JPG for PDF). Here you will find sections about different formats with description about how to get it.

Other formats can be produced, such as RTF (which can be used in Microsoft Word) and HTML. However, these documents are produced from software that parses and interprets the LaTeX files, and do not implement all the features available for the primary DVI and PDF outputs. Nonetheless, they do work, and can be crucial tools for collaboration with colleagues who do not edit documents with LaTeX.

Tools installation

This chapter features a lot of third-party tools; most of them are installed independently of your TeX distribution.

Some tools are Unix-specific (*BSD, GNU/Linux and Mac OS X), but it may be possible to make them work on Windows. If you have the choice, it is often easier with Unix systems for command line tools.

Some tools may already be installed. For instance, you can check if dvipng is installed and ready to use (Unix only):

type dvipng

Most of these tools are installable using your package manager or portage tree (Unix only).

Preview mode

This section describes how to generate a screenshot of a LaTeX page or of a specific part of the page using the LaTeX package preview. Screenshots are useful, for example, if you want to include a LaTeX generated formula on a presentation using you favorite slideware like Powerpoint, Keynote or LibreOffice Impress. First, start by making sure you have preview. See Installing Extra Packages.

Say you want to take a screenshot of

这里写图片描述
Write this formula in the preview environment:

\documentclass{article}
\usepackage[active]{preview}
\begin{document}
\begin{preview}
\[
\pi = \sqrt{12}\sum^\infty_{k=0} \frac{ (-3)^{-k} }{ 2k+1 }
\]
\end{preview}
\end{document}

Note the active option in the package declaration and the preview environment around the equation’s code. Without any of these two, you won’t get any output.

This package is also very useful to export specific parts to other format, or to produce graphics (e.g. using PGF/TikZ) and then including them in other documents. You can also automate the previewing of specific environments:

\usepackage[active,tightpage]{preview}
\PreviewEnvironment{lstlisting}
\setlength{\PreviewBorder}{10pt}%

% ...

\begin{lstlisting}
int main()
{
        /* ... */
}
\end{lstlisting}

This will produce a PDF containing only the listing content, the page layout will depend on the shape of the source code.

Convert to PDF

Directly

pdflatex my_file

DVI to PDF

dvipdfm my_file.dvi

will create my_file.pdf. Another way is to pass through PS generation:

dvi2ps myfile.dvi
ps2pdf myfile.ps

you will get also a file called my_file.ps that you can delete.

Merging PDF

If you have created different PDF documents and you want to merge them into one single PDF file you can use the following command-line command. You need to have Ghostscript installed:

Using Windows

gswin32 -dNOPAUSE -sDEVICE=pdfwrite -sOUTPUTFILE=Merged.pdf -dBATCH 1.pdf 2.pdf 3.pdf

Using Linux

gs -dNOPAUSE -sDEVICE=pdfwrite -sOUTPUTFILE=Merged.pdf -dBATCH 1.pdf 2.pdf 3.pdf

Alternatively, PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. This program may be available in your Linux distribution’s repository.

Another option to check out is pdftk (or PDF toolkit), which is a command-line tool that can manipulate PDFs in many ways. To merge one or more files, use:

pdftk 1.pdf 2.pdf 3.pdf cat output 123.pdf

Using pdfLaTeX

Note: If you are merging external PDF documents into a LaTeX document which is compiled with pdflatex, a much simpler option is to use the pdfpages package, e.g.:

\usepackage{pdfpages}
...
\includepdf[pages=-]{Document1.pdf}
\includepdf[pages=-]{Document2.pdf}
...

Three simple shell scripts using the pdfpages package are provided in the pdfjam bundle by D. Firth. They include options to merge several pdf files (pdfjoin), put several pages in one physical sheet (pdfnup) and rotate pages (pdf90).

See also Modular Documents

XeTeX

You can also use XeTeX (or, more precisely, XeLaTeX), which works in the same way as pdflatex: it creates a PDF file directly from LaTeX source. One advantage of XeTeX over standard LaTeX is support for Unicode and modern typography. See its Wikipedia entry for more details.

Customization of PDF output in XeTeX (setting document title, author, keywords etc.) is done using the configuration of hyperref package.

Convert to PostScript

from PDF

pdf2ps my_file.pdf

from DVI

dvi2ps my_file.dvi

Convert to RTF

LaTeX can be converted into an RTF file, which in turn can be opened by a word processor such as LibreOffice Writer or Microsoft Word. This conversion is done through latex2rtf, which may run on any computer platform, however is only actively supported on Windows, Linux and BSD, with the last mac update being from 2001 (a recent version for OSX is available via MacPorts). The program operates by reading the LaTeX source, and mimicking the behaviour of the LaTeX program. latex2rtf supports most of the standard implementations of LaTeX, such as standard formatting, some math typesetting, inclusion of EPS, PNG or JPG graphics, and tables. As well, it has some limited support for packages, such as varioref, and natbib. However, many other packages are not supported.

latex2rtf is simple to use. The Windows version has a GUI (l2rshell.exe), which is straightforward to use. The command-line version is offered for all platforms, and can be used on an example mypaper.tex file:

latex mypaper
bibtex mypaper # if you use bibtex
latex2rtf mypaper

Both latex and (if needed) bibtex commands need to be run before latex2rtf, because the .aux and .bbl files are needed to produce the proper output. The result of this conversion will create myfile.rtf, which you may open in many word processors such as Microsoft Word or LibreOffice.

Convert to HTML

There are many converters to HTML. Some of them use an intermediate file which then will be converted to the destination format.

HEVEA

hevea mylatexfile

latex2html

latex2html -html_version 4.0,latin1,unicode -split 1 -nonavigation -noinfo -title “MyDocument” MyDocument.tex

LaTeXML

latexmlc paper.tex –destination=paper.html

pdf2htmlEX

pdf2htmlEX [options]

TeX4ht

TeX4ht has many options and possible configurations, but for a basic conversion,

 htlatex myfile.tex

will usually result in a reasonable HTML approximation. An introduction by the original author was published in TUGboat [1].

bibtex2html

For exporting the BibTeX file only.

bibtex2html mybibtexfile

Convert to image formats

It is sometimes useful to convert LaTeX output to image formats for use in systems that do not support DVI nor PDF files, such as Wikipedia.

There are two families of graphics:

  • Vector graphics can be scaled to any size, thus do not suffer from
    quality loss. SVG is a vector format.
  • Raster graphics define every pixel explicitly. PNG is a raster
    format.

So vector graphics are usually preferred. There is still some cases where raster graphics are used:

  • The target system does not handle vector graphics, only raster
    graphics are supported.
  • SVG can not embed fonts. So either the font will be rendered using a
    local .ttf or .otf font (which will mostly change the output), or all
    characters must be turned to vector graphics. This last method makes
    the SVG big and slow. If the input LaTeX file contains a lot of text
    which formatting must be preserved, SVG is not that great.

So SVG is great for drawings and a small amount of text. JPG is a well known raster formats, however it is usually not as good as PNG for text.

In some cases it may be sufficient to simply copy a region of a PDF (or PS) file using the tools available in a PDF viewer (for example using LaTeX to typeset a formula for pasting into a presentation). This however will not generally have sufficient resolution for whole pages or large areas.

Multiple formats

pdftocairo

There is pdftocairo featured in the poppler toolset.

pdftocairo -svg latexdoc.pdf output.svg

pdftocairo also supports various raster graphic formats.

Vector graphics

pdf2svg

Direct conversion from PDF to SVG can be done using the command line tool pdf2svg.

pdf2svg file.pdf file.svg

ps2svg

Alternatively DVI or PDF can be converted to PS as described before, then the bash script ps2svg.sh can be used (as all the software used by this script is multiplatform, this is also possible in Windows, a step-by-step guide could be written).

dvisvgm

One can also use dvisvgm, an open source utility that converts from DVI to SVG.

dvisvgm -n file.dvi

Inkscape

Inkscape is able to convert to SVG, PDF, EPS, and other vector graphic formats.

inkscape --export-area-drawing --export-ps=OUTPUT INPUT
inkscape --export-area-page --export-plain-svg=OUTPUT INPUT

Raster graphics

JPEG

Run ghostscript on the PostScript file created by pdf2ps as follows:

echo "quit" | gs -sDEVICE=jpeg -sOutputFile=document.jpg -r300 document.ps

GIMP

Open your file with GIMP. It will ask you which page you want to convert, whether you want to use anti-aliasing (choose strong if you want to get something similar to what you see on the screen). Try different resolutions to fit your needs, but 100 dpi should be enough. Once you have the image within GIMP, you can post-process it as you like and save it to any format supported by GIMP, as PNG for example.

dvipng

A method for DVI files is dvipng. Usage is the same as dvipdfm.

Run latex as usual to generate the dvi file. Now, we want an X font size formula, where X is measure in pixels. You need to convert this, to dots per inch (dpi). The formula is: = *72.27/10. If you want, for instance, X = 32, then the size in dpi corresponds to 231.26. This value will be passed to dvipng using the flag -D. To generate the desired png file run the command as follows:

dvipng -T tight -D 231.26 -o foo.png foo.dvi

The flag -T sets the size of the image. The option tight will only include all ink put on the page. The option -o sends the output to the file name foo.png.

ImageMagick

The convert command from the ImageMagick suite can convert both DVI and PDF files to PNG.

convert input.pdf output.png

optipng

You can optimize the resulting image using optipng so that it will take up less space.

Convert to plain text

If you are thinking of converting to plain text for spell-checking or to count words, there may be an easier way — read Tips and Tricks first.

Most LaTeX distributions come with detex program, which strips LaTeX commands. It can handle multi-file projects, so all you need is to give one command:

detex yourfile

(note the omission of .tex extension). This will output result to standard output. If you want the plain text go to a file, use

detex yourfile > yourfile.txt

If the output from detex does not satisfy you, you can try a newer version available on Google Code, or use HTML conversion first and then copy text from your browser.

If you want to keep the formating, you can use a DVI-to-plain text converter, like catdvi. Example:

catdvi yourfile.dvi | fmt -u

The use of fmt -u (available on most Unices) will remove the justification.

猜你喜欢

转载自blog.csdn.net/cnhome/article/details/80394043