For more information or for assistance, please contact Data Services at datalib@uark.edu.
While gathering your data, you may not have a choice as to the formats you are using to retain your data. However, when saving or sharing your data it is a BEST PRACTICE to follow open data guidelines. Open guidelines help your work to tolerate software and operating system changes. You or someone else may need to use your data in the future!
Keep data files in their original raw format AND
Save data to share in a non-proprietary (open) file format. If conversion to an open data format will result in some data loss from your files, you might consider saving the data in both the proprietary format and an open format. When it is necessary to save files in a proprietary format, consider including a readme.txt file in your directory that documents the name and version of the software used to generate the file, as well as the company who made the software.
When selecting file formats for longer storage, the formats should ideally be:
The Library of Congress has published a Recommended Formats Statement that discusses this topic in great depth.
Data Type |
Proprietary (AVOID) |
Open (PREFERRED) |
---|---|---|
Containers |
|
TAR, GZIP, ZIP |
Databases |
|
XML, CSV |
Text |
Word (DOC, DOCX) |
ODF (Open Document Format), TXT, PDF/A, XML, HTML |
Tabular |
Excel (XLS, XLSX) |
CSV, TSV, TAB |
images |
Photoshop (PSD), Illustrator (AI) |
TIFF, JPG/JPEG 2000, PNG, BMP, GIF |
Audio |
Windows Media Audio (WMA) |
FLAC, WAV, AIFF, MXF |
Video |
Windows Media Video (WMV) |
MOV, MPEG-4, AVI, MXF |
Presentations |
Windows PowerPoint (PPT, PPTX) |
PDF/A, EPUB |
Geospatial |
CAD (DXF), MapInfo (MIF) |
GeoTIFF, GeoPDF, Shapefile (SHP, SHX, PRJ, DBF), NetCDF |
See the Library of Congress' Sustainability of Digital Formats web site for more complete listings and discussions of formats.