Supported Formats

ExploreMyData supports the most common tabular data formats.

FormatExtensionsNotes
CSV.csvComma-separated. Auto-detects delimiter, encoding, and headers.
TSV.tsvTab-separated. Treated as CSV with tab delimiter.
Excel.xlsx, .xlsSupports multi-sheet workbooks. Choose which sheet to import during upload. Parsed via the xlsx library.
Parquet.parquetColumnar format. Native DuckDB support - fastest to load.
JSON.jsonArray of objects or newline-delimited JSON (NDJSON).
PDF.pdfExtracts tables from text-based PDFs. Scanned PDFs supported via browser-based OCR. Password-protected PDFs supported.
XML.xmlNested elements are flattened into columns via a JSON intermediate step.
Text.txtAuto-detects delimiters (tab, comma, pipe). Plain text imported as single column.
Word.doc, .docxExtracts tables from Word documents. Multiple tables loaded as separate files.
DuckDB.duckdb, .dbNative DuckDB database files. Tables inside are listed for selection on import.

Export Formats

After transforming your data, export in any of these formats:

FormatExtensionNotes
CSV.csvComma-separated values.
TSV.tsvTab-separated values.
Excel.xlsxExcel spreadsheet via the xlsx library.
Parquet.parquetColumnar binary format. Smallest file size for large datasets.
JSON.jsonNewline-delimited JSON (NDJSON).
PDF.pdfFormatted table layout. Limited to 10,000 rows.
HTML.htmlSelf-contained HTML table with dark mode support. No external dependencies.
DuckDB.duckdbNative DuckDB database file. Preserves types and is fast to re-import.

File Size

Files up to ~100 MB work well in most browsers. Larger files may work but will use more memory. Parquet files are the most efficient format for large datasets.

Encoding

DuckDB auto-detects file encoding. If you see garbled characters, try saving your file as UTF-8 before uploading.