Supported Formats

ExploreMyData supports the most common tabular data formats.

Format	Extensions	Notes
CSV	.csv	Comma-separated. Auto-detects delimiter, encoding, and headers.
TSV	.tsv	Tab-separated. Treated as CSV with tab delimiter.
Excel	.xlsx, .xls	Supports multi-sheet workbooks. Choose which sheet to import during upload. Parsed via the xlsx library.
Parquet	.parquet	Columnar format. Native DuckDB support - fastest to load.
JSON	.json	Array of objects or newline-delimited JSON (NDJSON).
PDF	.pdf	Extracts tables from text-based PDFs. Scanned PDFs supported via browser-based OCR. Password-protected PDFs supported.
XML	.xml	Nested elements are flattened into columns via a JSON intermediate step.
Text	.txt	Auto-detects delimiters (tab, comma, pipe). Plain text imported as single column.
Word	.doc, .docx	Extracts tables from Word documents. Multiple tables loaded as separate files.
DuckDB	.duckdb, .db	Native DuckDB database files. Tables inside are listed for selection on import.

Export Formats

After transforming your data, export in any of these formats:

Format	Extension	Notes
CSV	.csv	Comma-separated values.
TSV	.tsv	Tab-separated values.
Excel	.xlsx	Excel spreadsheet via the xlsx library.
Parquet	.parquet	Columnar binary format. Smallest file size for large datasets.
JSON	.json	Newline-delimited JSON (NDJSON).
PDF	.pdf	Formatted table layout. Limited to 10,000 rows.
HTML	.html	Self-contained HTML table with dark mode support. No external dependencies.
DuckDB	.duckdb	Native DuckDB database file. Preserves types and is fast to re-import.

File Size

Files up to ~100 MB work well in most browsers. Larger files may work but will use more memory. Parquet files are the most efficient format for large datasets.

Encoding

DuckDB auto-detects file encoding. If you see garbled characters, try saving your file as UTF-8 before uploading.