Supported Formats
ExploreMyData supports the most common tabular data formats.
| Format | Extensions | Notes |
|---|---|---|
| CSV | .csv | Comma-separated. Auto-detects delimiter, encoding, and headers. |
| TSV | .tsv | Tab-separated. Treated as CSV with tab delimiter. |
| Excel | .xlsx, .xls | Supports multi-sheet workbooks. Choose which sheet to import during upload. Parsed via the xlsx library. |
| Parquet | .parquet | Columnar format. Native DuckDB support - fastest to load. |
| JSON | .json | Array of objects or newline-delimited JSON (NDJSON). |
| Extracts tables from text-based PDFs. Scanned PDFs supported via browser-based OCR. Password-protected PDFs supported. | ||
| XML | .xml | Nested elements are flattened into columns via a JSON intermediate step. |
| Text | .txt | Auto-detects delimiters (tab, comma, pipe). Plain text imported as single column. |
| Word | .doc, .docx | Extracts tables from Word documents. Multiple tables loaded as separate files. |
| DuckDB | .duckdb, .db | Native DuckDB database files. Tables inside are listed for selection on import. |
Export Formats
After transforming your data, export in any of these formats:
| Format | Extension | Notes |
|---|---|---|
| CSV | .csv | Comma-separated values. |
| TSV | .tsv | Tab-separated values. |
| Excel | .xlsx | Excel spreadsheet via the xlsx library. |
| Parquet | .parquet | Columnar binary format. Smallest file size for large datasets. |
| JSON | .json | Newline-delimited JSON (NDJSON). |
| Formatted table layout. Limited to 10,000 rows. | ||
| HTML | .html | Self-contained HTML table with dark mode support. No external dependencies. |
| DuckDB | .duckdb | Native DuckDB database file. Preserves types and is fast to re-import. |
File Size
Files up to ~100 MB work well in most browsers. Larger files may work but will use more memory. Parquet files are the most efficient format for large datasets.
Encoding
DuckDB auto-detects file encoding. If you see garbled characters, try saving your file as UTF-8 before uploading.