Parquet Power Tools - MergeSplit
por AB Advisory & Consulting
Merge multiple Apache Parquet files or split large Parquet files into manageable chunks.
Parquet Merge & Split is a professional Excel add-in that enables users to merge multiple Apache Parquet files or split large Parquet files into manageable chunks—all directly within Microsoft Excel. The add-in leverages client-side processing for maximum security and performance, requires no external servers, and seamlessly integrates with Excel's native worksheet functionality.
Key Value Proposition: Simplify complex Parquet file operations without leaving Excel, while maintaining data security through 100% client-side processing.
Core Features
1. Merge Multiple Parquet Files
- Combine 2 or more .parquet files into a single unified dataset
- Two Merge Strategies:
- Append: Stack files vertically (rows from all files combined)
- Interleave: Alternate rows from each file for mixed datasets
- Smart column alignment and schema matching
- Preview data before importing to Excel
- Support for files with 1M+ rows (Premium)
2. Split Large Parquet Files
- Divide large .parquet files using three powerful strategies:
- Chunk by Rows: Split into equal-sized chunks (e.g., 1,000 rows each)
- Split by Column Values: Create separate datasets for each unique value (e.g., by Region, Category)
- Custom Filters: Define SQL-style WHERE clauses for targeted splits
- Each split becomes a separate Excel worksheet with smart naming
- Preview splits before processing
- Export to worksheets or CSV files
3. Advanced Data Operations (Premium)
- SQL-style Filtering: Apply WHERE clauses to filter data before import
- Column Selection: Choose specific fields to import (reduce data volume)
- Custom Split Criteria: Complex multi-condition splits
- Unlimited Rows: No restrictions on file size (up to Excel's 1M row limit)
4. Professional Excel Integration
- Automatic worksheet creation with descriptive names
- Professional formatting: styled headers, auto-fit columns, colored rows
- Chunked loading for large datasets (prevents Excel freezing)
- Real-time progress tracking with detailed status messages
- Seamless integration with Excel's data analysis tools
5. Client-Side Security
- 100% browser-based processing using HyParquet library
- No user data uploaded to external servers
- All processing happens locally on user's machine
- GDPR compliant with comprehensive privacy policy
- OAuth 2.0 authentication via Auth0
6. Subscription Tiers:
Free Tier:
- Up to 1,000 rows per import
- Up to 2 files per Merge
- Basic authentication
- Standard import features
- Community support
Premium Tier:
- Up to Excel sheet row limit per import
- Full SQL filtering capabilities
- Advanced formatting options
- Priority support
- Access to all features
7. Privacy & Security:
Your data security is our priority. All file processing happens locally in your browser - we never upload or store your Parquet files on our servers. Authentication is handled through industry-standard Auth0.
8. Support & Resources:
- Comprehensive documentation at parquetpowertools.com
- Email support: support@parquetpowertools.com
- Regular updates with new features
- Video tutorials and guides
9. System Requirements:
- Microsoft Excel 2016 or later (Windows/Mac)
- Excel Online (web version)
- Modern web browser (Chrome, Edge, Safari, Firefox)
- Internet connection for authentication
Join thousands of data professionals who trust Parquet Data Merge&Split for their daily data analysis workflows. Transform how you work with big data in Excel today!
Funcionalidades de la aplicación
- Puede leer el documento y hacer cambios
- Puede enviar datos por Internet