Use Routine Vacuuming to Reclaim Unused Space December 06, 2021 by Abe Dearmer Amazon Redshift databases require periodic maintenance known as vacuuming. Amazon Redshift is based on PostgreSQL, but unlike PostgreSQL, Redshift doesn’t offer autovacuum. So when a row is deleted from a...
Use Column Encoding December 06, 2021 by Donal Tobin Adding compression to large, uncompressed columns will have a big impact on cluster performance. Compression accomplishes two things: Reduce storage utilization . Because file compression reduces the size footprint of...
Use Amazon Redshift Spectrum for infrequently used data December 06, 2021 by Abe Dearmer Amazon Redshift launched with disruptive pricing. To compare the cost, we’re looking at the price for storing 1TB of data for one year ($ / TB / Year). With a...
Query Optimization: How to efficiently compare two rows in a SQL query December 06, 2021 by Donal Tobin Table of Contents The Simplified Problem A Better Solution A Real-World Example Lessons Learned Query optimization that dramatically reduces runtime for queries which use window functions . The Simplified Problem...
How Wish Built Their Data Pipeline with Amazon Redshift December 06, 2021 by Abe Dearmer Wish Wish is a mobile commerce platform. It provides online services that include media sharing and communication tools, personalized and other content, as well as e-commerce. During the last few...
Using Opsworks and HAProxy for Routing December 06, 2021 by Mark Smallcombe At Integrate.io, with much of our infrastructure on AWS, we try to make use of the various AWS services available to us. One of these is Amazon Opsworks.
Benchmarking the Performance of Amazon Redshift ra3.16xlarge versus ds2.8xlarge instances December 06, 2021 by Abe Dearmer A first look at the new RA3 Amazon Redshift node type Table of Contents Introduction Specs Copy Performance I/O Performance Real-world performance Separation of Storage and Compute Conclusion Introduction Today...
Handling Column Characters in MySQL vs Amazon Redshift December 06, 2021 by Abe Dearmer When you build an application that works with different kinds of databases, you have to be aware of and deal with many subtle differences between databases.
Top 14 ETL Tools (Updated November 2024) June 04, 2024 by Abe Dearmer Integrate.io lists the 14 best ETL software tools for 2024 based on features, user review scores, and more. Which ETL tool should you choose?
MuleSoft vs. Integrate.io: Comparison and Review May 06, 2024 by Abe Dearmer When it comes to ETL solutions, Mulesoft ETL is like a Swiss Army Knife. But if all you need to do is open boxes, do you really need a corkscrew?
17 Best Data Integration Platforms May 13, 2024 by Abe Dearmer You have access to a wide range of data – make sure your organization can use it. Explore these seventeen data integration tools and platforms.