December 22, 2015How Data Is Transforming Our Design1Earlier this year, I joined TD as the team’s first UX Designer. I’m here to craft delightful experiences for data scientists, analysts, and fellow data lovers......
November 17, 2015Getting Direct, SQL Access to Your Mixpanel Data in 6 Easy Steps0I have a love hate relationship with Mixpanel. The love part is fairly obvious: Mixpanel allows users to create funnels with minimal technical integrations, and its visualization and reporting functionality have improved tremendously over the years. Interested in which campaign is performing best...
November 16, 2015Postgres + Presto = Prestogres, My Nifty Hack for Putting Them Together0When I co-founded TD four years ago, our primary database was MySQL. Since then, I have grown to love “the other” open source RDBMS: PostgreSQL. There are many reasons to love PostgreSQL, and I plan to share my thoughts on Postgres’ internals this......
November 6, 2015Spark Community By the Numbers: A Few Surprises0Without a doubt, Apache Spark is taking the data science world by storm. An open-source, in-memory distributed data processing engine that can run atop Hadoop, Cassandra, and others, Spark has become a welcomed addition to the data scientist toolbox. Growing in popularity, Spark is not without it...
November 5, 2015We Treasure PyData. See you in NYC!0TD has quite a bit of history with Python and PyData. Yuu, one of our devops engineers, is the creator of pyenv, a popular Python environment manager, and one of our backend engineers, Keisuke, has written pandas-td and luigi-td to make the......
October 26, 2015Announcing Data Tanks: Faster Reporting and Unlimited Connectivity0Data Tanks provide easy access to your aggregated metrics through convenient, fully hosted data marts on TD's core platform. They can be used to drive a variety of external business intelligence and visualization applications without the hassle of......
October 8, 2015Redshift is 400x Bigger than MySQL Yet MySQL is More Popular1The Amazon Redshift COPY Command Guide is now available! There are good reasons for the hype around Amazon Redshift. Redshift is blazing fast and not that much more expensive than MySQL or PostgreSQL, the traditional mainstay of data engineers. But is Amazon Redshift really becoming predominant i...
October 7, 2015Treasure Data brings you: The Amazon Redshift COPY command cheatsheet0Although it’s getting easier, ramping up on the COPY command to import tables into Redshift can become very tricky and error-prone. Following the accordion-like hyperlinked Redshift documentation to get a complete command isn’t exactly straighforward, either......
October 6, 2015AWS, SFDC and Marketo Data Connectors Released, Data Tanks in Beta for Broader BI Connectivity0This year’s AWS re:Invent brings many new announcements and features from TD. Specifically, we released AWS Data Connectors (S3 and Redshift) and SaaS Data Connectors (Marketo and Salesforce) for general availability and Data Tanks for private beta......
September 24, 2015Data loading into Amazon Redshift simplified: The Podcast, part 20You can hear the whole podcast at this link. As we saw in Part 1 of this series, there are at least two sides to the development of any software feature; one is the perspective of the business person who requires the feature and then other is that of the developer who must create and maintain i...