Designing ETL Pipelines with Structured Streaming and Delta Lake— How to Architect Things Right (Youtube)

"Eu quero meu dashboard atualizado a cada segundo".

Será que é um erro!?!? :)


How to Calculate the Cost of Data Downtime

"One CDO I spoke with recently told me that his 500-person team spends 1,200 cumulative hours per week tackling data quality issues, time otherwise spent on activities that drive innovation and generate revenue (...)"


Pq a Astrazeneca está na frente no desenvolvimento da vacina para o Corona?


Governo formaliza adesão à nuvem pública em contrato com Embratel

"(...) O projeto com a Embratel faz parte da contratação da primeira nuvem pública do governo federal, licitada ainda em 2018 e que começou a ser implementada no ano passado. Essa primeira contratação reúne 23 órgãos públicos e tem custo projetado de R$ 55 milhões. "


COVID-19 didn’t break your business. Data did.

(...) Not every enterprise stumbled, nor did every government. The defining factor wasn't how digital these public and private entities were. These surviving enterprises and governments embraced the data both pre-COVID-19 and during COVID-19 (...) 


How Amazon is solving big-data challenges with data lakes

"(...) A major reason companies choose to create data lakes is to break down data silos. Having pockets of data in different places, controlled by different groups, inherently obscures data. This often happens when a company grows fast and/or acquires new businesses. In the case of Amazon, it's been both (...)" (Werner Vogels)


Optimize Your Amazon S3 Data Lake with S3 Storage Classes and Management Tools (Youtube)

"As your data lake grows, it becomes increasingly important to manage objects at scale and optimize storage costs and resources. In this tech talk, AWS experts provide an overview of S3's capabilities that allow you to manage data at the object, bucket, and account levels. Learn about and watch demos for S3 Batch Operations. Also learn cost-optimization best practices by storing objects across the S3 Storage Classes."


Leitura Recomendada: "Factfulness: Ten Reasons We're Wrong about the World"

"I don’t love numbers. I am a huge, huge fan of data, but I don’t love it. It has its limits. I love data only when it helps me to understand the reality behind the numbers, i.e., people’s lives. In my research, I have needed the data to test my hypotheses, but the hypotheses themselves often emerged from talking to, listening to, and observing people. Though we absolutely need numbers to understand the world, we should be highly skeptical about conclusions derived purely from number crunching."


Australia-wide AWS deal

The Australian government's attitude towards cloud has been very positive, and according to Amazon Web Services (AWS) Worldwide Public Sector Asia Pacific regional managing director Peter Moore, what's prevented an all-in approach has been legacy arrangements and a traditional approach to procurement (...)


Machine Learning for Everyone

(...) Without all the AI-bullshit, the only goal of machine learning is to predict results based on incoming data. That's it. All ML tasks can be represented this way, or it's not an ML problem from the beginning. The greater variety in the samples you have, the easier it is to find relevant patterns and predict the result (...)

1 / 1

Please reload