ADFv2 – Data Integration in Azure Just Got Awesome

Azure Data Factory version 2 (ADFv2) arrived in Sept17 with a bunch of new concepts and features to support our Azure data integration pipelines. In this session, we’ll update your ADFv1 knowledge and start to understand the true nature of scale out control flows and data flows. What’s the integration runtime? Can we easily lift…

Advanced Data Visualisations for Your Reports with R and Python libraries

One picture tells more than thousand words. Effective report needs informative and good looking visuals. Standard Reporting Services capabilities are huge and strong but sometimes you feel you need more. And you can have more, much more with graphic libraries used in R and Python. Come and see what can you do with ggplot2, seaborn…

Are You Optimistic About Concurrency?

What exactly does it mean to have optimistic concurrency? What is the alternative? Is SQL Server 2012’s SNAPSHOT Isolation optimistic? How can SQL Server 2016’s In-Memory OLTP provide truly optimistic concurrency? In this session, we’ll look at what guarantees the various isolation levels provide, the difference between pessimistic and optimistic concurrency, and the new technology…

Azure CosmosDB – how to use it, when and where! Cases from the field

During this short session I will show you how some customers in Poland are using CosmosDB and what are prefered scenarios for it. As the NoSQL databases have a lot tempting features, they should not be used for all the scenarios. During the session we will review couple of them.

Azure Databricks: big data made easy

Databricks is to Spark as PaaS is to IaaS. If you’ve ever wanted to explore big data world but shied away because of its complexity, worry no more! During this session we will not only focus on Spark basics, but also cover practical examples of batch, interactive and streaming processing.

Azure IoT Solution Architecture Best Practices

The Internet of Things is becoming reality at a rapid pace. Many objects in the real world can be connected to the Internet. Cloud computing is the best option for the IoT backend. Microsoft Azure offers the offers a wide range of services, used in the Internet of Things solutions This presentation provides a deeper…

Azure Machine Learning Workbench

Microsoft has added a new Azure Machine Learning Workbench to a rich offer of solutions for building machine learning models. Let’s get to know the possibilities of this solution during the session, going through the typical machine learning path. We will see a variety of scenarios ranging from local work, containers, as well as working…

Azure SQL Database – yesterday, today and tommorow

Od dłuższego czasu obserwujemy dynamiczny rozwój usługi SQL Database – coraz więcej korzyści wynikających z przenoszenia rozwiązania do modelu PaaS, nowe możliwości w Data Platform, dostępne najpierw w chmurze. Podczas sesji przedstawię najnowsze zmiany i kierunki w których następuje ewolucja SQL Database w zakresie wydajności, bezpieczeństwa czy zarządzania usługą.

Azure SQL Datawarehouse – The Complete Introduction

Azure SQL Datawarehouse is not really the standard SQL Server that you could point your existing SSIS packages and reporting queries at and expect the same behaviour and performance. So why is that? And what should you do different to get the most out of Azure SQL DB? In this session I will explain what…

Brief introduction into SQL Server Execution Plans

The goal for this session is to present what Query Execution Plans are and how they are created. Attendees will learn how to get them, how to read them, and what is the most valuable information they deliver us. In addition, Marek Maśko will discuss few most popular operators and their important properties. This knowledge…

Building efficient and reliable Enterprise Reporting Platform with Microsoft Power BI

How to create and deploy Power BI reports that are fast and reliable? Join us and learn how to build serious, secure, and scalable reporting solutions with Power BI. We’ll cover the common patterns for Power BI usage, performance implications and best practices in deployment, monitoring, data source configurations, reports publishing and security aspects. You…

Challenges on GDPR from a database perspective

GDPR is the European Regulation that gives directives about data protection from an individual level. Every person in Europe is entitled to the some data rights which, at the organizational side, cause some major changes in how the information is managed and, consequently, affects the design and implementation of any applications that utilizes data. DBAs,…

Choosing the Right Database for Microservices Solutions

The past years have seen the explosive growth of large-scale distributed systems. This trend has changed the database world that is arguably without precedent in the history of the software industry. The result is a ompetitive database marketplace with a huge variety of platforms for us to choose from. But how do we go about…

Classifying support tickets using text analysis and machine learning

Machine learning is a very rapidly evolving branch of IT and you can find examples of companies using it almost in every industry. In my presentation I would like to talk about one particular company I collaborated with to build a solution for automatic classification of support tickets. I will discuss different tools and technics…

Cloud Data Wars: The Force Awakens

Each cloud has its own army of data services. Each one claims they are the best and unique. Let’s see if this is really the force awakens and who followed the light and who followed the dark side of the Force. May the Force of data services be with you…

Columnstore Clustered vs Nonclustered, when to choose what

The incredible Columnstore Indexes can increase your analytical query processing speed multiple times, they are updatable (Clustered from SQL Server 2014 and Nonclustered from SQL Server 2016 respectively), but they keep on supporting different sets of the functionalities – such as CDC (Nonclustered Columnstore) and LOBs (Clustered Columnstore), and this brings a great confusion. This…

Comparing Predictive Mining Models from R, Python, SSAS, and Azure ML

There is a lot of overlapping in Microsoft BI suite. For advanced analytics, like data mining, you can use SQL Server Analysis Services (SSAS), or R, or Python, or Azure ML. The question arises which tool to use. The answer is simple, through another question. Why not all of them? In data mining, you typically…

CosmosDB for (scary) developers

TBD

Data Lake praktycznie, czyli nie ważne jak zaczynasz, ważne jak kończysz

Wraz z upływem czasu rośnie wolumen danych w systemach sprzedażowych, a nie każda hurtownia danych jest gotowa dziesiątek terabajtów danych. Z pomocą przychodzi Data Lake, który jest w stanie przetworzyć takie wolumeny dzięki “nieograniczonej” możliwości skalowania. Jak podejść do tematu od A do Z, od źródeł danych po pierwsze analizy? Pokażemy jak w łatwy sposób…

Data Science in practice

Always wondering: What is data science? How can you apply its concepts, methods and tools to particular business cases? How does the implementation process look in practice? Our experts will try to answer all those questions during this session. Łukasz Grala Krzysztof Odasz Jakub Wawrzyniak

dbachecks – The PowerShell Module to validate your SQL Estate

As a SQL DBA you want to know that your SQL Server Estate is compliant with the rules that you have set up. Now there is a simple method to set this up using PowerShell and you can get the results in PowerBi or a report emailed to you on a schedule. Details such as…

Design Myths

Prepare for a surprising, intriguing, and even controversial session. 45 years after Codd defined the Relational Model, there are still so many parts misunderstood. For sure there are points where you will be surprised and change your view, even though you might have been convinced for years that your view is the correct one, and…

Dont’t Use Cursors – or Why You Maybe Should Use a Cursor After All

You might have heard “don’t use cursors, they are slow!”. In this presentation, you will learn that normally you should write set-based statements instead, and I will explain why they generally are faster. But I will also look at situations where using a loop for one reason or another is preferrable, and you will learn…

Examples of using Spark in Azure for data processing

Spark is a great technology for parallel computing and processing data. It was designed to be fast! And Azure could be a great way to try it. In this session we will look through the capabilities of Spark using a couple of real-world examples to ensure, that it is a great technology and really worth…

Get data insights faster with Data Wrangling

Self-Service Data Preparation aka Data Wrangling is becoming more and more popular in BigData and Enterprise world. Data Lake stores tons of different data and it is not easy to get value from it. Traditional approach doesn’t work here, data analysts and data scientists have to wait until IT guys gather requirements and prepare a…

Identity, Sequence i ROW_NUMBER

TBD

Image classification with Microsoft Cognitive Toolkit

Artificial intelligence is called electricity of the 21st century. One hundred years ago the popularization of electricity changed the ways in which we work, travel and rest – invention of electricity changed our whole world. Today, thanks to artificial intelligence, we are facing a revolution on a similar scale. The breakthrough was possible due to…

Integration of multi-source heterogeneous data using Python

Analyst often needs information from multiple sources with structured and unstructured data, often facing the data inconsistency and contradiction. The goal of the presentation is to give an overview of tools provided by Python environment to access, transform and combine data from multiple sources to obtain consistent information.

Kubernetes for data scientist

TBD

Laying the Foundations for GDPR Compliance

GDPR, no matter where you are if you are handling data on European data subjects. Laying a solid foundation of data security practices is vital to avoid the potential fines and damage to reputation that being non-compliant can bring. Practicing good data hygiene is vital to meeting compliance requirements, whether it is GDPR, PCI-DSS, HIPAA…

Level Up Your Biml: Best Practices and Coding Techniques

Is your Biml solution starting to remind you of a bowl of tangled spaghetti code? Good! That means you are solving real problems while saving a lot of time. The next step is to make sure that your solution does not grow too complex and confusing – you do not want to waste all that…

Lift and shift SQL, .Net, Python, R code into Big Data solutions

Big Data processing is not only limited to data search, but is also strongly connected with use of text analysis or image processing algorithms, where machine learning models can be applied. Azure Data Lake Analytics solves many of the problems related to the efficiency and scalability of the data analysis process, but also gives the…

Microsoft Azure for the DBA

Although Microsoft Azure and the concept of Cloud Computing has been around for a number of years it is still a mystery to many. This talk takes offers an introduction to Microsoft Azure and the many services it has to offer. We will then go onto look in depth at SQL Azure Database; creating, configuring,…

Microsoft R-Evolution – Operationalization with Microsoft R distribution

After Microsoft took over Revolution Analytics in 2015, they created a lot of R implementations, usage applications and extensions of available tools. The amount of them means that users may struggle to select and adapt the most appropriate of R distributions and make entire solution in production usage. During the session, you will get to…

Move part of your body to Azure Data Warehouse

Azure is cheaper, azure is faster, azure is more secure. Azure… everywhere is azure. Everywhere is data. Even if not today, certainly in the future (yes, believe me) you will face a case: how to move my data from premise Data Warehouse to Azure. This session will reveal the ideas how to do that and…

Non-trivial approach to table partitioning

Do you have very large tables in your database or do you want to effectively back up older data? Partitioning is the answer. At the beginning of the session, I will present a theory on why partitioning can be helpful for your database. Then, I will show practical methods concerning the application of partitioning in…

Odloty w T-SQL vol. 2

TBD

Opening, Keynote

Place of Data Vault 2.0 Architecture in data warehouses

Another look or addition to the existing architecture of our data warehouses? I will present the concept of Data Vault 2.0 architecture and its place in cooperation with other architectures. We will talk about the pros and cons of existing solutions and we will answer the question whether the new architecture is the answer to…

Power BI Report Server (PBIRS) – What / Why & How?

powerbi.com – as a cloud-based SaaS solution for self-service busines intelligence (BI) – is available since two years. Succeeding the Microsoft reporting roadmap announcements and it’s proposed cloud and on-premises feature parity the community had to wait for more than 1 year for news. With the release of the Power BI Report Server (PBIRS) in…

Power BI w praktyce

TBA

Query Store without SQL 2016 = Open Query Store

When SQL Server 2016 was released, it offered a fantastic new feature with the Query Store. Long term, statistics based, query tuning became a reality. But what about the thousands of servers that aren’t upgrading to SQL 2016 or newer? The open source project Open Query Store is designed to fulfill that need. This session…

Security Intelligence in Azure Sql PaaS

How to design secure solutions that use Azure Sql Database and Datawarehouse. Session covers security assessment and risk management of public cloud and specifically focuses on Azure Sql PaaS. That includes common network topologies with their strengths and weaknesses, blueprints and recommendations for hybrid and cloud-only solutions. In addition it talks about cloud-first, and in…

Solve daily indexing problems

Have you ever been in the situation that an application performed well the last day and – suddenly – you run into problems? Although you have Index Maintenance jobs and statistics up to date you will monitor a big degrade of performance! This session will demonstrate in a over 80 percent demo based presentation the…

Speed up your OLTP database with In-Memory

The decision to use memory-optimized tables puts SQL Server engine in a slightly different way of consuming resources. Our session will show you how to approach such issues as durability, natively compiled procedures, their limitations (along with workarounds), differences between versions of SQL Server, comparison with traditional tables and procedures, etc. After this session you…

SQL Server & Containers

This session aims to take attendees through what containers are and what benefits they bring. After this demo based session, attendees will be able to go back to their respective companies and be able to evaluate container technology to determine whether or not it will be of benefit. Attendees will be taken through the following:…

SQL Server vs. Windows Server – how to configure Windows for better SQL Server performance

Microsoft SQL Server is running on Windows Server Operating system, however, most of the OS Administrators treat SQL Server just like another basic application. On the session, I will show how steps taken before actual installation of SQL Server can impact performance. The session is a walkthrough on all of the layers starting on hardware,…

SSAS Multidimensional – optimize for large data warehouse

Working with a multidimensional model in Analysis Services that was built on the top of the large data warehouse I’ve encounter many problems that I’ve never had when working on the small projects. During this session I will share with the techniques that you can use to optimize even the very large cube, but we…

Start testing your SSIS packages

On this session I will show you how you can start automating testing of the packages and the whole projects

Stream Analytics i Machine Learning – czy to dobrze dobrana para na współczesne wyzwania?

Współczesne rozwiązania w obszarze zaawansowanej analityki danych muszą być gotowe na nowego typu wyzwanie: analizę danych pochodzących z różnych źródeł w czasie bliskim rzeczywistemu. Jednym z celów może być reagowanie na zdarzenia w social media czy dostarczanie spersonalizowanych ofert klientom dokładnie wtedy, kiedy tego rzeczywiście potrzebują. W trakcie prezentacji przedstawimy przykład implementacji takiego rozwiązania w…

The dark side of the force, namely the antipatterns

We can sometimes come across solutions to the problems which lead to some more problems. If too many people come up with the same idea or start to copy not the best solutions available, an antipattern appears. Some of them have already been given names. Let’s have a look at some common and interesting ones.

The Smart Way to Troubleshoot SQL Server Cluster and Availability Group Outages

SQL Server Failover Clustered Instances (FCI) and Availability Groups (AG) depend a lot on Windows Server Failover Clustering (WSFC). But when mission-critical databases go offline, bringing them back online could be a nerve-wracking experience. This session will look at understanding how the underlying WSFC platform works to help us maintain availability of our databases. This…

Transform your data with Power BI

For most business users and developers, Power BI is a great tool for data visualization with efficient tabular engine under the hood. For even more people, this software is a crucial part for every implementation of Data Warehouse and Analysis Services. However, that’s not all! Power BI gives us great opportunity to process data and…

What are Azure SQL Database Managed Instances?

The range of options for storing data in Microsoft Azure keeps growing, the most notable recent addition is the Managed Instance. But what is it, and why is it there? Join John as he walks through what they are and how you might start using them. Managed Instances add a new option for running workloads…

What Happened? Exploring the Plan Cache

Being pro-active, and using one of the various tracing capabilities of SQL Server, is one of the best ways to keep track of what is going on in your SQL Server and what might be causing performance problems. But does that mean if you haven’t set up any tracing, there is nothing you can do…

What is (NOT) possible with Microsoft SQL Server 2017 on Linux (on Azure)

A demo session on what is possible running Microsoft SQL Server on Linux (on Azure) and overview slides on what is NOT possible when deciding to use it.

Why should I care about SQL, if I have ORM?

Ok, we are starting a new project. Platform – .Net. Programming language – C#. Database – SQL Server. How will we work with the database? ORM. Most likely Entity Framework. Time to start! Unfortunately, this set is already enough to start the project 🙂 but not enough for its efficient and painless work in the…

Window Functions

TBD