databricks.com Open in urlscan Pro
2a04:4e42:e00::740  Public Scan

Submitted URL: https://databricks.com/dataaisummit/agenda?q_mailing_2NPTiYK1umGFyHWUExvVrEbAjdLX2NxH1V8Y=dWC1vxqYbC5tZeyssSPyxacPZozuH...
Effective URL: https://databricks.com/dataaisummit/agenda/?q_mailing_2NPTiYK1umGFyHWUExvVrEbAjdLX2NxH1V8Y=dWC1vxqYbC5tZeyssSPyxacPZozu...
Submission: On June 28 via manual from US — Scanned from DE

Form analysis 1 forms found in the DOM

<form class="Search"><label for="Search-items" class="css-1oktodv"><input name="Search-items" placeholder="Search" type="text" class="css-w48mtq"><button class="css-z96uao">Search</button></label></form>

Text Content

THIS WEBSITE USES COOKIES

We use cookies to optimize site functionality and give you the best possible
experience. If you click accept or continue to use the site after you are
presented this notice, you consent to this use. Learn More

[#OOI_PERSONAL_INFORMATION#]
Reject Cookies I ACCEPT Show details
I ACCEPT
Reject Cookies Allow selection Allow all cookies
Necessary
Preferences
Statistics
Marketing
Show details
Cookie declaration [#IABV2SETTINGS#] About
 Necessary (52)  Preferences (27)  Statistics (78)  Marketing (93)  Unclassified
(50)
Necessary cookies help make a website usable by enabling basic functions like
page navigation and access to secure areas of the website. The website cannot
function properly without these cookies.

NameProviderPurposeExpiryTypeCookieConsent [x11]CookiebotStores the user's
cookie consent state for the current domain1
yearHTTP__Host-airtable-sessionAirtableContains a specific ID for the current
session. This is necessary for running the website correctly. 1
yearHTTP__Host-airtable-session.sigAirtableContains a specific ID for the
current session. This is necessary for running the website correctly. 1
yearHTTPAWSELBCORS [x2]Airtable
SiteimproveRegisters which server-cluster is serving the visitor. This is used
in context with load balancing, in order to optimize user experience.
SessionHTTPbrwAirtableDetects and logs potential errors on third-party provided
functions on the website.1
yearHTTPlightstep/clock_state/lightstep.airtable.comAirtableNecessary for the
website's booking functionality. SessionHTMLmvAirtableMaintains website settings
across multiple visits. 1 dayHTTP__VASTUtil__aka.msNecessary for the
implementation of video-content on the website.PersistentHTML__cf_bm [x4]Marketo
Databricks.com
VimeoThis cookie is used to distinguish between humans and bots. This is
beneficial for the website, in order to make valid reports on the use of their
website.1 dayHTTPBIGipServer# [x2]Marketo
Databricks.comUsed to distribute traffic to the website on several servers in
order to optimise response times.SessionHTTPDEFAULTLOCALEDatabricks.comLanguage
setting as per the default locationSessionHTTPARRAffinitySameSite [x3]Azure
Microsoft
Databricks.comUsed to distribute traffic to the website on several servers in
order to optimise response
times.SessionHTTPCookieConsentPolicyDatabricks.comStores the user's cookie
consent state for the current domain1
yearHTTPLSKey-c$CookieConsentPolicyDatabricks.comDetermines whether the user has
accepted the cookie consent box. 1 yearHTTPrenderCtxDatabricks.comNecessary to
facilitate client-side-rendering, which allows the website to place website
scripts in the client’s browser.SessionHTTPsfdc-streamDatabricks.comUsed to
maintain the overall functionality of the website: Assigns the user to a server
and detects potential errors on specific servers, allowing the website to
reassign the users to another server. 1 dayHTTPAWSALBdataaisummit.comRegisters
which server-cluster is serving the visitor. This is used in context with load
balancing, in order to optimize user experience. 6
daysHTTPAWSALBCORS [x2]dataaisummit.com
AirtableRegisters which server-cluster is serving the visitor. This is used in
context with load balancing, in order to optimize user experience. 6
daysHTTPconnect.siddatabricks.searchunify.comThe cookie is necessary for secure
log-in and the detection of any spam or abuse of the
website.SessionHTTPai_session [x2]Microsoft
Databricks.comPreserves users states across page requests.1
dayHTTPARRAffinity [x2]Microsoft
Databricks.comUsed to distribute traffic to the website on several servers in
order to optimise response times.SessionHTTPuc_sessionDropboxDetermines whether
the user has accepted the cookie consent box. SessionHTTPAWSELBSiteimproveUsed
to distribute traffic to the website on several servers in order to optimise
response times.SessionHTTPCONSENT [x2]Google
YouTubeUsed to detect if the visitor has accepted the marketing category in the
cookie banner. This cookie is necessary for GDPR-compliance of the website. 2
yearsHTTPrc::aGoogleThis cookie is used to distinguish between humans and bots.
This is beneficial for the website, in order to make valid reports on the use of
their website.PersistentHTMLrc::cGoogleThis cookie is used to distinguish
between humans and bots. SessionHTMLcuAdelphicUsed to detect if the visitor has
accepted the marketing category in the cookie banner. This cookie is necessary
for GDPR-compliance of the website. 1 yearHTTPAI_bufferDatabricks.comUsed in
context with the "AI_sentBuffer" in order to limit the number of
data-server-updates (Azure). This synergy also allows the website to detect any
duplicate data-server-updates. SessionHTMLAI_sentBufferDatabricks.comUsed in
context with the "AI_buffer" in order to limit the number of data-server-updates
(Azure). This synergy also allows the website to detect any duplicate
data-server-updates. SessionHTMLli_gcLinkedInStores the user's cookie consent
state for the current domain2 yearsHTTPJSESSIONIDNew RelicPreserves users states
across page requests.SessionHTTP

Preference cookies enable a website to remember information that changes the way
the website behaves or looks, like your preferred language or the region that
you are in.

NameProviderPurposeExpiryTypelang [x2]LinkedInRemembers the user's selected
language version of a
websiteSessionHTTPbynderDatabricks.comPendingSessionHTTPCookieConsentBulkSetting-#CookiebotEnables
cookie consent across multiple websitesPersistentHTMLcftokendataaisummit.comThis
cookie is used to determine which type of device the visitor is using, so the
website can be properly formatted - This information is stored in the "CFID"
cookie.20 daysHTTP@@scroll# [x2]Databricks.com
delta.ioPendingSessionHTML_omappvs [x4]a.omappapi.com
Databricks.comThis cookie is used to determine if the visitor has visited the
website before, or if it is a new visitor on the website.1
dayHTTPbrowserTabId [x2]Databricks.comPendingSessionHTMLipify.ipapi.ipify.orgPendingSessionHTMLsummit_attendee_cookieDatabricks.comPending1
yearHTTPtheme-ui-color-modeDatabricks.comRemembers the user's preferences in
terms of font size and colours on the
website.PersistentHTML_gz_siddatabricks.searchunify.comCaptures and indexes user
search information to help improve search result relevance.1
dayHTTP_gz_taiddatabricks.searchunify.comCaptures and indexes user search
information to help improve search result relevance.1
yearHTTPi18nextLngdatabricks.searchunify.comDetermines the preferred language of
the visitor. Allows the website to set the preferred language upon the visitor's
re-entry. PersistentHTMLLTK__ec4ab66b13bb4a77b4e72932d2090ad2Adobe
Inc.PendingPersistentHTML__Host-logged-out-sessionDropboxUsed to implement or
transfer content through Dropbox. SessionHTTPgvcDropboxUsed to implement or
transfer content through Dropbox. 5 yearsHTTPlocaleDropboxThe cookie determines
the preferred language and country-setting of the visitor - This allows the
website to show content most relevant to that region and language.5
yearsHTTPlangDatabricks.comThe cookie determines the preferred language and
country-setting of the visitor - This allows the website to show content most
relevant to that region and
language.PersistentHTMLlanguageDatabricks.comDetermines the preferred language
of the visitor. Allows the website to set the preferred language upon the
visitor's re-entry. PersistentHTMLplayerVimeoSaves the user's preferences when
playing embedded videos from Vimeo.1 yearHTTPsync_activeVimeoContains data on
visitor's video-content preferences - This allows the website to remember
parameters such as preferred volume or video quality. The service is provided by
Vimeo.com.PersistentHTML

Statistic cookies help website owners to understand how visitors interact with
websites by collecting and reporting information anonymously.

NameProviderPurposeExpiryType_cltk [x3]The Trade Desk
MicrosoftRegisters statistical data on users' behaviour on the website. Used for
internal analytics by the website operator. SessionHTMLomVisits [x4]The Trade
Desk
Databricks.com
a.omappapi.comThis cookie is used to identify the frequency of visits and how
long the visitor is on the website. The cookie is also used to determine how
many and which subpages the visitor visits on a website – this information can
be used by the website to optimize the domain and its
subpages.PersistentHTMLomVisitsFirst [x4]The Trade Desk
a.omappapi.comThis cookie is used to count how many times a website has been
visited by different visitors - this is done by assigning the visitor an ID, so
the visitor does not get registered twice.PersistentHTMLvwoSn [x2]The Trade Desk
VWOThis cookie is set to make split-tests on the website, which optimizes the
website's relevance towards the visitor – the cookie can also be set to improve
the visitor's experience on a website.PersistentHTMLevents/1/#New RelicUsed to
monitor website performance for statistical purposes.SessionPixeljserrors/1/#New
RelicPendingSessionPixelHistory.storeAmazonContains an visitor ID - This is used
to track visitors' navigation and interaction on the website for internal
website-optimization. SessionHTMLc.gifMicrosoftCollects data on the user’s
navigation and behavior on the website. This is used to compile statistical
reports and heatmaps for the website owner.SessionPixelCLIDMicrosoftCollects
data on the user’s navigation and behavior on the website. This is used to
compile statistical reports and heatmaps for the website owner.1
yearHTTP1Databricks.comRegisters data on visitors' website-behaviour. This is
used for internal analysis and website optimization.
PersistentHTML_ga [x3]Google
Databricks.comRegisters a unique ID that is used to generate statistical data on
how the visitor uses the website.2 yearsHTTP_ga_# [x2]GoogleUsed by Google
Analytics to collect data on the number of times a user has visited the website
as well as dates for the first and most recent visit. 2
yearsHTTPcfiddataaisummit.comThis cookie is used in context with the "Cftoken"
cookie. The cookie stores a specific ID for the visitor and the visitor's device
and browser. 20 daysHTTP_clckMicrosoftCollects data on the user’s navigation and
behavior on the website. This is used to compile statistical reports and
heatmaps for the website owner.1 yearHTTP_clskMicrosoftRegisters statistical
data on users' behaviour on the website. Used for internal analytics by the
website operator. 1 dayHTTP_gat [x2]Databricks.com
GoogleUsed by Google Analytics to throttle request rate1
dayHTTP_gid [x2]Databricks.com
GoogleRegisters a unique ID that is used to generate statistical data on how the
visitor uses the website.1 dayHTTP_hjAbsoluteSessionInProgressHotjarThis cookie
is used to count how many times a website has been visited by different visitors
- this is done by assigning the visitor an ID, so the visitor does not get
registered twice.1 dayHTTP_hjFirstSeenHotjarThis cookie is used to determine if
the visitor has visited the website before, or if it is a new visitor on the
website.1 dayHTTP_hjIncludedInPageviewSample [x3]Hotjar
Databricks.comDetermines if the user's navigation should be registered in a
certain statistical place holder.1 dayHTTP_hjIncludedInSessionSample [x3]Hotjar
Databricks.comRegisters data on visitors' website-behaviour. This is used for
internal analysis and website optimization. 1
dayHTTP_hjRecordingLastActivityHotjarSets a unique ID for the session. This
allows the website to obtain data on visitor behaviour for statistical
purposes.SessionHTML_hjSession_#HotjarCollects statistics on the visitor's
visits to the website, such as the number of visits, average time spent on the
website and what pages have been read.1 dayHTTP_hjSessionUser_#HotjarCollects
statistics on the visitor's visits to the website, such as the number of visits,
average time spent on the website and what pages have been read.1
yearHTTP_hjTLDTestHotjarDetects the SEO-ranking for the current website. This
service is part of a third-party statistics and analysis service.
SessionHTTP_hp2_idDatabricks.comstores user_id, identity, other ids2
yearsHTTP_omappvp [x4]a.omappapi.com
Databricks.comThis cookie is used to determine if the visitor has visited the
website before, or if it is a new visitor on the website.11
yearsHTTP_vis_opt_exp_#_combiDatabricks.comUsed by Visual Website Optimizer to
ensure that the same user interface variant is displayed for each visit, if the
user is participating in a design experiment.99 daysHTTP_vis_opt_sVWOUsed by
Visual Website Optimizer to determine if the visitor is participating in a
design experiment.99 daysHTTP_vis_opt_test_cookieVWOUsed to check if the user's
browser supports cookies.SessionHTTP_vwo_dsVWOCollects data on the user's visits
to the website, such as the number of visits, average time spent on the website
and what pages have been loaded with the purpose of generating reports for
optimising the website content.3 monthsHTTP_vwo_snVWOCollects statistics on the
visitor's visits to the website, such as the number of visits, average time
spent on the website and what pages have been read.1 dayHTTP_vwo_uuidVWOUsed by
Visual Website Optimizer to ensure that the same user interface variant is
displayed for each visit, if the user is participating in a design experiment.10
yearsHTTP_vwo_uuid_v2VWOThis cookie is set to make split-tests on the website,
which optimizes the website's relevance towards the visitor – the cookie can
also be set to improve the visitor's experience on a website.1
yearHTTPitm_data [x3]Google
Databricks.comPendingSessionHTTPnmstatSiteimproveThis cookie contains an ID
string on the current session. This contains non-personal information on what
subpages the visitor enters – this information is used to optimize the visitor's
experience.999 daysHTTPpdfjs.historyDatabricks.comRemembers which and how many
PDF-documents have been downloaded or read by the user. This is used for
internal statistics. PersistentHTMLvidcdn-app.pathfactory.comCollects data on
visitor interaction with the website's video-content - This data is used to make
the website's video-content more relevant towards the visitor. 2
yearsHTTPSGoogleSets a unique ID for the session. This allows the website to
obtain data on visitor behaviour for statistical purposes.1
dayHTTP__Host-js_csrfDropboxUsed to implement or transfer content through
Dropbox. 3 yearsHTTP__Host-ssDropboxUsed to implement or transfer content
through Dropbox. 3 yearsHTTPtDropboxStores data on which websites the user has
visited.3 yearsHTTPBrowserId_secSalesforceRegisters statistical data on users'
behaviour on the website. Used for internal analytics by the website operator. 1
yearHTTPheat.aspxSiteimproveCollects data on the user’s navigation and behavior
on the website. This is used to compile statistical reports and heatmaps for the
website owner.SessionPixelimage.aspxSiteimproveRegisters statistical data on
users' behaviour on the website. Used for internal analytics by the website
operator. SessionPixelcollectGoogleUsed to send data to Google Analytics about
the visitor's device and behavior. Tracks the visitor across devices and
marketing channels.SessionPixelai_userDatabricks.comUsed by Microsoft
Application Insights software to collect statistical usage and telemetry
information. The cookie stores a unique identifier to recognize users on
returning visits over time.1 yearHTTPprevPageVisitDataDatabricks.comCollects
statistics on the user's visits to the website, such as the number of visits,
average time spent on the website and what pages have been
read.SessionHTMLAnalyticsSyncHistoryLinkedInUsed in connection with
data-synchronization with third-party analysis service. 29
daysHTTPpersonalization_idTwitter Inc.This cookie is set by Twitter - The cookie
allows the visitor to share content from the website onto their Twitter profile.
2 yearsHTTPp.gifAdobe Inc.Keeps track of special fonts used on the website for
internal analysis. The cookie does not register any visitor data.
SessionPixelvuidVimeoCollects data on the user's visits to the website, such as
which pages have been read.2 yearsHTTPanalyzeVWOThis cookie is used by the
website’s operator in context with multi-variate testing. This is a tool used to
combine or change content on the website. This allows the website to find the
best variation/edition of the site.SessionPixelv.gifVWOThis cookie is set to
make split-tests on the website, which optimizes the website's relevance towards
the visitor – the cookie can also be set to improve the visitor's experience on
a website.SessionPixelyt-player-headers-readableYouTubeUsed to determine the
optimal video quality based on the visitor's device and network settings.
PersistentHTML

Marketing cookies are used to track visitors across websites. The intention is
to display ads that are relevant and engaging for the individual user and
thereby more valuable for publishers and third party advertisers.

NameProviderPurposeExpiryType6suuidj.6sc.coRegisters user behaviour and
navigation on the website, and any interaction with active campaigns. This is
used for optimizing advertisement and for efficient retargeting. 2
yearsHTTP_6senseCompanyDetails [x3]The Trade Desk
GoogleUsed in context with Account-Based-Marketing (ABM). The cookie registers
data such as IP-addresses, time spent on the website and page requests for the
visit. This is used for retargeting of multiple users rooting from the same
IP-addresses. ABM usually facilitates B2B marketing
purposes.PersistentHTML_uetsid [x2]The Trade Desk
MicrosoftUsed to track visitors on multiple websites, in order to present
relevant advertisement based on the visitor's preferences.
PersistentHTML_uetsid_exp [x3]The Trade Desk
MicrosoftContains the expiry-date for the cookie with corresponding name.
PersistentHTML_uetvid [x2]The Trade Desk
MicrosoftUsed to track visitors on multiple websites, in order to present
relevant advertisement based on the visitor's preferences.
PersistentHTML_uetvid_exp [x3]The Trade Desk
MicrosoftContains the expiry-date for the cookie with corresponding name.
PersistentHTMLtotalCallsaka.msUsed in context with video-advertisement. The
cookie limits the number of times a user is shown the same advertisement. The
cookie is also used to ensure relevance of the video-advertisement to the
specific user. PersistentHTMLtotalCallsTimeoutaka.msUsed in context with
video-advertisement. The cookie limits the number of times a user is shown the
same advertisement. The cookie is also used to ensure relevance of the
video-advertisement to the specific user. PersistentHTMLrp.gifRedditNecessary
for the implementation of the Reddit.com's share-button
function.SessionPixelv1/beacon/img.gifb.6sc.coUsed in context with
Account-Based-Marketing (ABM). The cookie registers data such as IP-addresses,
time spent on the website and page requests for the visit. This is used for
retargeting of multiple users rooting from the same IP-addresses. ABM usually
facilitates B2B marketing purposes.SessionPixelMUID [x2]MicrosoftUsed widely by
Microsoft as a unique user ID. The cookie enables user tracking by synchronising
the ID across many Microsoft domains.1 yearHTTPSRM_BMicrosoftTracks the user’s
interaction with the website’s search-bar-function. This data can be used to
present the user with relevant products or services. 1
yearHTTPANONCHKMicrosoftRegisters data on visitors from multiple visits and on
multiple websites. This information is used to measure the efficiency of
advertisement on websites. 1 dayHTTPSMMicrosoftRegisters a unique ID that
identifies the user's device during return visits across websites that use the
same ad network. The ID is used to allow targeted
ads.SessionHTTPComponentDefStorage__MUTEX_XDatabricks.comUsed to track visitors
on multiple websites, in order to present relevant advertisement based on the
visitor's preferences.
PersistentHTMLGlobalValueProviders__MUTEX_XDatabricks.comUsed to track visitors
on multiple websites, in order to present relevant advertisement based on the
visitor's preferences.
PersistentHTMLGlobalValueProviders__MUTEX_YDatabricks.comCollects data on
visitor behaviour from multiple websites, in order to present more relevant
advertisement - This also allows the website to limit the number of times that
they are shown the same advertisement. PersistentHTMLpctrkDatabricks.comTracks
the individual sessions on the website, allowing the website to compile
statistical data from multiple visits. This data can also be used to create
leads for marketing purposes.1 yearHTTP__pdst [x2]cdn.pdst.fmUsed to track
visitors on multiple websites, in order to present relevant advertisement based
on the visitor's preferences. 1 yearHTML_an_uid [x3]j.6sc.co
Databricks.comPresents the user with relevant content and advertisement. The
service is provided by third-party advertisement hubs, which facilitate
real-time bidding for advertisers.6 daysHTTP_fbp Meta Platforms, Inc.Used by
Facebook to deliver a series of advertisement products such as real time bidding
from third party advertisers.3 monthsHTTP_gac_UA-#GoogleStores information about
ad campaigns from Google Adwords to show targeted ads to the visitor.3
monthsHTTP_gcl_auGoogleUsed by Google AdSense for experimenting with
advertisement efficiency across websites using their services. 3
monthsHTTP_gcl_awGoogleUsed to measure the efficiency of the website’s
advertisement efforts, by collecting data on the conversion rate of the
website’s ads across multiple websites.3 monthsHTTP_gd_session [x3]j.6sc.co
Databricks.comCollects visitor data related to the user's visits to the website,
such as the number of visits, average time spent on the website and what pages
have been loaded, with the purpose of displaying targeted ads.1
dayHTTP_gd_svisitor [x3]j.6sc.co
Databricks.comCollects visitor data related to the user's visits to the website,
such as the number of visits, average time spent on the website and what pages
have been loaded, with the purpose of displaying targeted ads.2
yearsHTTP_gd_visitor [x3]j.6sc.co
Databricks.comCollects visitor data related to the user's visits to the website,
such as the number of visits, average time spent on the website and what pages
have been loaded, with the purpose of displaying targeted ads.2
yearsHTTP_hjRecordingEnabledHotjarThis cookie is used to identify the visitor
and optimize ad-relevance by collecting visitor data from multiple websites –
this exchange of visitor data is normally provided by a third-party data-center
or ad-exchange.SessionHTML_mkto_trkMarketoContains data on visitor behaviour and
website interaction. This is used in context with the email marketing service
Marketo.com, which allows the website to target visitors via email. 2
yearsHTTP_rdt_uuidRedditUsed to track visitors on multiple websites, in order to
present relevant advertisement based on the visitor's preferences. 3
monthsHTTP_session_id [x3]Databricks.com
databricks.pathfactory.com
jukebox.pathfactory.comStores visitors' navigation by registering landing pages
- This allows the website to present relevant products and/or measure their
advertisement efficiency on other websites. 2 yearsHTTP_uetsidMicrosoftCollects
data on visitor behaviour from multiple websites, in order to present more
relevant advertisement - This also allows the website to limit the number of
times that they are shown the same advertisement. 1 dayHTTP_uetvidMicrosoftUsed
to track visitors on multiple websites, in order to present relevant
advertisement based on the visitor's preferences. 1
yearHTTPsnowplowOutQueue_#_post2CloudflareCollects statistical data related to
the user's website visits, such as the number of visits, average time spent on
the website and what pages have been loaded. The purpose is to segment the
website's users according to factors such as demographics and geographical
location, in order to enable media and marketing agencies to structure and
understand their target groups to enable customised online
advertising.PersistentHTMLsnowplowOutQueue_#_post2.expiresCloudflareCollects
statistical data related to the user's website visits, such as the number of
visits, average time spent on the website and what pages have been loaded. The
purpose is to segment the website's users according to factors such as
demographics and geographical location, in order to enable media and marketing
agencies to structure and understand their target groups to enable customised
online advertising.PersistentHTMLCOMPASSGooglePending1
dayHTTPMicrosoftApplicationsTelemetryDeviceIdMicrosoftSets a specific device ID.
This ID is used by Microsoft to track users' website behaviour and the
interaction with Microsoft application on the specific device. 1
yearHTTPMSFPCMicrosoftUsed widely by Microsoft as a unique user ID. The cookie
enables user tracking by synchronising the ID across many Microsoft domains.1
yearHTTPIDEGoogleUsed by Google DoubleClick to register and report the website
user's actions after viewing or clicking one of the advertiser's ads with the
purpose of measuring the efficacy of an ad and to present targeted ads to the
user.1 yearHTTPpagead/landing [x2]GoogleCollects data on visitor behaviour from
multiple websites, in order to present more relevant advertisement - This also
allows the website to limit the number of times that they are shown the same
advertisement.
SessionPixelpagead/viewthroughconversion/299401056GooglePendingSessionPixeltest_cookieGoogleUsed
to check if the user's browser supports cookies.1 dayHTTPtrMeta Platforms,
Inc.Used by Facebook to deliver a series of advertisement products such as real
time bidding from third party advertisers.SessionPixelads/ga-audiencesGoogleUsed
by Google AdWords to re-engage visitors that are likely to convert to customers
based on the visitor's online behaviour across
websites.SessionPixelNIDGoogleRegisters a unique ID that identifies a returning
user's device. The ID is used for targeted ads.6
monthsHTTPpagead/1p-user-list/#GoogleTracks if the user has shown interest in
specific products or events across multiple websites and detects how the user
navigates between sites. This is used for measurement of advertisement efforts
and facilitates payment of referral-fees between websites.SessionPixelhHeap
AnalyticsCollects data on user behaviour and interaction in order to optimize
the website and make advertisement on the website more relevant.
SessionPixelci_rtcAdelphicPending2 monthsHTTPbcookieLinkedInUsed by the social
networking service, LinkedIn, for tracking the use of embedded services.2
yearsHTTPbscookieLinkedInUsed by the social networking service, LinkedIn, for
tracking the use of embedded services.2 yearsHTTPlidcLinkedInUsed by the social
networking service, LinkedIn, for tracking the use of embedded services.1
dayHTTPUserMatchHistoryLinkedInUsed to track visitors on multiple websites, in
order to present relevant advertisement based on the visitor's preferences. 29
daysHTTPMC1MicrosoftUsed by Microsoft to keep statistics on what pages the user
has visited and how often an ad click leads to a purchase or other actions on
the advertiser's website.1 yearHTTPMS0MicrosoftUsed by Microsoft to keep
statistics on what pages the user has visited and how often an ad click leads to
a purchase or other actions on the advertiser's website.1
dayHTTPi/adsct [x2]Twitter Inc.The cookie is used by Twitter.com in order to
determine the number of visitors accessing the website through Twitter
advertisement content. SessionPixelmuc_adsTwitter Inc.Collects data on user
behaviour and interaction in order to optimize the website and make
advertisement on the website more relevant. 2 yearsHTTPi/jotTwitter Inc.Sets a
unique ID for the visitor, that allows third party advertisers to target the
visitor with relevant advertisement. This pairing service is provided by third
party advertisement hubs, which facilitates real-time bidding for advertisers.
SessionPixelRichHistoryTwitter Inc.Collects data on visitors' preferences and
behaviour on the website - This information is used make content and
advertisement more relevant to the specific visitor.
SessionHTMLs.gifVWORegisters user behaviour and navigation on the website, and
any interaction with active campaigns. This is used for optimizing advertisement
and for efficient retargeting. SessionPixelVISITOR_INFO1_LIVEYouTubeTries to
estimate the users' bandwidth on pages with integrated YouTube videos.179
daysHTTPYSCYouTubeRegisters a unique ID to keep statistics of what videos from
YouTube the user has seen.SessionHTTPyt.innertube::nextIdYouTubeRegisters a
unique ID to keep statistics of what videos from YouTube the user has
seen.PersistentHTMLyt.innertube::requestsYouTubeRegisters a unique ID to keep
statistics of what videos from YouTube the user has
seen.PersistentHTMLytidb::LAST_RESULT_ENTRY_KEYYouTubeStores the user's video
player preferences using embedded YouTube
videoPersistentHTMLyt-remote-cast-availableYouTubeStores the user's video player
preferences using embedded YouTube
videoSessionHTMLyt-remote-cast-installedYouTubeStores the user's video player
preferences using embedded YouTube
videoSessionHTMLyt-remote-connected-devicesYouTubeStores the user's video player
preferences using embedded YouTube
videoPersistentHTMLyt-remote-device-idYouTubeStores the user's video player
preferences using embedded YouTube
videoPersistentHTMLyt-remote-fast-check-periodYouTubeStores the user's video
player preferences using embedded YouTube
videoSessionHTMLyt-remote-session-appYouTubeStores the user's video player
preferences using embedded YouTube
videoSessionHTMLyt-remote-session-nameYouTubeStores the user's video player
preferences using embedded YouTube videoSessionHTML

Unclassified cookies are cookies that we are in the process of classifying,
together with the providers of individual cookies.

NameProviderPurposeExpiryTypebrandguidelines_collections_download:0FD7ECF9-0605-4D3A-A2D4-7EAD3CDEA1E2Databricks.comPendingSessionHTMLbrandguidelines_new_guide_creation_flow:0FD7ECF9-0605-4D3A-A2D4-7EAD3CDEA1E2Databricks.comPendingSessionHTMLbrandguidelines_translations:0FD7ECF9-0605-4D3A-A2D4-7EAD3CDEA1E2Databricks.comPendingSessionHTMLjwtDatabricks.comPendingPersistentHTMLjwt_expiryDatabricks.comPendingPersistentHTMLredirectTokenDatabricks.comPending1
dayHTTP##communityApp ~ #/s ~
#Databricks.comPendingPersistentIDB4Databricks.comPendingPersistentHTMLComponentDefStorage__MUTEX_YDatabricks.comPendingPersistentHTMLldsObjectInfo#loginApp2
~ /s ~
0DB3f000000KylMDatabricks.comPendingPersistentIDBLSKey-c$_gz_sidDatabricks.comPending1
dayHTTPLSKey-c$_gz_taidDatabricks.comPending1
yearHTTPLSKey-c$smartFacetsDatabricks.comPending1
dayHTTPLSSIndex:LOCAL{"namespace":"c"}Databricks.comPendingPersistentHTMLLSSNextSynthtic:LOCALDatabricks.comPendingPersistentHTMLrecordGVP55.0#loginApp2
~ /s ~
0DB3f000000KylMDatabricks.comPendingPersistentIDB__DBLCLK_REF_IDdataaisummit.comPending1
dayHTTPPERSISTENT_VIDEOdataaisummit.comPending1
dayHTTP__q_domainTestjs.qualified.comPendingSessionHTTP__q_state_KbmrrC8pEQRX5uYqjs.qualified.comPending10
yearsHTTP_dataaisummit_itm_dataDatabricks.comPendingSessionHTTP_hp2_#Databricks.comPending1
dayHTTP_hp2_hld130480384604.3428506230Databricks.comPending1
dayHTTP_hp2_hld433244471648.3428506230Databricks.comPending1
dayHTTP_hp2_hld43362648152.3428506230Databricks.comPending1
dayHTTP_hp2_hld553311712010.3428506230Databricks.comPending1
dayHTTP_hp2_hld637970992383.3428506230Databricks.comPending1
dayHTTP_hp2_hld684549727594.3428506230Databricks.comPending1
dayHTTP_hp2_hld724775637542.3428506230Databricks.comPending1
dayHTTP_hp2_hld844882896250.3428506230Databricks.comPending1
dayHTTP_hp2_id.#Databricks.comPending1
yearHTTP_hp2_ses_props.#Databricks.comPending1
dayHTTP_lbhq_eventscdn-app.pathfactory.comPendingPersistentHTML_lbvisitedcdn-app.pathfactory.comPendingPersistentHTML_lbvisitedcountcdn-app.pathfactory.comPendingPersistentHTML_pf_id.9afbCloudflarePending2
yearsHTTP_pf_ses.9afbCloudflarePending1
dayHTTPdb_country [x2]Databricks.comPending29
daysHTTPslTranslations-de-DEDatabricks.comPendingPersistentHTMLslTranslations-it-ITpinchjs-cdn.pinch.dev.smartling.netPendingPersistentHTMLsmartling_redirectDatabricks.comPending6
daysHTTPspBeaconPreflight_jukeboxTracker_jukeboxTrackerGooglePendingSessionHTMLspBeaconPreflight_jukeboxTracker_railsTrackerGooglePendingSessionHTMLwp-utm_allGooglePendingSessionHTTPanalytics/track.pngdatabricks.searchunify.comPendingSessionPixelAutoLearningdatabricks.searchunify.comPendingPersistentHTMLsmartFacetsdatabricks.searchunify.comPending1
dayHTTPcurrent_user_languageDatabricks.comPendingSessionHTTPsettingDatabricks.comPendingPersistentHTML

 [#IABV2_LABEL_PURPOSES#]  [#IABV2_LABEL_FEATURES#]  [#IABV2_LABEL_PARTNERS#]
[#IABV2_BODY_PURPOSES#]
[#IABV2_BODY_FEATURES#]
[#IABV2_BODY_PARTNERS#]

We use cookies to provide certain functionality of our services ('necessary' or
'essential' cookies) and to personalize content and ads, provide social media
features and analyze our traffic (optional cookies). We also may share
information about your use of our site with our social media, advertising and
analytics partners who may combine it with other information that you have
provided to them or that they have collected from your use of their services.
Please see our Cookie Policy and Privacy Policy for more details and if you
change your mind regarding cookies and would later like to opt out of
non-essential cookies.



Your consent applies to the following domains:
System.Collections.Generic.List`1[Cookiebot.Consent.Service.Domain.Domains.DomainListModel]


Cookie declaration last updated on 25.06.22 by Cookiebot
This site works best with JavaScript enabled.
Homepage
JUNE 27-30, 2022
SAN FRANCISCO + VIRTUAL
RegisterEnter virtual event

Agenda
Speakers
Trainings
Pricing
Special Events
Sponsors
Health
FAQ



AGENDA

View the agenda at a glance
Hide search filters
reset filters
Search
Difficulty Level

Session Type

Format

Category

Presentation Track

Industry

Monday
Tuesday
Wednesday
Thursday
Virtual


MONDAY

8:00am


Training
8:00am-12:00pm
Security Threat Detection with Databricks

Training
Virtual
See Details
Training
8:00am-12:00pm
Managing Machine Learning Models

Training
In-Person
See Details
Training
8:00am-12:00pm
Managing Machine Learning Models

Training
Virtual
See Details
Training
8:00am-12:00pm
Lakehouse with Delta Lake Deep Dive

Training
Virtual
See Details
Training
8:00am-12:00pm
Lakehouse with Delta Lake Deep Dive

Training
In-Person
See Details
Training
8:00am-12:00pm
Introduction to Databricks SQL

Training
In-Person
See Details
Training
8:00am-12:00pm
Introduction to Databricks SQL

Training
Virtual
See Details
Training
8:00am-12:00pm
End-to-End ETL with Databricks (V)

Training
Virtual
See Details
Training
8:00am-12:00pm
End-to-End ETL with Databricks

Training
In-Person
See Details
Training
8:00am-12:00pm
Databricks Platform Administration with Unity Catalog

Training
In-Person
See Details
Training
8:00am-12:00pm
Databricks Platform Administration with Unity Catalog

Training
Virtual
See Details
Training
8:00am-12:00pm
Databricks Lakehouse Overview

Training
In-Person
See Details
Training
8:00am-12:00pm
Databricks Lakehouse Overview

Training
Virtual
See Details
Training
8:00am-12:00pm
Build a Modern Data Stack With dbt and Databricks

Training
In-Person
Amy Chen
dbt Labs
See Details
Training
8:00am-5:00pm
Performance Tuning on Apache Spark

Training
In-Person
See Details
Training
8:00am-5:00pm
Performance Tuning on Apache Spark

Training
Virtual
See Details
Training
8:00am-5:00pm
Data Engineering with Databricks — Bundle: Day 1

Training
Virtual
See Details
Training
8:00am-5:00pm
Data Engineering with Databricks — Bundle: Day 1

Training
In-Person
See Details
Training
8:00am-5:00pm
Data Analysis with Databricks SQL

Training
In-Person
See Details
Training
8:00am-5:00pm
Data Analysis with Databricks SQL

Training
Virtual
See Details
Training
8:00am-5:00pm
Apache Spark Programming with Databricks - Bundle: Day 1

Training
Virtual
See Details
Training
8:00am-5:00pm
Apache Spark Programming with Databricks - Bundle: Day 1

Training
In-Person
See Details
Training
8:00am-5:00pm
Advanced Machine Learning with Databricks — Bundle: Day 1

Training
Virtual
See Details
Training
8:00am-5:00pm
Advanced Machine Learning with Databricks — Bundle: Day 1

Training
In-Person
See Details
Training
8:00am-5:00pm
Advanced Data Engineering with Databricks — Bundle: Day 1

Training
Virtual
See Details
Training
8:00am-5:00pm
Advanced Data Engineering with Databricks — Bundle: Day 1

Training
In-Person
See Details
Training
8:00am-6:00pm
Certification Exam Day 1

Certification Exam
In-Person
See Details
Nothing Left

1:00pm


Training
1:00pm-5:00pm
Lakehouse with Delta Lake Deep Dive

In-Person
See Details
Training
1:00pm-5:00pm
Lakehouse with Delta Lake Deep Dive

Training
Virtual
See Details
Training
1:00pm-5:00pm
Introduction to Databricks SQL

Training
Virtual
See Details
Training
1:00pm-5:00pm
Introduction to Databricks SQL

Training
In-Person
See Details
Training
1:00pm-5:00pm
End-to-End ETL with Databricks (V)

Training
Virtual
See Details
Training
1:00pm-5:00pm
End-to-End ETL with Databricks

Training
In-Person
See Details
Training
1:00pm-5:00pm
Deploying Machine Learning Models

Training
Virtual
See Details
Training
1:00pm-5:00pm
Deploying Machine Learning Models

Training
In-Person
See Details
Training
1:00pm-5:00pm
Databricks Platform Administration with Unity Catalog

Training
Virtual
See Details
Training
1:00pm-5:00pm
Databricks Platform Administration with Unity Catalog

Training
In-Person
See Details
Training
1:00pm-5:00pm
Databricks Lakehouse Overview

Training
Virtual
See Details
Training
1:00pm-5:00pm
Databricks Lakehouse Overview

Training
In-Person
See Details
Nothing Left

2:00pm

Partner Events
2:00pm-4:00pm
Partner Summit | Executive Keynote

Keynote
In-Person
Ali Ghodsi
Databricks
Reynold Xin
Databricks
+5
And five more
See Details

4:00pm

Partner Events
4:00pm-4:20pm
Partner Summit | Networking

Partner Events
In-Person
See Details

4:20pm

Partner Events
4:20pm-5:00pm
Partner Summit | Technology and Data Provider Keynote

Partner Events
In-Person
Adam Conway
Databricks
Jonathan Keller
Databricks
+3
And three more
See Details
Partner Events
4:20pm-5:00pm
Partner Summit | Consulting and System Integrator Keynote

Partner Events
In-Person
Kori O'Brien
Databricks
Ajay Singh
Databricks
+2
And two more
See Details

5:00pm

Partner Events
5:00pm-9:00pm
Partner Reception

Partner Events
In-Person
See Details

6:00pm

Meetup
6:00pm-9:00pm
The War in Ukraine: Challenges in Documenting War Crimes and Russian False Flags

Meetup
In-Person
Shawn Walker
Arizona State University
Michael Simeone
Arizona State University
+1
And one more
See Details


TUESDAY

8:30am

Keynote
8:30am-10:30am
Day 1 Opening Keynote
Join the Day 1 keynote to hear from Databricks co-founders - and original
creators of Apache Spark and Delta Lake - Ali Ghodsi, Matei Zaharia, and Reynold
Xin on how Databricks and the open source community is taking on the biggest
challenges in data. The talks will address the latest updates on the Apache
Spark and Delta Lake projects, the evolution of data lakehouse architecture, and
how companies like Ad...
Keynote
Hybrid
Ali Ghodsi
Databricks
Matei Zaharia
Databricks
Tristan Handy
dbt Labs
Reynold Xin
Databricks
Dave Weinstein
Adobe
Michael Armbrust
Databricks
Karthik Ramasamy
Databricks
Kerby Johnson
Amgen
Shant Hovsepian
Databricks
See Details

10:30am

Training
10:30am-6:00pm
Certification Exam Day 2

Certification Exam
In-Person
See Details
Expo Theater
10:30am-8:30pm
Summit Theater Programming | Tuesday

Lightning Talk
Expo Theater
In-Person
Srini Kadamati
Preset
Sheel Choksi
Ascend.io
+23
And twenty-three more
See Details

10:45am


Session
10:45am-11:20am
Unity Catalog: Journey to unified governance for your Data and AI assets on
Lakehouse

Databricks Experience (DBX)
Data Security and Governance
Hybrid
Todd Greenstein
Databricks
Yuyuan Tang
Databricks
See Details
Sponsored Session
10:45am-11:20am
Turning Fan Data Into an Asset

Intermediate
Delta Sharing
Sponsored Session
Hybrid
Itai Weiss
Databricks
Steve Touw
Immuta
+1
And one more
See Details
Session
10:45am-11:20am
Turning Big Biology Data into Insights on Disease – The Power of Circulating
Biomarkers

Intermediate
Healthcare and Life Sciences
Databricks, Machine Learning
Research
In-Person
Tao Long
Sapient
See Details
Sponsored Session
10:45am-11:20am
The Future of Data - What’s Next with Google Cloud

Beginner
Machine Learning
Sponsored Session
Hybrid
Bruno Aziza
Google
See Details
Session
10:45am-11:20am
Streaming ML Enrichment Framework Using Advanced Delta Table Features

Intermediate
MLOps
MLOps and DataOps
Hybrid
Peter Vasko
Emplifi
See Details
Session
10:45am-11:20am
Scaling ML at CashApp with Tecton

Machine Learning
MLOps and DataOps
Hybrid
Michael Barnathan
Square Up
Mike Del Balso
Tecton
See Details
Session
10:45am-11:20am
Journey to Solving Healthcare Price Transparency with Databricks and Delta Lake

Intermediate
Healthcare and Life Sciences
Databricks
Industry and Business Use Cases
In-Person
Narayanan Hariharasubramanian
Cigna
Ross Silberquit
Cigna
See Details
Session
10:45am-11:20am
Interactive Analytics on a Massive Scale Using Delta Lake

Advanced
Data lakehouse
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Hagai Attias
Akamai Technologies
See Details
Session
10:45am-11:20am
FutureMetrics: Using Deep Learning to Create a Multivariate Time Series
Forecasting Platform for Economic Strategic Planning

Intermediate
Financial Services
Data lake, Machine Learning
Industry and Business Use Cases
Hybrid
Matthew Wander
TD Bank
See Details
Session
10:45am-11:20am
Eliminating AI Risk—One Model Failure at a Time

Data Science, Machine Learning and MLOps
Hybrid
Yaron Singer
Robust Intelligence
See Details
Session
10:45am-11:20am
Destination Lakehouse: All Your Data, Analytics and AI on One Platform

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Erika Ehrli
Databricks
Bharath Gowda
Databricks
+1
And one more
See Details
Session
10:45am-11:20am
Defending Against Adversarial Model Attacks

Beginner
Machine Learning
Data Security and Governance
In-Person
Animesh Singh
IBM
Tommy Li
IBM
See Details
Session
10:45am-11:20am
Data Boards: A Collaborative and Interactive Space for Data Science

Intermediate
Apache Spark
Data Analytics, BI and Visualization
Hybrid
Paul Yang
Einblick
See Details
Session
10:45am-11:20am
Agile Data Engineering: Reliability and Continuous Delivery at Scale

Intermediate
Databricks
Data Engineering
Hybrid
Richa Singhal
Atlassian
esha shah
Atlassian
See Details
Nothing Left

11:30am


Session
11:30am-12:05pm
Towards Dynamic Microstructure: The Role of Machine Learning in the Next
Generation of Exchanges

Advanced
Financial Services
Machine Learning
Industry and Business Use Cases
Hybrid
Douglas Hamilton
Nasdaq
See Details
Session
11:30am-12:05pm
The Semantics of Biology—Vaccine and Drug Research with Knowledge Graphs and
Logical Inferencing on Apache Spark

Intermediate
Healthcare and Life Sciences
Apache Spark
Research
In-Person
John Hunter
GSK
See Details
Session
11:30am-12:05pm
The Road to a Robust Data Lake: Utilizing Delta Lake and Databricks to Map 150
Million Miles of Roads a Month

Intermediate
Public Sector
Databricks, Data Pipelines, Delta Lake
Data Engineering
Hybrid
Ofir Kerker
Nexar
Itai Yaffe
Databricks
See Details
Session
11:30am-12:05pm
The Modern Metadata Platform: What, Why, and How?

Data Security and Governance
In-Person
Mars Lan
Metaphor Data
See Details
Session
11:30am-12:05pm
Scaling Your Workloads with Databricks Serverless

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Nikhil Jethava
Databricks
Aaron Davidson
Databricks
+1
And one more
See Details
Session
11:30am-12:05pm
Modern Architecture of a Cloud-Enabled Data and Analytics Platform

Intermediate
Healthcare and Life Sciences
Security
Industry and Business Use Cases
In-Person
Dieter Krug
Bayer
Naghman Waheed
Bayer
See Details
Session
11:30am-12:05pm
Lessons Learned from Deidentifying 700 Million Patient Notes

Intermediate
Healthcare and Life Sciences
Databricks
Data Science, Machine Learning and MLOps
Hybrid
Nadaa Taiyab
Tegria
Lindsay Mico
Providence Health
See Details
Sponsored Session
11:30am-12:05pm
Driving Real-Time Data Capture and Transformation in Delta Lake with Change Data
Capture

Intermediate
Migration
Sponsored Session
Hybrid
Paul Lacey
Matillion
Paul Johnson
Matillion
See Details
Session
11:30am-12:05pm
Democratizing Metrics at Airbnb

Intermediate
Data Pipelines
Data Analytics, BI and Visualization
Hybrid
Toby Mao
Airbnb Inc
Shao Xie
Airbnb Inc
See Details
Session
11:30am-12:05pm
Data Warehousing on the Lakehouse

SQL and ecosystem, Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Franco Patano
Databricks
Jonathan Keller
Databricks
+1
And one more
See Details
Sponsored Session
11:30am-12:05pm
Data Mesh Implementation Patterns

Intermediate
Delta Sharing
Sponsored Session
Hybrid
Ken Gravenor
Mckesson
Sankalan Bhattacharjee
ACN
See Details
Session
11:30am-12:05pm
Beyond Monitoring: The Rise of Data Observability

Beginner
Data Engineering
Hybrid
Barr Moses
Monte Carlo Data
See Details
Session
11:30am-12:05pm
Batches, Streams, and Everything in between: Unifying Batch and Stream Storage
with Apache Pulsar and Lakehouse Architectures

Intermediate
Data lakehouse
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Addison Higham
StreamNative
See Details
Nothing Left

1:00pm

Keynote
1:00pm-2:00pm
Day 1 Afternoon Keynote
Details coming soon!
Keynote
Hybrid
Zhamak Dehghani
Thoughtworks
George Fraser
Fivetran
Eric Sun
Coinbase Inc.
Arsalan Tavakoli-Shiraji
Databricks
Francois Ajenstat
Tableau
Zaheera Valani
Databricks
See Details

2:05pm


Session
2:05pm-2:40pm
Towards a Modular Future: Reimagining and Rebuilding Kedro-viz for Visualizing
Modular Pipelines

Intermediate
Visualization
Data Analytics, BI and Visualization
Hybrid
Susanna Wong
QuantumBlack
See Details
Session
2:05pm-2:40pm
Securing Databricks on AWS Using Private Link

Advanced
Databricks, Security, MLOps, Databricks Experience (DBX)
Data Security and Governance
In-Person
Ioannis Papadopoulos
Databricks
Hemal Khatri
Databricks
See Details
Session
2:05pm-2:40pm
Protecting Personally Identifiable Information (PII)/PHI Data in Data Lake via
Column Level Encryption

Intermediate
Financial Services
Governance, Security
Industry and Business Use Cases
Hybrid
Chandiprasad Chintalapati
Northwestern Mutual
Keyuri Shah
Northwesternmutual Insurance
See Details
Session
2:05pm-2:40pm
Nixtla: Deep Learning for Time Series Forecasting

Beginner
Financial Services
Deep Learning
Data Science, Machine Learning and MLOps
Hybrid
Max Mergenthaler
Nixtla
See Details
Session
2:05pm-2:40pm
How Robinhood Built a Streaming Lakehouse to Bring Data Freshness from 24h to
Less Than 15 Mins

Intermediate
Financial Services
Data Pipelines
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Vikrant Goel
Robinhood
Balaji Varadarajan
Robinhood Markets
See Details
Sponsored Session
2:05pm-2:40pm
How McAfee Leverages Databricks on AWS at Scale

Advanced
Migration, Delta Lake, Streaming APIs and infrastructure
Sponsored Session
Hybrid
Hashem Raslan
McAfee
Kanishk Mahajan
AWS
See Details
Session
2:05pm-2:40pm
Delta Lake, the Foundation of Your Lakehouse

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Hagai Attias
Akamai Technologies
Himanshu Raja
Databricks
See Details
Session
2:05pm-2:40pm
Comprehensive Patient Data Self-Serve Environment and Executive Dashboards
Leveraging Databricks and Elasticsearch Processes

Beginner
Healthcare and Life Sciences
Data Pipelines
Industry and Business Use Cases
In-Person
DMITRIY BOYUK
OncoHealth
See Details
Session
2:05pm-2:40pm
Automating Model Lifecycle Orchestration with Jenkins

Intermediate
Machine Learning
MLOps and DataOps
Hybrid
Conrado Miranda
Verta
Liam Newman
Verta
See Details
Session
2:05pm-2:40pm
Automate Your Delta Lake or Practical Insights on Building Distributed Data Mesh

Intermediate
Data Pipelines
Data Engineering
Hybrid
Serge Smertin
Databricks
See Details
Session
2:05pm-2:40pm
Accelerating the Pace of Autism Diagnosis with Machine Learning Models

Advanced
Healthcare and Life Sciences
Deep Learning, Machine Learning, Python and Ecosystem
Research
In-Person
Anish Lakkapragada
Lynbrook High School / Stanford University
See Details
Sponsored Session
2:05pm-2:40pm
Accelerating Hybrid Data Mesh Implementation

Intermediate
Databricks
Sponsored Session
Hybrid
Timur Mehmedbasic
Avanade
See Details
Nothing Left

2:50pm


Session
2:50pm-3:25pm
Sink Framework Evolution in Apache Flink

Intermediate
Data lake
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Fabian Paul
Databricks
See Details
Session
2:50pm-3:25pm
Scaling Deep Learning on Databricks

Advanced
Deep Learning
MLOps and DataOps
Hybrid
Brian Law
Databricks
See Details
Session
2:50pm-3:25pm
Scaling AI Workloads with the Ray Ecosystem

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Jules Damji
Anyscale, Inc
See Details
Session
2:50pm-3:25pm
Power to the (SQL) People: Python UDFs in DBSQL

Intermediate
Databricks, Data lakehouse, SQL and ecosystem, Python and Ecosystem
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Stefania Leone
Databricks
Martin Grund
Databricks
See Details
Session
2:50pm-3:25pm
Neural Architecture Search for Inversion

Intermediate
Machine Learning
Research
In-Person
Licheng Zhang

See Details
Session
2:50pm-3:25pm
Multimodal Deep Learning Applied to E-commerce Big Data

Intermediate
Deep Learning
Data Science, Machine Learning and MLOps
Hybrid
Arthur Delaitre
Mirakl
Sang-hoon YOON
Mirakl
See Details
Session
2:50pm-3:25pm
Hassle-Free Data Ingestion into the Lakehouse

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Benyue Liu
Databricks
Burak Yavuz
Databricks
See Details
Session
2:50pm-3:25pm
Delta Lake Overview

Beginner
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Denny Lee
Databricks
Tathagata Das
Databricks
See Details
Sponsored Session
2:50pm-3:25pm
dbt and Databricks: Analytics Engineering on the Lakehouse

Beginner
Analytics and BI
Sponsored Session
Hybrid
Aaron Steichen
dbt Labs
See Details
Session
2:50pm-3:25pm
Databricks SQL Under the Hood: What's New with Live Demos

Databricks Experience (DBX)
Data Analytics, BI and Visualization
Hybrid
Can Efeoglu
Databricks
Miranda Luna
Databricks
See Details
Sponsored Session
2:50pm-3:25pm
Constraints, Democratization, and the Modern Data Stack - Building a Data
Platform At Red Ventures with Fivetran and Databricks

Intermediate
Data Pipelines
Sponsored Session
Hybrid
Brandon Beidel
Red Ventures
See Details
Session
2:50pm-3:25pm
Building Enterprise Scale Data and Analytics Platforms at Amgen

Intermediate
Healthcare and Life Sciences
Data lakehouse
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Deepak Abburi
Amgen
Vickye Jain
ZS Associates
See Details
Session
2:50pm-3:25pm
Building an Operational Machine Learning Organization from Zero and Leveraging
ML for Crypto Security

Intermediate
Financial Services
Apache Arrow, Security, Machine Learning
Industry and Business Use Cases
Hybrid
Anthony Tellez
BlockFi
See Details
Session
2:50pm-3:25pm
Automating Business Decisions Using Event Streams

Intermediate
Kafka
Data Analytics, BI and Visualization
Hybrid
Rohit Bose
Swim
See Details
Nothing Left

3:30pm

Industry Forum
3:30pm-5:00pm
Retail Industry Forum

Retail and Consumer Goods
Industry Forum
In-Person
Nick Hamilton
84.51
Barry Ralston
Shipt
+5
And five more
See Details
Industry Forum
3:30pm-5:45pm
Financial Services Industry Forum: The Future of Financial Services is Open with
Data and AI at Its Core

Financial Services
Industry Forum
In-Person
Jack Berkowitz
ADP
Junta Nakai
Databricks
+9
And nine more
See Details

4:00pm


Session
4:00pm-4:35pm
Why a Data Lakehouse is Critical During the Manufacturing Apocalypse

Intermediate
Manufacturing
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Brad Nicholas
Corning Incorporated
Heather Urbanek
Corning Inc.
See Details
Session
4:00pm-4:35pm
Serverless Kafka and Apache Spark in a Multi-Cloud Data Lakehouse Architecture

Intermediate
Kafka
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Kai Waehner
Confluent
See Details
Session
4:00pm-4:35pm
Secure Data Distribution and Insights with Databricks on AWS

Intermediate
Public Sector
Security
Data Security and Governance
In-Person
Nicole Murray
AWS
Kayla Grieme
Databricks
See Details
Sponsored Session
4:00pm-4:35pm
Pushing the limits of scale and performance for enterprise-wide analytics: A
fire-side chat with Akamai

Intermediate
Migration
Sponsored Session
Hybrid
Hagai Attias
Akamai Technologies
Arindam Chatterjee
Microsoft
See Details
Session
4:00pm-4:35pm
Productionizing Ethical Credit Scoring Systems with Delta Lake, Feature Store
and MLFlow

Intermediate
Financial Services
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Jeanne Choo
Databricks
See Details
Sponsored Session
4:00pm-4:35pm
Live Analytics: The next user engagement frontier

Beginner
ThoughtSpot
Sponsored Session
Hybrid
Martin Stangeland
Flyr
See Details
Session
4:00pm-4:35pm
Learn to Efficiently Test ETL Pipelines

Beginner
Data Pipelines, Apache Spark , Python and Ecosystem
Data Engineering
Hybrid
Jacqueline Bilston
Yelp
See Details
Session
4:00pm-4:35pm
Gazelle-Jni: A Middle Layer to Offload Spark SQL to Native Engines for Execution
Acceleration

Intermediate
SQL and ecosystem
Data Analytics, BI and Visualization
Hybrid
Zhichao Zhang
Kyligence
Weiting Chen
Intel
See Details
Session
4:00pm-4:35pm
Ensuring Correct Distributed Writes to Delta Lake in Rust with Formal
Verification

Advanced
Media and Entertainment
Internals
Data Engineering
Hybrid
QP Hou
Neuralink
See Details
Session
4:00pm-4:35pm
Efficient and Multi-Tenant Scheduling of Big Data and AI Workloads

Intermediate
Machine Learning
MLOps and DataOps
Hybrid
Chenya Zhang
Apple
Chaoran Yu
Apple
See Details
Session
4:00pm-4:35pm
Distributed Machine Learning at Lyft

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Anindya Saha
Lyft Inc.
Han Wang
Lyft Inc.
See Details
Session
4:00pm-4:35pm
Delta Sharing - A New Paradigm for Secure Data Sharing and Data Collaboration on
Lakehouse

Databricks Experience (DBX)
Data Security and Governance
Hybrid
Jay Bhankaria
Databricks
Celia Kung
Databricks
+1
And one more
See Details
Session
4:00pm-4:35pm
Amgen’s Journey To Building a Global 360 View of its Customers with the
Lakehouse

Healthcare and Life Sciences
Industry and Business Use Cases
In-Person
Bin Yuan
Amgen
Scott Hirayama
Amgen
See Details
Session
4:00pm-5:20pm
So Fresh and So Clean: Learn How to Build Real-Time Warehouses on Lakehouse

Intermediate
Data Pipelines
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Franco Patano
Databricks
Dillon Bostwick
Databricks
+1
And one more
See Details
Nothing Left

4:45pm


Session
4:45pm-5:20pm
Tackling Challenges of Distributed Deep Learning with Open Source Solutions

Intermediate
Deep Learning
Data Science, Machine Learning and MLOps
Hybrid
Amog Kamsetty
Anyscale
Antoni Baum
Anyscale
See Details
Session
4:45pm-5:20pm
Spark Inception: Exploiting the Apache Spark REPL to Build Streaming Notebooks

Intermediate
Internals
Data Engineering
Hybrid
Scott Haines
Nike
See Details
Session
4:45pm-5:20pm
Spark Data Source V2 Performance Improvement: Aggregate Push Down

Intermediate
Databricks, Iceberg, Scala and ecosystem, Internals, DNU - Open Source, Apache
Spark , Delta Lake
Data Engineering
Hybrid
DB Tsai
Apple
Huaxin Gao
Apple
See Details
Sponsored Session
4:45pm-5:20pm
Operational Analytics: Expanding the Reach of Data in the Lakehouse Era

Intermediate
Machine Learning
Sponsored Session
Hybrid
Boris Jabes
Census
See Details
Session
4:45pm-5:20pm
Mosaic: A Framework for Geospatial Analytics at Scale

Intermediate
Industry and Business Use Cases
Hybrid
Stuart Lynn
Databricks
Milos Colic
Databricks
See Details
Session
4:45pm-5:20pm
Implementing Data Governance 3.0 for the Lakehouse Era: Community-Led and
Bottom-Up

Intermediate
Governance
Data Security and Governance
In-Person
Prukalpa Sankar
Atlan
See Details
Session
4:45pm-5:20pm
How AT&T Data Science Team Solved an Insurmountable Big Data Challenge on
Databricks with Two Different Approaches using Photon and RAPIDS Accelerator for
Apache Spark

Intermediate
Media and Entertainment
Machine Learning
Industry and Business Use Cases
Hybrid
Chris Vo
AT&T
Hao Zhu
NVIDIA
See Details
Session
4:45pm-5:20pm
Data Lake for State Health Exchange Analytics using Databricks

Intermediate
Public Sector
Industry and Business Use Cases
In-Person
Deven Dharm
Deloitte
Perminder Bagri
Office of System Integration, State of CA
See Details
Session
4:45pm-5:20pm
Build an Enterprise Lakehouse for Free with Trino and Delta Lake

Data Engineering
Hybrid
Claudius Li
Starburst Data
Tom Nats
Starburst Data
See Details
Session
4:45pm-5:20pm
Auto Encoder Decoder-Based Anomaly Detection with the Lakehouse Paradigm

Intermediate
Deep Learning
Data Science, Machine Learning and MLOps
In-Person
Yinxi Zhang
Databricks
See Details
Session
4:45pm-5:20pm
Apache Arrow Flight SQL: High Performance, Simplicity, and Interoperability for
Data Transfers

Intermediate
Internals
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Jason Hughes
Dremio
See Details
Session
4:45pm-5:20pm
A Practitioner's Guide to Unity Catalog—A Technical Deep Dive

Advanced
Databricks Experience (DBX)
Data Security and Governance
Hybrid
Ifi Derekli
Databricks
Zeashan Pappa
Databricks
+1
And one more
See Details
Sponsored Session
4:45pm-5:20pm
A Low-Code Approach to 10x Data Engineering

Intermediate
Governance
Sponsored Session
Hybrid
Maciej Szpakowski
Prophecy
Raj Bains
Prophecy
See Details
Nothing Left

5:00pm

Industry Forum
5:00pm-6:30pm
Retail Industry Reception

Retail and Consumer Goods
Industry and Business Use Cases
In-Person
See Details

5:30pm


Session
5:30pm-6:05pm
Technical and Tactical Football Analysis Through Data

Beginner
Media and Entertainment
Analytics and BI
Industry and Business Use Cases
In-Person
Rafael Zambrano
LaLiga Tech
See Details
Session
5:30pm-6:05pm
Simon Whiteley + Denny Lee Live Ask Me Anything

Beginner
Data Pipelines
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Simon Whiteley
Advancing Analytics
Denny Lee
Databricks
See Details
Session
5:30pm-6:05pm
Search and Aggregations Made Easy with OpenSearch and NodeJS

Beginner
Data Analytics, BI and Visualization
Hybrid
Olena Kutsenko
Aiven
See Details
Session
5:30pm-6:05pm
Scaling Salesforce In-Memory Streaming Analytics Platform for Trillion Events
Per Day

Intermediate
Financial Services
Kafka, Data Pipelines, Apache Spark , Streaming APIs and infrastructure
Data Engineering
Hybrid
Dyno Fu
Salesforce
Kishore Reddipalli
Salesforce
See Details
Session
5:30pm-6:05pm
MLOps at DoorDash

Intermediate
Retail and Consumer Goods
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Hien Luu
DoorDash
See Details
Session
5:30pm-6:05pm
Meshing About with Databricks

Intermediate
Databricks, Data lake, Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Som Natarajan
Databricks
Jason Pohl
Databricks
See Details
Sponsored Session
5:30pm-6:05pm
Leveraging ML-Powered Analytics for Rapid Insights and Action (a demonstration)

Beginner
Machine Learning
Sponsored Session
Hybrid
Joel McKelvey
Sisu Data
See Details
Session
5:30pm-6:05pm
Improving Apache Spark Structured Streaming Application Processing Time by
Configurations, Code Optimizations, and Custom Data Source

Intermediate
Kafka
Data Engineering
Hybrid
Nir Dror
Akamai
Kineret Raviv
Akamai
See Details
Sponsored Session
5:30pm-6:05pm
How AARP Services, Inc. automated SAS transformation to Databricks using
LeapLogic—A cloud accelerator for transformation of legacy analytics, ETL, DW &
Hadoop

Advanced
Data Quality
Sponsored Session
Hybrid
Sanjay Sharma
Impetus Technologies
Junjun (Robert) Yue
AARP Services, Inc.
See Details
Session
5:30pm-6:05pm
Designing Better MLOps Systems

Beginner
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Chengyin Eng
Databricks
See Details
Session
5:30pm-6:05pm
DataFusion and Arrow: Supercharge Your Data Analytical Tool with a Rusty Query
Engine

Intermediate
Apache Arrow, Rust, Internals
Data Engineering
Hybrid
Daniël Heres
GoDataDriven
Andrew Lamb
InfluxData
See Details
Session
5:30pm-6:05pm
Coral and Transport: Portable SQL and UDFs for the Interoperability of Spark and
Other Engines

Intermediate
PrestoDB, SQL and ecosystem, Apache Spark
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Wenye Zhang
LinkedIn
Walaa Eldin Moustafa
LinkedIn
See Details
Session
5:30pm-6:05pm
Administrator Best Practices and Tips for Future-proofing your Databricks
Account

Databricks Experience (DBX)
Data Security and Governance
Hybrid
Vicky Avison
Plexure
Gaurav Bhatnagar
Databricks
+1
And one more
See Details
Session
5:30pm-6:10pm
Building an Analytics Lakehouse at Grab

Data Science, Machine Learning and MLOps
In-Person
Zulfikar Lazuardi Maulana
Grab
See Details
Nothing Left

5:45pm

Industry Forum
5:45pm-6:45pm
Financial Services Industry Reception

Financial Services
Industry Forum
In-Person
See Details

6:00pm

Evening Event
6:00pm-9:00pm
Opening Night Reception

Evening Event
In-Person
See Details


WEDNESDAY

8:30am

Keynote
8:30am-10:30am
Day 2 Opening Keynote
The Day 2 keynote focuses on the intersection of data and AI. AI visionaries
Andrew Ng, Peter Norvig, and Hilary Mason will dive deep into the latest
practices and applications of machine learning at scale. This session will also
provide a look into the latest advancements in MLflow, which is now the
second-most popular open-source machine learning project in the world. Finally,
AI leaders from John Deere a...
Keynote
Hybrid
Andrew Ng
DeepLearning.AI, Landing AI
Ali Ghodsi
Databricks
Hilary Mason
Hidden Door
Peter Norvig
Stanford's Human-Centered AI Institute and Google Inc
Ganesh Jayaram
John Deere
Kasey Uhlenhuth
Databricks
Patrick Wendell
Databricks
Alon Amit
Intuit
Michael Armbrust
Databricks
Manish Amde
Intuit
Stacy Kerkela
Databricks
See Details

10:30am

Expo Theater
10:30am-10:55am
Summit Theater Programming | Wednesday

Lightning Talk
Expo Theater
In-Person
Hyukjin Kwon
Databricks
Xiao Li
Databricks
+27
And twenty-seven more
See Details
Training
10:30am-6:00pm
Certification Exam Day 3

Certification Exam
In-Person
See Details

10:45am


Session
10:45am-11:20am
Security Best Practices for Lakehouse

Intermediate
Databricks, Security
Data Security and Governance
In-Person
David Veuve
Databricks
Arun Pamulapati
Databricks
See Details
Session
10:45am-11:20am
Predicting and Preventing Machine Downtime with AI and Expert Alerts

Intermediate
Manufacturing
Data lakehouse
Industry and Business Use Cases
Hybrid
Jayashree Karnam
John Deere
Jeremy Goebel
john Deere
See Details
Session
10:45am-11:20am
Opening the Floodgates: Enabling Fast, Unmediated End User Access to
Trillion-Row Datasets with SQL Data Warehouses

Intermediate
SQL and ecosystem, Dashboards
Data Analytics, BI and Visualization
Hybrid
Robert Hodges
Altinity
See Details
Session
10:45am-11:20am
How to Implement a Semantic Layer for Your Lakehouse

Intermediate
Data warehouse, SQL and ecosystem
Data Analytics, BI and Visualization
Hybrid
David Mariani
AtScale, Inc.
See Details
Sponsored Session
10:45am-11:20am
How Databricks is driving disruptive digital transformation in the airline
industry

Intermediate
Migration
Sponsored Session
Hybrid
Alan Grogan
Avanade
Zsolt Nadas
Wizz Air
See Details
Session
10:45am-11:20am
Fugue Tune: Distributed Hybrid Hyperparameter Tuning

Beginner
Dask, Machine Learning, DNU - Open Source, Apache Spark , Python and Ecosystem
Data Science, Machine Learning and MLOps
Hybrid
Jun Liu
Lyft
See Details
Session
10:45am-11:20am
Enabling BI in a Lakehouse Environment: How Spark and Delta Can Help With
Automating a DWH Development

Intermediate
Financial Services
Data lakehouse, SQL and ecosystem, Apache Spark , Delta Lake
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Ivana Pejeva
Microsoft
Yoshi Coppens
element61
See Details
Session
10:45am-11:20am
Enable Production ML with Databricks Feature Store

Databricks Experience (DBX)
Data Science, Machine Learning and MLOps
Hybrid
Avesh Singh
Databricks
Aakrati Talati
Databricks
See Details
Session
10:45am-11:20am
Dive Deeper into Data Engineering on Databricks

Databricks Experience (DBX)
Data Engineering
Hybrid
Frank Munz
Databricks
Paul Lappas
Databricks
+1
And one more
See Details
Session
10:45am-11:20am
DELETE, UPDATE, MERGE Operations in Data Source V2

Intermediate
Iceberg, Internals, Apache Spark
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Anton Okolnychyi
Apple
See Details
Session
10:45am-11:20am
dbt and Python—Better Together

Intermediate
Databricks, SQL and ecosystem, dbt, Python and Ecosystem
Data Engineering
In-Person
Drew Banin
dbt Labs
See Details
Sponsored Session
10:45am-11:20am
Building a Lakehouse on AWS for Less with AWS Graviton and Photon

Intermediate
Databricks, Data lakehouse, Delta Lake
Sponsored Session
Hybrid
Igor Alekseev
Amazon Web Services
Piyush Singh
Databricks
See Details
Session
10:45am-11:20am
Beyond Daily Batch Processing: Operational Trade-Offs of Microbatch,
Incremental, and Real-Time Processing for Your ETLs (and Your Team's Sanity)

Intermediate
Media and Entertainment
Data Pipelines
Data Engineering
Hybrid
Valerie Burchby
Netflix
See Details
Session
10:45am-11:20am
Achieve Machine Learning Hyper-Productivity with Transformers and Hugging Face

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Julien Simon
Hugging Face
See Details
Nothing Left

11:30am


Sponsored Session
11:30am-12:05pm
Your fastest path to Lakehouse and beyond

Intermediate
Analytics and BI
Sponsored Session
Hybrid
Nate Shea-han
Microsoft
See Details
Session
11:30am-12:05pm
Streaming Data into Delta Lake with Rust and Kafka

Intermediate
Media and Entertainment
Rust
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Christian Williams
Scribd
See Details
Session
11:30am-12:05pm
State-of-the-Art Natural Language Processing with Apache Spark NLP

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
David Talby
John Snow Labs
See Details
Session
11:30am-12:05pm
Scaling Privacy: Practical Architectures and Experiences

Intermediate
Financial Services
Security
Data Security and Governance
In-Person
Aaron Colcord
Privacera
Mei Gui
Databricks
See Details
Session
11:30am-12:05pm
Monitoring and Quality Assurance of Complex ML Deployments via Assertions

Advanced
Data Pipelines, Data Quality, Governance, Feature Engineering, Machine Learning,
Python and Ecosystem
Data Science, Machine Learning and MLOps
Hybrid
Daniel Kang
Stanford University
See Details
Session
11:30am-12:05pm
MLflow Pipelines: Accelerating MLOps from Development to Production

MLOps and DataOps
Hybrid
Xiangrui Meng
Databricks
Jin Zhang
Databricks
See Details
Session
11:30am-12:05pm
ML on the Lakehouse: Bringing Data and ML Together to Accelerate AI Use Cases

Databricks Experience (DBX)
Data Science, Machine Learning and MLOps
Hybrid
Kasey Uhlenhuth
Databricks
Prem Prakash
Databricks
+1
And one more
See Details
Sponsored Session
11:30am-12:05pm
How to get your data catalog implementation right the first time

Intermediate
Governance
Sponsored Session
Hybrid
Prukalpa Sankar
Atlan
See Details
Session
11:30am-12:05pm
FugueSQL—The Enhanced SQL Interface for Pandas and Spark DataFrames

Beginner
SQL and ecosystem, Pandas, Apache Spark
Data Engineering
Hybrid
Kevin Kho
Prefect
Han Wang
Lyft Inc.
See Details
Session
11:30am-12:05pm
Delta Live Tables: Modern software engineering and management for ETL

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Michael Armbrust
Databricks
See Details
Session
11:30am-12:05pm
Building Spatial Applications with Apache Spark and CARTO

Beginner
Databricks, Geospatial, Apache Spark
Data Analytics, BI and Visualization
Hybrid
Isabel Pozuelo
CARTO
Kent Marten
Databricks
See Details
Session
11:30am-12:05pm
Backfill Streaming Data Pipelines in Kappa Architecture

Intermediate
Kafka
Data Engineering
Hybrid
Sundaram Ananthanarayanan
Netflix Inc
Xinran Waibel
Netflix
See Details
Session
11:30am-12:05pm
Applied Predictive Maintenance in Aviation: Without Sensor Data

Intermediate
Manufacturing
Industry and Business Use Cases
Hybrid
Randy Provence
FedEx Express
David Taylor
FedEx Express
See Details
Session
11:30am-12:05pm
A Case Study in Rearchitecting an On-Premises Pipeline in the Cloud

Intermediate
Public Sector
Databricks, Apache Spark , Python and Ecosystem
Data Engineering
In-Person
Mary Clair Thompson
Duke University
See Details
Meetup
11:30am-1:00pm
Meetup | Women in Data and AI

Meetup
In-Person
Vini Jaiswal
Databricks
Aishwarya Srinivasan
Google
+2
And two more
See Details
Industry Forum
11:30am-1:30pm
Public Sector Industry Forum Lunch and Program

Public Sector
Industry Forum
In-Person
Rishi Tarar
CDC
Howard Levenson
Databricks
+6
And six more
See Details
Nothing Left

1:00pm

Keynote
1:00pm-2:00pm
Day 2 Afternoon Keynote
Details coming soon!
Keynote
Hybrid
Christopher Manning
Stanford University
Daphne Koller
insitro
Tarika Barrett
Girls Who Code
See Details

1:15pm

Industry Forum
1:15pm-2:15pm
Government Industry Coffee and Dessert Reception

Public Sector
Industry Forum
In-Person
See Details

2:05pm


Sponsored Session
2:05pm-2:40pm
The Future is Open - a Look at Google Cloud’s Open Data Ecosystem

Intermediate
Governance
Sponsored Session
Hybrid
Anagha Khanolkar
Google Cloud
See Details
Session
2:05pm-2:40pm
Smart Manufacturing: Real-time Process Optimization with Databricks

Intermediate
Manufacturing
Databricks
Industry and Business Use Cases
Hybrid
Ashwin Voorkarra
Tredence
Vamsi Krishna Bhupasamudram
Tredence
See Details
Session
2:05pm-2:40pm
Orchestration Made Easy with Databricks Workflows

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Robert Saxby
Databricks
Roland Fäustlin
Databricks
See Details
Session
2:05pm-2:40pm
Managing Straggler Executors at Apache Spark 3.3

Beginner
Internals, Apache Spark
Data Engineering
Hybrid
Alex Holmes
Apple Inc.
Dongjoon Hyun
Apple Inc.
See Details
Session
2:05pm-2:40pm
dbt + Machine Learning: What Makes a Great Baton Pass?

Intermediate
SQL and ecosystem, Data Pipelines, Feature Engineering, Machine Learning, dbt
Data Science, Machine Learning and MLOps
Hybrid
Sung Won Chung
dbt Labs
See Details
Session
2:05pm-2:40pm
Data-Centric Principles for AI Engineering

Data Science, Machine Learning and MLOps
In-Person
Vincent Chen
Snorkel AI
See Details
Sponsored Session
2:05pm-2:40pm
Complete Data Security and Governance Powered by Unity Catalog and Immuta

Intermediate
Governance
Sponsored Session
Hybrid
Jonathan Keller
Databricks
Steve Touw
Immuta
See Details
Sponsored Session
2:05pm-2:40pm
Competitive advantage hinges on predictive insights generated from AI! Build
powerful data-driven applications on the Databricks Data Lake with AI-ready
behavioral data from Snowplow

Intermediate
Machine Learning
Sponsored Session
In-Person
Nick King
Snowplow
See Details
Session
2:05pm-2:40pm
Chaos Engineering in the World of Large-Scale Complex Data Flow

Intermediate
LakeFS, Data Pipelines
Data Engineering
Hybrid
Adi Polak
Treeverse
See Details
Session
2:05pm-2:40pm
Building a Lakehouse for Data Science at DoorDash

Beginner
Retail and Consumer Goods
Data lakehouse
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Hien Luu
DoorDash
Brian Dirking
Databricks
See Details
Session
2:05pm-2:40pm
Accidentally Building a Petabyte-Scale Cybersecurity Data Mesh in Azure With
Delta Lake at HSBC

Data Engineering
Hybrid
Ryan Harris
HSBC
See Details
Session
2:05pm-2:40pm
A Modern Approach to Big Data for Finance

Intermediate
Financial Services
Delta Lake
Data Engineering
Hybrid
Leonid Rosenfeld
Nasdaq
Bill Dague
Nasdaq
See Details
Session
2:05pm-3:25pm
Deep Dive into the New Features of Apache Spark 3.2 and 3.3

Intermediate
SQL and ecosystem, Pandas, Databricks Experience (DBX), Apache Spark
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Xiao Li
Databricks
Wenchen Fan
Databricks
+1
And one more
See Details
Session
2:05pm-3:30pm
MLOps on Databricks: A How-To Guide

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Joseph Bradley
Databricks
Niall Turbitt
Databricks
+1
And one more
See Details
Nothing Left

2:50pm


Session
2:50pm-3:25pm
What's New with Delta Lake

Databricks
Data Engineering
Hybrid
Himanshu Raja
Databricks
See Details
Session
2:50pm-3:25pm
Scaling Up Machine Learning in Instacart Search for the 2020 Surge in Online
Shopping

Intermediate
Machine Learning
Industry and Business Use Cases
Hybrid
Tejaswi Tenneti
Instacart
See Details
Session
2:50pm-3:25pm
Rethinking Orchestration as Reconciliation: Software-Defined Assets in Dagster

Intermediate
Rust
Data Engineering
Hybrid
Sandy Ryza
Elementl
See Details
Session
2:50pm-3:25pm
Realize the Promise of Streaming with the Databricks Lakehouse Platform

Databricks Experience (DBX)
Data Engineering
Hybrid
Ray Zhu
Databricks
Steven Yu
Databricks
+1
And one more
See Details
Session
2:50pm-3:25pm
Moving from Apache Spark 2 to Apache Spark 3: Spark Version Upgrade at Scale in
Pinterest

Intermediate
Migration
Data Engineering
Hybrid
Zirui Li
Pinterest
Zaheen Aziz
Pinterest
See Details
Session
2:50pm-3:25pm
Migrating Complex SAS Processes to Databricks - Case Study

Intermediate
Public Sector
Industry and Business Use Cases
Hybrid
Jesse Beaumont
Tensile AI LLC
Uday Kumar
Akira Technologies
See Details
Sponsored Session
2:50pm-3:25pm
How EPRI Uses Computer Vision to Mitigate Wildfire Risks for Electric Utilities

Intermediate
Machine Learning
Sponsored Session
Hybrid
Nick Lee
Labelbox
Dexter Lewis
Electric Power Research Institute
See Details
Session
2:50pm-3:25pm
Enabling Learning on Confidential Data

Intermediate
Financial Services
Governance
Data Security and Governance
In-Person
Rishabh Poddar
Opaque Systems
See Details
Sponsored Session
2:50pm-3:25pm
Enabling Business Users to Perform Interactive Ad-Hoc Analysis over Delta Lake
with No Code

Beginner
Analytics and BI
Sponsored Session
In-Person
Prashant Soral
Sigma Computing
See Details
Session
2:50pm-3:25pm
Connecting the Dots with DataHub: Lakehouse and Beyond

Data Engineering
Hybrid
Shirshanka Das
Acryl Data
See Details
Session
2:50pm-3:25pm
Challenges in Time Series Forecasting

Beginner
Forecasting, Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Inbal Tadeski
Anodot
See Details
Session
2:50pm-3:25pm
An Advanced S3 Connector for Spark to Hunt for Cyber Attacks

Intermediate
Data Engineering
Hybrid
Wojciech Indyk
Hunters
Ada Sharoni
Hunters
See Details
Nothing Left

3:30pm

Industry Forum
3:30pm-5:00pm
The Future of Communications, Media & Entertainment Is Open With Data+AI at its
Core

Media and Entertainment
Industry Forum
In-Person
Rafael Zambrano
LaLiga Tech
Duan Peng
Warner Bros. Discovery
+5
And five more
See Details
Industry Forum
3:30pm-5:00pm
Manufacturing Industry Forum

Manufacturing
Industry Forum
In-Person
Aimee DeGrauwe
John Deere
Rob Saker
Databricks
+2
And two more
See Details
Industry Forum
3:30pm-5:00pm
Healthcare and Life Sciences Industry Forum

Healthcare and Life Sciences
Industry Forum
In-Person
Lindsay Mico
Providence Health
Jeffrey Reid
Regeneron Genetics Center
+6
And six more
See Details

4:00pm


Session
4:00pm-4:35pm
Unifying Data Science and Business: Artificial Intelligence Augmentation and
Integration into Production Business Applications

Beginner
Databricks, Dashboards, Governance, Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Ian Sotnek
AI Squared, inc
Jacob Renn
AI Squared, inc.
See Details
Session
4:00pm-4:35pm
Running a Low Cost, Versatile Data Management Ecosystem with Apache Spark at
Core

Intermediate
Financial Services
Data Pipelines
Data Engineering
Hybrid
Shariff Mohammed
Capital One
See Details
Session
4:00pm-4:35pm
Radical Speed on the Lakehouses: Photon under the hood

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Justin Breese
Databricks
Sriram Krishnamurthy
Databricks
See Details
Session
4:00pm-4:35pm
PySpark in Apache Spark 3.3 and Beyond

Intermediate
Python and Ecosystem
Data Engineering
Hybrid
Hyukjin Kwon
Databricks
Xinrong Meng
Databricks, Inc.
See Details
Sponsored Session
4:00pm-4:35pm
Practical Data Governance in a Large Scale Databricks Environment

Beginner
Governance
Sponsored Session
In-Person
Brad Nicholas
Corning Incorporated
Aaron Colcord
Privacera
See Details
Session
4:00pm-4:35pm
Migrating SAS to a Lakehouse on Databricks and S3

Beginner
Industry and Business Use Cases
Hybrid
Sri Ghattamaneni
Databricks
Rahul Shaw
Deloitte Consulting LLP
See Details
Sponsored Session
4:00pm-4:35pm
Migrate and Modernize your Data Platform with Confluent and Databricks

Beginner
Migration
Sponsored Session
In-Person
Peter Kennedy
Confluent
See Details
Session
4:00pm-4:35pm
Improving Interactive Querying Experience on Spark SQL

Intermediate
SQL and ecosystem
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Ashish Singh
Pinterest
See Details
Session
4:00pm-4:35pm
How to Build a Complete Security and Governance Solution Using Unity Catalog

Intermediate
Public Sector
Governance
Data Security and Governance
In-Person
Don Bosco Durai
Privacera
Zeashan Pappa
Databricks
See Details
Session
4:00pm-4:35pm
Correlation Over Causation: Cracking the Relationship Between User Engagement
and User Happiness

Intermediate
Analytics and BI
Data Analytics, BI and Visualization
Hybrid
Natalia Baryshnikova
Atlassian
Rameil Sarkis
Atlassian
See Details
Session
4:00pm-4:35pm
Cloud and Data Science Modernization of Veterans Affairs Financial Service
Center with Azure Databricks

Intermediate
Public Sector
Machine Learning
Industry and Business Use Cases
Hybrid
Cary Moore
Databricks
Scott Meier
Dept. of Veterans Affairs, Financial Services Center
See Details
Session
4:00pm-4:35pm
Cleanlab: AI to Find and Fix Errors in ML Datasets

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Curtis Northcutt
Cleanlab
See Details
Session
4:00pm-4:35pm
A Vision for the Future with Edge ML-Powered Devices

Intermediate
Retail and Consumer Goods
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Filipa Peleja
Levi Strauss & Co
See Details
Nothing Left

4:45pm


Sponsored Session
4:45pm-5:20pm
Take Databricks Lakehouse to the Max with Informatica

Intermediate
Machine Learning
Sponsored Session
In-Person
Rik Tamm-Daniels
Informatica
See Details
Session
4:45pm-5:20pm
Survey of Production ML Tech Stacks

Intermediate
Databricks, Governance, MLOps
Data Science, Machine Learning and MLOps
Hybrid
Marygrace Moesta
Databricks
Conor Murphy
Databricks
See Details
Session
4:45pm-5:20pm
Serving Near Real-Time Features at Scale

Intermediate
Data Engineering
Hybrid
Feng Xu
Uber
See Details
Session
4:45pm-5:20pm
Quick to Production with the Best of Both Apache Spark and Tensorflow on
Databricks

Intermediate
Retail and Consumer Goods
Deep Learning, MLOps, Machine Learning, Apache Spark
Data Science, Machine Learning and MLOps
In-Person
Ronny Mathew
Rue Gilt Groupe
See Details
Sponsored Session
4:45pm-5:20pm
Open source powers the modern data stack

Beginner
Data Pipelines
Sponsored Session
In-Person
Michel Tricot
Airbyte
See Details
Session
4:45pm-5:20pm
Low-Code Machine Learning on Databricks with AutoML

Databricks Experience (DBX)
Data Science, Machine Learning and MLOps
Hybrid
Nicolas Pelaez
Databricks
Stephanie Rivera
Databricks
See Details
Session
4:45pm-5:20pm
Fastest Speed to Market with Open-Source Retail Analytics Platform

Intermediate
Retail and Consumer Goods
Data lakehouse, Streaming APIs and infrastructure
Data Analytics, BI and Visualization
Hybrid
Sudhir Kulkarni
Lowes Inc.
See Details
Session
4:45pm-5:20pm
Enabling Advanced Analytics at The Department of State using Databricks

Intermediate
Public Sector
Industry and Business Use Cases
Hybrid
Mark Lopez
Deloitte
Alan Gersch
Deloitte
See Details
Session
4:45pm-5:20pm
Embedding Privacy by Design Into Data Infrastructure Through Open-Source,
Extensible Tooling

Intermediate
Privacy, Governance
Data Security and Governance
In-Person
Cillian Kieran
Ethyca
See Details
Session
4:45pm-5:20pm
Doubling the Capacity of the Data Platform Without Doubling the Cost

Data Engineering
Hybrid
Gavin Edgley
Databricks
R Tyler Croy
Scribd
See Details
Sponsored Session
4:45pm-5:20pm
Cloud Native Geospatial Analytics at JLL

Intermediate
Machine Learning
Sponsored Session
In-Person
Yanqing Zeng
JLL
Luis Sanz
CARTO
See Details
Sponsored Session
4:45pm-5:20pm
Building Scalable & Advanced AI based Language Solutions for R&D using
Databricks

Advanced
Sponsored Session
In-Person
Subadhra Parthasarathy
Deloitte
See Details
Session
4:45pm-5:20pm
Advanced Migrations: From Hive to SparkSQL

Intermediate
Migration, Apache Spark
Data Engineering
Hybrid
Zaheen Aziz
Pinterest
See Details
Nothing Left

5:00pm

Industry Forum
5:00pm-6:30pm
Manufacturing Industry Reception

Manufacturing
Industry Forum
In-Person
See Details
Industry Forum
5:00pm-6:30pm
Healthcare and Life Sciences Industry Forum Reception

Healthcare and Life Sciences
Industry Forum
In-Person
See Details
Industry Forum
5:00pm-6:30pm
Communications, Media, Entertainment Industry Reception

Media and Entertainment
Industry and Business Use Cases
In-Person
See Details

5:30pm


Sponsored Session
5:30pm-6:05pm
You have BI. Now what? Activate your data!

Beginner
Data Pipelines
Sponsored Session
In-Person
Ernest Prabhakar
Nauto, Inc.
Kashish Gupta
Hightouch
See Details
Session
5:30pm-6:05pm
What to Do When Your Job Goes OOM in the Night (Flowcharts!)

Data Engineering
Hybrid
Holden Karau
Netflix
Anya Bida
prophecy.io
See Details
Session
5:30pm-6:05pm
UIMeta: A 10X Faster Cloud-Native Apache Spark History Server

Intermediate
Apache Spark
Data Engineering
Hybrid
Lantao Jin
ByteDance
See Details
Sponsored Session
5:30pm-6:05pm
Turbocharge your AI/ML Databricks workflows with Precisely

Intermediate
Geospatial
Sponsored Session
In-Person
Diana Smith
Precisely
Mayank Kasturia
Precisely
See Details
Session
5:30pm-6:05pm
The Databricks Notebook: Front Door of the Lakehouse

Databricks Experience (DBX)
Data Science, Machine Learning and MLOps
Hybrid
Rafi Kurlansik
Databricks
Austin Ford
Databricks
See Details
Session
5:30pm-6:05pm
Mapping Data Quality Concerns to Data Lake Zones

Intermediate
Data lakehouse, Data lake
Data Security and Governance
In-Person
Stewart Bryson
Qualytics
See Details
Session
5:30pm-6:05pm
Implementing a Framework for Data Security and Policy at a Large Public Sector
Agency

Intermediate
Public Sector
Data Quality, Security
Industry and Business Use Cases
Hybrid
Dave Thomas
Deloitte
Danny Holloway
Immuta
See Details
Sponsored Session
5:30pm-6:05pm
Deliver Faster Decision Intelligence From Your Lakehouse

Beginner
Dashboards
Sponsored Session
In-Person
Ajay Khanna
Tellius
See Details
Session
5:30pm-6:05pm
Data Lakehouse and Data Mesh—Two Sides of the Same Coin

Intermediate
Retail and Consumer Goods
Data lakehouse, Data lake
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Max Schultze
Zalando
Arif Wider
Thoughtworks & HTW Berlin
See Details
Session
5:30pm-6:05pm
Building and Scaling Machine Learning-Based Products in the World's Largest
Brewery

Intermediate
Retail and Consumer Goods
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Dr. Renata Castanha
Anheuser-Busch InBev
See Details
Session
5:30pm-6:05pm
Apache Spark on Kubernetes—Lessons Learned from Launching Millions of Spark
Executors

Intermediate
Kubernetes, Apache Spark
Data Engineering
Hybrid
Zhou JIANG
Apple
Aaruna Godthi
Apple
See Details
Sponsored Session
5:30pm-6:05pm
AI powered Assortment Planning Solution

Intermediate
Machine Learning
Sponsored Session
In-Person
Shankar Radhakrishnan
Mindtree
See Details
Nothing Left

6:30pm

Meetup
6:30pm-8:30pm
Delta Lake Contributors and Committers Meet and Greet .. and Birthday Party

Meetup
In-Person
Michael Armbrust
Databricks
Dominique Brezinski
Apple
See Details

7:00pm

Evening Event
7:00pm-9:00pm
Datatorium Party

Evening Event
In-Person
See Details


THURSDAY

8:00am


Training
8:00am-12:00pm
Lakehouse with Delta Lake Deep Dive

Training
Virtual
See Details
Training
8:00am-12:00pm
Databricks Lakehouse Overview

Training
Virtual
See Details
Training
8:00am-5:00pm
Performance Tuning on Apache Spark

Training
Virtual
See Details
Training
8:00am-5:00pm
Performance Tuning on Apache Spark

Training
In-Person
See Details
Training
8:00am-5:00pm
Data Engineering with Databricks — Bundle: Day 2

Training
In-Person
See Details
Training
8:00am-5:00pm
Data Engineering with Databricks — Bundle: Day 2

Training
Virtual
See Details
Training
8:00am-5:00pm
Apache Spark Programming with Databricks - Bundle: Day 2

Training
Virtual
See Details
Training
8:00am-5:00pm
Apache Spark Programming with Databricks - Bundle: Day 2

Training
In-Person
See Details
Training
8:00am-5:00pm
Advanced Machine Learning with Databricks — Bundle: Day 2

Training
In-Person
See Details
Training
8:00am-5:00pm
Advanced Machine Learning with Databricks — Bundle: Day 2

Training
Virtual
See Details
Training
8:00am-5:00pm
Advanced Data Engineering with Databricks — Bundle: Day 2

Training
Virtual
See Details
Training
8:00am-5:00pm
Advanced Data Engineering with Databricks — Bundle: Day 2

Training
In-Person
See Details
Training
8:00am-6:00pm
Certification Exam Day 4

Certification Exam
In-Person
See Details
Nothing Left

8:30am


Sponsored Session
8:30am-9:05am
Supercharge your SaaS applications with a modern, cloud-native database

Intermediate
Analytics and BI
Sponsored Session
Hybrid
Shireesh Thota
SingleStore
See Details
Session
8:30am-9:05am
Presto On Spark: A Unified SQL Experience

Intermediate
SQL and ecosystem
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Shradha Ambekar
Intuit
See Details
Session
8:30am-9:05am
Optimizing Speed and Scale of User-Facing Analytics Using Apache Kafka and Pinot

Beginner
Kafka, Analytics and BI
Data Engineering
Hybrid
Karin Wolok
StarTree
Neha Pawar
StarTree
See Details
Session
8:30am-9:05am
Migrate Your Existing DAGs to Databricks Workflows

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Robert Saxby
Databricks
Jan van der Vegt
Databricks
See Details
Session
8:30am-9:05am
Implementing an End-to-End Demand Forecasting Solution Through Databricks and
MLflow

Intermediate
Retail and Consumer Goods
Feature Engineering
Data Science, Machine Learning and MLOps
Hybrid
Ivana Pejeva
Microsoft
Yoshi Coppens
element61
See Details
Session
8:30am-9:05am
Evolution of Data Architectures and How to Build a Lakehouse

Beginner
Data lakehouse, Data lake, Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Vini Jaiswal
Databricks
See Details
Sponsored Session
8:30am-9:05am
Data Mesh in Action – Building Data Mesh Architecture Pattern with LTI Canvas
Alcazar

Intermediate
Databricks
Sponsored Session
In-Person
Abhishek Patel
LTI
See Details
Sponsored Session
8:30am-9:05am
Customer-centric Innovation to Scale Data & AI Everywhere

Beginner
Machine Learning
Sponsored Session
In-Person
Lakshman Chari
Intel Corporation
See Details
Nothing Left

9:15am


Session
9:15am-9:50am
Tools for Assisted Apache Spark Version Migrations, From 2.1 to 3.2+

Intermediate
Media and Entertainment
Apache Spark
Data Engineering
Hybrid
Holden Karau
Netflix
See Details
Session
9:15am-9:50am
The Future of Data Partnerships: Improving Business Outcomes Through Multi-Cloud
Query Execution

Intermediate
Machine Learning
Data Security and Governance
In-Person
Nick Elledge
LiveRamp
Scott Baker
LiveRamp
See Details
Session
9:15am-9:50am
Simplifying Migrations to Lakehouse—the Databricks Way

Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Ron Guerrero
Databricks
Ramachandran Venkat
Databricks
See Details
Session
9:15am-9:50am
Predicting Repeat Admissions to Substance Abuse Treatment with Machine Learning

Intermediate
Healthcare and Life Sciences
Microsoft Power BI, Model Interpretability, Deep Learning, DNU - MLflow
Data Science, Machine Learning and MLOps
Hybrid
Jennifer Morizzo
Maritz
Kelsey Emnett
Kimberly Clark
See Details
Session
9:15am-9:50am
Powering Up the Business with a Lakehouse

Intermediate
Retail and Consumer Goods
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Ricardo Simon Moreira Wagenmaker
Wehkamp
See Details
Sponsored Session
9:15am-9:50am
OvalEdge: End-To-End Data Governance

Beginner
Machine Learning
Sponsored Session
In-Person
Rachel Lutz
OvalEdge
See Details
Session
9:15am-9:50am
Diving into Delta Lake Integrations, Features, and Roadmap

Beginner
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Denny Lee
Databricks
Tathagata Das
Databricks
See Details
Session
9:15am-9:50am
DBA Perspective—Optimizing Performance Table-by-Table

Advanced
Data lakehouse, Internals, Databricks Experience (DBX), Delta Lake
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Douglas Moore
Databricks
See Details
Session
9:15am-9:50am
Adversarial Drifts, Model Monitoring, and Feedback Loops: Building
Human-in-the-Loop Machine Learning Systems for Content Moderation

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Nihit Desai
Refuel.AI
See Details
Nothing Left

10:00am


Session
10:00am-10:35am
Sound Data Engineering in Rust—From Bits to DataFrames

Advanced
Rust
Data Engineering
Hybrid
Jorge Leitao
Munin Data
See Details
Session
10:00am-10:35am
Real-Time Search and Recommendation at Scale Using Embeddings and Hopsworks

Intermediate
Machine Learning, Apache Spark
Data Science, Machine Learning and MLOps
Hybrid
Jim Dowling
Hopsworks
See Details
Session
10:00am-10:35am
Git for Data Lakes—How lakeFS Scales Data Versioning to Billions of Objects

Intermediate
Data lake
Data Lakes, Data Warehouses and Data Lakehouses
In-Person
Oz Katz
Treeverse LTD
See Details
Session
10:00am-10:35am
GIS Pipeline Acceleration with Apache Sedona

Intermediate
Databricks, Geospatial, Apache Spark
Data Engineering
In-Person
Alihan Zihna
CKDelta
Fernando Ayuso Palacios
CKDelta (Hutchison Group)
See Details
Sponsored Session
10:00am-10:35am
Emerging Data Architectures & Approaches for Real-Time AI using Redis

Sponsored Session
In-Person
Sam Partee
Redis
See Details
Session
10:00am-10:35am
Discover Data Lakehouse With End-to-End Lineage

Databricks, Apache Spark , Delta Lake
Data Engineering
In-Person
Tao Feng
Databricks
See Details
Session
10:00am-10:35am
Deep Dive: How to Build Your Modern Data Stack on Databricks to Solve Modern
Problems

Intermediate
Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Amit Kara
Databricks
Tahir Fayyaz
Databricks
See Details
Session
10:00am-10:35am
Building Metadata and Lineage Driven Pipelines on Kubernetes

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
YI-HONG WANG
IBM
Tommy Li
IBM
See Details
Nothing Left

10:45am


Session
10:45am-11:20am
X-FIPE: eXtended Feature Impact for Prediction Explanation

Intermediate
Healthcare and Life Sciences
Machine Learning
Data Science, Machine Learning and MLOps
In-Person
Xingde Jiang
Humana
Steve Brunner
Humana
See Details
Session
10:45am-11:20am
Setting up On Shelf Availability Alerts at Scale with Databricks and Azure

Intermediate
Retail and Consumer Goods
Machine Learning
Industry and Business Use Cases
In-Person
Ann Sterle
Tredence
Sunil Ranganathan
Tredence
See Details
Session
10:45am-11:20am
Near Real-Time Analytics with Event Streaming, Live Tables, and Delta Sharing

Intermediate
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Christina Taylor
Carvana
See Details
Sponsored Session
10:45am-11:20am
Introduction to Flux and OSS Replication

Intermediate
Sponsored Session
In-Person
Zoe Steinkamp
InfluxData
See Details
Sponsored Session
10:45am-11:20am
Improving patient care with Databricks

Intermediate
Databricks
Sponsored Session
In-Person
Gurpreet Kaur
Integra Life Sciences
Shadab Hussain
Wipro
See Details
Session
10:45am-11:20am
How To Make Apache Spark on Kubernetes Run Reliably on Spot Instances

Intermediate
Kubernetes, Apache Spark , Streaming APIs and infrastructure
Data Engineering
Hybrid
Hudson Buzby
Spot.io
Jean-Yves Stephan
Spot by NetApp
See Details
Session
10:45am-11:20am
Building Production-Ready Recommender Systems with Feature Stores

Intermediate
Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Danny Chiao
Tecton
See Details
Session
10:45am-11:20am
Apache Spark AQE SkewedJoin Optimization and Practice in ByteDance

Intermediate
SQL and ecosystem
Data Engineering
Hybrid
Liu Thomas
字节跳动
See Details
Session
10:45am-12:05pm
How To Use Databricks SQL for Analytics on Your Lakehouse

Intermediate
Databricks Experience (DBX)
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Pearl Ubaru
Databricks
See Details
Nothing Left

11:30am


Session
11:30am-12:05pm
Recent Parquet Improvements in Apache Spark

Advanced
Internals
Data Engineering
Hybrid
Chao Sun
Apple
See Details
Session
11:30am-12:05pm
Real-Time Cost Reduction Monitoring and Alerting

Intermediate
Media and Entertainment
Industry and Business Use Cases
In-Person
Ofer Ohana
HuuugeGames
David Sellam
HuuugeGames
See Details
Sponsored Session
11:30am-12:05pm
Log Processing at Scale

Intermediate
Analytics and BI
Sponsored Session
In-Person
Greg McNutt
Pure Storage, Inc.
See Details
Session
11:30am-12:05pm
Introducing Zipline: An Open Source Feature Engineering Platform

Intermediate
MLOps, Feature Engineering, Machine Learning
Data Science, Machine Learning and MLOps
Hybrid
Nikhil Simha Raprolu
Airbnb
See Details
Sponsored Session
11:30am-12:05pm
How unsupervised machine learning can scale data quality monitoring in
Databricks

Intermediate
Machine Learning
Sponsored Session
In-Person
Jeremy Stanley
Anomalo
See Details
Session
11:30am-12:05pm
How socat and UNIX Pipes Can Help Data Integration

Intermediate
Kubernetes
Data Engineering
In-Person
Davin Chia
Airbyte
See Details
Session
11:30am-12:05pm
Cutting the Edge in Fighting Cybercrime: Reverse-Engineering a Search Language
to Cross-Compile it to PySpark

Intermediate
Financial Services
Data lakehouse
Data Lakes, Data Warehouses and Data Lakehouses
Hybrid
Serge Smertin
Databricks
Jude Ken-Kwofie
HSBC
+1
And one more
See Details
Session
11:30am-12:05pm
Apache Spark SQL Aggregate Improvement at Meta (Facebook)

Intermediate
Internals
Data Engineering
Hybrid
Cheng Su
Meta (Facebook)
Shipra Agrawal
Meta
See Details
Nothing Left

1:00pm

Training
1:00pm-5:00pm
Lakehouse with Delta Lake Deep Dive

Training
Virtual
See Details
Training
1:00pm-5:00pm
Databricks Lakehouse Overview

Training
Virtual
See Details


VIRTUAL SESSIONS

Virtual


Session
Vision AI—Animal Health Industry Use Cases Using Databricks on Azure

Intermediate
Healthcare and Life Sciences
Industry and Business Use Cases
Virtual
VirooPax Mirji
Microsoft
See Details
Session
Using Feast Feature Store with Apache Spark for Self-Served Data Sharing and
Analysis for Streaming Architectures

Intermediate
Data Security and Governance
Virtual
Sameer Mangalampalli
CoMatrix
See Details
Session
US Air Force: Safeguarding Personnel Data at Enterprise Scale

Beginner
Public Sector
Industry and Business Use Cases
Virtual
Chris Brown
Immuta
Derek Eichin
United States Air Force
See Details
Session
Time Series Forecasting with PyCaret

Intermediate
Machine Learning, Python and Ecosystem
Data Science, Machine Learning and MLOps
Virtual
Moez Ali
PyCaret
See Details
Session
Swedbank: Enterprise Analytics in Cloud

Intermediate
Financial Services
Governance, Machine Learning
Industry and Business Use Cases
Virtual
Vineeth Menon
Swedbank
See Details
Session
Spline: Central Data-Lineage Tracking, Not Only For Spark

Intermediate
MLOps and DataOps
Virtual
Danil Vagapov
Outreach
Oleksandr Vayda
ABSA
See Details
Session
Simplify Global DataOps and MLOps Using Okta’s FIG Automation Library

Intermediate
Data Science, Machine Learning and MLOps
Virtual
Gregory Fee
Okta
See Details
Session
Self-Serve, Automated and Robust CDC pipeline using AWS DMS, DynamoDB Streams
and Databricks Delta

Intermediate
Data Engineering
Virtual
Dibyendu Karmakar
Swiggy
See Details
Nothing Left

Virtual


Session
Scalable XGBoost on GPU Clusters

Intermediate
Feature Engineering, Machine Learning, DNU - Open Source, Apache Spark , Python
and Ecosystem
Data Science, Machine Learning and MLOps
Virtual
Bobby Wang
Nvidia
Jiaming Yuan
Nvidia
See Details
Session
ROAPI: Serve Not So Big Data Pipeline Outputs Online with Modern APIs

Beginner
Data Engineering
Virtual
QP Hou
Neuralink
See Details
Session
Privacy Preserving Machine Learning and Big Data Analytics Using Apache Spark

Intermediate
Data Security and Governance
Virtual
Qiyuan Gong
Intel
Chunyang Hui
Ant Group
See Details
Session
Presto 101: An Introduction to Open Source Presto

Beginner
SQL and ecosystem
Data Analytics, BI and Visualization
Virtual
Philip Bell
Meta
Rohan Pednekar
Ahana
See Details
Session
Powering Geospatial Data Science with Graph Machine Learning

Intermediate
Data Science, Machine Learning and MLOps
Virtual
Anirudh Shah
Iggy
See Details
Session
Polars: Blazingly Fast DataFrames in Rust and Python

Intermediate
Data Engineering
Virtual
Ritchie Vink
Xomnia BV
See Details
Session
Optimizing Incremental Ingestion in the Context of a Lakehouse

Intermediate
Data Engineering
Virtual
Ivana Pejeva
Microsoft
Yoshi Coppens
element61
See Details
Session
Obfuscating Sensitive Information from Spark UI and Logs

Intermediate
Data Security and Governance
Virtual
Yian Liou
Workday
See Details
Nothing Left

Virtual


Session
Moving to the Lakehouse: Fast & Efficient Ingestion with Auto Loader

Intermediate
Data Lakes, Data Warehouses and Data Lakehouses
Virtual
Eric Maynard
Databricks
Benyue Liu
Databricks
See Details
Session
Measuring the Success of Your Algorithm Using a Shadow System

Intermediate
Databricks, Data Pipelines, MLOps
MLOps and DataOps
Virtual
Florine Groenen
Gousto
See Details
Session
Lessons Learned Running RL Recommendation at Scale in Physical Retail Setting at
Starbucks

Intermediate
Industry and Business Use Cases
Virtual
Sulbha Jain
Starbucks
See Details
Session
Intermittent Demand Forecasting in Scale Using Meta-Modelling (Deep Auto
Regressive Linear Dynamic System)

Intermediate
Retail and Consumer Goods
Forecasting, Deep Learning, Machine Learning
Data Science, Machine Learning and MLOps
Virtual
Biswajit Pal
Walmart Global Tech
Abhishek Sengupta
Walmart Global Tech
See Details
Session
Integrating Apache Superset into a B2B Platform: Why and How

Intermediate
Superset
Data Analytics, BI and Visualization
Virtual
Eugene Bikkinin
Dodo Engineering
See Details
Session
Ingesting data into Lakehouse with COPY INTO

Beginner
Data Lakes, Data Warehouses and Data Lakehouses
Virtual
Yaohua Zhao
Databricks
Ruowang (Jackie) Zhang
Databricks
See Details
Session
How to Automate the Modernization and Migration of Your Data Warehousing
Workloads to Databricks Lakehouse

Intermediate
Data Lakes, Data Warehouses and Data Lakehouses
Virtual
Jared Hillam
BladeBridge
Simon Eligulashvilli
BladeBridge
See Details
Session
How the Largest County in the US is Transforming Hiring with a Modern Data
Lakehouse

Public Sector
Virtual
Roozan Zarifian
Los Angeles County’s Department of Human Resources
Majida Adnan
County of Los Angeles
See Details
Nothing Left

Virtual


Session
Graph-based stream processing

Intermediate
Data Analytics, BI and Visualization
Virtual
Ivan Despot
Memgraph
Dominik Tomicevic
Memgraph
See Details
Session
From PostGIS to Spark SQL: The History and Future of Spatial SQL

Advanced
Data Analytics, BI and Visualization
Virtual
Ernesto Martínez
CARTO
Matthew Forrest
CARTO
See Details
Session
Elixir: The Wickedly Awesome Batch and Stream Processing Language You Should
Have in Your Toolbox

Intermediate
Data Pipelines
Data Engineering
Virtual
Brian Femiano
Apple
See Details
Session
Disrupting the Prescription Drug Market with AI and Data

Intermediate
Healthcare and Life Sciences
Industry and Business Use Cases
Virtual
Luyuan Fang
Prescryptive Health
See Details
Session
Detecting Financial Crime Using an Azure Advanced Analytics Platform and MLOps
Approach

Intermediate
Data Science, Machine Learning and MLOps
Virtual
Lars Haringa
ABN AMRO N.V.
Saman Amini
ABN AMRO
See Details
Session
Deep-Dive into Delta Lake

Advanced
Data Lakes, Data Warehouses and Data Lakehouses
Virtual
Gerhard Brueckl
paiqo GmbH
See Details
Session
Databricks Meets Power BI

Intermediate
Databricks, Microsoft Power BI, SQL and ecosystem
Data Analytics, BI and Visualization
Virtual
Gerhard Brueckl
paiqo GmbH
See Details
Training
Databricks Certified Data Engineer Associate

Certification Exam
Virtual
See Details
Nothing Left

Virtual


Training
Databricks Certified Associate Developer for Apache Spark

Certification Exam
Virtual
See Details
Training
Databricks Certification Exam: Professional Data Engineer

Certification Exam
In-Person
See Details
Training
Databricks Certification Exam: Data Engineer Professional

Certification Exam
In-Person
See Details
Training
Databricks Certification Exam: Associate SQL Analyst

Certification Exam
Virtual
See Details
Training
Databricks Certification Exam: Associate SQL Analyst

Certification Exam
In-Person
See Details
Training
Databricks Certification Exam: Associate Developer for Apache Spark

Certification Exam
In-Person
See Details
Training
Databricks Certification Exam: Associate Data Engineer

Certification Exam
In-Person
See Details
Session
Databricks and Enterprise Observability with Overwatch

Beginner
In-Person, Virtual
Daniel Tomes
Databricks
Mohan Baabu
Databricks
See Details
Nothing Left

Virtual


Session
Data Policy in the Past, Present, and Future

Intermediate
Data Analytics, BI and Visualization
Virtual
Jacob Pasner
State Water Resources Control Board
See Details
Session
Data On-Board: The Aerospace Revolution

Intermediate
Manufacturing
Industry and Business Use Cases
Virtual
Miguel Martin Acosta
Airbus
See Details
Session
Computational Data Governance at Scale

Intermediate
Data Security and Governance
Virtual
Roman Storchak
Fozzy Group
See Details
Session
Cloud Fetch: High-bandwidth Connectivity With BI Tools

Intermediate
Data Lakes, Data Warehouses and Data Lakehouses
Virtual
Bogdan Ghit
Databricks
See Details
Session
Can ML Forecast Fashion Trends? What Should We Predict?

Intermediate
Industry and Business Use Cases
Virtual
Celine Xu
H&M group
See Details
Session
Building Recommendation Systems Using Graph Neural Networks

Intermediate
Media and Entertainment
Machine Learning
Data Science, Machine Learning and MLOps
Virtual
Swamy Sriharsha
Condé Nast
See Details
Session
Building a Data Science as a Service platform in Azure with Databricks.

Beginner
Data Science, Machine Learning and MLOps
Virtual
Terry McCann
Advancing Analytics
See Details
Session
Big Data in the Age of Moneyball

Beginner
Media and Entertainment
Industry and Business Use Cases
Virtual
Alexander Booth
Texas Rangers Baseball Club, LLC
Ryan Stoll
Texas Rangers Baseball Club
See Details
Nothing Left

Virtual


Session
Best Practices of Maintaining High-Quality Data

Intermediate
Data Quality, Governance, Machine Learning
Data Security and Governance
Virtual
Vidhi Chugh
NA
See Details
Session
Auditing Your Data and Answering the Lifelong Question—Is It the End of the Day
Yet?

Intermediate
MLOps
Virtual
Simona Meriam
Aidoc
See Details
Session
ÀLaSpark: Gousto's Recipe for Building Scalable PySpark Pipelines

Intermediate
Data Engineering
Virtual
Elena Martina
Gousto
Daniel Baron
Gousto
See Details
Session
AI-Fueled Forecasting: The Next Generation of Financial Planning

Intermediate
Financial Services
Industry and Business Use Cases
Virtual
Arunima Gupta
Deloitte
Eric Merrill
Deloitte
See Details
Session
Adversarial AI—The Nature of the Threat, Impacts, and Mitigation Strategies

Intermediate
Public Sector
Machine Learning
Data Security and Governance
Virtual
Edmon Begoli
Oak Ridge National Laboratory (ORNL)
See Details
Session
Accelerating MLOps Using Databricks and Vertex AI on GCP

Advanced
Data Science, Machine Learning and MLOps
Virtual
Deb Lee
Google Cloud
Ivan Nardini
Google Cloud
See Details
Session
Goodbye Hell of Unions in Spark SQL

Intermediate
Data Engineering
Virtual
Kazuaki Ishizaki
IBM
See Details
Nothing Left
Homepage


June 27-30, 2022
San Francisco + Virtual

Organized By

 * Agenda
 * Trainings
 * Speakers
 * Sponsors
 * Pricing
 * Health
 * FAQ
 * Awards

 * Event Policy
 * Code of Conduct
 * Privacy Policy

Apache, Apache Spark, Spark, and the Spark logo are trademarks of the Apache
Software Foundation. The Apache Software Foundation has no affiliation with and
does not endorse the materials provided at this event.