lous.info Open in urlscan Pro
31.187.234.86  Public Scan

Submitted URL: https://www.graphiq.xyz/
Effective URL: https://lous.info/
Submission: On June 27 via api from US — Scanned from NL

Form analysis 0 forms found in the DOM

Text Content

Start
 * About
 * Experience
 * Speaker Bio
 * Articles
 * Skills
 * Open Source
 * Interests
 * Links
 * Availability
 * Company Info

 * --------------------------------------------------------------------------------

 * Resume / CV


TOM LOUS

Berkel & Rodenrijs · Netherlands · +31645528510 · info@graphiq.xyz · KVK
57477574

I'm a Data & ML Software Engineer
I develop Scala and Python software that runs on a Spark cluster or dockerize
functional scala microservices to run on a Kubernetes cluster.
I'm proficient with many tools concerning setting up a big data ingestion &
processing pipeline in the cloud and deploying the results via a scalable API.
I'm also skilled in cleaning & analyzing huge amounts of data, followed by
training, validating & testing machine learning models and deploying them in
production.



ScalaZIOCatsApache SparkPythonJavaJenkinsAnsibleSnowflakeDremioGoogle Cloud
PlatformAzureAWSMySQLPostgreSQLMongoDBNeo4jElasticsearchCassandraParquetApache
AvroApache HadoopLinuxBashDockerKubernetesgitApache KafkaScikit LearnApache
FlinkHelmStrimziTerraform



 * 
 * 
 * 
 * 


EXPERIENCE


SENIOR DATA ENGINEER

Eneco, Rotterdam

Building streaming data products with on Azure

Python, Kafka, dbt, Snowflake, Rust.

April 2024 - Present


SENIOR SOFTWARE ENGINEER

DHL, Utrecht

Building streaming data products with ZIO 2, Scala 3, Kafka and Cassandra on
Azure Kubernetes

Scala, ZIO, Kafka, Azure, Kubernetes.

October 2022 - April 2024


LEAD DATA ENGINEER

Schiphol, Amsterdam

Leading the data factory team and implementing scalable data ingestion solutions
for a data mesh architecture using Spark, Scala, Databricks, Kafka and
Kubernetes (OpenShift)

Scala, Spark, ZIO, Azure, Kubernetes, Databricks.

November 2021 - October 2022


PRINCIPAL ENGINEER

Nike, Hilversum

Part of the Architecture Chapter

Spark, Python, AWS, Airflow, Ansible, Snowflake & Hadoop development.

June 2021 - November 2021


LEAD DATA ENGINEER

Shell, Rotterdam

Part of the Agile Hub

Spark, Scala, Azure, Kubernetes, Airflow, Terraform & Hadoop development.

June 2019 - June 2021


MACHINE LEARNING ENGINEER

VodafoneZiggo, Utrecht

Part of the Advanced Analytics Platform (AAP) and Technical Passport (TP)

(Py)Spark, Hive, Oozie & Hadoop development.

April 2019 - June 2019


BIG DATA ENGINEER

eBay Classifieds Group, Amsterdam

Data ingestion as a service (Kafka, Hadoop, Kubernetes) @ eBay's PE (Platform
Engineering) Team

Spark, Scala, Flink, Hadoop, Cassandra, Kafka & Machine Learning @ eBay's CDATA
(Central Data) Team

June 2018 - April 2019


BIG DATA & MACHINE LEARNING SOFTWARE ENGINEER

Datlinq Datalabs, Rotterdam

Building fully automatic data ingestion & processing pipelines in the cloud with
Scala, Spark and Airflow. Enrichment via machine learning and deployment via API
on top of Elasticsearch in Kubernetes

April 2016 - June 2018


« 15 YEARS OF DIVERSE TECH RELATED JOBS »

All over the place

In the past I've been a PHP web developer, MySQL database admin, Linux system
engineer, IT Manager, Team lead, Frontend developer, iOS Software developer, etc
etc. Less relevant to the tools I'm using now, but it does make me an
experienced well rounded developer.

Check LinkedIn for the details, or read my story

Jan 2001 - April 2016


SPEAKER BIO


DEVOPS FOR DATA ENGINEERS

Young Maverics Training, Remote
Code & Slides · Organization
2020 - 2021


FUNCTIONAL PROGRAMMING IN SCALA

Young Maverics Training, Remote
Code & Slides · Organization
2020 - 2021


BUILDING & DEPLOYING SPARK APPLICATIONS

Young Maverics Training, Remote
Code & Slides · Organization
2020 - 2021


DEPLOYING APACHE SPARK JOBS ON KUBERNETES WITH HELM AND SPARK OPERATOR

Spark+AI Summit 2020, San Francisco
Video · Slides · Event
June 2020


APACHE AIRFLOW & APACHE SPARK DATA PIPELINES IN THE CLOUD

Data Driven Rijnmond Meetup
Slides · Event
January 2018


GOOGLING THE ERROR MESSAGE - 2

Days of Code
Slides · Event
July 2017


BUILDING A DATA INGESTION & PROCESSING PIPELINE WITH SPARK & AIRFLOW

Data Driven Rijnmond Meetup
Slides · Event
February 2017


BUILDING A DISTRIBUTED DATA PIPELINE

Days of Code
Slides · Event
July 2016


GOOGLING THE ERROR MESSAGE

Days of Code
Slides · Event
July 2016


PUBLISHED ARTICLES


BUILDING AN OPEN SOURCE SCALA GRPC/REST HTTP PROXY FOR KAFKA

Medium Part 1 - Part 2
March 2021


CI/CD FOR DATA ENGINEERS

Medium
February 2021


DEPLOYING APACHE SPARK JOBS ON KUBERNETES WITH HELM AND SPARK OPERATOR

Medium
January 2020


RECORD LINKING WITH APACHE SPARK’S MLLIB & GRAPHX

Medium
April 2017


RE-BECOMING A DEVELOPER

LinkedIn
April 2016


INSTALLING OCTAVE ON MAC OS X MOUNTAIN LION

BlogSpot
September 2012


SKILLS

Programming Languages & Tools
 * 
 * 
 * 
 * 
 * 
 * 
 * 
 * 
 * 
   
   
   
   
   
   
   
 * 
 * 
 * 
 * 
 * 
   
 * 
 * 
   
   
 * 
 * 
 * 
 * 
 * 
   
 * 
   
 * 
 * 
 * 
 * 
 * 
   
   
   
   
 * 
   
   
   
   
 * 
 * 
 * 
 * 
 * 
 * 
 * 


OPEN SOURCE CONTRIBUTIONS

 * GCP's Spark on Kubernetes Operator - Contributor - Spark on Kubernetes via
   Operator
 * Spark on Kubernetes Operator's Helm Chart - Contributor - Helm Chart of Spark
   on Kubernetes via Operator
 * Strimzi Kafka Operator - Contributor - Apache Kafka running on Kubernetes and
   OpenShift
 * Scalafiniti - Maintainer - Scala SDK wrapper around Datafiniti API
 * http4s-rho - Contributor - Self documenting (swagger) DSL for http4s web
   server
 * elasticsearch-client - Contributor - Elasticsearch client for Scala
 * solr-schema-dataimport-generator - Maintainer - Generate solr config based on
   cetralized template
 * « Many more OSS repos » - Mostly coursera related


INTERESTS

Apart from being a software engineer, I organize a regular meetup for data
enthousiasts in Rotterdam and surroundings, called Data Driven Rijnmond

I also spend a lot of my free time at night learning new things using MOOC like
Coursera, resulting in 34 certifications

In my free time I enjoy spending time with my wife, daughter and son. I run,
swim, play tennis and golf and get in shape via a personal trainer. I also hold
1st dan (shodan) in Aikido.

I love puzzles and attempt to escape from about 4-5 escape rooms every year.

When forced in doors I like to read, and watch Youtube (My favorite channels:
PBS Space Time, Mark Rober, Numberphile, SmarterEveryDay, 3Blue1Brown,
standupmaths, minutephysics, CPG Grey, Kurzgesagt, Veritasium, Critical Role,
Simone Giertz). I also play board games or catch up on some movies / series


LINKS

STACK OVERFLOW



GITHUB

Tom Lous
Mostly harmless
⚲ Berkel en Rodenrijs, The Netherlands
275
Followers
102
Following
93
Repositories

--------------------------------------------------------------------------------

Top repositories
coursera-cryptography1
Python
★116
coursera-parallel-programming-scala
Scala
★15
coursera-exploratory-data-analysis-course-project-1
R
★7
Follow
Last active: 199 day(s) ago



LINKS

 * 
 * 
   
   
   
   
   
   
   
   
 * 
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
 * 
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
 * 
 * 
 * 
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
   
 * 
   
   
   
   
   
   
 * 
 * 
 * 
 * 
 * 
   
   
   
   
   
   
   
   
   
   
   
   
   
   
 * 




AVAILABILITY

starting October 2024


GRAPHIQ



GraphIQ
Spinel 7
2651RV Berkel en Rodenrijs

info@graphiq.xyz

+31645528510

KvK (CoC): 57477574
BTW (VAT): NL001703498B23
Bank (IBAN): NL15INGB0007044957
Swift (BIC) : INGBNL2A
            





GraphIQ Stuur facturen via Peppol NL:KVK 57477574