Author Image

Hi, I am Jan

Jan Sobus

Data Science Architect at Caterpillar

I am a passionate machine learning engineer and data architect with research background and over 5 years of industry experience. I built end-to-end computer vision solution for wildfire detection. Now I am working on several projects utilizing AI to maintain Caterpillar's lead in the fields of mine productivity, electrification, and automation. Those cover the fields of computer vision, natural language processing, and time series analysis. I love solving problems (while learning new things) and turning solutions into business value.

Machine Learning
Generative AI
Python Development
CI/CD
Problem Solving
Mentoring

Skills

Experience

1
Caterpillar

Dec 2021 - Present

Brisbane, Australia

Caterpillar is a global leader in the supply of construction and mining equipment, diesel and natural gas engines, industrial gas turbines, and diesel-electric locomotives. Key differentiator is the inclusion of managing software as a whole package (Minestar suite)

Data Science Architect

Sep 2023 - Present

Responsibilities:
  • Providing support to multiple Data Science teams, when it comes to solution design of multi-model inference systems
  • Preparing the Python developer covering the environment setup, dependencies, CI/CD pipelines, Python packaging and best code practices
  • Spearheading projects utilizing new AI technologies both in NLP space (LLMs for document generation, summarization, etc.) and CV space (multimodal segmentation models)
  • Designing end-to-end pipelines for the data ingestion, processing and storage
  • Designing API endpoints, Data Models and interfaces to be shared between the teams (Data Science, Engineering, Product, etc.)
  • Maintaining CI/CD pipelines and Dockerizing the applications
Data Science Team Lead

Dec 2021 - Sep 2023

Responsibilities:
  • Supervising a team of 5 Data Scientists
  • Implementing models (physics, machine learning and heuristic) for accurate machine tracking and production recording
  • Refactoring the legacy Python codebase and unification of versioning and deployment patterns
  • Establishing Python wheel repository and automated build+testing pipeline templates

Peregian Beach, Australia

exci is a company that provides AI-powered early wildfire detection and notification system.

Machine Learning Software Engineer

July 2020 - October 2021

Responsibilities:
  • Designing and implementing training and inference pipelines for the multi-model early wildfire inference system
  • Construction and maintenance of the local GPU infrastructure in order to minimize cloud costs
  • Design and implementation of the PTZ camera POV correction system and system pinning detected events on the map
  • Refactoring of the legacy codebase and moving it from TF1 to TF2 and later to PyTorch
2

3
University of Queensland

August 2016 - January 2021

Brisbane, Australia

One of the leading Universities in Australia (member of the Group of Eight), I worked at the School of Mathematics and Physics, closely collaborating with the School of Chemistry and Molecular Biosciences and Centre for Organic Photonics and Electronics.

Casual Research Fellow

February 2020 - January 2021

Responsibilities:
  • Writing and reviewing research papers
  • Co-supervision of PhD students
Postdoctoral Research Fellow

August 2016 - February 2020

Responsibilities:
  • Manufacturing of multiple classes of organic photonic and optoelectronic devices (photodetectors, solar cells, OLEDs, lasers, Light Emitting Field Effect Transistors)
  • Optoelectronic characterization of the devices
  • Design and assembly of measurement setups for characterization of the devices
  • Maintaining cleanroom facilities
  • Supervision of undergraduate and PhD students
  • Publishing and reviewing research papers in the top international journals

SlideWorx (acquired by mTab)

May 2016 - June 2016

Poznan, Poland

mTab is a company that provides a cloud-based platform for managing and automating business processes, including analytical and reporting tools.

Junior JavaScript Developer

May 2016 - June 2016

Responsibilities:
  • Developing analytical dashboards using JavaScript with D3.js
  • Consulting with clients to turn their business needs into technical requirements
4

5
Ecole Polytechnique de Lausanne (EPFL)

February 2015 - August 2015

Lausanne, Switzerland

EPFL is a leading Swiss university that focuses on science and technology.

Visiting Researcher

February 2015 - August 2015

Responsibilities:
  • Developement of Dye-Sensitized Solar Cells (DSSCs) with a new class of dyes - MK2
  • Photovoltaic characterization of the DSSCs
  • Spectroscopic characterization of the dyes and complete devices
  • Modelling of the internal charge transfer processes

Trondheim, Norway

NTNU is a leading Norwegian university that focuses on science and technology.

Exchange Student (Erasmus+ Programme)

August 2011 - June 2012

Responsibilities:
  • Working on the development of new composite material for electrodes in supercapacitors (graphene-based carbon foam covered with Manganese Oxide)
  • Performing measurements of specific surface area of the electrodes, volumetric and gravimetric capacitance
6

7

Warsaw, Poland

The Polish Academy of Sciences is a Polish national research institution that focuses on all the branches of science.

Research Intern

September 2010 - October 2010

Responsibilities:
  • Manufacture and examine superconducting wires made of MgB2 in the copper shielding
  • Perform measurements of the magnetic field and the critical current density

Education

Ph.D in Physics
Thesis:
Research:
My PhD was focused on the study of new generation of hybrid organic-inorganic solar cells - Dye-sensitized solar cells (DSSCs). It included both experimental (spectroscopy, device fabrication and evaluation) and theoretical (modeling of the charge transfer processes in the time ranges from femtoseconds to milliseconds) work. I also worked on the modelling of tandem (multi-junction) solar cells.
Supervisor:
M.Sc. in Applied Physics (Nanotechnology)
GPA: 5.2 out of 5.5
Extracurricular Activities:
  • Chairmen of the Material Engineering Student Committee
  • Active member of the Physics Student Committee
  • Multiple national and international conferences, workshops and poster presentations
Thesis:
MnO4 covered graphene based nanocomposite as electrode material for supercapacitors
Supervisor:
B.Sc. in Applied Physics (Nanotechnology)
GPA: 5.05 out of 5.5
Extracurricular Activities:
  • Active member of the Physics Student Committee
  • Co-organizer of the Baltic Festival of Science
Thesis:
Graphene films grown on the thin copper foils with the help of the chemical vapour deposition
Supervisor:

Projects

Named entity recognition demo
Named entity recognition demo
Author 2023

A demo repository showing the concept of Named Entity Recognition with the use of Transformers library and Flask front end.

Publications

Research focusing on triplet quenching mechanisms in organic semiconductors.

Digital Service Transformation: Pathways to human and economic wellbeing White Paper
UQ 2023

This White Paper, resulting from a Brisbane roundtable in November 2022, brings together insights from 64 participants including industry representatives and academics to address challenges and opportunities in digital service transformation across various sectors including financial services, health services, construction, consulting, tourism, and government.

Controlling triplet–triplet upconversion and singlet-triplet annihilation in organic light-emitting diodes for injection lasing

This research demonstrates how triplet excitons can positively contribute to electrically-driven organic lasing through triplet-triplet upconversion (TTU), while emphasizing the importance of minimizing singlet-triplet annihilation (STA) for optimal performance.

Probing polaron-induced exciton quenching in TADF based organic light-emitting diodes

This study investigates polaron-induced exciton quenching in TADF-based OLEDs, examining singlet-polaron annihilation (SPA) and triplet-polaron annihilation (TPA) under steady-state conditions and their contributions to efficiency roll-off, using experimentally obtained parameters.

Solid cyclooctatetraene-based triplet quencher demonstrating excellent suppression of singlet–triplet annihilation in optical and electrical excitation

This research presents the design and synthesis of a solid-state organic triplet quencher and its integration with a solution processable bis-stilbene-based laser dye, demonstrating complete suppression of singlet-triplet annihilation and improved photostability under continuous wave excitation.

Triplet-Triplet Upconversion in Organic Light-Emitting Diodes: Implications to Injection Lasing

This work demonstrates the positive contribution of triplet excitons for electrically driven organic lasers, studying a model fluorescence material and showing how triplet-triplet exciton upconversion processes can significantly reduce threshold current densities required for lasing emission, while emphasizing the importance of minimizing singlet-triplet exciton annihilation.

High EQE and high brightness solution‐processed TADF light‐emitting transistors and OLEDs

This work demonstrates highly efficient solution processed LEFETs using ACRXTN showing high external quantum efficiencies of ≈1% and on/off ratios at low operating voltages with negligible EQE roll-off. The same emitter achieved high peak EQEs (≈16%) and brightness in solution-processed OLEDs with a simple architecture.

Charge and exciton dynamics of OLEDs under high voltage nanosecond pulse: towards injection lasing

This research provides a comprehensive analysis of charge injection, transport, device on/off dynamics, and exciton processes in Super Yellow OLEDs under high voltage nanosecond pulses, demonstrating complete exciton and charge carrier dynamics from sub-ns to microsecond timescales.

This work develops a comprehensive exciton quenching model for TADF systems, studying singlet-singlet, singlet-triplet, and triplet-triplet annihilation rate constants using ACRXTN as a model compound under intensity-dependent optical and electrical pulse excitation.

This comprehensive review explores the fundamentals, working principles, materials, device physics, and architectures of light-emitting transistors (LETs), discussing their development from an optoelectronic curiosity to potential competitors in display technology and injection lasers.

Deep‐Red Lasing and Amplified Spontaneous Emission from Nature Inspired Bay‐Annulated Indigo Derivatives

This work presents a new family of solution-processable organic semiconductor laser dyes based on bay-annulated indigo derivatives, achieving excellent photoluminescence quantum yields and low ASE thresholds with deep-red emission when blended in a mixed host system.

This patent describes an organic light-emitting field-effect transistor containing a delayed fluorescent material, where excitons can be efficiently used for light emission to remarkably enhance the emission efficiency of the transistor while achieving high mobility along with high on/off ratios.

Mobility Evaluation of [1]Benzothieno[3,2-b][1]benzothiophene Derivatives: Limitation and Impact on Charge Transport

This study investigates BTBT derivatives with varying alkyl chain configurations, examining their impact on hole mobilities and charge transport properties, while addressing the debate on mobility overestimation in organic field-effect transistors.

High‐speed OLEDs and area‐emitting light‐emitting transistors from a tetracyclic lactim semiconducting polymer

This work reports on high-speed OLEDs and high-performance hybrid light-emitting transistors using a new solution processable luminescent material (PTNT). The OLEDs achieve peak brightness of 8×105 cd m−2 and 40 MHz modulation frequency under 10 ns pulse operation, significantly higher than commercial LEDs used for visible light communication.

Low amplified spontaneous emission threshold and efficient electroluminescence from a carbazole derivatized excited-state intramolecular proton transfer dye

This work reports a new organic semiconducting laser dye (HBT-Cz) with remarkably low ASE threshold in both solution and film states, achieving the lowest reported waveguide loss coefficient for solution-processed organic semiconductors, while also demonstrating efficient electroluminescence in OLEDs.

High Performance p‐and n‐Type Light‐Emitting Field‐Effect Transistors Employing Thermally Activated Delayed Fluorescence

This work presents an alternative strategy for triplet usage in LEFETs using thermally activated delayed fluorescence (TADF). The study demonstrates devices employing a TADF capable material (4CzIPN) in both n-type and p-type configurations, showing excellent electrical characteristics.

This work introduces oxygen plasma treatment as a simple method to modify the surface energy and work function of hydrophobic polymer interlayers for use as p-contacts in perovskite solar cells, enabling improved processing and device performance.

Factors Affecting the Performance of Champion Silyl‐Anchor Carbazole Dye Revealed in the Femtosecond to Second Studies of Complete ADEKA‐1 Sensitized Solar Cells

This study investigates key preparation factors affecting ADEKA-1 solar cells, including dye synthesis routes, co-adsorbent addition, and electrode passivation, revealing the crucial role of electron recombination from titania to the dye in cell performance.

Effect of solvent variations in the alcothermal synthesis of template-free mesoporous titania for dye-sensitized solar cells applications

This study investigates the synthesis of mesoporous titania materials using various alcohols as solvents in a template-free alcothermal method, characterizing their properties and performance in dye-sensitized solar cells with efficiency values ranging from 0.54% to 4.6%.

Effect of different photoanode nanostructures on the initial charge separation and electron injection process in dye sensitized solar cells: a photophysical study with indoline dyes

This study investigates ultrafast and fast charge separation processes in complete cells based on various ZnO-based photoanode nanostructures and standard TiO2 nanoparticle layers sensitized with the indoline dye D358, examining different ZnO morphologies and synthesis methods.

Carbazole Dye‐Sensitized Solar Cells Studied from Femtoseconds to Seconds—Effect of Additives in Cobalt‐and Iodide‐Based Electrolytes

This comprehensive study examines charge-separation processes in carbazole dye-sensitized solar cells, revealing the importance of fast electron recombination from semiconductor nanoparticles to oxidized dye, and investigating how additives affect charge-transfer dynamics in different electrolytes.

Transient states and the role of excited state self-quenching of indoline dyes in complete dye-sensitized solar cells

This study investigates the photobehavior of indoline dye D149 on different metal oxide nanoparticles in functioning solar cells, identifying locally excited and charge transfer excited states in electron injection and dye deactivation mechanisms, while examining the effects of coadsorbent concentration and aging on cell performance.

Comparison of TiO<sub>2</sub> and ZnO solar cells sensitized with an indoline dye: time-resolved laser spectroscopy studies of partial charge separation processes

This study compares TiO2 and ZnO-based dye-sensitized solar cells using time-resolved laser spectroscopy, revealing that the superior performance of TiO2 cells is primarily due to more efficient electron injection in the first 100 ps rather than differences in charge collection or dye regeneration processes.

Optimization of absorption bands of dye-sensitized and perovskite tandem solar cells based on loss-in-potential values

This numerical study investigates optimal bandgaps of light absorbers in tandem solar cell configurations, focusing on dye-sensitized solar cells (DSSCs) and perovskite solar cells (PSCs), examining efficiency limits and potential improvements as functions of loss-in-potential, incident photon to current efficiency, and fill factor.

Courses

AI Devs 3 - Agents
AI Devs December 2024 - January 2025

This course was a comprehensive, hands-on introduction to building multi-model LLM agentic systems. It covered prompt engineering and injection, multi-modal processing, graph and vector databases, different versions of RAG - all using both cloud based and local solutions.

Python Software Design Mindset
ArjanCodes October 2023 - November 2023

This is probably the best Python specific course when it comes to good software design practices. It covers not only modern software patterns in Pythonic flavour, but also provides solid guidelines on how to write robust, understandable and decoupled code. in general (both in Object Oriented and Functional paradigms).

Fast.ai course V2
Fast AI November 2020 - January 2021

This course is a great blend of theory and practice. It goes deep into understanding (and implementing from scratch) the most important concepts of deep learning. On the other hand it offers a high level abstractions enabling easy prototyping and deployment of complex models.

Complete Machine Learning and Data Science Bootcamp
Zero to Mastery on Udemy July 2020 - September 2020

This was my first general ML course. It provided a broad introduction to machine learning in different domains - from supervised learning to unsupervised learning- and using different data types (tabular data, images, NLP tasks).