Open to opportunities

Business Data Scientist

Mathias

Salomonsen Juul

Cand. merc. Business Data Science · Aalborg University

Business Data Science student with a passion for data, statistics, and software engineering. A curious, goal-oriented individual building at the intersection of analytical rigour and practical systems.

Location Aalborg, Denmark

"I am 25 years old and finishing a master's in Business Data Science. Curious and focused when encountering new knowledge, methods, or theories. Beyond programming, I bring a background in business management and operations — and having worked in groups, I understand how to collaborate and make teams work."

— Mathias Salomonsen Juul

8 Projects
5+ Years studying
AAU University
NLP Thesis domain
01

Selected Projects

Full-Stack

Aurum Tempus

End-to-end luxury e-commerce portfolio — FastAPI backend, SQLite database, synthetic data pipeline, KMeans/RF/GBM ML models, and a static BI dashboard on GitHub Pages.

FastAPIscikit-learnSQLiteStreamlitDocker
LLM Pipeline

Sustainability Consultant

Automated weekly pipeline fetching sustainability news, summarising with Gemini 2.5, and generating strategic business consultations — orchestrated by GitHub Actions.

Gemini 2.5GitHub ActionsFastAPISQLite
NLP · RAG

Employee Assistant

RAG chat application for internal documents — OpenAI embeddings in ChromaDB, GPT-4o grounded generation, and a FastAPI backend.

OpenAIChromaDBFastAPIRAG
Graph · NLP

Organisational Knowledge Graph

Communication network from the Enron dataset — PageRank, betweenness centrality, community detection, LDA topic modelling, and an interactive PyVis graph.

NetworkXscikit-learnLDAPyVis
Time Series

Wind & Solar Causality Analysis

Granger causality analysis of Danish renewable energy vs. electricity spot prices — VAR modelling, impulse response functions, and a Dockerised FastAPI results service.

statsmodelsFastAPIDockerEnerginet API
Generative AI

PDF-to-Podcast Pipeline

Transforms academic PDFs into two-host podcast audio — Llama 3.1 for script generation, Suno Bark TTS for synthesis, plus a knowledge graph and quiz generator.

Llama 3.1Bark TTSHugging Face
LLM · Streamlit

Customer Message Generator

Two-stage Gemini pipeline classifying customer region and emotional type, retrieving behavioural context, and generating a personalised outreach message in a tradesperson's voice.

GeminiStreamlitPrompt Engineering
In Progress

CDSS — Clinical NLP

Master's thesis. Synthetic Danish PSOAP clinical note generation using GPT-4o, evaluated through a downstream patient diagnosis pipeline within a Design Science Research framework.

GPT-4oClinical NLPDSRDanish
02

Core Skills

01 🐍

Python

Primary language across all projects — from data pipelines and statistical modelling to REST APIs and LLM orchestration. Comfortable with the full ecosystem: pandas, NumPy, FastAPI, scikit-learn, and more.

02 📊

Data Processing & Analysis

End-to-end data work: ingestion, cleaning, transformation, and exploratory analysis. Experience with structured tabular data, time series, text corpora, and graph data — always focused on reliable, reproducible outputs.

03 📐

Statistics & Mathematics

Applied statistical reasoning underpins the work — hypothesis testing, stationarity analysis, Granger causality, VAR modelling, regression, and probabilistic modelling. Equally comfortable with theory and implementation.

04 ⚙️

Prototype Construction

Rapid, end-to-end system building — from notebook proof-of-concept to deployed application. Includes API design, containerisation with Docker, GitHub Actions automation, and lightweight frontends.

05 🗄️

SQL

Relational database design and querying for analytical and transactional use cases. Experience with SQLite and SQLAlchemy ORM — structuring schemas, seeding data, and integrating databases into backend services.

06 🤖

AI

Hands-on with the modern LLM stack — OpenAI, Gemini, and open-source models via Hugging Face. Covers prompt engineering, RAG pipelines, function calling, vector databases, and evaluation workflows across multiple projects.

03

About Me

I am a Business Data Science student at Aalborg University, working toward a cand. merc. at the intersection of analytical methods, software engineering, and domain knowledge.

My work spans the full stack — statistical modelling, machine learning, LLM pipelines, backend APIs, and production deployments. I am drawn to projects where rigorous methodology and practical systems thinking come together to produce something that genuinely works.

I approach problems with a goal-oriented mindset and a preference for building end-to-end rather than in isolation — equally at home in a Jupyter notebook, a FastAPI service, or a CI/CD pipeline.

My thesis in clinical NLP reflects a deeper interest in applying language models responsibly in high-stakes domains — combining solid methodology with practical system design.

8 Portfolio projects
5+ Years at AAU
PBL Problem-Based Learning
04

Education

2024 — present
ongoing
Master's

Cand. merc. Business Data Science

Aalborg University

Master's programme combining business strategy, statistics, and software engineering. Thesis in clinical NLP: synthetic Danish PSOAP note generation and evaluation within a Design Science Research framework.

One semester (Sep 2025 – Jan 2026) completed as an exchange within Marketing & Sales Management, before returning to the Business Data Science programme.

2021 — 2024
Bachelor's

HA — Business Administration

Aalborg University

Foundation in business economics, organisational theory, and quantitative methods — the domain context that grounds the data science work.

2017 — 2020
Upper Secondary

Higher Commercial Examination (HHX)

Business College

Upper secondary education with a business and economics focus — providing the commercial foundation that underpins subsequent academic and professional development.