Skip to main content

Hi, I'm Michael

I'm a data engineer specializing in Python and SQL, building scalable pipelines and structured datasets for analytics and decision-making. Experienced across the full data lifecycle, from ingestion and transformation to modeling and delivery, with a focus on reliability and maintainability.

I also develop full-stack applications, APIs, and automation tools, enabling end-to-end solutions from backend systems to user-facing interfaces. Background in scientific research informs a rigorous approach to data quality, analysis, and technical communication.

Check my skills

Formal Education (education-list)

2010-2014
Florida State University
Bachelor of Science (BSc), Biochemistry
2023-2025
Florida State University
Master of Science (BSc), Biochemistry

Key Job Experience

Sep 2023 - May 2025
Research Assistant
Florida State University. Tallahassee, FL

Research Assistant (Biophysics)

Florida State University

  • Architechted framework for protein conformational database using Apache Spark and Python
  • Used framework to pipeline protein data from multiple sources for cross analysis
  • Ensured code quality with pre-commit, black, isort, flake8 and Github Actions
  • Wrote intermediate data to Apache Parquet and consolidated data catalog using Apache Iceberg
  • Wrote Python libraries for serializing/deserializing PDB files and transforming protein data as Pandas Dataframes

Sep 2018 - Oct 2023
Software Developer
Anju Software. Ft. Lauderdale, FL

Software Developer

Anju Software, Inc.

  • Built and Maintained data pipelines in Azure Databricks, Microsoft SQL, Spark-Python, and Unity Catalog
  • Investigated and corrected source-code defects for clinical testing software written in C# and tSQL
  • Debug codebase software in C# and tSQL to identify and fix problems in product
  • Automated setup of development environment using PowerShell scripting and Python
  • Architechted framework for protein conformational database using Apache Spark and Python

Jan 2017 - Sep 2018
Software Developer
Strategic Database Research. Deerfield Beach, FL

Software Developer

Anju Software, Inc.

  • Drove initiative to build a Data Warehouse
  • Built reporting system used daily by operations department in prioritizing tasks
  • Built and maintained data ingestion pipelines using REST API best practices
  • Built and Maintained data pipelines in Azure Databricks, Microsoft SQL, Spark-Python, and Unity Catalog

A selection of my work

Cryptocurrency Tracker

After recent success in the cryptocurrency market. I decided to build an efficient cryptocurrency tracker. It grabs the latest prices for any and all cryptocurrencis that the user defines. Here are a few features:

  • An alerting sytem, that lets you know when a currency’s price has moved out of preset bounds, or if a currency’s price has changed by more than a given percentage in a given time.
  • Graph price history of a currency in USD
  • Graph price history of a currency relative to another currency BTC/GOLD
  • “Last at” queries, which enable you to see at a glance when a currency was, if ever, at a given price.
Case Study

Molecular DataBase (a Protein Conformational Pipeline)

The lab where I worked at FSU was studying protein dynamis - the ways that proteins move in order to do their particular functions. This has been studied by many others. But when each research group database of protein movements, they change the definition of what kind of movement is taking place.

In order to make possible a cross-comparison of conformational studies I created a unified data pipeline capable of injesting data from:

  • html formats for DynDom and Protein Structual Change (PSC) databases
  • Protein Data Bank api json results
  • SCOPe categorization database custom file formats

Once this data was gathered, PySpark transformed it into tables which could be queried with ordinary SQL, enabling easy cross-comparison of databases. One of the most advanced features of this code is the of subclasses and graphlib to handle recursive dependencies, which is a particular issue with the Protein Data Bank.

Case Study

Clioreader

Clioreader is a full-stack web application using Flask, BootStrap, and Amazon Web Services to enable a user to upload text, including pdf format books, and convert them into audiobooks using Amazon’s state-of-the-art Text-To-Speech engine.

(not actively maintained)

Case Study

What you seek

If you are looking for someone to take care of any of the following needs, I may just be the right person for you.