The Long Resume

Skills

Python

My preferred language to do just about everything. The bulk of my professional Python work involves using Pandas or Pyspark for data pipelines. I have extensive experience with several visual plotting libraries, Plotly being my prefered. As well I have used Django and Flask to prop up websites or REST APIs.

Database

I have experience configuring Oracle databases for maximizing hardware efficiency, and inspecting the master files of SQL Server and Teradata to recreate data lineage. In the cloud I prefer Databricks as an on-prem database replacement.

Cloud

Databricks is my favorite cloud environment for most data solutions. My favorite stack for most data solutions is any cloud offering using Spark. I have more experience with Azure, especially since MS has made their UI very similar to their on-prem products. I have also used AWS serverless features.

Unix

My understanding of the infrastructure behind my data pipelines comes from my experience deploying Unix-like VMs or containers running data intensive applications. Thas had made more comfortable with a command line than a GUI, and all the protocols for using HTTP to send/receive data.

Data Quality

A lesson I had learned in my experience with data science projects has shown that the first step is dealing with poor data quality. I make sure to impement quality checks between the modules of my pipelines.

Spark

A few years ago I knew I would have to learn a method to handle 'big data' and I found PySpark's documentation to be the easiest to read. Since then I have become certified in Spark. I find a great deal of satisfaction refactoring queries to reach max cluster efficiency.

Awards & Projects

JAN 2019

Reddit Code Scraper

I wrote a Python script that scraped the forum pages I expected to have Steam video game codes published occasionally in giveaways. This script made regular checks scanning for string patterns that matched Steam code patterns, then redeemed immediately online whenever valid codes were found.

JAN 2020

Conference Survey Text Mining

As a capstone project for a 20-week Python-data science bootcamp, Rosenthal Media requested consultation on analysis of thousands of pages of open-ended survey responses toward preferences in conference amenities. Using NLP libraries available in Python, topics were modeled based on word correlations and audience demands.

JULY 2020

Civtech-San Antonio Datathon 2nd Place

Participated in a group that created slide deck and infographic with data points procured from publicly available data about city bus ridership demographics. I extracted the data into a Tableau Public dashboard that pointed colleagues to geospatial data and visualizations of targeted data.

DEC 2020

Wells Fargo Invention of The Year

The director of the bank’s contact center wanted a more stable view of devastating external events to operations, usually on a regional or state level. I rapidly developed a SharePoint site that I could process with Pandas, which eventually grew into a SQL Server Integration Service job from SharePoint to populate a database feeding Tableau dashboards. The patent application number is 17/124074.

Testimonials

Fred and his team did a great job completing a data science consulting project to help improve Codeup's marketing pipeline. Their use of NLP on customer surveys, construction of customer profiles, and summary report helped us zero in our marketing. Fred did a fantastic job!

Jason Straughan

CEO at Codeup

Jason Straughan

Fred was part of a data science team that performed a high-level analysis—both quantitative and qualitative—of data critical to my company. The analysis was immensely helpful. Also, I was pleasantly surprised at how the data science team was able break a highly complex analysis into small pieces intelligible to a general audience. I highly recommend Fred to future employers.

Louis Rosenfeld

Publisher at Rosenfeld Media

Louis Rosenfeld

Fred has been a fantastic peer and contributor to get to know. He is self motivated, dedicated to quality, tenacious, and always wanting to learn more. I have witnessed Fred bring alive entire worlds of data that our business uses to make critical decisions. He's particularly effective with Python amongst his many talents

Dustin Hernandez

Senior Lead Analytics Consultant at Wells Fargo

Louis Rosenfeld