Scraping, analyzing and generating companies, founders and job postings from YC's Work at a Startup
Scraping, analyzing and generating companies, founders and job postings from YC's Work at a Startup
Last updated January 16, 2021
Weekend project update: Open SEC Data
This project uses Django, DRF and Celery to read public SEC filings from sec.gov, build it into an API which is consumed through a Vue.js application.
Last updated November 29, 2020
Reading 13F SEC filings with python
How to read SEC filing data with Python
Last updated January 30, 2018
Analysis of AUR and Official Arch Repository data
Arch Linux provides packages through the official Arch Linux repositories and the Arch User Repository (AUR). I recently gathered data on ~50,000 packages from these repositories on archlinux.org to better understand the makeup of the packages. In this article I will share some visualizations I made as well as some key takeaways about the data set I gathered.
Last updated December 2, 2017
Related subreddit graph exploration with NetworkX
This notebook explores some basic concepts of graph theory. A few weeks ago I set up a script to scrape data from reddit.com with the goal of visualizing the network of related subreddits (forums on specific topics) and related data.
Last updated March 3, 2017
PCPartPicker data
In the summer of 2016 I built two high-end computers, something I haven't done since 2011. I used PCPartPicker to research the components and read about PC builds similar to the ones I had in mind. It's a relatively new site that has a strong community of builders, helpful tools to help with part compatibility as well as extensive user reviews on PC components.
Last updated January 1, 2017