Working On

  • Processing Data, Storing, Analyzing ASAP
    • Python – HTML Parsing, 
    • Databases – MongoDB, ??Redis
    • Displaying – ?? Django, Flask, D3, Javascript,…
  • Virtual Machines / App Managers / Message Queuing – Running multiple Processes
    • Docker – Linux / Windows Containers
    • Hyper-V – Windows Virtual Machine
    • RabbitMQ – Messaging Server
  • Text Processing / Document Searching – How can I find stuff faster?
    • Apache Lucerne – indexing documents into “shards”
    • Apache Tika – Document Conversion (i.e. PDF to text, XLSX to csv)
    • Apache Solr  – Document Search Engine on top of Lucerne & Tika
    • ElasticSearch – Newer Document Search very similar to Solr
  • Interesting App
    • https://github.com/samuelclay/NewsBlur

Leave a Reply