Alcid Analytics LLC

Data Science and Business Consulting Services by Mike Cunha

Services

I help clients use data to achieve their goals. From data collection, processing, and analysis, to communicating the results and building out data-driven products.

Machine Learning and Statistics

I can help you apply machine learning techniques and statistics to your business processes: classification, A / B testing, prediction, recommendation, sentiment, search, natural language processing, entity resolution and more.

Dashboards and Automation

Many clients have a slow but critical reporting process that can be sped up by automating manual and Excel workflows and by making the output more versatile. Common things I deliver for these types of projects are data processing pipelines, automated reporting, and dashboards. Read More about common reasons clients hire me or take a look at the web interface my clients have access to.

Retainer

For some clients, I serve as an on-call partner by providing the statistical and technical expertise they need to test new strategies on a regular basis or diagnose problems as they appear.

Training

Workshops and one-on-one mentoring on topics like data visualization, data munging, automated reporting and hypothesis testing. I also give crash courses on using tools like python, Jupyter Notebook, Tableau and more.

Learn More

Contact Me and Discuss What I Can Help You With.

I am currently taking on new clients, reach out to me for a free consultation.

Projects

Some common problems and opportunities I can help you with:


Too Much

Having problems scaling an existing data-related process?Spreadsheets getting too complicated? Or maybe you have so much data you don't know where to start. I automate time-consuming reptitive tasks and help you decide which opportunities to pursue now, and in the long-term. I can also help you build tools to search through your data, whether you are an analyst bogged down with slow SQL queries or an HR specialist unable to locate a form on your intranet.

Too Messy

excel error Many businesses struggle with poor data quality, from sole proprietors to multi-national corporations. It's hard to make informed decisions when you can't get what you need, when you need it. I help clients assess data quality and how it's impacting their business, improve it with better data collection practices, and make the most of what they already have. Perfect data isn't practical; often times great value can be had from data that's just good enough.

Optimize

Some obvious questions can be very difficult to answer: What price should I charge? Which combination of products in a bundled offer will yield the most profit? Which part of my inventory can I liquidate to free up the most cash with the least risk? These are all questions I can help you answer.

Cutting Edge

Thanks to widely available open source libraries and cheap cloud computing you don't need to hire a team of deep learning experts or buy an expensive proprietary appliance to benefit from some of the unstructured data you probably already have: emails, feedback, blog posts, documents, images, audio recordings or social media posts. Software that can automatically summarize, classify and act on unstructured data is not just for Fortune 500 companies.

From Reactive to Proactive

ARIMA Forecast Reporting tells you what happened, real-time reporting tells you what is happening, and forecasting tells you what could happen. By training a predictive model on historical data, I can help you make informed decisions before your normal reporting would usually be available.

Customer Insight

Sometimes you have to make decisions on how to interact with your customers, regardless of available data. Validating the assumptions you make about your customers can be critical. I can help you learn more about your customers and validate the assumptions you have already made by: conducting text analysis of customer reviews, surveys and comments; mining log files for pain points and buying patterns; segmenting them into personas; and scoring leads.

Siloes

A very common problem for larger businesses is "siloed" data. Each department has its own database, for its own purposes, often undocumented outside of that department. How do you know if the 'John Smith' in your marketing database is the same customer as the 'John Smith' in your sales database? I'll partner with your in-house teams and help you set up unique identifiers to pass between systems, build out a company-wide data dictionary, and perform entity resolution on your existing data.

Guidance

Vendor Logos There has been an explosion of data-related vendors and services in the past few years, and likewise, there is no shortage of salespeople eagerly telling you how you're a perfect fit for their one-click solution. I provide clients with technical advice that allows them to see through the hype and find the right tools and services for their business.

Hosted Apps

Dashboards and interfaces to automated data pipelines are available via a secure web app on alcidanalytics.com. Contact me for access to demo.alcidanalytics.com and you can see for yourself.


Dashboards

Custom, interactive, D3.js-based. Click the GIF below to see an example.

Animated GIF showing an Alcid Analytics interactive dashboard 
                consisting of a bar chart, donut chart, line chart and several subtotals. 
                As a user clicks on items in the charts it, and the other visualizations 
                and totals, are being updated and filtered in real-time.

Run ETLs

An easy to use interface for running complex automated processes.

Animated GIF showing a user navigate through a multi-step web form, 
                submit the form, and then receiving status updates from the automated 
                process the form submission triggered. Demonstrating an easy to use web 
                interface to automated processes.

Responsive

Consistent interface across screens allows clients to access the solutions I build on almost any device.

Secure

Apps adhere to security best practices and can range from being public facing to a private extension of a client's intranet over VPN, complete with role-based user management.

Live Editing

Display an Excel-like view of data underlying a dashboard and edit it in-place.

Animated screen capture showing a user 
                click on tabular data in a table and update a value.

Other Deliverables

Not every project needs an app: one-off analyses and other software.

Notebooks

Jupyter notebooks allow me to show a client exactly how I performed each step of an analysis and tailor the explanation to the client's technical ability. It is an important tool I use to make data accessible to a wide variety of teams within a client organization. Decision makers can use them to see specific results and visualizations, while hiding complex details. The detail is easily revealed allowing the client's in-house team to quickly verify the results.

Code

When I provide code to clients looking to outsource software development I like to deliver a python package versioned in a Git repository, alongside any scripts or Jupyter notebooks I've written. Doc-strings and test coverage where appropriate are par for the course as well.

Articles

Latest posts and guides from the Alcid blog.

Making a Geographic Heatmap with Python

How to make an interactive geographic heatmap using Python and free tools. This example uses Folium, a Python wrapper for leaflet.js maps and geopandas. Read More


About Me

I am a data consultant based in Rochester, NY. I have a MS in Natural Resource Management from HSU and a BS in Biology from SUNY ESF. My academic background has provided me with a strong foundation in experimental design and statistics. I have worked in industry for the past few years with a variety of teams at a Fortune 1000 company. I regularly managed vendors, worked along-side developers and IT, and communicated results to non-technical decision makers including executives. The following is a summary of my experience and skills:

Mike Cunha
Natural Language Processing
Collection and analysis of unstructured texts including: parsing, tagging, categorizing, and scraping.
Machine Learning, Data Mining
Classification, clustering, prediction, optimization, and recommendation.
SQL
I have worked with most popular relational datbases and MongoDB
Data Munging and EDA
When I have a choice, I tend to use unix command-line tools and the Pandas library for Python inside a Jupyter Notebook.
Visualization
Tableau, Matplotlib for Python, ggplot2 for R, and d3.js for javascript-based HTML dashboards.
Development and Operations
First and foremost: Python. Also, javascript, HTML/CSS, Octave, PHP, and R. I am very comfortable at a unix command-line, Amazon EC2 and other virtual environments, most web-stacks like LAMP and Win Server + iis, proficient in Git, and a very quick study.
Web Analytics
Google Analytics, Conversion Rate Optimization via A/B testing and Multi-armed Bandits, custom tracking pixels, Google Tag Manager, many more...
Pay-per-Click Advertising
Google Adwords, Bing/Yahoo Adcenter, bid-management platforms, offline phone-call tracking

Contact Me

Your message was sent, thank you!