Collecting my attempts to improve at tech, art, and life

Data Analysis

Examining and summarizing large data sets to identify trends and help organizations make strategic decisions.

Jots

Related to and often tagged as Data Science. Data analysis is more about using past data to inform current decisions, while data science is trying to predict future outcomes based on available data.

Kinds of data: categorical or numerical, discrete or continuous

Kind Examples
categorical gender, location
numerical number of customers, active users
discrete number of applicants to a job
continuous infinite possible outcomes

Data characteristics:

Characteristic Explanation
cross-sectional snapshot of pattern or trend
time series test scores, wages over time
panel data multiple subjects and multiple points in time

Specializations

business analyst
streamline IT processes, organizational structures, or staff development
financial analyst
guide investment opportunities, identify revenue opportunities, and mitigate financial risk
health care analyst
use data from health records, cost reports, and patient surveys to help providers improve their quality of care
market research analyst
analyze market trends to help determine product and service offerings, price points, and target customers
operations analyst
collaborative role working with teams to identify and solve technical, structural, and procedural issues in order to optimize org performance
systems analyst
use cost-benefit analysis to help match technological solutions to company needs

Terms

business intelligence (BI)
infrastructure to support collection and analysis of business ops data
business analytics
turning an org’s raw data into useful information to identify trends, predict outcomes, etc
data warehouse
central repository of data integrated from one or more sources
ETL
Extract, Transform, Load; munging data from multiple sources into a single set for further processing by multiple processes
ELT
Extract, Load, Transfer; grab it and store it, letting consumers transform it as needed when they grab it
data blending
munging data from multiple sources into a single set or warehouse for a specific use case

Tools


Added to vault 2024-03-15. Updated on 2024-03-23