FinOps Program
Audience: Cloud Engineering, Platform Engineering, and IT Finance
Purpose: Centralized resources for cloud cost management and optimization
Quick Links
| Resource | Description |
|---|---|
| Cloudability | FinOps platform for cost visibility |
| FinOps Foundation | Industry framework and best practices |
| AWS Cost Management | Native AWS cost tools |
| Azure Cost Management | Native Azure cost tools |
What is FinOps?
FinOps (Financial Operations) is an operational framework and cultural practice that brings financial accountability to cloud spending. It combines systems, best practices, and culture to increase an organization's ability to understand cloud costs and make data-driven decisions.
Program Resources
Orientation Materials
New team members should review the FinOps orientation materials to understand cloud cost management principles and TAMU-specific practices.
Orientation Topics
-
FinOps Framework Overview
- The three phases: Inform, Optimize, Operate
- Key personas and responsibilities
- Maturity model and progression
-
TAMU Cloud Environment
- Multi-cloud strategy (AWS, Azure, GCP)
- Account structure and governance
- Tagging standards for cost allocation
-
Tooling and Reporting
- Cloudability dashboard navigation
- Native cloud cost tools
- Custom reports and alerts
-
Optimization Strategies
- Reserved instances and savings plans
- Right-sizing recommendations
- Waste elimination tactics
Standard Operating Procedures
| SOP | Description |
|---|---|
| Monthly Cost Review | Process for reviewing and analyzing monthly cloud costs |
| Reserved Instance Management | Purchasing and managing RIs across platforms |
| Cost Anomaly Investigation | Steps for investigating unexpected cost spikes |
| Chargeback Reporting | Generating cost allocation reports for units |
Cost Visibility
Tagging Standards
Proper tagging is essential for accurate cost allocation. All cloud resources should include the following tags:
| Tag Key | Description | Example |
|---|---|---|
CostCenter | Financial cost center code | 12345 |
Environment | Deployment environment | production, development, test |
Owner | Team or individual responsible | cloud-engineering |
Project | Project or application name | student-portal |
DataClassification | Data sensitivity level | public, confidential |
Reporting Cadence
| Report | Frequency | Audience |
|---|---|---|
| Daily Spend Summary | Daily | Cloud Engineering |
| Weekly Optimization Report | Weekly | Platform Engineering |
| Monthly Cost Analysis | Monthly | IT Leadership |
| Quarterly Business Review | Quarterly | Executive Leadership |
Optimization Strategies
Reserved Instances & Savings Plans
Reserved capacity purchases provide significant discounts (up to 72%) for predictable workloads:
AWS
- Reserved Instances for EC2, RDS, ElastiCache
- Savings Plans for compute flexibility
- Analyze usage patterns before purchasing
Azure
- Reserved VM Instances
- Reserved capacity for SQL, Cosmos DB
- Azure Hybrid Benefit for Windows/SQL licensing
GCP
- Committed Use Discounts (CUDs)
- Sustained use discounts (automatic)
Right-Sizing Recommendations
Regularly review compute resources to ensure appropriate sizing:
-
Identify Underutilized Resources
- CPU utilization < 20% average
- Memory utilization < 40% average
- Network throughput minimal
-
Evaluate Right-Sizing Options
- Downsize to smaller instance types
- Consider burstable instances (T-series)
- Consolidate workloads where appropriate
-
Implement Changes
- Schedule changes during maintenance windows
- Monitor performance after changes
- Document decisions and outcomes
Waste Elimination
Common sources of cloud waste:
| Waste Type | Detection | Resolution |
|---|---|---|
| Orphaned volumes | Unattached EBS/managed disks | Delete or snapshot and remove |
| Idle load balancers | No registered targets | Remove or consolidate |
| Stale snapshots | Old backups beyond retention | Delete per retention policy |
| Unused IPs | Unattached elastic IPs | Release back to pool |
| Dev/test running 24/7 | Non-production always on | Implement schedules |
Governance & Policies
Budget Alerts
Configure budget alerts to monitor spending:
- 50% threshold — Informational notification
- 80% threshold — Warning to stakeholders
- 100% threshold — Alert to leadership and action required
- Forecasted 100% — Proactive warning based on trend
Cost Anomaly Detection
Automated anomaly detection helps identify unexpected spending:
- Enable native anomaly detection (AWS, Azure)
- Configure Cloudability alerts for custom thresholds
- Establish investigation workflow for anomalies
- Document root cause and remediation
Related Resources
- Cloudability Tool Guide — Platform access and navigation
- AWS Documentation — AWS service guides
- Azure Documentation — Azure IaaS guides
- GCP Documentation — GCP service guides