Data Standardization for Salesforce: What Journalism Taught Me About CRM Data

Apply editorial standards to your database and watch data quality transform

Before I was a Salesforce architect, I was a journalist. And the thing that prepared me most for data quality work wasn't a certification. It was the AP Stylebook.

In journalism, the Associated Press Stylebook is the bible. It standardizes how you write dates, abbreviate states, format phone numbers, and handle thousands of other details. "March 9, 2026" not "3/9/26." "N.Y." in headlines, "New York" in text. Consistency isn't optional.

When I moved into Salesforce, I found databases that looked like they'd been edited by a hundred different writers with a hundred different style guides. "California" and "CA" and "Calif." in the same field. Phone numbers in every conceivable format. Dates that could mean anything.

The fix wasn't technical. It was editorial. Apply consistent standards to your data, enforce them on entry, and transform the outliers.

Here's the data standardization playbook I've developed over 14 years, built on principles every newspaper copy editor would recognize.

Why Standardization Matters

Unstandardized data creates problems:

Reporting Fails:

A report filtering on State = "California" misses records where State = "CA." Your California pipeline report is wrong.

Automation Breaks:

A Flow checking if Industry = "Healthcare" doesn't trigger for records where Industry = "Health Care" or "Medical."

AI Gets Confused:

AI trying to segment "VP-level contacts in the Northeast" can't work if job titles and regions aren't consistent.

Users Lose Trust:

When search doesn't find records that exist, users stop trusting the system.

Standardization isn't about being picky. It's about making data usable.

The Data Style Guide

Just as publications have style guides, your Salesforce org needs one. Here's how to build it:

Section 1: Geographic Data

State Fields:

Standard: Two-letter postal abbreviations (CA, NY, TX)

Don't Use | Use

California | CA

Calif. | CA

Cal | CA

california | CA

Implementation: Convert State fields from text to picklist with standard abbreviations.

Country Fields:

Standard: ISO 3166-1 alpha-2 codes or full names (pick one, enforce it)

Standard | Acceptable Variations to Convert

US | USA, United States, U.S.A., United States of America

UK | United Kingdom, Great Britain, England (if appropriate)

CA | Canada, CAN

Implementation: Picklist with standard values. Validation rule preventing non-standard entry.

Address Format:

Standard:

• Street addresses: Number, Street Name, Street Suffix (123 Main St)

• Street suffixes: St, Ave, Blvd, Dr, Rd, Ln (abbreviated, no periods)

• Unit numbers: Ste 100, Apt 2B, Unit 5

Implementation: Consider address validation tools (Smarty, Melissa) for real-time standardization.

Section 2: Phone Numbers

Standard Format:

(XXX) XXX-XXXX for US numbers

+1 (XXX) XXX-XXXX for international display

Store as digits only: XXXXXXXXXX

Input | Stored | Displayed

555-123-4567 | 5551234567 | (555) 123-4567

(555) 123-4567 | 5551234567 | (555) 123-4567

555.123.4567 | 5551234567 | (555) 123-4567

5551234567 | 5551234567 | (555) 123-4567

Implementation:

• Validation rule requiring 10 digits (for US)

• Formula field for formatted display

• Flow to strip non-numeric characters on save

Section 3: Names and Titles

Name Capitalization:

Standard: Title Case (John Smith, not JOHN SMITH or john smith)

Exception: Respect unusual capitalization when known (McDonald, DeWitt, van der Berg)

Implementation: Use Flow or Apex to apply title case, with exception list for known variations.

Name Suffixes:

Standard: Jr., Sr., III, IV (with periods for Jr./Sr., no comma before suffix)

Don't Use | Use

Jr | Jr.

junior | Jr.

JR. | Jr.

, Jr. | Jr.

Salutations:

Standard: Mr., Ms., Dr., Prof. (periods, followed by space)

Don't Use | Use

Mr | Mr.

Mister | Mr.

DR. | Dr.

doctor | Dr.

Implementation: Picklist for Salutation field with standard values.

Job Titles:

Standardization is harder here because titles vary legitimately. Focus on:

Title Level Field (picklist):

• C-Suite

• VP

• Director

• Manager

• Individual Contributor

This enables segmentation even when exact titles differ.

Common Title Cleanup:

Input | Standardized

Vice President | VP

V.P. | VP

Vice-President | VP

Vp | VP

Section 4: Company Names

Legal Suffixes:

Standard: Inc., LLC, Corp., Ltd. (with periods, preceded by comma)

Don't Use | Use

Incorporated | Inc.

Inc | Inc.

, INC. | , Inc.

Corporation | Corp.

Common Variations:

Create a mapping table for known companies:

Variation | Standard

IBM | IBM

I.B.M. | IBM

International Business Machines | IBM

MSFT | Microsoft

Microsoft Corporation | Microsoft Corp.

The vs. No The:

Standard: Omit "The" unless it's legally part of the company name.

Don't Use | Use

The Coca-Cola Company | Coca-Cola Company

The New York Times | The New York Times (legal name includes "The")

Section 5: Dates and Times

Date Fields:

Salesforce handles date storage consistently. Focus on:

• Text fields that should be dates (convert them)

• Date entry validation (no future birthdates, no close dates in the past for new opportunities)

Time References in Text Fields:

If capturing times in text fields:

Standard: 9:00 AM, 2:30 PM (colon, space before AM/PM)

Don't Use | Use

9am | 9:00 AM

9:00AM | 9:00 AM

9:00 a.m. | 9:00 AM

0900 | 9:00 AM

Section 6: Email Addresses

Standard:

• Lowercase (john.smith@company.com, not John.Smith@Company.com)

• Trimmed (no leading/trailing spaces)

• Validated format

Implementation:

• Validation rule for format: REGEX(Email, "^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\.[a-zA-Z]{2,}$")

• Flow to lowercase on save: LOWER(Email)

Section 7: Industry and Picklist Values

Industry:

Create a standard taxonomy. Example:

Broad Categories:

• Healthcare

• Financial Services

• Technology

• Manufacturing

• Retail

• Education

• Government

• Nonprofit

Avoid:

• Overlapping categories (Healthcare vs. Medical vs. Health Services)

• Overly specific categories that create fragmentation

• Spelling variations (Financial Services vs. Financial services)

Implementation: Picklist, not text. Map existing variations to standard values.

Implementing Your Style Guide

Phase 1: Document Standards

Write down your standards. A simple document covering:

• Each field requiring standardization

• The standard format

• Common variations to convert

• Implementation approach

Get stakeholder buy-in on standards before implementing.

Phase 2: Prevent New Violations

Implement technical controls for new records:

Validation Rules:

Block entry that doesn't meet standards.

Picklists:

Replace text fields where enumerated values work.

Flows:

Automatically standardize data on save (lowercase emails, format phone numbers).

Phase 3: Clean Historical Data

Use Data Loader or similar tools to mass update existing records:

1. Export records with non-standard values

2. Transform in Excel/Google Sheets using formulas

3. Import updates

For complex transformations, consider tools like:

• Informatica

• OpenRefine

• Custom Python scripts

Phase 4: Monitor Ongoing

Create reports showing:

• Records not matching standards

• Fields with unusual values

• New violations since last cleanup

Review weekly. Catch drift before it becomes a problem.

The Quick Wins

If you can only do three things:

1. Standardize State Fields

Convert to picklist with 2-letter abbreviations. Highest ROI for effort.

2. Enforce Email Formatting

Validation rule + Flow to lowercase. Prevents bounce issues.

3. Clean Phone Numbers

Validation rule for 10 digits. Store raw, display formatted.

These three changes improve data quality dramatically with minimal effort.

The Editorial Mindset

The best data stewards think like editors, not engineers.

Ask "Is this clear?" Would someone unfamiliar with this record understand it?

Ask "Is this consistent?" Does this match how we've formatted similar records?

Ask "Is this correct?" Does this data reflect reality?

These questions, applied at data entry and during review, prevent most quality issues.

Next Steps

1. Pick three fields to standardize first

2. Document standards for those fields

3. Implement validation rules preventing violations

4. Clean existing records using Data Loader

5. Build a report monitoring compliance

6. Expand to additional fields

If you're building a comprehensive data style guide or need help implementing standardization, Clear Concise Consulting offers data quality services. My journalism background informs my approach: treat your data with the same care you'd treat published content.


Jeremy Carmona is a 13x certified Salesforce Architect whose journalism background shapes his approach to data quality. He applies editorial standards to CRM data, treating every field as copy that needs consistent formatting.

Jeremy Carmona

13x certified Salesforce Architect and founder of Clear Concise Consulting. Specializing in data governance, data quality, and AI governance for nonprofit, government, healthcare, and enterprise organizations. Former instructor of NYU Tandon's first Salesforce Administration course. Published in Salesforce Ben on AI governance and data quality. Based in New York.

https://www.clearconciseconsulting.com
Next
Next

How Long Does a Salesforce Implementation Take?