From Text to Numbers: How Computers Convert Strings to Integers Under the Hood

An in-depth exploration of how computers transform textual representations of numbers into actual integers, with code examples in multiple languages and performance considerations.

Posted May 4, 2025 Updated Jul 20, 2025

By Mohit Mishra

views 15 min read

How Computers Cast Strings to Numbers

Ever wondered what happens when you convert a string like “123” into a number that a computer can actually do math with? At first glance, it seems simple—after all, we humans can look at the characters “123” and instantly recognize the number 123.

But for computers, which can only process binary data, transforming text into computable numbers requires some fascinating translation work under the hood.

Why We Need String-to-Number Conversion

Before diving into implementation details, let’s understand why we need to convert strings to numbers in the first place.

Computers receive input in various forms—keystrokes, file contents, network data—and much of this data arrives as text. When users type numbers into a form field or when we read data from a CSV file, what we’re actually getting are character sequences, not numerical values.

The distinction matters because:

  
# String concatenation vs. numerical addition
"5" + "10" = "510"  # Joining text
5 + 10 = 15         # Mathematical operation

That’s a huge difference! To perform mathematical operations, we need actual numbers, not textual representations of them. This is where string-to-number conversion becomes essential.

The ASCII Connection

At the heart of string-to-number conversion is a fundamental concept: character encoding. In most programming contexts, we’re working with ASCII (or its superset, Unicode) where each character corresponds to a specific numerical value.

For instance, in ASCII:

Character ‘0’ corresponds to decimal value 48
Character ‘1’ corresponds to decimal value 49
Character ‘2’ corresponds to decimal value 50
And so on…

What’s convenient here is that numeric characters are grouped and ordered sequentially—the character ‘0’ is encoded as 48, ‘1’ as 49, ‘2’ as 50, and so forth. This pattern means we can extract the actual digit value by simply subtracting 48 (or the value of ‘0’) from any numeric character’s ASCII value:

'5' → ASCII value 53 → 53 - 48 = 5

But converting one character only gets us a single digit. What about multi-digit numbers?

The Algorithm Behind String-to-Number Conversion

Let’s break down how computers typically convert a string like “2573” into the number 2573:

Start with a result variable initialized to 0
Process each character from left to right
For each character:
- Convert it to its numeric value (by subtracting the ASCII value of ‘0’)
- Multiply the existing result by 10
- Add the new digit value
After processing all characters, result contains the final number

Let’s walk through this with “2573”:

Start: result = 0
Character '2': 
   - Numeric value = ASCII('2') - ASCII('0') = 50 - 48 = 2
   - result = 0 * 10 + 2 = 2

Character '5':
   - Numeric value = ASCII('5') - ASCII('0') = 53 - 48 = 5
   - result = 2 * 10 + 5 = 25

Character '7':
   - Numeric value = ASCII('7') - ASCII('0') = 55 - 48 = 7
   - result = 25 * 10 + 7 = 257

Character '3':
   - Numeric value = ASCII('3') - ASCII('0') = 51 - 48 = 3
   - result = 257 * 10 + 3 = 2573

Final result: 2573

What we’re doing here is building up the number position by position, where each digit gets properly weighted according to its place value. This is essentially implementing the place-value system we learned in elementary school, where:

2573 = 2×10³ + 5×10² + 7×10¹ + 3×10⁰

String-to-Number Conversion in Different Languages

Now that we understand the basic principle, let’s look at how various programming languages implement string-to-number conversion.

Python

Python offers straightforward functions for converting strings to numbers:

  
# Integer conversion
num_str = "123"
num_int = int(num_str)  # Returns 123

# Float conversion
float_str = "123.45"
num_float = float(float_str)  # Returns 123.45

# Error handling
try:
    int("abc")  # This will raise a ValueError
except ValueError:
    print("Cannot convert non-numeric string to integer")
    
# Base conversion
binary_str = "1010"
binary_to_int = int(binary_str, 2)  # Returns 10
hex_str = "1A"
hex_to_int = int(hex_str, 16)  # Returns 26

Python’s conversion functions are robust—they handle whitespace, validate input, and throw appropriate exceptions for invalid conversions.

JavaScript

JavaScript provides multiple ways to convert strings to numbers:

  
// Using Number constructor
let num1 = Number("123");  // Returns 123
let num2 = Number("123.45");  // Returns 123.45
let invalid = Number("abc");  // Returns NaN (Not a Number)

// Using parseInt and parseFloat
let int1 = parseInt("123", 10);  // Returns 123 (base 10)
let int2 = parseInt("123abc", 10);  // Returns 123 (ignores trailing non-numeric)
let hex = parseInt("1A", 16);  // Returns 26 (hexadecimal)
let float1 = parseFloat("123.45");  // Returns 123.45
let float2 = parseFloat("123.45abc");  // Returns 123.45

// Using unary plus operator
let num3 = +"123";  // Returns 123

// Using math operations
let num4 = "123" * 1;  // Returns 123

Each method has its quirks. Number() is strict and returns NaN for invalid inputs, while parseInt() extracts whatever valid numeric characters it can find at the beginning of the string and ignores the rest.

C++

C++ offers several options for converting strings to numbers:

  
#include <string>
#include <iostream>

int main() {
    // Using stoi for integers
    std::string str1 = "123";
    int num1 = std::stoi(str1);  // Returns 123
    
    // Using stof for floats
    std::string str2 = "123.45";
    float num2 = std::stof(str2);  // Returns 123.45
    
    // Using strtol for long integers with base
    const char* str3 = "1A";
    char* end;
    long num3 = std::strtol(str3, &end, 16);  // Returns 26 (hex)
    
    // Using sscanf (from C)
    const char* str4 = "123";
    int num4;
    sscanf(str4, "%d", &num4);  // Sets num4 to 123
    
    return 0;
}

C++ functions like stoi() and stof() throw exceptions for invalid inputs, while C-style functions like strtol() provide pointers to the end of the parsed number, helping identify parsing errors.

C#

C# provides parsing methods and conversion utilities:

  
// Using Parse methods
string str1 = "123";
int num1 = int.Parse(str1);  // Returns 123

string str2 = "123.45";
double num2 = double.Parse(str2);  // Returns 123.45

// Using TryParse for safe conversion
string str3 = "abc";
int num3;
bool success = int.TryParse(str3, out num3);  // success = false, num3 = 0

// Using Convert class
string str4 = "123";
int num4 = Convert.ToInt32(str4);  // Returns 123

C#’s TryParse methods are particularly useful because they don’t throw exceptions—they simply return a boolean indicating whether the conversion succeeded.

Under the Hood: Manual String-to-Number Conversion

Let’s implement our own basic string-to-integer conversion function to better understand the process:

  
int string_to_int(const char* str) {
    int result = 0;
    int sign = 1;
    
    // Handle negative numbers
    if (*str == '-') {
        sign = -1;
        str++;
    }
    
    // Process each character
    while (*str != '\0') {
        // Check if character is a digit
        if (*str >= '0' && *str <= '9') {
            // Convert character to digit and add to result
            result = result * 10 + (*str - '0');
        } else {
            // Invalid character found
            return -1;  // Error code
        }
        str++;
    }
    
    return result * sign;
}

This function demonstrates the core algorithm we discussed earlier. For each character:

We verify it’s a valid digit
We convert it to its numeric value by subtracting the ASCII value of ‘0’
We incorporate it into our running total by multiplying the existing total by 10 and adding the new digit

This approach works for most basic integer conversions, but production-ready implementations need additional features like:

Handling different bases (binary, octal, hexadecimal)
Overflow detection
Specific error reporting
Locale handling for decimals (e.g., “123,45” in European notation)

Common Challenges in String-to-Number Conversion

String-to-number conversion isn’t always straightforward. Here are some common challenges and how to address them:

Invalid Input

Not all strings represent valid numbers. Invalid characters, empty strings, or malformed input need proper handling:

  
def safe_int_conversion(s):
    try:
        return int(s.strip()), None
    except ValueError as e:
        return None, str(e)
        
# Usage
num, error = safe_int_conversion("  123  ")  # Returns (123, None)
num, error = safe_int_conversion("abc")  # Returns (None, "invalid literal for int() with base 10: 'abc'")

Trailing Characters

Different conversion functions handle trailing non-numeric characters differently:

  
// JavaScript's parseInt ignores trailing characters
parseInt("123abc", 10);  // Returns 123

// But Number() rejects strings with trailing non-numeric characters
Number("123abc");  // Returns NaN

Locale-Specific Formats

Number formats vary across regions—decimal points, thousands separators, and even digit representations differ:

  
// US: 1,234.56
// Many European countries: 1.234,56

// Using Intl.NumberFormat in JS
const usNum = new Intl.NumberFormat('en-US').format(1234.56);  // "1,234.56"
const deNum = new Intl.NumberFormat('de-DE').format(1234.56);  // "1.234,56"

// Parsing with locale support
const usValue = parseFloat("1,234.56".replace(/,/g, ''));  // 1234.56
const deValue = parseFloat("1.234,56".replace(/\./g, '').replace(',', '.'));  // 1234.56

Floating-Point Precision

Converting decimal strings to floating-point numbers can introduce precision issues:

  
parseFloat("0.1") + parseFloat("0.2");  // 0.30000000000000004

This is due to how floating-point numbers are represented in binary, not an issue with the string conversion itself.

Performance Considerations

Different conversion methods have varying performance characteristics:

JavaScript Performance

  
// Benchmarking different conversion methods
console.time('Number');
for (let i = 0; i < 1000000; i++) {
    Number("12345");
}
console.timeEnd('Number');

console.time('parseInt');
for (let i = 0; i < 1000000; i++) {
    parseInt("12345", 10);
}
console.timeEnd('parseInt');

console.time('Unary Plus');
for (let i = 0; i < 1000000; i++) {
    +"12345";
}
console.timeEnd('Unary Plus');

Typically, the unary plus operator (+) is fastest, followed by Number(), with parseInt() being slightly slower due to its more complex parsing logic.

Batch Processing

For high-volume string-to-number conversions, batch processing can be more efficient than individual conversions:

  
# Less efficient: individual conversions
numbers = []
with open('data.csv', 'r') as file:
    for line in file:
        numbers.append(int(line.strip()))

# More efficient: batch processing with list comprehension
with open('data.csv', 'r') as file:
    numbers = [int(line.strip()) for line in file]

Practical Applications

String-to-number conversion is essential in many real-world scenarios:

User Input Validation

When collecting numerical input from users, conversion and validation are critical:

  
function processUserInput() {
    const input = document.getElementById('age').value;
    const age = parseInt(input, 10);
    
    if (isNaN(age) || age <= 0) {
        alert("Please enter a valid age");
        return;
    }
    
    // Process valid age...
}

Data Processing

When working with external data sources like CSV files or API responses:

  
import csv

total_sales = 0
with open('sales.csv', 'r') as file:
    reader = csv.DictReader(file)
    for row in reader:
        # Convert string to float for calculation
        sale_amount = float(row['amount'])
        total_sales += sale_amount
        
print(f"Total sales: ${total_sales:.2f}")

Configuration Parsing

When reading configuration files:

  
def load_config(filename):
    config = {}
    with open(filename, 'r') as file:
        for line in file:
            if line.strip() and not line.startswith('#'):
                key, value = line.split('=', 1)
                key = key.strip()
                value = value.strip()
                
                # Try to convert numeric values
                try:
                    if '.' in value:
                        config[key] = float(value)
                    else:
                        config[key] = int(value)
                except ValueError:
                    config[key] = value
    
    return config

Building Your Own Parser

For educational purposes or specific requirements, you might need to build your own string-to-number parser. Here’s a more comprehensive example in C++:

  
#include <string>
#include <stdexcept>
#include <limits>

// Custom string to integer conversion
int custom_stoi(const std::string& str, int base = 10) {
    if (str.empty()) {
        throw std::invalid_argument("Empty string");
    }
    
    size_t i = 0;
    int sign = 1;
    long long result = 0; // Using long long to check for overflow
    
    // Handle sign
    if (str[0] == '-') {
        sign = -1;
        i++;
    } else if (str[0] == '+') {
        i++;
    }
    
    // Process digits
    for (; i < str.length(); i++) {
        char c = str[i];
        int digit;
        
        if (c >= '0' && c <= '9') {
            digit = c - '0';
        } else if (base > 10 && c >= 'A' && c <= 'Z') {
            digit = c - 'A' + 10;
        } else if (base > 10 && c >= 'a' && c <= 'z') {
            digit = c - 'a' + 10;
        } else {
            throw std::invalid_argument("Invalid character in input");
        }
        
        if (digit >= base) {
            throw std::invalid_argument("Digit out of range for given base");
        }
        
        // Check for overflow before updating result
        if (result > (std::numeric_limits<int>::max() - digit) / base) {
            throw std::overflow_error("Integer overflow");
        }
        
        result = result * base + digit;
    }
    
    return static_cast<int>(result * sign);
}

This implementation handles:

Different bases (decimal, hexadecimal, etc.)
Sign (positive or negative)
Input validation
Overflow detection

Beyond Basic Conversion: Advanced Topics

Fixed-Point Arithmetic

For precise decimal arithmetic, especially in financial applications, fixed-point arithmetic is often preferred over floating-point:

  
# Simple fixed-point representation (2 decimal places)
def string_to_fixed_point(s):
    # Convert "12.34" to 1234 (integer cents)
    parts = s.split('.')
    if len(parts) == 1:
        return int(parts[0]) * 100
    elif len(parts) == 2:
        return int(parts[0]) * 100 + int(parts[1].ljust(2, '0')[:2])
    else:
        raise ValueError("Invalid number format")

# Usage
dollars_cents = string_to_fixed_point("12.34")  # 1234 (representing $12.34)

Locale-Aware Parsing

For international applications, locale-aware parsing is crucial:

  
import locale

# Set locale to German
locale.setlocale(locale.LC_ALL, 'de_DE.UTF-8')

# Parse German format (1.234,56)
german_num = locale.atof("1.234,56")  # Returns 1234.56

# Reset to default locale
locale.setlocale(locale.LC_ALL, '')

Binary, Octal, and Hexadecimal Conversion

Different number bases require specialized handling:

  
# Binary to int
binary = "1010"
binary_val = int(binary, 2)  # 10

# Octal to int
octal = "777"
octal_val = int(octal, 8)  # 511

# Hex to int
hexadecimal = "1A"
hex_val = int(hexadecimal, 16)  # 26

# Custom base (base 36 includes 0-9 and A-Z)
base36 = "Z"
base36_val = int(base36, 36)  # 35

Practice Exercises

Ready to test your understanding? Try these exercises:

Basic Converter: Write a function that converts a string to an integer without using built-in conversion functions.
Robust Parser: Enhance your converter to handle negative numbers, whitespace, and invalid characters.
Base Converter: Extend your function to support different number bases (binary, octal, hexadecimal).
Calculator: Build a simple calculator that takes string expressions like “12 + 34” and computes the result.
CSV Processor: Write a program that reads a CSV file with numeric data, converts strings to appropriate number types, and performs calculations.

Summary

We’ve explored how computers convert strings to numbers, from high-level language functions to the underlying algorithms. Key takeaways:

Computers store text as character codes (ASCII/Unicode), which must be converted to binary numbers for mathematical operations.
The core conversion algorithm involves:
- Processing each character in the string
- Converting it to its numeric value
- Building up the final number using place values
Different programming languages offer various functions for string-to-number conversion, each with unique behaviors for edge cases.
Common challenges include handling invalid input, trailing characters, locale-specific formats, and maintaining precision.
Implementing your own converter provides insights into the conversion process and gives you control over error handling and edge cases.

Understanding string-to-number conversion is fundamental to programming, especially when working with user input, file data, or external APIs. By mastering these concepts, you’ll write more robust code that handles numerical data with confidence.

References

ASCII Table: ASCII Table and Description
Python Documentation: Built-in Functions
JavaScript MDN: Number
C++ Reference: String to Number Conversion
Unicode Standard: Unicode 14.0 Character Database

Programming, Computer Science

This post is licensed under CC BY 4.0 by the author.