ML Interview Q Series: How would you normalise Longitude/Latitude feature?

Apr 02, 2025

📚 Browse the full ML Interview series here.

Comprehensive Explanation

When faced with raw latitude and longitude features, a straightforward approach such as standard normalization or min-max scaling often fails to capture the cyclical nature of these coordinates. Longitude in particular wraps around seamlessly at -180 and +180 degrees, meaning that -180 and +180 are effectively the same geographic location, yet a naïve normalization method would treat them as being far apart on a numerical scale.

Connect with me on X (Twitter)

Similarly, if latitude values are near +90 (the North Pole) or -90 (the South Pole), certain distance metrics become distorted due to the spherical geometry of Earth. Below are several strategies, with deeper reasoning, to handle and normalise longitude/latitude features.

Transforming into a 2D Cyclical Representation

Longitude ranges from -180 to +180, and latitude ranges from -90 to +90. One way to manage their cyclical nature is to convert each angle into sine and cosine components:

Each of these features (lon_x, lon_y, lat_x, lat_y) can then be fed into your model. The sin/cos transformation helps the model understand the wrap-around effect; for instance, -180 and +180 end up having nearly identical (sin, cos) pairs. You might then apply a standard scaler or min-max scaler to these new sin/cos features if numerical scaling is still desired.

Parameters explanation (inline text-based):

longitude in radians means longitude * (pi / 180).
latitude in radians means latitude * (pi / 180).
The sin function transforms the angle to a range in [-1, +1].
The cos function also transforms the angle to a range in [-1, +1].
The resulting (lon_x, lon_y) or (lat_x, lat_y) pairs preserve angular information.

Transforming into a 3D Spherical Representation

Another common strategy is to place each (latitude, longitude) pair on a 3D sphere, which can be viewed as embedding them into Cartesian coordinates (x, y, z). Specifically:

This representation preserves the global spherical structure of the Earth and ensures that points that are geographically close remain close in these 3D coordinates. Then you can choose to apply a further scaling mechanism (like standard scaling) if needed.

Using Local Projections

If your application involves a relatively small geographic area, you might sometimes adopt a local coordinate system (e.g., UTM projection or a simpler local projection like a meter-based or kilometer-based system). This local projection flattens the sphere locally, and you could then apply typical scaling approaches (like min-max scaling). This strategy avoids the complexities of global wrap-around because the region of interest is not large enough to encounter significant wrap-around errors.

Potential Pitfalls and Considerations

Handling the wrap-around effect:

Directly applying standard scaling to raw lat/long might cause discontinuities at boundaries like +180 and -180. This is a crucial reason for the sin/cos transformation or an alternative approach.

Handling polar regions:

Near the poles (latitude near +90 or -90), small differences in longitude might have less geographic significance in terms of actual distance. A spherical or 3D embedding approach handles this more gracefully than a naïve 2D scaling.

Geodesic distances:

If your model relies on real-world distances, consider the geometry of the Earth. Direct Euclidean distance in raw lat/long space does not represent actual “great-circle” distance. The 3D spherical embedding or explicit haversine formula is often necessary if precise distance calculations are needed.

Practical data range:

If the dataset covers a tiny region (e.g., a single city), standard scaling might be sufficient, but it is still not robust if your model depends on distances across the boundary of your coordinate range.

Follow-up Questions

How do you decide if it is better to use the 2D sin/cos approach or the 3D spherical representation?

It depends on the nature and scope of your problem. For a purely global application or when you want to preserve accurate distances across large portions of the globe, the 3D spherical approach is often superior because it naturally preserves proximity on the sphere. If your problem data is somewhat local or if you only care about general cyclical effects (like wrap-around), the sin/cos approach for both latitude and longitude can be simpler and still effective.

If you also need to compute or approximate distances between coordinates, a 3D representation or an explicit use of the haversine formula might be advisable. In many practical contexts, either representation is acceptable if the model is primarily using these variables as input features rather than directly computing distances among them.

How would you handle distance calculations once you have normalized coordinates?

If you have transformed your coordinates to a sin/cos 2D representation, using Euclidean distance on (lon_x, lon_y, lat_x, lat_y) does not strictly reflect great-circle distance. You might still need the haversine formula or a 3D Euclidean approach from the spherical coordinates if you want distances that mirror the Earth's surface. For an application that relies heavily on distance metrics (like nearest-neighbor queries, clustering, or any distance-based algorithm), using a geodesic-aware distance calculation is essential.

What if your data covers only a small region like a single city?

When focusing on a small region, the curvature of the Earth and the wrap-around at ±180 degrees might not be a significant factor. In such cases, you could project your lat/long to a planar coordinate system (for example, a UTM zone or a local projection used by GIS systems) and apply standard min-max or z-score normalization. This approach reduces the complexity of global coordinates. However, if there is a possibility that your data might expand to larger geographic coverage, planning for a robust approach (like spherical embedding) can save re-engineering effort later.

Could you use a standard scaler on raw lat/long if you only have local data?

Yes, if the area is small enough that you never cross the ±180-degree boundary and do not come near the poles, you can treat latitude and longitude as if they were effectively linear coordinates within that zone. A standard scaler would not break anything if the geographic coverage is small, and you do not require precise great-circle distances. However, keep in mind that this is a workaround that only applies in narrow contexts. For broader coverage or more accurate distance-based modeling, sin/cos transformations or 3D spherical representations are generally preferred.

Does the Earth's ellipsoidal shape affect these transformations?

Strictly speaking, yes. The Earth is not a perfect sphere; it is an oblate spheroid. Spherical approximations can introduce small errors when converting lat/long into x,y,z coordinates. For most machine learning tasks, especially if not operating at extremely high-precision geospatial scales, the spherical approximation is usually sufficient. If you require more geodetic accuracy, you can use more precise ellipsoidal formulas or local projections that match your region of interest closely.

Why does the 2D sin/cos transformation help with cyclical features?

Longitude is cyclical because -180 degrees is effectively the same place as +180 degrees. A similar concept applies in other cyclical contexts, such as hours on a clock (where 23 and 0 are just one hour apart). By converting angles to (sin, cos) pairs, the model sees that -180 and +180 have nearly identical values, capturing the cyclical wrap-around. Without this transformation, the model might learn an incorrect representation that places -180 and +180 too far apart in the feature space.

Example of Code for Sin/Cos Normalization

import numpy as np

def sin_cos_transform(lat, lon):
    """
    lat, lon are in degrees.
    Returns (lat_x, lat_y, lon_x, lon_y).
    """
    # Convert degrees to radians
    lat_rad = np.radians(lat)
    lon_rad = np.radians(lon)

    lat_x = np.sin(lat_rad)
    lat_y = np.cos(lat_rad)
    lon_x = np.sin(lon_rad)
    lon_y = np.cos(lon_rad)

    return lat_x, lat_y, lon_x, lon_y

# Example usage:
latitudes = [34.05, 36.12, 42.36, -23.55]
longitudes = [-118.24, -115.17, -71.06, -46.63]

for lat, lon in zip(latitudes, longitudes):
    lx, ly, ox, oy = sin_cos_transform(lat, lon)
    print(f"Lat: {lat}, Lon: {lon} -> (lat_x, lat_y, lon_x, lon_y) = ({lx:.3f}, {ly:.3f}, {ox:.3f}, {oy:.3f})")

You can further scale or feed these features into a deep learning model, tree-based model, or any other ML pipeline.