GraphQL Fundamentals

Ask for exactly what you need, get exactly that

What is GraphQL?

GraphQL is a query language and runtime for APIs developed by Facebook (now Meta) in 2012 and open-sourced in 2015. Unlike REST APIs that return fixed data structures, GraphQL allows clients to specify exactly what data they need, reducing over-fetching and under-fetching.

The Core Problem GraphQL Solves

Traditional REST APIs have a fundamental limitation: they return fixed data structures. If you need user data, you call /users/123 and get all user fields—even if you only need the name. If you need related data (like a user’s orders), you make multiple requests. This leads to:

Over-fetching: Getting data you don’t need (wasting bandwidth)
Under-fetching: Not getting enough data (requiring multiple requests)
Multiple round trips: Slower performance due to network latency

GraphQL solves this by providing a single endpoint where clients can request exactly the data they need, including related data, in a single request.

Key Concepts

Single Endpoint - One URL for all operations
Client-Specified Queries - Client decides what data to fetch
Strongly Typed Schema - Schema defines all possible queries
Introspection - Schema is self-documenting
Real-time Subscriptions - WebSocket support for live updates

GraphQL vs REST: The Core Difference

REST: Multiple Endpoints, Fixed Responses

1
GET /users/123          → Returns full user object
2
GET /users/123/orders   → Returns all orders
3
GET /users/123/profile  → Returns full profile

Problems:

Over-fetching (get data you don’t need) - Mobile app might only need name, but gets full user object with 20+ fields
Under-fetching (need multiple requests) - Need user and orders? That’s 2+ requests
Multiple round trips - Each request adds network latency, slowing down the app

GraphQL: Single Endpoint, Flexible Queries

1
query {
2
  user(id: 123) {
3
    name
4
    email
5
    orders {
6
      id
7
      total
8
    }
9
  }
10
}

Benefits:

Fetch exactly what you need - Request only the fields you need, nothing more
Get related data in one request - Fetch user and their orders in a single query
Single round trip - One network request instead of multiple, reducing latency

GraphQL Schema: The Foundation

Schema defines what data is available and how to query it.

Basic Schema Example

1
type User {
2
  id: ID!
3
  name: String!
4
  email: String!
5
  orders: [Order!]!
6
}
7

8
type Order {
9
  id: ID!
10
  total: Float!
11
  items: [OrderItem!]!
12
}
13

14
type Query {
15
  user(id: ID!): User
16
  users: [User!]!
17
}
18

19
type Mutation {
20
  createUser(name: String!, email: String!): User!
21
  updateUser(id: ID!, name: String): User!
22
}

Type System

Type	Meaning	Example
`String`	Text	`"John Doe"`
`Int`	Integer	`42`
`Float`	Decimal	`99.99`
`Boolean`	True/False	`true`
`ID`	Unique identifier	`"123"`
`!`	Required (non-null)	`String!`
`[Type]`	Array	`[String!]!`

Queries: Fetching Data

Queries are for reading data. They’re like GET requests in REST.

Simple Query

1
query {
2
  user(id: "123") {
3
    name
4
    email
5
  }
6
}

Response:

1
{
2
  "data": {
3
    "user": {
4
      "name": "John Doe",
5
      "email": "[email protected]"
6
    }
7
  }
8
}

Query with Arguments

1
query {
2
  users(limit: 10, offset: 0) {
3
    id
4
    name
5
  }
6
}

Query with Nested Data

1
query {
2
  user(id: "123") {
3
    name
4
    email
5
    orders {
6
      id
7
      total
8
      items {
9
        product {
10
          name
11
          price
12
        }
13
        quantity
14
      }
15
    }
16
  }
17
}

This single query replaces multiple REST calls:

GET /users/123
GET /users/123/orders
GET /orders/456/items
GET /products/789

Query with Aliases

Get multiple versions of same field:

1
query {
2
  user1: user(id: "123") {
3
    name
4
  }
5
  user2: user(id: "456") {
6
    name
7
  }
8
}

Query with Fragments

Reusable field sets:

1
fragment UserInfo on User {
2
  id
3
  name
4
  email
5
}
6

7
query {
8
  user(id: "123") {
9
    ...UserInfo
10
    orders {
11
      id
12
    }
13
  }
14
}

Mutations: Modifying Data

Mutations are for creating, updating, or deleting data. Like POST/PUT/DELETE in REST.

Create Mutation

1
mutation {
2
  createUser(name: "Jane Doe", email: "[email protected]") {
3
    id
4
    name
5
    email
6
  }
7
}

Response:

1
{
2
  "data": {
3
    "createUser": {
4
      "id": "456",
5
      "name": "Jane Doe",
6
      "email": "[email protected]"
7
    }
8
  }
9
}

Update Mutation

1
mutation {
2
  updateUser(id: "123", name: "John Smith") {
3
    id
4
    name
5
    email
6
  }
7
}

Delete Mutation

1
mutation {
2
  deleteUser(id: "123") {
3
    id
4
  }
5
}

Multiple Mutations

Execute multiple mutations in one request:

1
mutation {
2
  createUser(name: "Alice", email: "[email protected]") {
3
    id
4
  }
5
  createOrder(userId: "123", items: [...]) {
6
    id
7
  }
8
}

Subscriptions: Real-Time Updates

Subscriptions provide real-time data using WebSockets.

Subscription Example

1
subscription {
2
  userUpdated(userId: "123") {
3
    id
4
    name
5
    email
6
  }
7
}

How it works:

Client subscribes via WebSocket
Server sends updates when data changes
Client receives real-time updates

Use Cases

Live chat messages
Real-time notifications
Stock price updates
Collaborative editing
Live dashboards

The N+1 Query Problem

The biggest performance issue in GraphQL.

The Problem

1
query {
2
  users {
3
    name
4
    orders {  # N+1 problem!
5
      id
6
      total
7
    }
8
  }
9
}

What happens:

Query 1: SELECT * FROM users (gets 100 users)
Query 2: SELECT * FROM orders WHERE user_id = 1
Query 3: SELECT * FROM orders WHERE user_id = 2
… (100 more queries!)

Total: 1 + 100 = 101 queries! This is the N+1 query problem—one query to get the list, then N queries (one per item) to get related data.

Solution: DataLoader (Batching)

DataLoader batches requests:

How DataLoader works:

Collects all requests in a batch
Waits for next event loop tick
Executes single batched query
Distributes results to individual requests

DataLoader Implementation

Resolvers: The Implementation

Resolvers are functions that fetch data for each field.

Resolver Structure

When to Use GraphQL vs REST

Use GraphQL When:

Multiple clients with different data needs - Web app needs all fields, mobile app needs minimal fields
Mobile apps where bandwidth matters - Reducing data transfer saves battery and improves performance
Complex relationships between data - Easier to fetch related data in one query
Rapidly evolving API requirements - Schema changes don’t break existing queries (backward compatible)
Real-time updates needed (subscriptions) - Built-in WebSocket support for live data

Use REST When:

Simple CRUD operations - REST is simpler for basic create/read/update/delete
Caching is critical (HTTP caching) - REST benefits from HTTP caching infrastructure
File uploads (GraphQL handles this poorly) - REST is better suited for file operations
Simple APIs where over-fetching isn’t a problem - If bandwidth isn’t a concern, REST is simpler
Existing REST infrastructure - If you already have REST APIs, migration might not be worth it

GraphQL Best Practices

1. Use Pagination

Bad:

1
query {
2
  users {  # Could return millions!
3
    id
4
    name
5
  }
6
}

Why it’s bad: Without pagination, a query could return millions of records, causing performance issues, memory problems, and timeouts.

Good:

1
query {
2
  users(first: 10, after: "cursor123") {
3
    edges {
4
      node {
5
        id
6
        name
7
      }
8
    }
9
    pageInfo {
10
      hasNextPage
11
      endCursor
12
    }
13
  }
14
}

2. Limit Query Depth

Prevent deeply nested queries:

1
MAX_QUERY_DEPTH = 10
2

3
def validate_query_depth(query, max_depth=MAX_QUERY_DEPTH):
4
    depth = calculate_depth(query)
5
    if depth > max_depth:
6
        raise GraphQLError("Query too deep")

3. Use Field-Level Authorization

Authorize at field level:

1
@query.field("user")
2
def resolve_user(_, info, id: str):
3
    user = user_repository.find_by_id(id)
4

5
    # Check if user can access email field
6
    if not can_access_field(info, "email"):
7
        user.pop("email")  # Remove email from response
8

9
    return user

4. Implement Query Complexity Analysis

Prevent expensive queries:

1
def calculate_complexity(query):
2
    complexity = 0
3
    for field in query.fields:
4
        complexity += field.complexity
5
        if field.has_list:
6
            complexity *= field.list_size
7
    return complexity
8

9
if calculate_complexity(query) > MAX_COMPLEXITY:
10
    raise GraphQLError("Query too complex")

5. Use Introspection Wisely

Disable introspection in production (or limit it):

1
# Disable introspection
2
schema = make_executable_schema(type_defs, resolvers)
3
schema.introspection = False  # In production

LLD Connection: Implementing GraphQL

At the code level, GraphQL translates to resolvers, schema definitions, and DataLoader patterns.

Complete Example

Key Takeaways

Ask for What You Need

GraphQL lets clients specify exactly what data they need, reducing over-fetching and under-fetching.

Watch for N+1

The N+1 query problem is GraphQL’s biggest performance issue. Use DataLoader to batch requests.

Resolvers Fetch Data

Resolvers are functions that fetch data for each field. They’re where your business logic lives.

Schema is Contract

GraphQL schema defines your API contract. It’s self-documenting and strongly typed.

Next Steps

Learn gRPC & Protocol Buffers - high-performance binary protocol
Understand API Gateway Pattern - single entry point for APIs
Master Rate Limiting - controlling API usage

Request a feature or report an issue