Skip to content

OOM caused by numerous crawls #33520

Open
@H0llyW00dzZ

Description

@H0llyW00dzZ

Description

In the latest versions, 1.23.2 and 1.23.3, memory leaks occur. (update: see below, not memory leak, not regression)

These OOMs are caused by numerous crawls, such as those used by Facebook Inc. (Meta), Amazon (AWS), and other entities that fetch data excessively for AI training.

My Gitea self-hosted configuration:

  • Sessions using files
  • Cache using Redis with a TTL of 5 hours, and the last commit cache is 10K
  • No SSH

Screenshots

Image
Image

The logs exemplify how these companies use crawls for their AI.

Image

Essentially, memory leaks occur when there are many fetch requests, leading to crashes due to excessive memory consumption (thanks to OOM Kubernetes).

Metadata

Metadata

Assignees

No one assigned

    Labels

    issue/needs-feedbackFor bugs, we need more details. For features, the feature must be described in more detail

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions