Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-10649

Fix RMNodeImpl.updateExistContainers leak

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      YARN-5168 the patch added RMNodeImpl.updatedExistContainers, but it didn't remove completed containers.

      These objects (ContainerStatusPBImpl & ContainerIdPBImpl ) stay in RMNodeImpl.updatedExistContainers forever.
      Because of this leak, ResourceManager in our production environment encountered OOM issue. We found 13 million ContainerStatusPBImpl objects in the heap dump file of ResourceManager.

      The patch has been applied in our production env and so far it works well.

       

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            max2049 Max Xie
            max2049 Max Xie
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 2h
                2h

                Slack

                  Issue deployment