oom killer - 其它 - 程序员开发

TOP

oom killer(五)

2019-09-01 23:08:40 【大中小】浏览:115次

ONTINUE: continue; case OOM_SCAN_ABORT: rcu_read_unlock(); return (struct task_struct *)(-1UL); case OOM_SCAN_OK: break; }; points = oom_badness(p, NULL, nodemask, totalpages); if (!points || points < chosen_points) continue; /* Prefer thread group leaders for display purposes */ if (points == chosen_points && thread_group_leader(chosen)) continue; chosen = p; chosen_points = points; } if (chosen) get_task_struct(chosen); rcu_read_unlock(); *ppoints = chosen_points * 1000 / totalpages; return chosen; }

此函数通过for_each_process_thread()宏遍历所有进程，进而借用oom_scan_process_thread()获得进程扫描类型然后通过switch-case作特殊化处理，例如存在某进程退出中则中断扫描、某进程占用内存过多且被标识为优先kill掉则优选等特殊处理。而正常情况则会通过oom_badness()计算出进程的分值，然后根据最高分值将进程控制块返回回去。

顺便研究一下oom_badness()的实现:

【file:/ mm/oom_kill.c】
/**
 * oom_badness - heuristic function to determine which candidate task to kill
 * @p: task struct of which task we should calculate
 * @totalpages: total present RAM allowed for page allocation
 *
 * The heuristic for determining which task to kill is made to be as simple and
 * predictable as possible. The goal is to return the highest value for the
 * task consuming the most memory to avoid subsequent oom failures.
 */
unsigned long oom_badness(struct task_struct *p, struct mem_cgroup *memcg,
              const nodemask_t *nodemask, unsigned long totalpages)
{
    long points;
    long adj;
 
    if (oom_unkillable_task(p, memcg, nodemask))
        return 0;
 
    p = find_lock_task_mm(p);
    if (!p)
        return 0;
 
    adj = (long)p->signal->oom_score_adj;
    if (adj == OOM_SCORE_ADJ_MIN) {
        task_unlock(p);
        return 0;
    }
 
    /*
     * The baseline for the badness score is the proportion of RAM that each
     * task's rss, pagetable and swap space use.
     */
    points = get_mm_rss(p->mm) + atomic_long_read(&p->mm->nr_ptes) +
         get_mm_counter(p->mm, MM_SWAPENTS);
    task_unlock(p);
 
    /*
     * Root processes get 3% bonus, just like the __vm_enough_memory()
     * implementation used by LSMs.
     */
    if (has_capability_noaudit(p, CAP_SYS_ADMIN))
        points -= (points * 3) / 100;
 
    /* Normalize to oom_score_adj units */
    adj *= totalpages / 1000;
    points += adj;
 
    /*
     * Never return 0 for an eligible task regardless of the root bonus and
     * oom_score_adj (oom_score_adj can't be OOM_SCORE_ADJ_MIN here).
     */
    return points > 0 ? points : 1;
}

计算进程分值的函数中，首先排除了不可OOM kill的进程以及oom_score_adj值为OOM_SCORE_ADJ_MIN（即-1000）的进程，其中oom_score_adj取值范围是-1000到1000；接着就是计算进程的RSS、页表以及SWAP空间的使用量占RAM的比重，如果该进程是超级进程，则去除3%的权重；最后将oom_score_adj和points归一后，但凡小于0值的都返回1，其他的则返回原值。由此可知，分值越低的则越不会被kill，而且该值可以通过修改oom_score_adj进行调整。

最后分析一下找到了最“bad”的进程后，其享受的“待遇”oom_kill_process()：

【file:/ mm/oom_kill.c】
/*
 * Must be called while holding a reference to p, which will be released upon
 * returning.
 */
void oom_kill_process(struct task_struct *p, gfp_t gfp_mask, int order,
              unsigned int points, unsigned long totalpages,
              struct mem_cgroup *memcg, nodemask_t *nodemask,
              const char *message)
{
    struct task_struct *victim = p;
    struct task_struct *child;
    struct task_struct *t;
    struct mm_struct *mm;
    unsigned int victim_points = 0;
    static DEFINE_RATELIMIT_STATE(oom_rs, DEFAULT_RATELIMIT_INTERVAL,
                          DEFAULT_RATELIMIT_BURST);
 
    /*
     * If the task is already exiting, don't alarm the sysadmin or kill
     * its children or threads, just set TIF_MEMDIE so it can die quickly
     */
    if (p->flags & PF_EXITING) {
        set_tsk_thread_flag(p, TIF_MEMDIE);
        put

首页上一页 2 3 4 5 6 下一页尾页 5/6/6
【大中小】【打印】【繁体】【投稿】【收藏】【推荐】【举报】【评论】【关闭】【返回顶部】

上一篇：Linux 匿名页的反向映射	下一篇：在keil中添加stc系列单片机型号(..