unregister a net device

unregister_netdev is used to delete a net device. In fact, it equals:

1
2
3
4
rtnl_lock();
rollback_registered_many( a temp list with a single net device);
list_add_tail(&dev->todo_list, &net_todo_list);
rtnl_unlock();

a temporary list stores a single net device, which is to be deleted.
net_todo_list stores all the net devices are being deleted.

The core function is rollback_registered_many, which efficiently deletes many devices in a list.
But here, in this case, one a single netdevice in the list.

Read More

draft: how to pick next task

summary

There are four sched_class,
stop_sched_class --> rt_sched_class --> fair_sched_class --> idle_sched_class

They are linked one by one staticly by struct sched_class->next by their defination.

Each sched_class has method pick_next_task, which is used to select
a perfect process to run from each sched_class‘s runqueue.

When we need schedule, the four pick_next_task will be called one by one.

As a optimization, most time there is no rt task in running state,
in this case we can directly call fair_sched_class.

Read More

draft: isolcpus

Isolcpus

kthread has wrong affinity when use isolcpus in bootline

when boot kernel with isolcpus in grub command lines, only init thread has expected affinity, which exclude the isolated cpus.

while the kthreads affinity still includes isolated cpus.

Read More

PROMISC in net device->flag

summary

promisc is one bit of struct net_device’s flag, which is used to indicate if a device is in promisc status.

1
2
3
4
5
6
7
8
9
10
11
12
30 /* Standard interface flags (netdevice->flags). */
31 #define IFF_UP 0x1 /* interface is up */
32 #define IFF_BROADCAST 0x2 /* broadcast address valid */
33 #define IFF_DEBUG 0x4 /* turn on debugging */
34 #define IFF_LOOPBACK 0x8 /* is a loopback net */
35 #define IFF_POINTOPOINT 0x10 /* interface is has p-p link */
36 #define IFF_NOTRAILERS 0x20 /* avoid use of trailers */
37 #define IFF_RUNNING 0x40 /* interface RFC2863 OPER_UP */
38 #define IFF_NOARP 0x80 /* no ARP protocol */
39 #define IFF_PROMISC 0x100 /* receive all packets */
40 #define IFF_ALLMULTI 0x200 /* receive all multicast packets*/
...

There are two kinds of operataion, could cause a NIC enter/leave promisc status.

  1. ip command
    run mutli on command, just need one off to recover.

    1
    2
      	ip link set dev eth0 promisc on
    ip link set dev eth0 promisc off
  2. tcpdump command
    When tcpdump starts, it let dev to promisc,
    and just before exit, tcpdump let dev left promisc.
    All these is done by call kernel api dev_set_promiscuity.

Read More