- ½¹µãÊÖÒÕ
- ÒÔÔ´´ÊÖÒÕϵͳΪ»ù±¾£¬£¬£¬£¬£¬£¬£¬£¬SenseCoreÉÌÌÀAI´ó×°ÖÃΪ½¹µã»ù×ù£¬£¬£¬£¬£¬£¬£¬£¬½á¹¹¶àÁìÓò¡¢¶àÆ«ÏòÇ°ÑØÑо¿£¬£¬£¬£¬£¬£¬£¬£¬
¿ìËÙÂòͨAIÔÚ¸÷¸ö±ÊÖ±³¡¾°ÖеÄÓ¦Ó㬣¬£¬£¬£¬£¬£¬£¬ÏòÐÐÒµ¸³ÄÜ¡£¡£¡£¡£¡£¡£
Ïã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒAAAIÈÎÃüÂÛÎÄÏê½â£ºST-GCNʱ¿Õͼ¾í»ýÍøÂçÄ£×Ó
¿ËÈÕ£¬£¬£¬£¬£¬£¬£¬£¬Ïã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒµÄ×îРAAAI ¾Û»áÂÛÎÄ¡¸Spatial Temporal Graph Convolutional Networks for Skeleton Based Action Recognition¡¹Ìá³öÁËÒ»ÖÖÐ嵀 ST-GCN£¬£¬£¬£¬£¬£¬£¬£¬¼´Ê±¿Õͼ¾í»ýÍøÂçÄ£×Ó£¬£¬£¬£¬£¬£¬£¬£¬ÓÃÓÚ½â¾ö»ùÓÚÈËÌå¹Ç¼ÜÒªº¦µãµÄÈËÀàÐж¯Ê¶±ðÎÊÌâ¡£¡£¡£¡£¡£¡£¸ÃÒªÁì³ýÁË˼Ð÷ÐÂÓ±Ö®Í⣬£¬£¬£¬£¬£¬£¬£¬ÔÚ±ê×¼µÄÐж¯Ê¶±ðÊý¾Ý¼¯ÉÏҲȡµÃÁ˽ϴóµÄÐÔÄÜÌáÉý¡£¡£¡£¡£¡£¡£±¾ÎÄÖÐÎÒÃǽ«ÏêϸÏÈÈݸÃÂÛÎÄÖÐÌá³öµÄÒªÁ죬£¬£¬£¬£¬£¬£¬£¬²¢ÏÈÈÝһЩÍýÏëÖеĽøÒ»²½ÊÂÇéµÈ¡£¡£¡£¡£¡£¡£
×÷ÕߣºÑÕ˼½Ý£¬£¬£¬£¬£¬£¬£¬£¬ÐÜÔª¿¥£¬£¬£¬£¬£¬£¬£¬£¬Áִﻪ

ͼ 1. ST-GCN µÄÄ£×ӽṹʾÒâͼ


ͼ 2. ¶Ô ST-GCN ×îÄ©¾í»ý²ãµÄÏìÓ¦
¾ÙÐпÉÊÓ»¯µÄЧ¹û
(via GitHub)
»ùÓڹǼÜÒªº¦µãµÄÐж¯Ê¶±ð
Ëæ×ÅÈç Microsoft Kinect¡¢OpenPose µÈÈËÌå×Ë̬¼ì²âϵͳµÄ³ÉÊ죬£¬£¬£¬£¬£¬£¬£¬»ùÓڹǼÜÒªº¦µãµÄÈËÀàÐж¯Ê¶±ð³ÉÁËÅÌËã»úÊÓ¾õ£¬£¬£¬£¬£¬£¬£¬£¬ÌØÊâÊÇÈËÀàÐж¯Ê¶±ðÑо¿ÖеÄÒ»¸öÖ÷ҪʹÃü¡£¡£¡£¡£¡£¡£¸ÃʹÃüÒªÇóÊäÈëÔÚÒ»Á¬µÄÊÓÆµÖ¡Öмì²âµ½µÄÈËÌå¹Ç¼ÜÒªº¦µãÐòÁУ¬£¬£¬£¬£¬£¬£¬£¬Êä³öÕýÔÚ±¬·¢µÄÈËÀàÐж¯Öֱ𡣡£¡£¡£¡£¡£×÷ΪÐж¯Ê¶±ðϵͳÖеÄÖ÷Ҫģ̬£¬£¬£¬£¬£¬£¬£¬£¬»ùÓڹǼܵÄÐж¯Ê¶±ðÒѾչÏÖ³öÖ÷ÒªµÄÊÊÓüÛÖµÓëÑо¿¼ÛÖµ¡£¡£¡£¡£¡£¡£±¾ÂÛÎÄÕýÊÇÕë¶ÔÕâ¸öʹÃüÌá³öÁËÒ»ÖÖȫеÄÉî¶Èѧϰģ×Ó£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃdzÆÖ®Îª¡¸Ê±¿Õͼ¾í»ýÍøÂ硹(ST-GCN)¡£¡£¡£¡£¡£¡£
¹¹½¨Ê±¿Õͼ
ST-GCN µÄ»ù´¡ÊÇʱ¿Õͼ½á¹¹¡£¡£¡£¡£¡£¡£´Ó¹Ç¼ÜÒªº¦µãÐòÁй¹½¨Ê±¿Õͼ (spatial-temporal graph) µÄÏ뷨ȪԴÓÚÎÒÃǶÔÏÖÓеĹǼÜÐж¯Ê¶±ðÒªÁìÒÔ¼°Í¼Ïñʶ±ðÒªÁìµÄÊӲ졣¡£¡£¡£¡£¡£ÎÒÃÇ·¢Ã÷£¬£¬£¬£¬£¬£¬£¬£¬ÏÖÓеĻùÓڹǼܵÄÐж¯Ê¶±ðÒªÁìÖÐΪÁËÌá¸ßʶ±ð¾«¶È´ó¶¼ÒýÈëÁËһЩ¿Õ¼ä½á¹¹ÐÅÏ¢£¬£¬£¬£¬£¬£¬£¬£¬°üÀ¨ÏàÁÚÒªº¦µãµÄÅþÁ¬¹ØÏµ»òÉíÌ岿¼þµÈ(ÈçÊÖ-ÊÖÖâ-¼ç°òµÄÅþÁ¬¹ØÏµ)¡£¡£¡£¡£¡£¡£
ΪÁ˽¨Ä£ÕâЩ¿Õ¼äÐÅÏ¢£¬£¬£¬£¬£¬£¬£¬£¬ÏÖÓÐÒªÁì¾³£Ê¹Óà RNN µÈÐòÁÐÄ£×ÓÀ´±éÀúÏàÁ¬µÄÒªº¦µã¡£¡£¡£¡£¡£¡£Õâ¾ÍÒªÇóÄ£×ÓÉè¼ÆÕß½ç˵һÖÖ±éÀúµÄ¹æÔò£¬£¬£¬£¬£¬£¬£¬£¬»òÕßÊÖ¶¯½ç˵һЩÉíÌ岿¼þ¡£¡£¡£¡£¡£¡£ÎÒÃÇÖ¸³ö£¬£¬£¬£¬£¬£¬£¬£¬ÔÚÕâÖÖÉè¼ÆÖУ¬£¬£¬£¬£¬£¬£¬£¬ºÜÄÑ»ñµÃÒ»¸ö×îÓŵıéÀú¹æÔò»òÕß²¿¼þ»®·Ö¡£¡£¡£¡£¡£¡£¿ÉÊÇ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇ·¢Ã÷£¬£¬£¬£¬£¬£¬£¬£¬Òªº¦µãÖ®¼ä×ÔÈ»µÄÅþÁ¬¹ØÏµ£¬£¬£¬£¬£¬£¬£¬£¬×Åʵ×é³ÉÁËÒ»¸ö×ÔÈ»µÄͼ½á¹¹(graph)¡£¡£¡£¡£¡£¡£ÄÇô£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÔõôÄܹ»¸ßЧµØÊ¹ÓÃÕâЩͼ½á¹¹À´¾ÙÐÐÐж¯Ê¶±ðÄØ?
ÔÚ ST-GCN µÄÊÂÇéÖÐÎÒÃÇÌá³ö£¬£¬£¬£¬£¬£¬£¬£¬¿ÉÒÔ´ÓÊäÈëµÄÒªº¦µãÐòÁÐÖн¨ÉèÒ»¸öʱ¿Õͼ(spatial-temporal graph)¡£¡£¡£¡£¡£¡£Õâ¸öͼ½á¹¹Æ¾Ö¤ÈçÏµĹæÔòÀ´¹¹½¨¡£¡£¡£¡£¡£¡£
1. ÔÚÿһ֡ÄÚ²¿£¬£¬£¬£¬£¬£¬£¬£¬Æ¾Ö¤ÈËÌåµÄ×ÔÈ»¹Ç¼ÜÅþÁ¬¹ØÏµ½á¹¹¿Õ¼äͼ;
2. ÔÚÏàÁÚÁ½Ö¡µÄÏàͬҪº¦µãÅþÁ¬ÆðÀ´£¬£¬£¬£¬£¬£¬£¬£¬×é³ÉʱÐò±ß;
3. ËùÓÐÊäÈëÖ¡ÖÐÒªº¦µã×é³É½Úµã¼¯(node set)£¬£¬£¬£¬£¬£¬£¬£¬°ì·¨ 1¡¢2 ÖеÄËùÓбß×é³É±ß¼¯(edge set)£¬£¬£¬£¬£¬£¬£¬£¬¼´×é³ÉËùÐèµÄʱ¿Õͼ¡£¡£¡£¡£¡£¡£
ÔÚÆ¾Ö¤ÉÏÊö¹æÔò»ñµÃµÄʱ¿ÕͼÉÏ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇ×ÔÈ»µØ±£´æÁ˹ǼÜÒªº¦µãµÄ¿Õ¼äÐÅÏ¢£¬£¬£¬£¬£¬£¬£¬£¬²¢Ê¹µÃÒªº¦µãµÄÔ˶¯¹ì¼£(trajectory)ÒÔʱÐò±ßµÄÐÎʽ»ñµÃÌåÏÖ¡£¡£¡£¡£¡£¡£ÕâʹµÃÎÒÃÇ¿ÉÒÔÉè¼ÆÒ»¸öͳһµÄÄ£×ÓÀ´ÍêÕûµØ¶ÔÕâЩÐÅÏ¢¾ÙÐн¨Ä£¡£¡£¡£¡£¡£¡£ÔÚͼ 3 ÖÐÎÒÃÇչʾÁËÒ»ÖÖʱ¿ÕͼµÄ½á¹¹¡£¡£¡£¡£¡£¡£

ͼ 3. NTU-RGBD Êý¾Ý¼¯ÉϽ¨ÉèµÄʱ¿ÕͼʾÀý
ͼ½á¹¹Éϵľí»ýÍøÂç
ΪÁËÔÚʱ¿ÕͼÉ϶ÔÈËÀàÐж¯µÄÐÅÏ¢¾ÙÐÐÆÊÎö£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÌá³öʹÓÃͼ¾í»ýÍøÂç (graph convolutional networks ¨C GCN)¡£¡£¡£¡£¡£¡£Í¼ÉϵÄÉñ¾ÍøÂçÄ£×ÓÊÇ»úеѧϰÑо¿µÄÒ»¸öÈÈÃÅÁìÓò¡£¡£¡£¡£¡£¡£±¾ÎÄÖÐʹÓõÄͼ¾í»ýÍøÂç¼´ÊÇͼÉÏÉñ¾ÍøÂçÖеÄÒ»ÖÖ£¬£¬£¬£¬£¬£¬£¬£¬ÆäÔÚÍøÂçÆÊÎö¡¢ÎÄÌìÖ°ÀàµÈÎÊÌâ¶¼ÓÐÀÖ³ÉÓ¦Óᣡ£¡£¡£¡£¡£
ÔÚÏÈÈÝͼ¾í»ýÍøÂçµÄ¿´·¨Ö®Ç°£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÏÈÍù·µÊ×ͼÏñÉϵľí»ý²Ù×÷¡£¡£¡£¡£¡£¡£ÔÚͼÏñÉÏ£¬£¬£¬£¬£¬£¬£¬£¬¾í»ý²Ù×÷ʹÓÃһЩÀο¿¾ÞϸµÄ¾í»ýºË(filter/kernel)À´É¨ÃèÊäÈëµÄͼÏñ¡£¡£¡£¡£¡£¡£Èçͼ 3 Ëùʾ£¬£¬£¬£¬£¬£¬£¬£¬ÔÚÿ¸öɨÃèµÄÖÐÐÄλÖÃÏñËØÖÜΧ£¬£¬£¬£¬£¬£¬£¬£¬³éȡһ¸öÓëÈ¨ÖØ¾ØÕó¾ÞϸÏàͬµÄÏñËØ¾ØÕ󣬣¬£¬£¬£¬£¬£¬£¬½«ÕâЩÏñËØÉϵÄÌØÕ÷ÏòÁ¿°´¿Õ¼ä˳ÐòÆ´½Ó²¢Óë¾í»ýºËµÄ²ÎÊýÏòÁ¿×öÄÚ»ýÒÔ»ñµÃ¸ÃλÖõľí»ýÊä³öÖµ¡£¡£¡£¡£¡£¡£ÔÚÕâÀ£¬£¬£¬£¬£¬£¬£¬¡¸ÖÜΧÏñËØ¡¹¿ÉÒÔ½ç˵ΪÏñËØÍø¸ñ(grid)ÉϵÄÒ»¸öÁÚÓò(neighborhood)¡£¡£¡£¡£¡£¡£½«Í¼ÏñÉϵľí»ý²Ù×÷ÍÆ¹ãµ½í§Òâ½á¹¹µÄͼ½á¹¹ÉÏʱ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇͬÑù¿ÉÒÔ½ç˵ÈκÎÒ»¸ö½ÚµãµÄÁÚÓò£¬£¬£¬£¬£¬£¬£¬£¬ÓëһϵÁÐÈ¨ÖØ¾ØÕ󡣡£¡£¡£¡£¡£Õâ¾ÍÊÇͼ¾í»ýÍøÂçµÄ»ù±¾Í·ÄÔ¡£¡£¡£¡£¡£¡£
¿ÉÊÇ£¬£¬£¬£¬£¬£¬£¬£¬ÓëͼÏñ²î±ðµÄÊÇ£¬£¬£¬£¬£¬£¬£¬£¬Í¨Ë×µÄͼ½á¹¹ÉÏÈôÊÇʹÓÃÅþÁÚ¾ØÕó(Adjacency matrix)À´½ç˵ÁÚÓòʱ£¬£¬£¬£¬£¬£¬£¬£¬Ã¿¸ö½ÚµãµÄÁÚÓòÖнڵãµÄÊýÄ¿²¢²»ÊÇÀο¿µÄ(˼Á¿²¹ 0 ʱ£¬£¬£¬£¬£¬£¬£¬£¬Í¼ÏñÉÏÏñËØÖÜΧµÄÏñËØÊÇ×ÜÊÇÀο¿µÄ)¡£¡£¡£¡£¡£¡£Õâ¾ÍʹµÃÎÒÃÇÎÒÃǺÜÄÑÈ·¶¨£º 1)ÐèҪʹÓõľí»ýºËµÄ²ÎÊýά¶È;2)ÈôÊÇÅÅÁÐÈ¨ÖØ¾ØÕóÓëÁÚÓòÄڵĽڵãÒÔ¾ÙÐÐÄÚ»ýÔËËã¡£¡£¡£¡£¡£¡£ÔÚÔʼµÄ GCN ÎÄÕÂÖУ¬£¬£¬£¬£¬£¬£¬£¬×÷ÕßÌá³öÁ˽«ÄÚ»ý²Ù×÷±äΪÕâÑùÒ»¸ö²Ù×÷£ºÊ¹ÓÃͳһ¸öÏòÁ¿ÓëËùÓÐÁÚÓòÄڵĵãÉϵÄÌØÕ÷ÏòÁ¿ÅÌËãÄÚ»ý²¢½«Ð§¹ûÇó¾ùÖµ¡£¡£¡£¡£¡£¡£ÕâʹµÃ£º 1)¾í»ýºËµÄ²ÎÊý¿ÉÒÔÈ·¶¨ÎªÒ»¸öÀο¿³¤¶ÈµÄÏòÁ¿;2)²»ÐèҪ˼Á¿ÁÚÓòÄÚ½ÚµãµÄ˳Ðò¡£¡£¡£¡£¡£¡£Õâ¸öÉè¼ÆÊ¹µÃ GCN ¿ÉÒÔÔÚí§ÒâÅþÁ¬¹ØÏµµÄͼÉÏʹÓ㬣¬£¬£¬£¬£¬£¬£¬²¢ÔÚһЩʹÃü£¬£¬£¬£¬£¬£¬£¬£¬ÈçÍøÂçÆÊÎöºÍ°ë¼àÊÓѧϰÖÐÈ¡µÃÁ˽ϺõÄÐÔÄÜ¡£¡£¡£¡£¡£¡£
ÐèÒª×¢ÖØµÄÊÇ£¬£¬£¬£¬£¬£¬£¬£¬Í¼ÉÏÉñ¾ÍøÂç³ýÁËÉÏÃæÌáµ½»ùÓÚͼµÄ¿Õ¼ä½á¹¹µÄ˼Ð÷Ö®Í⣬£¬£¬£¬£¬£¬£¬£¬ÉÐÓÐÒ»ÖÖ»ùÓÚÆ×ÆÊÎö (spectral analysis) µÄ½á¹¹Ë¼Ð÷¡£¡£¡£¡£¡£¡£¹ØÓÚÕâÒ»ÀàÒªÁ죬£¬£¬£¬£¬£¬£¬£¬Çë¼û²Î¿¼ÎÄÏס¾2¡¿¡£¡£¡£¡£¡£¡£ÔÚ ST-GCN ÖУ¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇҲʹÓÃÁË»ùÓÚͼµÄ¿Õ¼ä½á¹¹µÄ˼Ð÷¡£¡£¡£¡£¡£¡£
ʱ¿Õͼ¾í»ýÍøÂçÓëÐж¯Ê¶±ð
Òª½«Í¼¾í»ýÍøÂçÔËÓÃÓÚ»ùÓڹǼÜÒªº¦µãµÄÐж¯Ê¶±ðÖУ¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇ»¹ÐèÒª×ÐϸÆÊÎöÕâ¸öʹÃüµÄÌØµãÓëÄѵ㣬£¬£¬£¬£¬£¬£¬£¬¶ø²»ÊÇÖ±½Ó½«ÒÑÓÐÒªÁìÉú°áÓ²´Õ¡£¡£¡£¡£¡£¡£ÔÚ±¾ÎÄÖУ¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÖ¸³öÁËÔʼ GCN µÄÒ»¸öÖ÷ÒªÐÔ×Ó£º¸ÃÎÄÖÐÌá³öµÄ¾í»ý²Ù×÷£¬£¬£¬£¬£¬£¬£¬£¬ÊµÖÊÉϵȼÛÓÚÏȽ«ÁÚÓòÄÚËùÓнڵãµÄÌØÕ÷ÏòÁ¿Ç󯽾ù£¬£¬£¬£¬£¬£¬£¬£¬ÔÙÓë¾í»ýºËµÄ²ÎÊýÏòÁ¿ÅÌËãÄÚ»ý¡£¡£¡£¡£¡£¡£ÕâÖÖÆ½¾ù²Ù×÷ÔڹǼÜÐж¯Ê¶±ð»áÓöµ½Ò»¸öÖ÷ÒªÎÊÌ⣬£¬£¬£¬£¬£¬£¬£¬¼´£ºËüÎÞ·¨½¨Ä£Òªº¦µãÖ®¼äÏà¶ÔλÖÃת±äµÄÇéÐΣ¬£¬£¬£¬£¬£¬£¬£¬»òËùνµÄ¡¸Î¢·ÖÐÔ×Ó¡¹(differential properties)¡£¡£¡£¡£¡£¡£Òò´Ë»ùÓÚÔʼ GCN µÄÄ£×Ó£¬£¬£¬£¬£¬£¬£¬£¬Ê¶±ðÐÔÄܲ¢²»»áºÜÀíÏë¡£¡£¡£¡£¡£¡£
Õë¶ÔÕâ¸öÎÊÌ⣬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÒÔΪ£¬£¬£¬£¬£¬£¬£¬£¬ÒªÕæÕýÔöÇ¿µÄÄ£×ÓµÄÐÔÄÜ£¬£¬£¬£¬£¬£¬£¬£¬±ØÐèÌø³öÔʼ GCN µÄ¡¸Æ½¾ùÍ·ÄÔ¡¹¡£¡£¡£¡£¡£¡£ÎªÏàʶ¾öÕâ¸öÎÊÌ⣬£¬£¬£¬£¬£¬£¬£¬ÎÒÃǽ«Ã÷È·ÁËͼÏñÉϵľí»ý²Ù×÷Ã÷ȷΪ°ÑÖÐÐÄÏñËØÏàÁÚµÄÏñËØÜöÝÍ(ÁÚÓò¼¯-neighbor set)ƾ֤¿Õ¼ä˳Ðò£¬£¬£¬£¬£¬£¬£¬£¬Èç´Ó×óÖÁÓÒ£¬£¬£¬£¬£¬£¬£¬£¬´ÓÉÏÖÁÏ£¬£¬£¬£¬£¬£¬£¬£¬»®·ÖΪһϵÁÐÜöÝÍ¡£¡£¡£¡£¡£¡£ÔÚͼÏñµÄÇéÐÎÖУ¬£¬£¬£¬£¬£¬£¬£¬Ã¿¸öÜöÝÍÕýºÃ°üÀ¨Ò»¸öÏñËØ¡£¡£¡£¡£¡£¡£ÕâЩÜöÝ;Í×é³ÉÁËÁÚÓò¼¯µÄÒ»¸ö»®·Ö(partition)¡£¡£¡£¡£¡£¡£¾í»ýºËµÄ²ÎÊýÖ»ÓëÕâ¸ö»®·ÖÖеÄ×Ó¼¯¸öÊýÒÔ¼°ÌØÕ÷ÏòÁ¿³¤¶ÈÓйء£¡£¡£¡£¡£¡£ÄÇôÔÚͨË×µÄͼ½á¹¹ÖУ¬£¬£¬£¬£¬£¬£¬£¬Ö»Òª½ç˵ÁËijÖÖ»®·Ö¹æÔò(partitioning strategy)£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃǾÍÒ²¿ÉÒÔ²ÎÕÕͼÏñ¾í»ýÀ´½ç˵¾í»ýºËµÄ²ÎÊý¡£¡£¡£¡£¡£¡£ÀàËÆµÄÍ·ÄÔÒ²Ó¦ÓÃÔÚÁËÈç deformable CNN µÈ½üÆÚÊÂÇéÖС£¡£¡£¡£¡£¡£
ÓÐÁËÕâ¸öÍ·ÄÔ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃǾͿÉÒÔÕë¶Ô¹Ç¼ÜÐж¯Ê¶±ð£¬£¬£¬£¬£¬£¬£¬£¬ÒÔÖÂÈκÎͼ¾í»ýÍøÂçËùÃæÁÙµÄÎÊÌâÀ´½ç˵ÓÐÕë¶ÔÐԵľí»ý²Ù×÷¡£¡£¡£¡£¡£¡£¶ø½ç˵¾í»ý²Ù×÷¾Í¼ò»¯ÎªÁËÉè¼Æ¶ÔÓ¦µÄ»®·Ö¹æÔò¡£¡£¡£¡£¡£¡£¶ÔÒ»¸ö±£´æ K ¸ö×Ó¼¯µÄ»®·Ö¹æÔò£¬£¬£¬£¬£¬£¬£¬£¬¾í»ýºËµÄ²ÎÊý°üÀ¨ K ¸ö²¿·Ö£¬£¬£¬£¬£¬£¬£¬£¬Ã¿¸ö²¿·Ö²ÎÊýÊýÄ¿ÓëÌØÕ÷ÏòÁ¿Ò»Ñù¡£¡£¡£¡£¡£¡£ÈÔÈ»ÒÔͼÏñÉϵľí»ýΪÀý£¬£¬£¬£¬£¬£¬£¬£¬ÔÚÒ»¸ö´°¿Ú¾ÞϸΪ 3x3 µÄ¾í»ý²Ù×÷ÖУ¬£¬£¬£¬£¬£¬£¬£¬Ò»¸öÏñËØµÄÁÚÓò(´°¿Ú)ƾ֤¿Õ¼ä˳Ðò±»»®·ÖΪ 9 ¸ö×Ó¼¯(×óÉÏ£¬£¬£¬£¬£¬£¬£¬£¬ÉÏ£¬£¬£¬£¬£¬£¬£¬£¬ÓÒÉÏ£¬£¬£¬£¬£¬£¬£¬£¬×󣬣¬£¬£¬£¬£¬£¬£¬ÖУ¬£¬£¬£¬£¬£¬£¬£¬ÓÒ£¬£¬£¬£¬£¬£¬£¬£¬×óÏ£¬£¬£¬£¬£¬£¬£¬£¬Ï£¬£¬£¬£¬£¬£¬£¬£¬ÓÒÏÂ)£¬£¬£¬£¬£¬£¬£¬£¬Ã¿¸ö×Ó¼¯°üÀ¨Ò»¸öÏñËØ¡£¡£¡£¡£¡£¡£¾í»ýºËµÄ²ÎÊý°üÀ¨ 9 ¸ö²¿·Ö£¬£¬£¬£¬£¬£¬£¬£¬Ã¿¸ö²¿·ÖÓëÌØÕ÷ͼ(feature map)µÄÌØÕ÷ÏòÁ¿³¤¶È(number of channel)Ò»Ö¡£¡£¡£¡£¡£¡£Ò²¾ÍÊÇ˵£¬£¬£¬£¬£¬£¬£¬£¬Í¼Ïñ¾í»ý¿ÉÒÔÚ¹ÊÍΪͨË×ͼÉϾí»ýÔÚ¹æÔòÍø¸ñͼ(regular grid)ÉϵÄÒ»ÖÖÓ¦Óᣡ£¡£¡£¡£¡£
ΪÁËÔÚʱ¿ÕͼÉϾÙÐйǼÜÐж¯Ê¶±ð£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÌá³öÁËÈýÖÖ¿Õ¼äµÄ»®·Ö¹æÔò¡£¡£¡£¡£¡£¡£
µÚÒ»ÖÖ³ÆÎª¡¸Î¨Ò»»®·Ö¡¹(uni-labeling)¡£¡£¡£¡£¡£¡£ÆäÓëÔʼ GCN Ïàͬ£¬£¬£¬£¬£¬£¬£¬£¬½«½ÚµãµÄ 1 ÁÚÓò»®·ÖΪһ¸ö×Ó¼¯¡£¡£¡£¡£¡£¡£
µÚ¶þÖÖ³ÆÎª¡¸»ùÓÚ¾àÀëµÄ»®·Ö¡¹(distance partitioning)£¬£¬£¬£¬£¬£¬£¬£¬Ëü½«½ÚµãµÄ 1 ÁÚÓò·ÖΪÁ½¸ö×Ó¼¯£¬£¬£¬£¬£¬£¬£¬£¬¼´½Úµã×Ô¼º×Ó¼¯ÓëÁÚ½Úµã×Ó¼¯¡£¡£¡£¡£¡£¡£ÒýÈë»ùÓÚ¾àÀëµÄ»®·ÖʹµÃÎÒÃÇ¿ÉÒÔÆÊÎö¹Ç¼ÜÒªº¦µãÖ®¼äµÄ΢·ÖÐÔ×Ó¡£¡£¡£¡£¡£¡£
½øÒ»²½£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÕë¶ÔÐж¯Ê¶±ðµÄÌØµã£¬£¬£¬£¬£¬£¬£¬£¬Ìá³öÁ˵ÚÈýÖÖ£¬£¬£¬£¬£¬£¬£¬£¬¡¸¿Õ¼ä¹¹ÐÍ»®·Ö¡¹(spatial configuration partitioning)¡£¡£¡£¡£¡£¡£ÕâÖÖ»®·Ö¹æÔò½«½ÚµãµÄ 1 ÁÚÓò»®·ÖΪ 3 ¸ö×Ó¼¯£¬£¬£¬£¬£¬£¬£¬£¬µÚÒ»¸ö×Ó¼¯Îª½Úµã×Ô¼º£¬£¬£¬£¬£¬£¬£¬£¬µÚ¶þ¸öΪ¿Õ¼äλÖÃÉϱȱ¾½Úµã¸ü¿¿½üÕû¸ö¹Ç¼ÜÖØÐĵÄÁÚ½ÚµãÜöÝÍ£¬£¬£¬£¬£¬£¬£¬£¬µÚÈý¸öÔòΪ¸üÔ¶ÀëÖØÐĵÄÁÚ½ÚµãÜöÝÍ¡£¡£¡£¡£¡£¡£½¨ÉèÕâÖÖ»®·Ö¹æÔòÔÚÆ¾Ö¤Ô˶¯ÆÊÎöµÄÑо¿ÖжÔÏòÐÄÔ˶¯ÓëÀëÐÄÔ˶¯µÄ½ç˵¡£¡£¡£¡£¡£¡£ÈýÖÖ»®·Ö¹æÔòµÄʾÒâͼÇë¼ûͼ 4¡£¡£¡£¡£¡£¡£

ͼ 4.ÈýÖÖ¿Õ¼äµÄ»®·Ö¹æÔòʾÒâͼ
³ýÁËͳһ֡ÄÚ²¿µÄ¿Õ¼ä»®·Ö¹æÔò£¬£¬£¬£¬£¬£¬£¬£¬ÔÚʱ¼äÉÏ£¬£¬£¬£¬£¬£¬£¬£¬ÓÉÓÚʱÐò±ß×é³ÉÁËÒ»¸öÍø¸ñ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇ¿ÉÒÔÖ±½ÓʹÓÃÀàËÆÓÚʱÐò¾í»ý(temporal convolution)µÄ»®·Ö¹æÔò¡£¡£¡£¡£¡£¡£×îÖÕ£¬£¬£¬£¬£¬£¬£¬£¬Ê±¿ÕͼÉÏʹÓõĻ®·Ö¹æÔò»ñµÃµÄ×Ó¼¯ÜöÝÍ»áÊǿռ仮·ÖÓëʱÐò»®·ÖµÄµÑ¿¨¶û»ý¡£¡£¡£¡£¡£¡£
½ç˵ºÃÁËʱ¿ÕͼÉϵľí»ý²Ù×÷£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃǾͿÉÒÔÉè¼Æ¾í»ýÍøÂçÁË¡£¡£¡£¡£¡£¡£ÎªÁËչʾ ST-GCN µÄÐÔÄÜ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÖ±½Ó´ÓÒ»¸öÒÑÓеÄʱÐò¾í»ýÍøÂç½á¹¹µÄ»ù´¡ÉÏÉè¼ÆÁËÎÄÖÐÓõ½µÄ ST-GCN µÄÍøÂç½á¹¹¡£¡£¡£¡£¡£¡£ÎÒÃǽ«ËùÓÐʱÐò¾í»ý²Ù×÷תΪʱ¿ÕͼµÄ¾í»ý²Ù×÷£¬£¬£¬£¬£¬£¬£¬£¬Ã¿Ò»¸ö¾í»ý²ãµÄÊä³öÊÇÒ»¸öʱ¿Õͼ£¬£¬£¬£¬£¬£¬£¬£¬Í¼ÉÏÿһ¸ö½Úµã±£ÓÐÒ»¸öÌØÕ÷ÏòÁ¿¡£¡£¡£¡£¡£¡£×îÖÕ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃǺϲ¢ËùÓнڵãÉϵÄÌØÕ÷²¢Ê¹ÓÃÏßÐÔ·ÖÀà²ã¾ÙÐÐÐж¯·ÖÀà¡£¡£¡£¡£¡£¡£ÑµÁ·Ê¹Óñê×¼µÄ SoftMax ½»Ö¯ìØËðʧº¯Êý¾ÙÐмàÊÓ¡£¡£¡£¡£¡£¡£²ÎÊýѧϰʹÓñê×¼Ëæ»úÌݶÈϽµËã·¨(SGD)¡£¡£¡£¡£¡£¡£
ʵÑéЧ¹û
ÎÒÃÇÔÚÁ½¸öÐÔ×ÓåÄÒìµÄ¹Ç¼ÜÐж¯Ê¶±ðÊý¾Ý¼¯ÉϾÙÐÐÁËʵÑéÀ´ÑéÖ¤ ST-GCN µÄÐÔÄÜ¡£¡£¡£¡£¡£¡£
µÚÒ»¸öÊý¾Ý¼¯ÊÇ Kinetics-Skeleton£¬£¬£¬£¬£¬£¬£¬£¬ËüÀ´×ÔÓÚ×î½üÓÉ Google DeepMind Т˳µÄ Kinetics ÊÓÆµÈËÀàÐж¯Ê¶±ðÊý¾Ý¼¯¡£¡£¡£¡£¡£¡£ÎÒÃÇʹÓà OpenPose ×Ë̬Ԥ¼ÆÈí¼þ»ñµÃÊÓÆµÖÐËùÓеĹǼÜÒªº¦µãÐÅÏ¢À´×é³É Kinetics-Skeleton¡£¡£¡£¡£¡£¡£¸ÃÊý¾Ý¼¯¹²ÓÐÔ¼ 30 Íò¸öÊÓÆµÓë 400 ÀàÐж¯¡£¡£¡£¡£¡£¡£
µÚ¶þ¸öÊý¾Ý¼¯ÊÇ NTU-RGB+D£¬£¬£¬£¬£¬£¬£¬£¬ÕâÊÇÈýά¹Ç¼ÜÐж¯Ê¶±ðµÄ±ê×¼²âÊÔÊý¾Ý¼¯¡£¡£¡£¡£¡£¡£Ëü°üÀ¨ÁËÓà Microsoft Kinect ÊÕÂÞµÄÈýά¹Ç¼ÜÐòÁС£¡£¡£¡£¡£¡£¸ÃÊý¾Ý¼¯¹²ÓÐÔ¼ 6 Íò¸öÊÓÆµ£¬£¬£¬£¬£¬£¬£¬£¬60 ¸öÐж¯Öֱ𡣡£¡£¡£¡£¡£Õâ¸öÊý¾Ý¼¯°üÀ¨ÁËÁ½¸ö²âÊÔÐÒ飬£¬£¬£¬£¬£¬£¬£¬¼´¿çÑݳöÈË(X-Sub)Óë¿çÊÓ½Ç(X-View)ÐÒé¡£¡£¡£¡£¡£¡£ÔÚÁ½¸öÊý¾Ý¼¯µÄÈý¸ö²âÊÔÐÒéÉÏ£¬£¬£¬£¬£¬£¬£¬£¬ST-GCN Ïà±ÈÏÖÓÐÒªÁìÔÚʶ±ð¾«¶ÈÉϾùÓкܴóÌá¸ß£¬£¬£¬£¬£¬£¬£¬£¬ÏêϸЧ¹û¿É¼û±íͼ 1¡£¡£¡£¡£¡£¡£

±íͼ 1. ¹Ç¼ÜÐж¯Ê¶±ðЧ¹û
³ýÁË»ñµÃ¸üºÃµÄÐÔÄÜ£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÒ²ÏêϸÆÊÎöÁËÈýÖÖ»®·Ö¹æÔò¶Ôʶ±ð¾«¶ÈµÄÓ°Ïì¡£¡£¡£¡£¡£¡£Èç±í 2 Ëùʾ£¬£¬£¬£¬£¬£¬£¬£¬ÕýÈçÎÒÃÇËùÆÚÍûµÄ£¬£¬£¬£¬£¬£¬£¬£¬¾àÀë»®·ÖÓë¿Õ¼ä¹¹ÐÍ»®·ÖÏà¹ØÓÚÔʼ GCN ʹÓõÄΨһ»®·ÖÔÚ¾«¶ÈÉϾùÓнϴóÌá¸ß¡£¡£¡£¡£¡£¡£Õâ֤ʵÎúÒýÈëÐµĻ®·Ö¹æÔòµÄÖ÷ÒªÐÔ¡£¡£¡£¡£¡£¡£ÌرðµÄ£¬£¬£¬£¬£¬£¬£¬£¬Õë¶ÔÐж¯Ê¶±ðʹÃüÉè¼ÆµÄ¿Õ¼ä¹¹ÐÍ»®·ÖÈ¡µÃÁË×î¸ßµÄÐÔÄÜ£¬£¬£¬£¬£¬£¬£¬£¬²¢±»×îºóÓ¦ÓÃÓÚ ST-GCN µÄÏà¹ØÊµÑéÖС£¡£¡£¡£¡£¡£
ÎÒÃÇ»¹½« ST-GCN µÄ×îºóÒ»²ãÉñ¾ÔªÏìÓ¦¾ÙÐÐÁË¿ÉÊÓ»¯(±í 2)¡£¡£¡£¡£¡£¡£ÔÚЧ¹ûÖÐÎÒÃÇ¿ÉÒÔÏÔ×Å¿´µ½ ST-GCN Äܹ»×·×Ù²¢ÉîÈëÆÊÎöÔÚij¸öʱ¼ä¶ÎÓëÐж¯×îÏà¹ØµÄÉíÌ岿·ÖµÄÔ˶¯£¬£¬£¬£¬£¬£¬£¬£¬ÕâÚ¹ÊÍÁËΪºÎ ST-GCN Ïà¹ØÓÚÆäËû²»¹Ø×¢¿Õ¼ä½á¹¹µÄÏÖÓÐÒªÁìÄÜ»ñµÃºÜ´óµÄÐÔÄÜÌá¸ß¡£¡£¡£¡£¡£¡£

±íͼ 2.²î±ð»®·Ö¹æÔòµÄÓ°Ïì
˼Ë÷Óë×îºóµÄ»°
»ØÊ× ST-GCN µÄÌá³ö£¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇ×ܽáÁËÁ½¸öÖ÷ÒªµÄÍ·ÄÔ¿çÔ½¡£¡£¡£¡£¡£¡£
µÚÒ»¸öÊÇ´Ó½«¹Ç¼ÜÐòÁÐÃ÷ȷΪһ֡֡µÄ¹Ç¼ÜÑݽøÎª½«Õû¸öÊÓÆµÃ÷ȷΪһ¸öÕûÌåµÄʱ¿Õͼ£¬£¬£¬£¬£¬£¬£¬£¬ÕâʹµÃÓÃÒ»¸öͳһµÄÄ£×ÓÀ´ÆÊÎöÐж¯³ÉΪ¿ÉÄÜ¡£¡£¡£¡£¡£¡£
µÚ¶þ¸öÊÇ´ÓÔʼ GCN µÄÖÊÆÓÍ·ÄÔÑݽøÎªÊ¹ÓûùÓÚ»®·Ö¹æÔòµÄ¾í»ý½ç˵¡£¡£¡£¡£¡£¡£Õâ¸öÍ·ÄÔʹµÃÎÒÃÇ¿ÉÒÔÓâÔ½Ôʼ GCN ²¢»ñµÃÖØ´óµÄÐÔÄÜÌáÉý£¬£¬£¬£¬£¬£¬£¬£¬¸ÃÍ·ÄÔÒ²ÔÚ MoNet¡¾3¡¿µÄÊÂÇéÖб»Ìá¼°¹ý¡£¡£¡£¡£¡£¡£ÎÒÃǽ«ÆäÔÔò»¯Îª ÜöÝ͵Ļ®·Ö²Ù×÷¡£¡£¡£¡£¡£¡£ÕâҲʹµÃÕâ¸öÍ·ÄÔ¿ÉÒÔÓ¦ÓÃÆäËûµÄÆÊÎöʹÃüÖС£¡£¡£¡£¡£¡£
ÔÚδÀ´µÄÊÂÇéÖУ¬£¬£¬£¬£¬£¬£¬£¬ÎÒÃÇÍýÏëÔËÓà ST-GCN µÄÎÞаÐÔÀ´´¦Öóͷ£¸ü¶àµÄͼÆÊÎöÎÊÌâ¡£¡£¡£¡£¡£¡£Í¬Ê±£¬£¬£¬£¬£¬£¬£¬£¬Õë¶ÔÐж¯Ê¶±ðʹÃü£¬£¬£¬£¬£¬£¬£¬£¬Ò»¸ö×ÔÈ»µÄÑݽø¾ÍÊÇÔڹǼÜÒªº¦µã×ø±êµÄ»ù´¡ÉÏÒýÈëÊÓ¾õÌØÕ÷£¬£¬£¬£¬£¬£¬£¬£¬ÈçͼÏñÌØÕ÷£¬£¬£¬£¬£¬£¬£¬£¬ÒÔÖ³¡¾°Í¼(scene-graph)µÈ£¬£¬£¬£¬£¬£¬£¬£¬²¢½«ËüÃÇͳһÔÚ ST-GCN µÄÆÊÎö¿ò¼ÜÏ¡£¡£¡£¡£¡£¡£¹þ¹þ(haha)ÌåÓý×îÖÕÄ¿µÄÔòÊÇÒ»¸öÐÔÄܸü¸ß£¬£¬£¬£¬£¬£¬£¬£¬¸ü¾ßÓпÉÚ¹ÊÍÐÔµÄͳһµÄÊÓÆµÐж¯Ê¶±ðÄ£×Ó¡£¡£¡£¡£¡£¡£
Ïà¹ØÎÄÏ×£º
1.¡¸Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition¡¹, Sijie Yan, Yuanjun Xiong and Dahua Lin, AAAI 2018.
2.¡¸Convolutional neural networks on graphs with fast localized spectral filtering.¡¹£¬£¬£¬£¬£¬£¬£¬£¬Defferrard, et. al., NIPS 2016.
3.¡¸Geometric deep learning on graphs and manifolds using mixture model CNNs.¡¹, Monti, Federico, et al. CVPR 2017.
Github ´úÂ룺
https://github.com/yysijie/st-gcn






·µ»Ø