- ½¹µãÊÖÒÕ
- ÒÔÔ´´ÊÖÒÕϵͳΪ»ù±¾£¬£¬£¬£¬£¬£¬£¬£¬SenseCoreÉÌÌÀAI´ó×°ÖÃΪ½¹µã»ù×ù£¬£¬£¬£¬£¬£¬£¬£¬½á¹¹¶àÁìÓò¡¢¶àÆ«ÏòÇ°ÑØÑо¿£¬£¬£¬£¬£¬£¬£¬£¬
¿ìËÙÂòͨAIÔÚ¸÷¸ö±ÊÖ±³¡¾°ÖеÄÓ¦Ó㬣¬£¬£¬£¬£¬£¬£¬ÏòÐÐÒµ¸³ÄÜ¡£¡£¡£¡£¡£
CVPR 2017 | ÉÌÌÀ¿Æ¼¼¼°Ïã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒ¾«Ñ¡ÂÛÎĽâ¶Á£¨¶þ£©
ÔÚ½ñÄêµÄCVPRѧÊõ¾Û»áÉÏÉÌÌÀ¿Æ¼¼ÓëÏã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒÅäºÏ½ÒÏþµÄÂÛÎĶà´ï23ƪ£¬£¬£¬£¬£¬£¬£¬£¬Ïã¸ÛÖдóÁִﻪ½ÌÊÚÇ××Ô¾«Ñ¡ÎåÆªÂÛÎÄÓë¸÷ÈË·ÖÏí¡£¡£¡£¡£¡£ÕâÎåÆªÂÛÎĵÄÑо¿Ð§¹ûÔÚ¸÷¸öרҵÊý¾Ý¼¯¡¢¾ºÈüÈçImageNet¡¢Visual Genome¡¢DAVIS Challenge¡¢MPIIÖУ¬£¬£¬£¬£¬£¬£¬£¬È¡µÃµÚÒ»ÃûµÄЧ¹û¡£¡£¡£¡£¡£
¹ú¼ÊÅÌËã»úÊÓ¾õÓëģʽʶ±ð´ó»á (CVPR) ÊÇÈ˹¤ÖÇÄÜÓÈÆäÊÇÅÌËã»úÊÓ¾õÁìÓòµÄ¶¥¼¶Ñ§Êõ¾Û»á¡£¡£¡£¡£¡£½ñÄêCVPRµÄÎÄÕ¼ȡÂÊԼĪ29%¡£¡£¡£¡£¡£ÔÚCVPRÉϽÒÏþµÄÂÛÎı»¹«ÒÔΪ´ú±íÁ˸ÃÁìÓò¿Æ¼¼Éú³¤µÄ×î¸ßˮƽ¡£¡£¡£¡£¡£´Ó½ñÄê7ÔÂ21ÈÕ×îÏÈÕâ¸öÒ»ÄêÒ»¶ÈµÄÊ¢»á¾ÍÔÚÆ¯ÁÁµÄÏÄÍþÒÄ»ðū³³ÕÙ¿ª¡£¡£¡£¡£¡£±¾½ìCVPR´ó̸ÅÐÌÀ¿Æ¼¼ÓëÏã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒ¹²½ÒÏþÁË23ƪÂÛÎÄ£¬£¬£¬£¬£¬£¬£¬£¬Áè¼ÝÁËÒÔÈ˹¤ÖÇÄÜÊÖÒÕÖø³ÆµÄ¿Æ¼¼¾ÞÍ·Google (21ƪ)ºÍFacebook AI Research (7ƪ)¡£¡£¡£¡£¡£Õâ23ƪÂÛÎĺ¸ÇÁËÅÌËã»úÊÓ¾õµÄ¶à¸öÁìÓò£¬£¬£¬£¬£¬£¬£¬£¬²»µ«Ìá³öÁËÐí¶àÐÂÐ͵ÄÓ¦Ó㬣¬£¬£¬£¬£¬£¬£¬Ò²ÔÚ½¹µãÊÖÒÕµÄÑз¢ÉÏÈ¡µÃÁ˶àÏî¹ú¼ÊÁìÏȵÄЧ¹û¡£¡£¡£¡£¡£ÔÚ±¾ÎÄÖÐÎÒÃǽ«ÒÔÆäÖÐ5ƪÂÛÎÄΪ´ú±íÒ»¿úÉÌÌÀÕâ¸öAI¶À½ÇÊÞ±³ºóµÄ½¹µãÊÖÒÕÁ¢Òì֮·¡£¡£¡£¡£¡£
PolyNet£ºÉî¶ÈÍøÂçÉè¼ÆµÄÀï³Ì±® ¶àÏî½ÇÖð¶á¹Ú

Éî¶ÈѧϰÔÚÕ⼸ÄêÈ¡µÃÁËÁîÖÚÈËÖõÄ¿µÄ³É¼¨¡£¡£¡£¡£¡£ÕâЩ³É¼¨±³ºóµÄ×îÖ÷ÒªµÄÊÖÒÕÒòËØ¾ÍÊÇÉî¶ÈÍøÂçÉè¼ÆÊÖÒÕµÄÒ»Ö±Í»ÆÆ¡£¡£¡£¡£¡£Éî¶ÈÍøÂçµÄǰ½øÊ×ÏÈ·´Ó¦ÔÚImageNet1000ÀàͼÏñ·ÖÀàµÄ׼ȷÂÊÉÏËüÊÇÒµ½ç¹«ÈϵÄȨºâÉî¶ÈÍøÂçÐÔÄܵÄÒ»¸öÖ÷ÒªÖ¸±ê¡£¡£¡£¡£¡£´ÓAlexNet¡¢GoogLeNetµ½ResNetÍøÂçÔ½À´Ô½Éî׼ȷÂÊÒ²ÔÚÒ»Ö±ÌáÉý¡£¡£¡£¡£¡£ÔÚÒÑÍùÎåÄêImageNet·ÖÀà׼ȷÂÊ´Ó84%Ìá¸ßµ½ÁË97%¡£¡£¡£¡£¡£¿£¿£¿£¿£¿£¿ÉÊÇËæ×ÅÍøÂçÉî¶ÈÒ»Á¬ÔöÌíÐÔÄܵĽøÒ»²½ÌáÉý×îÏÈÓöµ½ÁËÆ¿¾±¡£¡£¡£¡£¡£ºÃ±ÈResNet´ÓÒ»°Ù¶à²ãÔöÌíµ½Ò»Ç§²ãÐÔÄÜҲûÓÐÏÔÖøµÄÌá¸ß¡£¡£¡£¡£¡£
ÃæÁÙÕâÑùµÄÄÑÌâ´ËÂÛÎÄÌá³öÁËÒ»ÖÖÐÂµÄÆÆ¾Ö˼Ð÷ÔöÇ¿ÍøÂçµÄ½á¹¹¶àÑùÐÔ(Structural diversity)¡£¡£¡£¡£¡£Õâ¸ö˼Ð÷Ô´×ÔÓÚÎÒÃÇÔÚʵ¼ùÖеÄÒ»¸öÖ÷ÒªÊÓ²ì¶à¸ö²î±ð½á¹¹µÄ³¬ÉîÍøÂçµÄ¼òÆÓÍŽáÍùÍùÄÜ´øÀ´ÐÔÄܵĽøÒ»²½ÏÔÖøÌáÉý¡£¡£¡£¡£¡£ÑØ×ÅÕâ¸ö˼Ð÷ÎÒÃÇÉè¼ÆÁËPolyNet¡£¡£¡£¡£¡£ËüÓɶà¸ö²î±ðÐÎ̬µÄ³ÆÎªPolyInceptionµÄ»ù±¾Ôª×éµþºÏÔÚÒ»Æð×é³É¡£¡£¡£¡£¡£ÓÉÓÚÿÖÖÔª×é¶¼¿ÉÒÔͨ¹ýÒ»¸öÊýѧ¶àÏîʽ±í´ï³öËüµÄ½á¹¹ÒÔÊdzÆÎªPolynomial Inception¼ò³ÆPolyInception¡£¡£¡£¡£¡£ÈçÉÏͼËùʾÿ¸öPolyInceptionÊÇÄÚ²¿ºÏ³ÉÁ˶à¸ö²î±ðÉî¶ÈµÄ²¢ÐÐͨ·¡£¡£¡£¡£¡£Õâ¸öÉè¼Æ°Ñ½á¹¹¶àÑùÐÔµÄÍ·ÄÔÍùÇ°ÍÆ½øÁËÖ÷ÒªÒ»²½°ÑËüÈÚÈ뵽ÿ¸öµ¥Î»µÄÉè¼ÆÖ®ÖС£¡£¡£¡£¡£
´ËÂÛÎĵÄ×÷Õß͸¹ýϵͳÐԵıÈÕÕʵÑé¶ÔPolyNetµÄÉè¼Æ¾ÙÐвâÊÔ·¢Ã÷ËüÏà±ÈÓÚÖ÷Á÷µÄÉè¼Æ°üÀ¨ResNetÒÔ¼°ÓÉGoogleÌá³öµÄInceptionResNet v2Óиü¸ßµÄЧÂʼ´ÔÚÏàͬµÄÅÌËãÁ¿ÏÂÄÜ»ñµÃÏÔןü¸ßµÄÐÔÄÜ¡£¡£¡£¡£¡£ÎÄÖÐÐÎòµÄVery Deep PolyNetÔÚImageNet´ó¹æÄ£Í¼Ïñ·ÖÀà²âÊÔ¼¯ÉÏ»ñµÃÁËsingle-crop¹ýʧÂÊ4.25%ºÍmulti-crop¹ýʧÂÊ3.45%µÄÐÔÄÜÏÔÖøÁè¼ÝÁËGoogleÌá³öµÄInceptionResNet v2ºÍFacebookÔÚͳһʱ¼ä½ÒÏþµÄResNeXt³ÉΪÒѽÒÏþµÄÐÔÄÜ×î¸ßµÄÍøÂçÒ²ÊÇÉî¶ÈÍøÂçÉè¼ÆÁìÓòµÄбê¸Ë¡£¡£¡£¡£¡£
ÔÚImageNet2016µÄ½ÇÖðÖÐÉÌÌÀ¿Æ¼¼ÓëÏã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒÔÚ¶àÏî½ÇÖðÖÐÑ¡ÓÃÁËÕâÖÖÍøÂç½á¹¹²¢È¡µÃÁËÈý¸öµ¥ÏîµÚÒ»µÄÓÅÒìЧ¹û¡£¡£¡£¡£¡£ÕâһЧ¹û·´Ó¦ÁËÒ»¸ö¸ßЧµÄÉî¶ÈÍøÂç¼Ü¹¹¹ØÓÚÌáÉýÓ¦ÓÃÐÔÄܵÄÖ÷Òª×÷Óᣡ£¡£¡£¡£PolyNetµÄÄ£×ӽṹÒÑÔÚGithub¹ûÕæ
https://github.com/CUHK-MMLAB/polynet
ÂÛÎÄÎÊÌ⣺PolyNet:A Pursuit of Structural Diversity in Very Deep Networks
ÂÛÎÄ×÷ÕߣºXingcheng Zhang, Zhizhong Li, ChenChange Loy, Dahua Lin
Relational Net£ºVisual GenomeÉϰѹØÏµÊ¶±ð׼ȷÂÊ´Ó48%´ó·ù¶ÈÌá¸ßµ½82%

ÎïÌå¼ì²âÓëʶ±ðÒ»Ö±ÊÇÅÌËã»úÊÓ¾õÑо¿µÄÖ÷ҪƫÏò¡£¡£¡£¡£¡£½üÄêÀ´Ëæ×ÅFaster RCNNµÈ»ùÓÚÉî¶ÈѧϰµÄ¼ì²â¿ò¼ÜµÄÉú³¤ÎïÌå¼ì²âµÄÐÔÄÜÓÐÁ˳¤×ãµÄǰ½øµ«Õâ¹ØÓÚͼÏñÃ÷È·ÈÔÈ»ÊÇȱ·¦¹»µÄ¡£¡£¡£¡£¡£·ºÆðÁËͬÀàÐÍÎïÌåµÄͼƬËüÃDZ³ºóת´ïµÄ¼ÄÒå¿ÉÄÜÊǽØÈ»²î±ðµÄ¡£¡£¡£¡£¡£ÒªÃ÷È·ÕâÑùµÄͼƬÎÒÃDz»µ«ÐèÒªÅжÏͼÖзºÆðµÄÎïÌåµÄÀàÐͲ¢ÇÒÐèÒªÃ÷È·ÎïÌåÖ®¼äµÄ¹ØÏµ¡£¡£¡£¡£¡£
×î½üÒ»¶Îʱ¼ä¹ØÏµÊ¶±ðÖð²½½øÈëÅÌËã»úÊÓ¾õÑо¿µÄÊÓÒ°¡£¡£¡£¡£¡£ÔÚÕâ¸öÎÊÌâÉϵÄÒ»¸öÓдú±íÐÔµÄÊÂÇéÊÇ˹̹¸£´óѧÀî·É·ÉÍŶÓÔÚ2016Äê10Ô½ÒÏþÔÚECCVÉϵÄÂÛÎÄ¡£¡£¡£¡£¡£¸ÃÊÂÇé°ÑÓïÑÔÏÈÑéÓÃÓÚ¹ØÏµÊ¶±ðÏà±ÈÓÚ֮ǰµÄÒªÁìÈ¡µÃÁ˲»´íµÄÏ£Íû¹ØÏµÊ¶±ðµÄ׼ȷÂÊÌá¸ßµ½ÁË48%¡£¡£¡£¡£¡£È»¶øÕâÑùµÄ׼ȷÂÊÏÔÈ»ÀëÖª×ãÊÊÓÃÐèÇóÈÔÓнϴóµÄ¾àÀëͬʱËüÒ²·´Ó¦µÄÎÊÌâµÄÄѶȡ£¡£¡£¡£¡£Õâ¸öÎÊÌâµÄÖ÷ÒªÌôÕ½ÔÚÓÚ¹ØÏµÓëÎïÌåÓÐ×ÅÏÔÖø²î±ðµÄÐÔ×Ó¡£¡£¡£¡£¡£Ä³ÖÖÀàÐ͵ÄÎïÌåËüµÄÍâ¹ÛÌåÏÖÍùÍù½ÏÁ¿Ò»Ö¶øÍ³Ò»ÖÖ¹ØÏµµÄÍâ¹ÛÌåÏÖÍùÍùǧ²îÍò±ð¡£¡£¡£¡£¡£
´ËÂÛÎÄÌá³öÁËÒ»ÖÖеĽ¨Ä£ÒªÁì°Ñͳ¼ÆÑ§Ï°ºÍÉî¶ÈÍøÂçÓлúµØÍŽáÆðÀ´¡£¡£¡£¡£¡£´ËÒªÁì°ÑÒ»¸ö»ùÓÚÌõ¼þËæ»ú³¡(Conditional Random Field)µÄµü´úÍÆ¶ÏÒªÁì½â¿ª´Ó¶ø°ÑËü±í´ïΪһ¸öǰÏòÅÌËãÍøÂç¡£¡£¡£¡£¡£Õâ¸öǰÏòÍøÂçºÍCRFÒ»Ñù¾ßÓÐͳ¼ÆÍƶϵÄÄÜÁ¦¿ÉÊÇËü¿ÉÒÔ±»Ç¶Èëµ½Ò»¸ö¸ü´óµÄÉî¶ÈѧϰϵͳÖÐºÍÆäËüÄ£¿£¿£¿£¿£¿£¿éÒ»Æð¾ÙÐж˶Զ˵Äѧϰ¡£¡£¡£¡£¡£»£»£»£»£»ùÓÚÕâÖÖÒªÁì´ËÂÛÎÄÌá³öÁËÒ»¸ö³ÆÎªDeep Relational NetµÄ¹ØÏµÃ÷È·¿ò¼Ü°ÑÌØÕ÷ѧϰÒÔ¼°¿çÁìÓò¹ØÏµÑ§Ï°ÍŽá³ÉÒ»¸öÍ³Ò»ÍøÂç¡£¡£¡£¡£¡£Õâ¸ö¿ò¼ÜÔÚ¹ØÏµÃ÷È·ÁìÓòµÄÖ÷ÒªÊý¾Ý¼¯Visual GenomeÉϰѹØÏµÊ¶±ð׼ȷÂÊ´Ó48%´ó·ù¶ÈÌá¸ßµ½82%´Ó¶øÎªÕâ¸öÎÊÌâÉèÁ¢ÁËеÄÐÔÄܱê¸Ë¡£¡£¡£¡£¡£ÕâÆªÂÛÎÄÔÚCVPR 2017±»ÎüÊÕΪOral presentation¡£¡£¡£¡£¡£¸ÃÂÛÎÄÏà¹Ø´úÂëÓëÄ£×ӽṹÒѾÔÚGithub¹ûÕæ
https://github.com/doubledaibo/drnet
ÂÛÎÄÎÊÌ⣺Detecting Visual Relationships with Deep Relational Networks
ÂÛÎÄ×÷ÕߣºBo Dai, Yuqi Zhang, Dahua Lin
Deep Layer Cascade£ºÕ¶»ñµÚÒ»½ìDAVIS Challenge ÊÓÆµÓïÒåÖ§½â½ÇÖð¹Ú¾ü

ͼÏñÓïÒåÖ§½âÊÇͼÏñÃ÷È·µÄ»ùʯÐÔÊÖÒÕÔÚ×Ô¶¯¼ÝʻϵͳµÈÇ£Éæ½Ö¾°»ò³¡¾°Ê¶±ðÓ¦ÓÃÖÐÊÎÑݾÙ×ãÇáÖØµÄ½ÇÉ«¡£¡£¡£¡£¡£»£»£»£»£»ùÓÚÉî¶ÈѧϰµÄÓïÒåÖ§½âÊÖÒÕËäÈ»¿ÉÒÔÈ¡µÃÔ¶³¬¹Å°åÒªÁìµÄÐÔÄÜ¿ÉÊÇÍùÍùÒÀÀµÓÚ¼«ÉîµÄÍøÂç»òÕßÖØ´óµÄºó´¦Öóͷ£ÊÖÒÕÌá¸ßÄ£×ÓµÄÕ¹Íû¾«¶È¡£¡£¡£¡£¡£Òò¶ø´¦Öóͷ£Àú³Ìͨ³£ÐèÒªãýÃð´ó×ÚµÄÅÌËã×ÊÔ´²¢ÇÒËÙÂʲ»¸ß¡£¡£¡£¡£¡£
´ËÂÛÎÄÌá³öÁËÒ»¸öºÜ¾ßÆô·¢ÐÔµÄÁ¢Òìµã¶ÔÄѶȲî±ðµÄÇøÓòÇø±ð¿´´ý ¡ª ÈüòÆÓµÄ¹éÓÚ¼òÆÓ°ÑÃû¹óµÄÅÌËã×ÊԴͶ·ÅÔÚÄÑÌâµÄ²¿·Ö¡£¡£¡£¡£¡£Ïêϸ¶øÑÔ´ó¿éƽ»¬µÄÇøÓòÖ§½âÊǽÏΪÈÝÒ׵ĶøÎïÌå½çÏß»òÕßÓÐÖØ´óÎÆÀíµÄÇøÓòµÄÖ§½âÔò½ÏΪÄÑÌâ¡£¡£¡£¡£¡£»£»£»£»£»ùÓÚÕâÒ»ÊÓ²ì´ËÎÄÌá³öÁËÒ»¸öÐÂÐ͵ÄÓïÒåÖ§½âÄ£×ÓDeep Layer Cascade (LC)¡£¡£¡£¡£¡£Ëü½«Í¼ÏñÖеÄÏñËØ·ÖΪ¼òÆÓ¡¢Öеȡ¢ÄÑÌâÈý¸ö²¿·Ö»®·ÖÓÉÍøÂçÖеÄdz²ã¡¢Öв㡢¸ß²ãÀ´Õë¶ÔÐÔÍêÉíÖ§½â¡£¡£¡£¡£¡£
´ËÎÄ»¹Ìá³öÁËRegion ConvolutionÀ´Ö»´¦Öóͷ£Ã¿¸ö½×¶Î¸ÐÐËȤµÄÇøÓò¶øºöÊÓÆäËûÇøÓò¡£¡£¡£¡£¡£¹ØÓÚ¼òÆÓÇøÓòµÄµÍ²ã´¦Öóͷ£Ð§¹ûͨ¹ýskip connectionÖ±½Ó´«Ë͵½Êä³ö¶Ë²¢ÓëÖØ´óÇøÓò´¦Öóͷ£µÄЧ¹ûÏàÈÚºÏÒò´ËÈý¸öÌõÀíµÄ´¦Öóͷ£ÎÞ·ìÏνÓ×é³ÉÁËÒ»¸ö¿ÉÒԶ˶ԶËѵÁ·µÄÍ³Ò»ÍøÂç¡£¡£¡£¡£¡£Deep Layer Cascade²»µ«Ìá¸ßÅÌËãËÙÂʲ¢ÇÒÓÉÓÚÿ¸ö½×¶Îֻרע´¦Öóͷ£Ä³ÀàÄѶȵÄÏñËØ¾«¶ÈÒ²»áÏà¶ÔµØÌá¸ß¡£¡£¡£¡£¡£LC¿ÉÒÔÎÞаµØÓë²î±ðµÄÉî¶ÈÍøÂç¼Ü¹¹ÍŽáÔËÓᣡ£¡£¡£¡£Inception-ResNet-v2Ó¦ÓÃLCºóËÙÂÊÌá¸ßÁË42.8% ¾«¶ÈÌá¸ßÁË1.7%¡£¡£¡£¡£¡£¾²âÊÔLCÔÚ¶à¸ö²âÊÔ¼¯ÉÏ´ó·ù¶ÈÌá¸ßËÙÂʲ¢µÖ´ïÉõÖÁÓâÔ½state-of-the-artµÄ¾«¶ÈˮƽLCÔÚVOC12²âÊÔ¼¯ÉÏmIoU¿ÉµÖ´ï80.3ÔÚCOCOÉÏԤѵÁ·¿ÉµÖ´ï82.7ÔÚCityscapes²âÊÔ¼¯ÉϿɵִï71.1¡£¡£¡£¡£¡£
³ýÁËÉÏÊöÂÛÎÄÖµµÃÒ»ÌáµÄÊÇÉÌÌÀ¿Æ¼¼ÓëÏã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒÔÚÕâ´ÎµÄCVPRÕ¶»ñÁËÓÉVideoNet(http://videonet.team/)Ö÷ÀíµÄµÚÒ»½ìDAVIS Challenge ÊÓÆµÓïÒåÖ§½â½ÇÖð¹Ú¾ü http://davischallenge.org/ ¡£¡£¡£¡£¡£²ÎÈüÕßÐèÒªÌá³öË㷨׼ȷµØ°ÑÊÓÆµÀïÖ¸¶¨µÄÎïÌå¾ÙÐÐÓïÒåʵÀýÖ§½â¡£¡£¡£¡£¡£ÓÉÓÚÎïÌå¿ìËÙÒÆ¶¯Ôì³ÉµÄÄ£ºý״̬ºÍ¶àÎïÌåÖ®¼äµÄÕÚµ²ÎÊÌâ½ÇÖðµÄÌôÕ½ÄѶÈÌØÊâ¸ß¡£¡£¡£¡£¡£ÉÌÌÀ¿Æ¼¼ÓëÏã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒÏàÖúÌá³öµÄVideo Object Segmentation with Re-identificationVS-ReIDËã·¨ÈÃËûÃÇÔÚÈ«Çò¹² 22Ö§²½¶ÓÀïÍÑÓ±¶ø³ö»ñµÃ¹Ú¾ü¡£¡£¡£¡£¡£
ÂÛÎÄÎÊÌ⣺Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade
ÂÛÎÄ×÷ÕߣºXiaoxiao Li, Ziwei Liu, Ping Luo, ChenChange Loy, Xiaoou Tang
±ðµÄ£¬£¬£¬£¬£¬£¬£¬£¬ÉÌÌÀ¿Æ¼¼ÓëÏã¸ÛÖдó-ÉÌÌÀÍŽáʵÑéÊÒ¼ÓÈëDAVIS Challenge½ÇÖðµÄÊÖÒռƻ®Ò²ÔÚDAVIS Challenge workshopÕûÀí½ÒÏþ£º
ÂÛÎÄÎÊÌ⣺Video Object Segmentation with Re-identification
ÂÛÎÄ×÷ÕߣºXiaoxiaoLi, Yuankai Qi, Zhe Wang, Kai Chen, Ziwei Liu, Jianping Shi, Ping Luo, ChenChange Loy, Xiaoou Tang
Tubelet Proposal Network£ºImageNet¹Ú¾ü¼ì²âÊÖÒÕµ½ÊÓÆµµÄԾǨ

ÊÓÆµÖеÄÎïÌå¼ì²âÎÊÌâÓÉÓÚÆäÔڸ߼¶AIϵͳÈç×Ô¶¯¼ÝÊ»¡¢¼Ò¾Ó»úеÈËÖеÄDZÔÚÓÃ;½üÄêÀ´»ñµÃÁËÆÕ±éµÄ¹Ø×¢¡£¡£¡£¡£¡£Ïà½ÏÓڹŰ徲ֹͼƬµÄÎïÌå¼ì²âÎÊÌâÏÖÓеÄÊÓÆµÎïÌå¼ì²âÒªÁìͨ³£»£»£»£»£»ùÓÚʱ¿Õ¡¸¹ÜµÀ¡¹(tubelets)¼´¿çʱ¼äÅþÁ¬µÄ¼ì²â¿òÀ´ÓÐÓõÄÓ¦ÓÃÊÓÆµÖеÄʱÓòÐÅÏ¢¡£¡£¡£¡£¡£¿ÉÊÇÏÖÓÐÒªÁìÖÐʱ¿Õ¹ÜµÀÌìÉúµÄÖÊÁ¿ºÍЧÂÊÍùÍù²îÇ¿ÈËÒâ»ùÓÚÔ˶¯ÐÅÏ¢µÄÌìÉúÒªÁìÖ»ÄÜÌìÉú½Ï¶ÌµÄʱ¿Õ¹ÜµÀ¶ø»ùÓÚͼÏñÐÅÏ¢µÄÌìÉúÒªÁìÐè񻮮·Ñ´ó×ÚµÄÅÌËãÁ¿Ò²²»¿É°ü¹Ü¹ØÓÚÎïÌå½Ï¸ßµÄÕÙ»ØÂÊ¡£¡£¡£¡£¡£
¸ÃÂÛÎĽ«¹Å°åÕë¶Ô¾²Ö¹Í¼ÏñµÄFasterRCNN¿ò¼Ü¾ÙÐÐÁËÀ©Õ¹½«ÊÓÆµÎïÌå¼ì²â¿ò¼ÜÀ©Õ¹Îª¡¸ºòѡʱ¿Õ¹ÜµÀÌìÉú¡¹ºÍ¡¸ºòѡʱ¿Õ¹ÜµÀʶ±ð¡¹Á½¸öÄ£¿£¿£¿£¿£¿£¿éÌá³öÁËÒ»ÖÖ¸ßЧÂʵĺòѡʱ¿Õ¹ÜµÀÌìÉúÒªÁìÄܹ»ÔÚ°ü¹Üʱ¿Õ¹ÜµÀ½Ï³¤³¤¶ÈµÄͬʱ¾¡¿ÉÄܵı£´æ²î±ðʱ¿Õ¹ÜµÀµÄ¶àÑùÐÔ´Ó¶øÌá¸ßÎïÌåµÄÕÙ»ØÂÊ¡£¡£¡£¡£¡£»£»£»£»£»ùÓÚÕâЩ¸ßÖÊÁ¿µÄºòѡʱ¿Õ¹ÜµÀÓ¦ÓñàÂë-½âÂëLSTMÍøÂç¾ÙÐÐʱ¿Õ¹ÜµÀµÄʶ±ðÄܹ»ÓÐÓõÄÌáÉý¼ì²âÕûÌåµÄ׼ȷÂÊ¡£¡£¡£¡£¡£×÷Õß»¹¶ÔTubelet Proposal Network³õʼ»¯ºÍ²î±ðÉèÖþÙÐÐÁËÏ꾡ÆÊÎö»ùÓÚTPNµÄÎïÌå¼ì²âƽ¾ù׼ȷÂÊÏà½ÏÓÚ¾²Ö¹Í¼Ïñ¼ì²â¿ò¼ÜÓÐ>5%µÄÌáÉý¡£¡£¡£¡£¡£
¸ÃÂÛÎÄ×÷ÕßÔÚ2015ºÍ2016Ò»Á¬Á½ÄêÈ¡µÃImageNetÊÓÆµÎïÌå¼ì²âÏîÄ¿µÚÒ»¡£¡£¡£¡£¡£¸ÃÂÛÎÄÊÇËûÃÇÔÚ2016ÄêImageNet¾ºÈüµÚÒ»ÊÂÇéÉϾÙÐÐÀ©Õ¹ºóÌá³öµÄÈ«ÐÂÊÓÆµÎïÌå¼ì²â¿ò¼ÜÏà½ÏÓÚÏÖÓÐËã·¨¶ÔÊÓÆµÖÐÎïÌå¼ì²âµÄЧÂʺÍ׼ȷÂʽøÒ»²½ÌáÉý¡£¡£¡£¡£¡£
ÂÛÎÄÎÊÌ⣺Object Detection in Videos with Tubelet Proposal Networks
ÂÛÎÄ×÷ÕߣºKai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, XihuiLiu, Xiaogang Wang
Multi-Context Attention£ºµ¥ÈËÌå×Ë̬ʶ±ðÊý¾Ý¼¯MPII׼ȷÂʵÚÒ»

ÈËÌå×Ë̬Ԥ¼ÆÖ¼ÔÚ¼ì²â³öͼÏñ»òÊÓÆµÖÐÈËÌå¸÷Òªº¦µãµÄλÖþßÓкܴóµÄÓ¦ÓüÛÖµÈçÌå¸ÐÓÎÏ·ÈË»ú½»»¥»úеÈËÐéÄâÏÖʵװ±¸Ðж¯²¶»ñ»úеÊÓ¾õµÈ¡£¡£¡£¡£¡£È»¶øÓÉÓÚÈËÌå×Ë̬ºÜÊǸ»ºñͼÏñÊÓÆµÅä¾°ÈßÔÓÈËÌåÕÚµ²µÈÇéÐγ£Óб¬·¢ÒÔÊÇÈËÌå×Ë̬Ԥ¼ÆÎÊÌ⼫¾ßÌôÕ½ÐÔ¡£¡£¡£¡£¡£Òª½â¾öÉÏÊöÎÊÌâÐèÒª³äÇå³þȷͼÏñµÄÉÏÏÂÎÄÐÅÏ¢¹Å°åÒªÁìͨ³£Ê¹Óöà¸ö²î±ð±ê×¼µÄͼÏñ¿éÀ´¶Ô¶à±ê×¼ÐÅÏ¢½¨Ä£ÕâÑù»ñµÃµÄ¶à±ê×¼ÐÅÏ¢ÍùÍùȱÉÙÎÞаÐԺͶàÑùÐÔ¡£¡£¡£¡£¡£ ÈËÄÔÊÓ¾õ×¢ÖØÁ¦»úÖÆÊÇÈËÄÔ¸ßЧÃ÷È·×ÔÈ»³¡¾°µÄÓÐÓûúÖÆ¡£¡£¡£¡£¡£Í¨¹ý½«×¢ÖØÁ¦¼¯Öе½½¹µãÇøÓòÈËÄÔÄÜÓÐÓÃɨ³ýÓëʹÃüÎÞ¹ØµÄÆäËû×ÌÈÅÇøÓò²¢×ÅÖØÆÊÎöÓëʹÃüÏà¹ØµÄÒªº¦ÇøÓò¡£¡£¡£¡£¡£
´ËÂÛÎÄÌá³öµÄ¶àÇé¾³×¢ÖØÁ¦»úÖÆÍøÂçmulti-context attention networkÊ״ν«×¢ÖØÁ¦»úÖÆÄ£×ÓÓëÈËÌå×Ë̬Ԥ¼ÆÊ¹ÃüÓÐÓÃÍŽáͨ¹ýÉè¼ÆÈýÖÖ²î±ðµÄ×¢ÖØÁ¦»úÖÆÄ£×Ó¡ª¡ª¶à±ê×¼×¢ÖØÁ¦»úÖÆmulti-resolution attention¶àÓïÒå×¢ÖØÁ¦»úÖÆmulti-semantics attentionÈËÌåÈ«¾Ö-¾Ö²¿×¢ÖØÁ¦»úÖÆhierarchical global-part attention¡ª¡ªÀ´Ñ§Ï°Í¼ÏñÉÏÏÂÎÄÐÅÏ¢Äܹ»ÓÐÓÃÈ¥³ýÈËÌå×Ë̬Ԥ¼ÆÊ¹ÃüÀïµÄÈßÓàÅä¾°Ìá¸ß¶ÔÒ×»ìÏýÈËÌå²¿Î»Çø·ÖÄÜÁ¦´Ó¶øÈËÌåÒªº¦²¿Î»µÄ¼ì²â¾«¶È¡£¡£¡£¡£¡£ÔÚ×îÆÕ±éʹÓõĵ¥ÈËÌå×Ë̬ʶ±ðÊý¾Ý¼¯MPIIÉϸÃÒªÁì׼ȷ¶ÈÔÚÒѽÒÏþµÄÊÂÇéÖÐ×î¸ß¡£¡£¡£¡£¡£¸ÃÒªÁìÏÖÔÚÒѾ¿ªÔ´´úÂë¿ÉÔÚÈçÏÂÍøÕ¾»ñµÃ
https://github.com/bearpaw/pose-attention
ÂÛÎÄÎÊÌ⣺Multi-Context Attention for Human Pose Estimation
¸½Â¼
ÉÌÌÀ¿Æ¼¼¼°Ïã¸ÛÖдó-ÉÌÌÀ¿Æ¼¼ÍŽáʵÑéÊÒ¹²ÓÐ23ƪÂÛÎı»ÎüÊÕ£¬£¬£¬£¬£¬£¬£¬£¬¸½ÉÏCVPR2017ÉÏSessionʱ¼ä
Multi-Context Attention for Human Pose Estimation - Saturday, July 22, 2017, 09:00¨C10:30
Multi-Scale Continuous CRFs as Sequential Deep Networksfor Monocular Depth Estimation - Saturday, July 22, 2017, 09:00¨C10:30
Accurate Single Stage Detector Using Recurrent Rolling Convolution - Saturday, July 22, 2017, 10:30¨C12:30
Mimicking Very Efficient Network for Object Detection - Saturday, July 22, 2017, 10:30¨C12:30
Object Detection in Videos with Tubelet Proposal Networks - Saturday, July 22, 2017, 10:30¨C12:30
Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion - Saturday, July 22, 2017, 10:30¨C12:30
Discover and Learn New Objects from Documentaries - Saturday, July 22, 2017, 13:30¨C15:00
Learning object interactions and descriptions for Semantic Image Segmentation - Saturday, July 22, 2017, 13:30¨C15:00
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification Saturday, July 22, 2017, 15:00¨C17:00
Scale-Aware Face Detection - Saturday, July 22, 2017, 15:00¨C17:00
Interpretable Structure-Evolving LSTM - Sunday, July 23, 2017, 08:30¨C10:00
Detecting Visual Relationships with Deep Relational Networks - Sunday, July 23, 2017, 13:00¨C14:30
Joint Detection and Identification Feature Learning for Person Search - Sunday, July 23, 2017, 13:00¨C14:30
Learning Cross-Modal Deep Representations for Robust Pedestrian Detection - Sunday, July 23, 2017, 14:30¨C16:30
PolyNet: A Pursuit of Structural Diversity in Very Deep Networks - Sunday, July 23, 2017, 14:30¨C16:30
Pyramid Scene Parsing Network - Sunday, July 23, 2017, 14:30¨C16:30
Person Search with Natural Language Description - Monday, July 24, 2017, 10:00¨C12:00
Quality Aware Network for Set to Set Recognition - Monday, July 24, 10:00¨C12:00
Untrimmed Nets for Weakly Supervised Action Recognitionand Detection - Tuesday, July 25, 2017, 10:00¨C12:00
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade - Tuesday, July 25, 2017, 13:00¨C14:30
Residual Attention Network for Image Classification- Tuesday, July 25, 13:00¨C14:30
ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection - Tuesday, July 25, 2017, 14:30¨C16:30
Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing - Tuesday, July 25, 2017, 14:30¨C16:30






·µ»Ø