網(wǎng)站首頁(yè) 編程語(yǔ)言正文

（數(shù)據(jù)）圖像預(yù)處理——image augmentation圖像增廣之cutout、Mixup、CutMix方法及其實(shí)現(xiàn)

作者：甘霖佳佳更新時(shí)間： 2022-01-31 編程語(yǔ)言

圖片增廣（增強(qiáng)） image-augmentation

圖像增強(qiáng)即通過一系列的隨機(jī)變化生成大量“新的樣本”，從而減低過擬合的可能。現(xiàn)在在深度卷積神經(jīng)網(wǎng)絡(luò)訓(xùn)練中，圖像增強(qiáng)是必不可少的一部分。

常用增廣方法

圖像增廣方法一般分為兩類：一是對(duì)圖片做變形，二是對(duì)圖片做顏色變化
圖像增廣的一般方法的代碼和實(shí)現(xiàn)見以下鏈接，我們不再闡述。
深度學(xué)習(xí)圖像數(shù)據(jù)增廣方法總結(jié)
下面我們實(shí)現(xiàn)兩種圖像增強(qiáng)的高級(jí)方法：Cutout、Mixup和CutMix。
在這里插入圖片描述

Mixup方法

Mixup is 是一個(gè)普遍通用的數(shù)據(jù)增強(qiáng)原則。本質(zhì)上，mixup訓(xùn)練神經(jīng)網(wǎng)絡(luò)的凸組合的例子和他們的標(biāo)簽。通過這樣做，mixup正則化了神經(jīng)網(wǎng)絡(luò)，以支持訓(xùn)練示例之間的簡(jiǎn)單線性行為。
如圖所示，Mixup將兩個(gè)圖像根據(jù)透明度混淆在一起，使得機(jī)器更好的學(xué)習(xí)。

代碼實(shí)現(xiàn)

# mixup function
def mixup_data(x, y, alpha=1.0, use_cuda=True):
    '''Returns mixed inputs, pairs of targets, and lambda'''
    if alpha > 0:
        lam = np.random.beta(alpha, alpha) # bata分布隨機(jī)數(shù) 
    else:
        lam = 1

    batch_size = x.size()[0]
    if use_cuda:
        index = torch.randperm(batch_size).cuda() # 返回一個(gè)[0, batch_size-1]的隨機(jī)數(shù)組
    else:
        index = torch.randperm(batch_size)

    mixed_x = lam * x + (1 - lam) * x[index, :]
    y_a, y_b = y, y[index]
    return mixed_x, y_a, y_b, lam

Cutout方法

Cutout是一種簡(jiǎn)單的卷積

代碼

import import numpy as np

 幫助理解代碼的鏈接：

神經(jīng)網(wǎng)絡(luò)正則化方法，它包括在訓(xùn)練過程中屏蔽輸入圖像的隨機(jī)部分。這種技術(shù)模擬閉塞的例子，鼓勵(lì)模型在做決策時(shí)考慮更多次要的特性，而不是依賴于幾個(gè)主要特性的存在。
 如圖所示，Cutout方法是隨機(jī)選取圖像上一個(gè)或者多個(gè)正方形區(qū)域?qū)⑵鋼赋?/p> 實(shí)現(xiàn)

 torch class="token keyword">class Cutout(object): class="token string">"""Randomly mask out one or more patches from an image. class="token macro property"># class="token expression">Args: class="token function">n_holes (int): Number of patches to cut out of each image. class="token function">length (int): The length (in pixels) of each square patch. class="token string">""" class="token function">__init__(self, n_holes, length): class="token punctuation">.n_holes = n_holes class="token punctuation">.length = length class="token function">__call__(self, img): class="token string">""" class="token operator">: class="token function">img (Tensor): Tensor image of size (C, H, W). class="token operator">: class="token operator">: Image with n_holes of dimension length x length cut out of it. class="token string">""" class="token operator">= img.size(1) #32圖片的高 class="token operator">= img.size(2) #32圖片的寬 class="token operator">= np.ones((h, w), np.float32) #32*32w*h的全1矩陣 class="token keyword">for n in range(self.n_holes): #n_holes=2,length=4 選擇2個(gè)區(qū)域；每個(gè)區(qū)域的邊長(zhǎng)為4 class="token operator">= np.random.randint(h) #0~31隨機(jī)選擇一個(gè)數(shù) y=4 class="token operator">= np.random.randint(w) #0~31隨機(jī)選擇一個(gè)數(shù) x=24 class="token operator">= np.clip(y - self.length // 2, 0, h) #2,0,32 ->2 class="token operator">= np.clip(y + self.length // 2, 0, h) #6,0,32 ->6 class="token operator">= np.clip(x - self.length // 2, 0, w) #24-2,0,32 ->22 class="token operator">= np.clip(x + self.length // 2, 0, w) #24+2,0,32 ->26 class="token punctuation">[y1: y2, x1: x2] = 0. #將這一小塊區(qū)域去除 class="token operator">= torch.from_numpy(mask) class="token operator">= mask.expand_as(img) class="token macro property"># expand_as（）函數(shù)與expand（）函數(shù)類似，功能都是用來擴(kuò)展張量中某維數(shù)據(jù)的尺寸，區(qū)別是它括號(hào)內(nèi)的輸入?yún)?shù)是另一個(gè)張量，作用是將輸入tensor的維度擴(kuò)展為與指定tensor相同的size。 class="token operator">= img * mask class="token keyword">return img 
 
 python中numpy模塊下的np.clip()的用法
pytorch中的expand（）和expand_as（）函數(shù)  
 

CutMix 
CutMix的所選取的正方形區(qū)域在訓(xùn)練圖像之間剪切和粘貼，真實(shí)標(biāo)簽值也按patches的面積比例混合。通過有效利用訓(xùn)練像素，并保留區(qū)域dropout的正則化效果，CutMix在CIFAR分類任務(wù)上的表現(xiàn)始終優(yōu)于最先進(jìn)的增強(qiáng)策略。 

代碼實(shí)現(xiàn) 
def rand_bbox(size, lam):
    W = size[2]
    H = size[3]
    cut_rat = np.sqrt(1. - lam)
    cut_w = np.int(W * cut_rat)
    cut_h = np.int(H * cut_rat)

    # uniform
    cx = np.random.randint(W)
    cy = np.random.randint(H)

    bbx1 = np.clip(cx - cut_w // 2, 0, W)
    bby1 = np.clip(cy - cut_h // 2, 0, H)
    bbx2 = np.clip(cx + cut_w // 2, 0, W)
    bby2 = np.clip(cy + cut_h // 2, 0, H)

    return bbx1, bby1, bbx2, bby2
 
# generate mixed sample
lam = np.random.beta(args.beta, args.beta)
rand_index = torch.randperm(images.size()[0]).cuda()
labels_a = labels
labels_b = labels[rand_index]
bbx1, bby1, bbx2, bby2 = rand_bbox(images.size(), lam)
images[:, :, bbx1:bbx2, bby1:bby2] = images[rand_index, :, bbx1:bbx2, bby1:bby2]
# adjust lambda to exactly match pixel ratio
lam = 1 - ((bbx2 - bbx1) * (bby2 - bby1) / (images.size()[-1] * images.size()[-2]))


        原文鏈接：https://blog.csdn.net/weixin_45928096/article/details/122406271
        
      
                上一篇：為什么要使用3×3卷積？&amp; 1*1卷積的作用是什么？
        
        
        下一篇：torch.save實(shí)現(xiàn)對(duì)網(wǎng)絡(luò)結(jié)構(gòu)和模型參數(shù)的保存 &amp
        
                
      
    

    
      
    
      相關(guān)推薦
      
                2022-10-13 Python常用圖像形態(tài)學(xué)操作詳解_python
                2022-07-19 安卓TextView的lineHeight*lineCount!=height問題,解決不支持滾動(dòng)的
                2021-12-19 C語(yǔ)言?八大排序算法的過程圖解及實(shí)現(xiàn)代碼_C 語(yǔ)言
                2022-05-01 Android?模擬地圖定位功能的實(shí)現(xiàn)_Android
                2024-03-22 springboot報(bào)錯(cuò)Error creating bean with name ‘dataSou
                2023-12-10 Invalid bound statement (not found): 各種原因
                2022-04-09 node sass下載失敗解決方案
                2022-09-22 Apriori算法的實(shí)現(xiàn)






 
    
     
     
      欄目分類
     
      
    

    Python教程
    織夢(mèng)教程
    前端文檔
    PHP教程
    電腦知識(shí)
    服務(wù)器教程
    Mysql教程
    Java教程
    軟件教程
  

      
    
    
    
    
     
     
      最近更新
     
      
      
            window11 系統(tǒng)安裝 yarn 
            超詳細(xì)win安裝深度學(xué)習(xí)環(huán)境2025年最新版（ 
            Linux 中運(yùn)行的top命令 怎么退出？ 
            MySQL 中decimal 的用法？ 存儲(chǔ)小 
            get 、set 、toString 方法的使 
            @Resource和 @Autowired注解 
            Java基礎(chǔ)操作-- 運(yùn)算符，流程控制 Flo 
            1. Int 和Integer 的區(qū)別，Jav 
            spring @retryable不生效的一種 
            Spring Security之認(rèn)證信息的處理 
            Spring Security之認(rèn)證過濾器 
            Spring Security概述快速入門 
            Spring Security之配置體系 
            【SpringBoot】SpringCache 
            Spring Security之基于方法配置權(quán) 
            redisson分布式鎖中waittime的設(shè) 
            maven:解決release錯(cuò)誤：Artif 
            restTemplate使用總結(jié) 
            Spring Security之安全異常處理 
            MybatisPlus優(yōu)雅實(shí)現(xiàn)加密？ 
            Spring ioc容器與Bean的生命周期。 
            【探索SpringCloud】服務(wù)發(fā)現(xiàn)-Nac 
            Spring Security之基于HttpR 
            Redis 底層數(shù)據(jù)結(jié)構(gòu)-簡(jiǎn)單動(dòng)態(tài)字符串（SD 
            arthas操作spring被代理目標(biāo)對(duì)象命令 
            Spring中的單例模式應(yīng)用詳解 
            聊聊消息隊(duì)列，發(fā)送消息的4種方式 
            bootspring第三方資源配置管理 
            GIT同步修改后的遠(yuǎn)程分支

日本免费高清视频-国产福利视频导航-黄色在线播放国产-天天操天天操天天操天天操|www.shdianci.com

網(wǎng)站首頁(yè) 編程語(yǔ)言 正文

（數(shù)據(jù)）圖像預(yù)處理——image augmentation圖像增廣之cutout、Mixup、CutMix方法及其實(shí)現(xiàn)

圖片增廣（增強(qiáng)） image-augmentation

常用增廣方法

Mixup方法

代碼實(shí)現(xiàn)

Cutout方法

CutMix

代碼實(shí)現(xiàn)

相關(guān)推薦

網(wǎng)站首頁(yè) 編程語(yǔ)言正文