1. <strong id="7actg"></strong>
    2. <table id="7actg"></table>

    3. <address id="7actg"></address>
      <address id="7actg"></address>
      1. <object id="7actg"><tt id="7actg"></tt></object>

        人工智能領域100+數(shù)據(jù)集分享,趕緊收藏!

        共 9484字,需瀏覽 19分鐘

         ·

        2022-02-17 21:55




        學習數(shù)據(jù)分析需要持續(xù)進行實操,但很多讀者找不到合適的數(shù)據(jù)集來練手,小編整理了人工智能領域100+數(shù)據(jù)集,總有一個是適合你練手的數(shù)據(jù)集!趕緊收藏點贊吧!


        01


        NLP語料庫數(shù)據(jù)集





        1.2016-2019新聞聯(lián)播語料庫(11.3MB)

        https://www.heywhale.com/mw/dataset/5d2d344c688d36002c5da8e5


        2.中文謠言語料庫(32.6MB)

        https://www.heywhale.com/mw/dataset/5d257f87688d36002c579342


        3.中國對聯(lián)數(shù)據(jù)集(28.2MB)

        https://www.heywhale.com/mw/dataset/5c46e6f42d8ef5002b736d6d


        4.1998人民日報標注語料庫(PFR)(10.2MB)

        https://www.heywhale.com/mw/dataset/5ce7983cd10470002b334de3


        5.人民日報文章數(shù)據(jù)集(1979-2010)(811.9MB)

        https://www.heywhale.com/mw/dataset/5c862b1ad635ff002ca2eb19


        6.人民日報文章數(shù)據(jù)集(1949-1978)(559.4MB)

        https://www.heywhale.com/mw/dataset/5c8626031e7104002b380a4b


        7.中文新聞數(shù)據(jù)集(70.3MB)

        https://www.heywhale.com/mw/dataset/5d8878638499bc002c1148f7


        8.耶魯文本轉SQL語句挑戰(zhàn)數(shù)據(jù)集(95.1MB)

        https://www.heywhale.com/mw/dataset/5d48f322c143cf002bf36319


        9.新加坡國立大學SMS語料庫(23.4MB)

        https://www.heywhale.com/mw/dataset/5d3ea76acf76a600361e9aa0


        10.中文經典典籍語料

        https://www.heywhale.com/mw/dataset/5d11e717708b90002c4d2983


        11.非正式漢語數(shù)據(jù)集(214.5MB)

        https://www.heywhale.com/mw/dataset/5d1c45459f53a9002ce35b61


        12.維基百科中文語料庫(518.7MB)

        https://www.heywhale.com/mw/dataset/5d1ee7939f53a9002ce5910e


        13.頻率最高的9933個最常用漢字數(shù)據(jù)集(1.0MB)

        https://www.heywhale.com/mw/dataset/5d8dd076037db3002d3a715c


        14.聊天語料庫數(shù)據(jù)集(210.7MB)

        https://www.heywhale.com/mw/dataset/5dee1459953ca8002c9678a6


        15.短文本分類數(shù)據(jù)集(13.1MB)

        https://www.heywhale.com/mw/dataset/5dd645fca0cb22002c94e65d/file


        16.成語閱讀理解數(shù)據(jù)集(195.8MB)

        https://www.heywhale.com/mw/dataset/5ddf91e8ca27f8002c4ad48d


        17.論文自動評分數(shù)據(jù)集(78.8MB)

        https://www.heywhale.com/mw/dataset/5de0c5ccca27f8002c4b178a


        18.翻譯語料(595.9MB)

        https://www.heywhale.com/mw/dataset/5de5fcafca27f8002c4ca993


        19.中文科學文獻摘要數(shù)據(jù)集(92.9MB)

        https://www.heywhale.com/mw/dataset/5de72db2ca27f8002c4ce7b4


        20.維基百科英文語料庫(89.0MB)

        https://www.heywhale.com/mw/dataset/5ddba2c9f41512002cebfef6


        21.Lord of the Rings指環(huán)王數(shù)據(jù)(223.9KB)

        https://www.heywhale.com/mw/dataset/5da83b27c83fb400420c5707


        22.中文機器閱讀理解的跨度提取數(shù)據(jù)集(CMRC 2018)

        https://www.heywhale.com/mw/dataset/5e7b180798d4a8002d2d3af6


        23.36氪新聞數(shù)據(jù)集(42.5MB)

        https://www.heywhale.com/mw/dataset/5eb68e91366f4d002d77d08d


        24.1萬條亞馬遜樂器的評測/評論(13MB)

        https://www.heywhale.com/mw/dataset/5e980ce4ebb37f002c5feccc


        25.1萬條互聯(lián)網專欄資訊數(shù)據(jù)集(75.7MB)

        https://www.heywhale.com/mw/dataset/5ebba2de0bff1b002ce6d6a7


        26.2萬條中文金融新聞數(shù)據(jù)集(66.6MB)

        https://www.heywhale.com/mw/dataset/5eb69242366f4d002d77d2b7


        27.中文圖書分類數(shù)據(jù)集(49.8MB)

        https://www.heywhale.com/mw/dataset/5ecf5a25162df90036ddec65


        28.英文歌詞數(shù)據(jù)集(69.1MB)

        https://www.heywhale.com/mw/dataset/5aab8085afaabd5e93e4e027


        29.特朗普政府發(fā)表的聲明和簡報(63.6MB)

        https://www.heywhale.com/mw/dataset/5fae515f7d1e6d0030d68088







        02


        問答類數(shù)據(jù)集





        1.金融行業(yè)問答數(shù)據(jù)集(245.5MB)


        https://www.heywhale.com/mw/dataset/5e9588f8e7ec38002d0331b1


        2.社區(qū)問答數(shù)據(jù)集(1.7GB)

        https://www.heywhale.com/mw/dataset/5de601f3ca27f8002c4cac47


        3.中文醫(yī)學問答數(shù)據(jù)集(85MB)

        https://www.heywhale.com/mw/dataset/5d313070cf76a60036e4b023


        4.CNN?新聞文章中的 12?萬個問答對數(shù)據(jù)集(17.3MB)

        https://www.heywhale.com/mw/dataset/5eef1408caa99b002d6e37cc






        03


        情感分析類數(shù)據(jù)集




        1.斯坦福情緒樹庫:帶有情感注釋的標準情緒數(shù)據(jù)集(6.1MB)

        https://www.heywhale.com/mw/dataset/5daa748c1035d8002c35cdee


        2.關于美國的航空公司的推特的情緒分析數(shù)據(jù)集(2.6MB)

        https://www.heywhale.com/mw/dataset/5dab23781035d8002c3634c9


        3.中文對話情緒語料(1.1MB)

        https://www.heywhale.com/mw/dataset/5d00c390e727f8002c4599ad


        4.多域情感數(shù)據(jù)集(51.2MB)

        https://www.heywhale.com/mw/dataset/5de5ce0aca27f8002c4c9ee8


        5.sentiment140?情感分析數(shù)據(jù)集(72.6KB)

        https://www.heywhale.com/mw/dataset/5ca46f1a8408c1002b498cca







        04


        爬蟲類數(shù)據(jù)集





        1.6000條周杰倫微博超話數(shù)據(jù)!(1.1MB)

        https://www.heywhale.com/mw/dataset/5d3551bdcf76a60036f605aa


        2.《中餐廳3》19W彈幕數(shù)據(jù)(12.8MB)

        https://www.heywhale.com/mw/dataset/5d7b69798499bc002c0d3ec5


        3.bilibili流行動漫影評數(shù)據(jù)(2.3MB)

        https://www.heywhale.com/mw/dataset/5d3a76dfcf76a600360e19c9


        4.淘寶某店鋪電風扇評論(273.9KB)

        https://www.heywhale.com/mw/dataset/5d442caec143cf002bdb687c


        5.7K條馬蜂窩國內熱門景點游記(140+MB)

        https://www.heywhale.com/mw/dataset/5e7c55db98d4a8002d2db5a2


        6.IMDB電影評論數(shù)據(jù)(32.0MB)

        https://www.heywhale.com/mw/dataset/5d143d41708b90002c5f7021


        7.未名BBS熱門話題(3.6MB)

        https://www.heywhale.com/mw/dataset/5dad84b375df5c002b20d79f


        8.咪蒙所有公眾號文章(3.9MB)

        https://www.heywhale.com/mw/dataset/5c8723441e7104002b3831d3


        9.6000條周杰倫微博超話數(shù)據(jù)(1.1MB)

        https://www.heywhale.com/mw/dataset/5d3551bdcf76a60036f605aa


        10.麥當勞就餐負面評論數(shù)據(jù)集(891.1KB)

        https://www.heywhale.com/mw/dataset/5dab2c0b1035d8002c36372b







        05


        實體識別類數(shù)據(jù)集





        1.用于命名實體識別的帶注釋語料庫(26.4MB)

        https://www.heywhale.com/mw/dataset/5de9be34953ca8002c95c35f


        2.使用LatticeLSTM的中文NER數(shù)據(jù)(191.5KB)

        https://www.heywhale.com/mw/dataset/5d564ea0c143cf002b235181


        3.醫(yī)療命名實體識別數(shù)據(jù)集(5.1MB)

        https://www.heywhale.com/mw/dataset/5dedef59953ca8002c96667a


        4.中文實體關系抽取數(shù)據(jù)集(8.1MB)

        https://www.heywhale.com/mw/dataset/5dde487dca27f8002c4a8352


        5.金融信息負面及主體判定比賽數(shù)據(jù)集(17MB)

        https://www.heywhale.com/mw/dataset/5e09a9eb2823a10036b126c0





        06


        CV類數(shù)據(jù)集





        1.Pronto共享單車數(shù)據(jù)集(70.8MB)

        https://www.heywhale.com/mw/dataset/58a515c48460306efcce2e96



        1.Fashion-MNIST圖像數(shù)據(jù)集(200.4MB)

        https://www.heywhale.com/mw/dataset/5a0cfcf860680b295c28a753


        2.CIFAR100數(shù)據(jù)集(161.3MB)

        https://www.heywhale.com/mw/dataset/5e96da12e7ec38002d03bf51


        3.車輛數(shù)據(jù)集(車輛識別與分類)(62.5MB)

        https://www.heywhale.com/mw/dataset/5bc316173631bc00109d2abf


        4.垃圾分類數(shù)據(jù)集

        https://www.heywhale.com/mw/dataset/5d133d11708b90002c570588


        5.另一個垃圾分類數(shù)據(jù)集(40.9MB)

        https://www.heywhale.com/mw/dataset/5d1578e4708b90002c6a3238


        6.CIFAR10數(shù)據(jù)集(148MB)

        https://www.heywhale.com/mw/dataset/5ab3403bfdf6b86c23f259e3


        7.GTSRB-德國交通標志識別圖像數(shù)據(jù)(253.3MB)

        https://www.heywhale.com/mw/dataset/5d8db5ca037db3002d3a5ba0


        8.手勢識別數(shù)據(jù)庫(1.1GB)

        https://www.heywhale.com/mw/dataset/5d8da999037db3002d3a523


        9.情緒的面部表情(170MB+)

        https://www.heywhale.com/mw/dataset/5d773bd68499bc002c0c4a6f


        10.槍支目標檢測(2.4MB)

        https://www.heywhale.com/mw/dataset/5d576984c143cf002b238528


        11.人臉圖像數(shù)據(jù)(294.1MB)

        https://www.heywhale.com/mw/dataset/5da6d4cac83fb4004206edf0


        12.RMFD口罩遮擋人臉數(shù)據(jù)集(610.3MB)

        https://www.heywhale.com/mw/dataset/5e81a24b246a590036b884d5


        13.中國交警手勢數(shù)據(jù)集(1.8GB)

        https://www.heywhale.com/mw/dataset/5de75df5ca27f8002c4cf1bb


        14.場景分類數(shù)據(jù)集(105.9MB)

        https://www.heywhale.com/mw/dataset/5ddb4adef41512002cebc776


        15.87種寶石圖片數(shù)據(jù)(50.9MB)

        https://www.heywhale.com/mw/dataset/5ddccb4aca27f8002c4a26db


        16.驗證碼數(shù)據(jù)集(13.5MB)

        https://www.heywhale.com/mw/dataset/5e143ef32823a10036b34846/file


        17.硬幣圖像數(shù)據(jù)集(326.7MB)

        https://www.heywhale.com/mw/dataset/5e7c7b3e98d4a8002d2dd15c


        18.LabelMe圖像語義分割數(shù)據(jù)集(102.6MB)

        https://www.heywhale.com/mw/dataset/5e7b770698d4a8002d2d7925


        19.車牌識別數(shù)據(jù)集(62.8MB)

        https://www.heywhale.com/mw/dataset/5e96cf8fe7ec38002d03b8ed


        20.Biwi頭姿勢數(shù)據(jù)庫(449.7MB)

        https://www.heywhale.com/mw/dataset/5ea16e79105d91002d4f4a25


        動物

        21.Butterfly-200細粒度圖像分類數(shù)據(jù)集(828MB)

        https://www.heywhale.com/mw/dataset/5e85d97095b029002ca7e904


        22.寵物圖像數(shù)據(jù)集(783.5MB)

        https://www.heywhale.com/mw/dataset/5d76060b8499bc002c0bdd66


        23.狗狗種類圖像數(shù)據(jù)集(919.5MB)

        https://www.heywhale.com/mw/dataset/5def46de2823a10036aab2f5


        24.黑猩猩圖片數(shù)據(jù)集(604.4MB)

        https://www.heywhale.com/mw/dataset/5e7b62da98d4a8002d2d6deb


        植物:

        25.水稻葉子疾病圖片集(36.7MB)

        https://www.heywhale.com/mw/dataset/5d522069c143cf002b21edbb


        26.植物幼苗圖片數(shù)據(jù)集

        https://www.heywhale.com/mw/dataset/5d4cd153c143cf002b0a8d4f


        27.花卉識別數(shù)據(jù)集(224.9MB)

        https://www.heywhale.com/mw/dataset/5cc127b98c90d7002c8375d7


        28.花卉圖像分類

        https://www.heywhale.com/mw/dataset/5d3320afcf76a60036ec5fd1


        29.可食用野外植物數(shù)據(jù)集

        https://www.heywhale.com/mw/dataset/5d4a5e88c143cf002bfbf5d0


        30.葉片計數(shù)圖像數(shù)據(jù)集(882.3MB)

        https://www.heywhale.com/mw/dataset/5e841900246a590036b954bd


        氣象:

        31.颶風損害的衛(wèi)星圖像數(shù)據(jù)集(63MB)

        https://www.heywhale.com/mw/dataset/5da6e8a6c83fb40042073e4f


        32.從衛(wèi)星圖像理解云層數(shù)據(jù)集(42MB)

        https://www.heywhale.com/mw/dataset/5dafcd9275df5c002b2189c6


        字符識別:

        33.TibetanMNIST藏文手寫數(shù)字數(shù)據(jù)集(53.2MB)

        https://www.heywhale.com/mw/dataset/5bfe734a954d6e0010683839


        34.MNIST手寫識別數(shù)據(jù)集(9.5MB)

        https://www.heywhale.com/mw/dataset/58a7c84c803d1a0d2e26441a


        35.Chars74K字符識別數(shù)據(jù)集(188.3MB)

        https://www.heywhale.com/mw/dataset/5d89c020e3ffb2002c44fb0a


        36.信用卡卡面圖像及標注數(shù)據(jù)(42.9MB)

        https://www.heywhale.com/mw/dataset/5954cf1372ead054a5e25870


        37.手寫數(shù)學表達式識別(29MB)

        https://www.heywhale.com/mw/dataset/5daff66f75df5c002b219fb0


        38.圖片與單詞匹配數(shù)據(jù)集(31.1MB)

        https://www.heywhale.com/mw/dataset/5dab27631035d8002c3635eb


        39.密集不規(guī)則文本行數(shù)據(jù)集(353MB)

        https://www.heywhale.com/mw/dataset/5da95febc83fb400420f3631


        40.視覺文字識別數(shù)據(jù)集

        https://www.heywhale.com/mw/dataset/5df058762823a10036aae665


        41.HASY手寫符號圖片數(shù)據(jù)集(127.2MB)

        https://www.heywhale.com/mw/dataset/5dee0b25953ca8002c9671ea


        42.麻將圖片數(shù)據(jù)集(7.5MB)

        https://www.heywhale.com/mw/dataset/5de76474ca27f8002c4cf44c



        醫(yī)療:

        43.犬球蟲病寄生蟲圖片集(18.1MB)

        https://www.heywhale.com/mw/dataset/5d4cd9c7c143cf002b0abead


        44.頭部CT圖像數(shù)據(jù)(24.4MB)

        https://www.heywhale.com/mw/dataset/5d7213eb8499bc002c0af1e8


        45.肺部CT圖像數(shù)據(jù)(529.0MB)

        https://www.heywhale.com/mw/dataset/5d71de448499bc002c0ae1fc


        46.心血管疾病預測(2.7MB)

        https://www.heywhale.com/mw/dataset/5db00b9175df5c002b21af83/file


        47.深圳醫(yī)院胸片檢查掩膜圖片數(shù)據(jù)集(19.8MB)

        https://www.heywhale.com/mw/dataset/5daff18575df5c002b219c89


        48.肺部CT圖像數(shù)據(jù)(529MB)

        https://www.heywhale.com/mw/dataset/5d71de448499bc002c0ae1fc


        49.結核病圖像數(shù)據(jù)集(456.8MB)

        https://www.heywhale.com/mw/dataset/5efc4de063975d002c9792de



        行人識別:

        50.行人檢測數(shù)據(jù)集ETHZ(146MB)

        https://www.heywhale.com/mw/dataset/5db2680f75df5c002b23b755


        51.行人重識別數(shù)據(jù)集Market-1501(145.7MB)

        https://www.heywhale.com/mw/dataset/5db148b375df5c002b2295dd


        52.行人重識別數(shù)據(jù)集RAiD(140.1MB)

        https://www.heywhale.com/mw/dataset/5db11bf775df5c002b226914


        53.行人重識別數(shù)據(jù)集prid_2011(1015.3MB)

        https://www.heywhale.com/mw/dataset/5db10d1a75df5c002b2259dd


        54.汽車后視攝像頭視角行人數(shù)據(jù)集(799.7MB)

        https://www.heywhale.com/mw/dataset/5dafc75175df5c002b2186a0





        07


        語音類數(shù)據(jù)集






        1.Mozilla語音數(shù)據(jù)集-中文(358.2MB)

        https://www.heywhale.com/mw/dataset/5d6f91678499bc002c0a722b


        2.2000個英語讀數(shù)字的錄音(8.9MB)

        https://www.heywhale.com/mw/dataset/5ddde933ca27f8002c4a6013






        如果您覺得我們的文章還不錯,請分享,點贊,再看,一鍵三連?。?!


        END


        數(shù)據(jù)分析求職面試相關資訊持續(xù)分享,盡請關注數(shù)據(jù)萬花筒和數(shù)據(jù)百曉生



        相關閱讀:



        瀏覽 105
        點贊
        評論
        收藏
        分享

        手機掃一掃分享

        分享
        舉報
        評論
        圖片
        表情
        推薦
        點贊
        評論
        收藏
        分享

        手機掃一掃分享

        分享
        舉報
        1. <strong id="7actg"></strong>
        2. <table id="7actg"></table>

        3. <address id="7actg"></address>
          <address id="7actg"></address>
          1. <object id="7actg"><tt id="7actg"></tt></object>
            中文字幕成人在线 | 黑人肏逼 | 日本免费黄色小说 | 日本A片短视频 | 欧美美穴 | 国产精品三级在线观看 | 久久久综合视频 | 日韩做爱免费视频 | a级毛片无码视频AAAA流出91 | 青青草逼视频 |