ChatGPT探討｜ChatGPT的技術發展與優勢剖析

Published On: 2023/04/03|Categories: 科技(Technology)|

ChatGPTAn artificial intelligence chatbot program to be launched in November 2022 by OpenAI, a not-for-profit artificial intelligence company founded in 2015 by Elon Musk and Samuel H. Altman and headquartered in San Francisco, with the goal of solving the dilemmas and problems encountered by human beings in science and technology as the basis for future artificial intelligence research. The company's mission is to create a foundation for future artificial intelligence research by solving the technological dilemmas and problems facing humanity. The company is comprised of leading researchers and scholars from around the world who are developing artificial intelligence technologies through machine learning and academic research to create value for the global community and help humans understand their natural environment more fully. in 2018, Elon Musk stepped down from the board of directors in consideration of the potential conflict of interest between Tesla's self-driving technology development and OpenAI, and Altman has since taken over the company's operations. The company's operations are in the hands of Altman, who is now responsible for the company's operations. Because it could not afford the high cost of long-term training models, OpenAI shifted to a limited profit model in 2019. Soon after the reorganization, it received a $1 billion investment from Microsoft to obtain priority rights to commercialize some of its AI technologies, and facilitated the cooperation between the two companies in developing artificial intelligence technologies for the Azure cloud platform service. After the popularity of ChatGPT, in 2023, Microsoft invested billions of dollars to import its search engine Bing and browser Edge into ChatGPT's language module, in order to capture the huge search market opportunity. This move has caused Google to feel threatened, and founders Sergey Brin and Larry Page have even gone back to the drawing board to supervise the research and development of artificial intelligence technology. Currently, OpenAI's main AI technologies include:

Machine learning: automated learning and application of new knowledge to improve AI performance.
Deep Learning: enables the development of deeper and more diverse applications that more effectively understand and simulate the behavior of artificial intelligence.
Natural language processing: deeper understanding and simulation of human speech behavior.
Autonomous Behavior: Understanding, Modeling, and Modeling Human Behavior.

GPT的全名是生成型預訓練變換模型(Generative Pre-Trained Transformer)，顧名思義即透過變換器(Transformer)為基礎、使用預訓練技術得到通用文本模型。目前OpenAI已經公佈的模型有文本預訓練之GPT-1~3及圖像預訓練之iGPT，GPT -1~3的比較如表2所列。GPT-1是透過由左向右生成式的構建預訓練任務，然後得到一個通用的預訓練模型，這個模型和基於變換器的雙向編碼器表示(BERT)技術一樣都可用做下游任務的微調，它在9個自然語言處理任務上達到當時最先進模型的效果，但因為處理資料量都比較小，所以開發出GPT-2模型。GPT-2並未大幅更動模型結構，只是使用更多參數的模型和訓練資料，它的開發理念是讓所有的有監督學習都是無監督語言模型的一個子集合。GPT-2在誕生之初引發不少轟動，它生成的新聞能夠達到以假亂真的效果，故被稱為「人工智慧界最危險的武器」，所以許多入口網站禁止使用GPT-2生成的假新聞。後續的發展出GT-3模型大幅提高參數量與訓練資料量，使用的上下文學習(In-context Learning)模式可透過少量的資料尋找一個合適的初始化範圍，讓該模型能夠在有限的資料集上快速擬合，並獲得不錯的效果。它除了能完成常見的自然語言處理任務外，也可用於編寫SQL，JavaScript等程式碼。2023年3月14日推出GPT-4模型，相較前一代產品的最大差異在於多模態和內容生成。使用者在先前的GPT模型只能輸入文字，GPT-4可以識別圖片內容並給出答案，而且最多可以輸出25000個單詞，同時給出答案錯誤更少，涉及到倫理和敏感問題時回答也更符合常理。

表2、三種GPT模型的重要參數比較

資料來源 : OpenAI

因為訓練得到的模型不是非常可控，回饋到生成模型中之訓練資料分佈便是影響生成內容品質最重要的因素。有時候開發者希望模型並不僅僅只受訓練資料的影響，而且還是人為可控以保證生成資料的有用性、真實性和無害性。於是OpenAI使用人類反饋強化學習(RLHF)技術改進GPT-3模型，稱為InstructGPT。其方法是根據用戶向應用程式開發介面(API)提交的提示，由標記員向模型提供示範行為並對輸出進行排名來進行微調。InstructGPT可以更好地遵循人類指示，有害內容輸出也大幅降低。它雖然僅具有13億個參數，遠低於GPT-3模型，但研究人員使用自然語言處理效能評估方法來衡量其能力而發現兩者差不多。

ChatGPT是由GPT-3延伸出的GPT-3.5模型所製作，也是使人類反饋強化學習來訓練該模型，其訓練程序分成三步驟，如圖2所示。首先根據採集的資料集對GPT-3進行有監督的微調(SFT)，其次是收集人工標注的對比資料來訓練獎勵模型(RM)，最後是使用獎勵模型作為強化學習的優化目標，利用近端策略優化(PPO)演算法微調模型。ChatGPT與InstructGPT的資料收集方法略有不同，並加入強化學習近端策略優化，可以理解成在人腦思維的基礎上加入人類回饋系統，因此成文效果更真實、編碼能力更強而模型的無害性有些許提升。ChatGPT的技術優勢是採用自注意力機制，能夠更好理解語境並在產生文本時考慮到先前的對話內容，除了可快速產生高品質的文本外，還不需要任何額外的訓練就能在多種不同的領域中使用，並可進行如情感分析、關係推斷和情境建模等多種對話任務。而其技術仍有侷限性需要突破，包括:(1).ChatGPT輸出文本時效性受到OpenAI的模型資料庫更新頻率、資料來源影響，故可能出現不符現況之狀況。(2).ChatGPT只能基於現有資料輸出文本資訊，若資料庫欠缺特定領域資訊則生成的文本勢必不夠專業。(3).OpenAI訓練ChatGPT時通常使用大量經過人工或自動的過濾來排除生成不合適內容的文本，然而隨著模型的公開使用，有可能會出現某些不合適的資料被用於生成結果而導致準確性下降。

圖2、GPT-3.5與InstructGPT模型的訓練程序