[Lex Computer & Tech Group/LCTG] New York rabbi delivers sermon written by artificial intelligence

Adam Broun abroun at gmail.com
Thu Feb 16 09:06:13 PST 2023


Here’s what I could find on the corpus: https://gist.github.com/veekaybee/6f8885e9906aa9c5408ebe5c7e870698   See about 40% of the way down under “Training Data”.  Excerpt:
===
The model was trained on:

Books1 <https://github.com/soskek/bookcorpus/issues/27#issuecomment-716104208> - also known as BookCorpus[…] which maintains that it's free books scraped from smashwords.com.
Books2 - No one knows exactly what this is, people suspect it's libgen
Common Crawl <https://en.wikipedia.org/wiki/Common_Crawl>
WebText2 <https://www.eleuther.ai/projects/owt2/> - an internet dataset created by scraping URLs extracted from Reddit submissions with a minimum score of 3 as a proxy for quality, deduplicated at the document level with MinHash <https://boringml.com/docs/recsys/minhash/>
What's in MyAI Paper <https://lifearchitect.ai/whats-in-my-ai-paper/>, Source <https://twitter.com/kdamica/status/1600328844753240065> - Detailed dive into these datasets.
===


And here’s a guy who trained a GPT on texts with right-wing viewpoints:  https://davidrozado.substack.com/p/rightwinggpt




> On Feb 16, 2023, at 11:53, Ted Kochanski <tedpkphd at gmail.com> wrote:
> 
> All
> 
> As I mentioned yesterday -- I applied for a demo and am on the waiting list
> 
> I thought of asking about the recent breakthrough announcements in Fusion
> 
> But I may ask the generic question suggested yesterday as part of our discussion:
> How was the corpus used for training ChatGPT created
> 
> Ted
> 
> On Wed, Feb 15, 2023 at 8:05 PM Stephen Quatrano <stefanoq at gmail.com <mailto:stefanoq at gmail.com>> wrote:
>> I'd feel better about this assertion, Ted, if you framed it as a question:  How was the corpus used for training ChatGPT created?  That is a great question.
>> 
>> Or, on the other hand, of course, you could provide evidence of what you claim.
>> 
>> Personally, I have no evidence one way or the other to share.
>> 
>> Regards,
>> 
>> Stephen Quatrano
>> CEO and Cofounder | Meema, Inc
>> web: http://meemastories.com <http://meemastories.com/>
>> email: stephen.quatrano at meemastories.com <mailto:stephen.quatrano at meemastories.com>
>> cell: +1 781-266-8799
>> https://www.linkedin.com/in/quatrano/
>> 
>> Board Member | The Right Question Institute
>> http://www.rightquestion.org <http://www.rightquestion.org/>
>> 
>> Lifelong Learner
>> http://www.howdoweknow.info/p/home.html
>> https://stefano.quatrano.us/2004/05/antonios-liberation-story-by-steve.html
>> 
>>> On Feb 15, 2023, at 7:02 PM, Ted Kochanski <tedpkphd at gmail.com <mailto:tedpkphd at gmail.com>> wrote:
>>> 
>>> All,
>>> 
>>>  the statement attributed to Adam Broun:
>>>> Remember, ChatGPT wasn’t “programmed” with any responses and doesnt know anything.
>>> is not true -- the corpus of material which ChatGPT has access to is its programming and someone defined that corpus
>>> 
>>> So for example if you exclude anything positive which has been written about cats because you are a caninophile  -- if you ask ChatGPT to compare cats and dogs -- you will get nothing but negatives about cats as ChatGPT will not be "aware" that anything positive can be said about cats
>>> 
>>> This selection bias has already been tested when comparing Donald Trump and Joe Biden -- ChatGPT treats Mr. Trump the same way as my hypothetical about cats
>>> 
>>> Ted 
>>> 
>>> On Wed, Feb 15, 2023 at 5:31 PM Adam Broun <abroun at gmail.com <mailto:abroun at gmail.com>> wrote:
>>>> Remember, ChatGPT wasn’t “programmed” with any responses and doesnt know anything. It’s easy to read ‘knowledge’ into its responses because we’re wired to interpret intelligible sentences as coming from an intelligence.  It’s parroting back words that sound like an answer to your prompt because the text is was trained on has those words, nothing more. 
>>>> 
>>>> 
>>>> 
>>>> 
>>>>> On Feb 15, 2023, at 16:41, Marvin Menzin <mmenzin at icloud.com <mailto:mmenzin at icloud.com>> wrote:
>>>>> 
>>>>> While on AI , here is a thought experiment I saw in oped in wsj. 
>>>>> The new AI program was asked to reply to this : 
>>>>> 
>>>>> You can prevent the explosion of a nuke that will kill millions of innocent people but to do that you must utter a terrible racial slur . What should you do? 
>>>>> 
>>>>> The answer came back that “you must never utter a racial slur because we must protect all races and minorities etc etc . “ So the ethics in AI are programmed in by the authors . At least right now .. 
>>>>> Marvin 
>>>>> 
>>>>> 
>>>>> Sent from my iPad
>>>>> 
>>>>>> On Feb 15, 2023, at 4:30 PM, jjrudy1 at comcast.net <mailto:jjrudy1 at comcast.net> wrote:
>>>>>> 
>>>>>> 
>>>>>> www.thejc.com/news/world/new-york-rabbi-delivers-sermon-written-by-artificial-intelligence-6BkwDEHc2ZWR63tmoOdvvf <http://www.thejc.com/news/world/new-york-rabbi-delivers-sermon-written-by-artificial-intelligence-6BkwDEHc2ZWR63tmoOdvvf>
>>>>>>  
>>>>>> There is a more recent article by a rabbi saying that the sermon wasn’t very good and they don’t have to worry about their jobs.  I think he is partially wrong.  Let’s say a rabbi takes 8 hours to write a sermon.  With the right prompts AI can toss out 3000 words in a few minutes.  Now the rabbi can tune and/or expand and it will take ½ the time or less, and the rabbi can have the AI do some of the content tuning.
>>>>>>  
>>>>>> Sermons, of course, are a small percentage of the job, so I suppose that they are OK
>>>>>>  
>>>>>>  
>>>>>>  
>>>>>> ===============================================
>>>>>> ::The Lexington Computer and Technology Group Mailing List::
>>>>>> Reply goes to sender only; Reply All to send to list.
>>>>>> Send to the list: LCTG at lists.toku.us <mailto:LCTG at lists.toku.us>      Message archives: http://lists.toku.us/pipermail/lctg-toku.us/
>>>>>> To subscribe: email lctg-subscribe at toku.us <mailto:lctg-subscribe at toku.us>  To unsubscribe: email lctg-unsubscribe at toku.us <mailto:lctg-unsubscribe at toku.us>
>>>>>> Future and Past meeting information: http://LCTG.toku.us <http://lctg.toku.us/>
>>>>>> List information: http://lists.toku.us/listinfo.cgi/lctg-toku.us
>>>>>> This message was sent to mmenzin at icloud.com <mailto:mmenzin at icloud.com>.
>>>>>> Set your list options: http://lists.toku.us/options.cgi/lctg-toku.us/mmenzin@icloud.com
>>>>> ===============================================
>>>>> ::The Lexington Computer and Technology Group Mailing List::
>>>>> Reply goes to sender only; Reply All to send to list.
>>>>> Send to the list: LCTG at lists.toku.us <mailto:LCTG at lists.toku.us>      Message archives: http://lists.toku.us/pipermail/lctg-toku.us/
>>>>> To subscribe: email lctg-subscribe at toku.us <mailto:lctg-subscribe at toku.us>  To unsubscribe: email lctg-unsubscribe at toku.us <mailto:lctg-unsubscribe at toku.us>
>>>>> Future and Past meeting information: http://LCTG.toku.us <http://lctg.toku.us/>
>>>>> List information: http://lists.toku.us/listinfo.cgi/lctg-toku.us
>>>>> This message was sent to abroun at gmail.com <mailto:abroun at gmail.com>.
>>>>> Set your list options: http://lists.toku.us/options.cgi/lctg-toku.us/abroun@gmail.com
>>>> 
>>>> ===============================================
>>>> ::The Lexington Computer and Technology Group Mailing List::
>>>> Reply goes to sender only; Reply All to send to list.
>>>> Send to the list: LCTG at lists.toku.us <mailto:LCTG at lists.toku.us>      Message archives: http://lists.toku.us/pipermail/lctg-toku.us/
>>>> To subscribe: email lctg-subscribe at toku.us <mailto:lctg-subscribe at toku.us>  To unsubscribe: email lctg-unsubscribe at toku.us <mailto:lctg-unsubscribe at toku.us>
>>>> Future and Past meeting information: http://LCTG.toku.us <http://lctg.toku.us/>
>>>> List information: http://lists.toku.us/listinfo.cgi/lctg-toku.us
>>>> This message was sent to tedpkphd at gmail.com <mailto:tedpkphd at gmail.com>.
>>>> Set your list options: http://lists.toku.us/options.cgi/lctg-toku.us/tedpkphd@gmail.com
>>> ===============================================
>>> ::The Lexington Computer and Technology Group Mailing List::
>>> Reply goes to sender only; Reply All to send to list.
>>> Send to the list: LCTG at lists.toku.us <mailto:LCTG at lists.toku.us>      Message archives: http://lists.toku.us/pipermail/lctg-toku.us/
>>> To subscribe: email lctg-subscribe at toku.us <mailto:lctg-subscribe at toku.us>  To unsubscribe: email lctg-unsubscribe at toku.us <mailto:lctg-unsubscribe at toku.us>
>>> Future and Past meeting information: http://LCTG.toku.us <http://lctg.toku.us/>
>>> List information: http://lists.toku.us/listinfo.cgi/lctg-toku.us
>>> This message was sent to stefanoq at gmail.com <mailto:stefanoq at gmail.com>.
>>> Set your list options: http://lists.toku.us/options.cgi/lctg-toku.us/stefanoq@gmail.com
>> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.toku.us/pipermail/lctg-toku.us/attachments/20230216/78d01dc8/attachment.htm>


More information about the LCTG mailing list