Sentence Similarity
sentence-transformers
Safetensors
bert
feature-extraction
dense
Generated from Trainer
dataset_size:5000
loss:MultipleNegativesRankingLoss
text-embeddings-inference
Instructions to use aasifali4813/bert-summarizer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use aasifali4813/bert-summarizer with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("aasifali4813/bert-summarizer") sentences = [ "(CNN) -- A third minute goal from Gonzalo Higuain gave Real Madrid a 1-0 victory over Valencia, coach Juande Ramos' first Primera Liga win since taking charge of the Spanish champions. Gonzalo Higuain is congratulated after scoring Real Madrid's only goal against Valencia. The win ended Madrid's run of three straight league defeats and moved them up to 29 points, nine behind leaders Barcelona -- who visit Villarreal on Sunday. With captain Raul Gonzalez only on the bench befcause of the flu, Higuain played alone in attack, and he soon made his mark with the early goal -- the Argentine collecting Arjen Robben's pass to fire home his 11th goal of the season. Valencia almost fell two goals behind in the 16th minute when Rafael van der Vaart struck the post with a curling drive. Higuain had a chance to double Madrid's tally in the second-half, but his close-range shot struck the bar. The visitors' hopes were effectively ended when captain Carlos Marchena was sent off after picking up his second yellow card, for a foul on Robben. The defeat, only Valencia's second of the season after also going down to Barcelona, leaves them on 30 points, just one above Madrid.", "California governor honors U.S. Airways pilot Capt. Chesley Sullenberger .\nSchwarzenegger: \"We need a great hero\" in state legislature like Sullenberger .\nSullenberger safely landed flight 1549 in the Hudson River in New York .\nSullenberger accepts recognition on behalf of crew members, others .", "Gonzalo Higuain scoresd the only goal as Real Madrid defeat Valencia 1-0 .\nThe victory marks first success in the Primera Liga for coach Juande Ramos .\nValencia, who had Carlos Marchena sent off, were losing only second match .", "By definition, a premature baby is born before the 37th week of pregnancy .\nPremature babies cost the U.S. at least $26 billion each year .\nThere are risk factors, but not all premature births can be prevented .\nIn tough economic times, pregnant women should not cut back on health care ." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
Add new SentenceTransformer model
Browse files- 1_Pooling/config.json +10 -0
- README.md +661 -0
- config.json +30 -0
- config_sentence_transformers.json +14 -0
- model.safetensors +3 -0
- modules.json +14 -0
- sentence_bert_config.json +4 -0
- tokenizer.json +0 -0
- tokenizer_config.json +16 -0
1_Pooling/config.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"word_embedding_dimension": 384,
|
| 3 |
+
"pooling_mode_cls_token": false,
|
| 4 |
+
"pooling_mode_mean_tokens": true,
|
| 5 |
+
"pooling_mode_max_tokens": false,
|
| 6 |
+
"pooling_mode_mean_sqrt_len_tokens": false,
|
| 7 |
+
"pooling_mode_weightedmean_tokens": false,
|
| 8 |
+
"pooling_mode_lasttoken": false,
|
| 9 |
+
"include_prompt": true
|
| 10 |
+
}
|
README.md
ADDED
|
@@ -0,0 +1,661 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- sentence-transformers
|
| 4 |
+
- sentence-similarity
|
| 5 |
+
- feature-extraction
|
| 6 |
+
- dense
|
| 7 |
+
- generated_from_trainer
|
| 8 |
+
- dataset_size:5000
|
| 9 |
+
- loss:MultipleNegativesRankingLoss
|
| 10 |
+
base_model: sentence-transformers/paraphrase-MiniLM-L6-v2
|
| 11 |
+
widget:
|
| 12 |
+
- source_sentence: (CNN) -- A third minute goal from Gonzalo Higuain gave Real Madrid
|
| 13 |
+
a 1-0 victory over Valencia, coach Juande Ramos' first Primera Liga win since
|
| 14 |
+
taking charge of the Spanish champions. Gonzalo Higuain is congratulated after
|
| 15 |
+
scoring Real Madrid's only goal against Valencia. The win ended Madrid's run of
|
| 16 |
+
three straight league defeats and moved them up to 29 points, nine behind leaders
|
| 17 |
+
Barcelona -- who visit Villarreal on Sunday. With captain Raul Gonzalez only on
|
| 18 |
+
the bench befcause of the flu, Higuain played alone in attack, and he soon made
|
| 19 |
+
his mark with the early goal -- the Argentine collecting Arjen Robben's pass to
|
| 20 |
+
fire home his 11th goal of the season. Valencia almost fell two goals behind in
|
| 21 |
+
the 16th minute when Rafael van der Vaart struck the post with a curling drive.
|
| 22 |
+
Higuain had a chance to double Madrid's tally in the second-half, but his close-range
|
| 23 |
+
shot struck the bar. The visitors' hopes were effectively ended when captain Carlos
|
| 24 |
+
Marchena was sent off after picking up his second yellow card, for a foul on
|
| 25 |
+
Robben. The defeat, only Valencia's second of the season after also going down
|
| 26 |
+
to Barcelona, leaves them on 30 points, just one above Madrid.
|
| 27 |
+
sentences:
|
| 28 |
+
- 'California governor honors U.S. Airways pilot Capt. Chesley Sullenberger .
|
| 29 |
+
|
| 30 |
+
Schwarzenegger: "We need a great hero" in state legislature like Sullenberger
|
| 31 |
+
.
|
| 32 |
+
|
| 33 |
+
Sullenberger safely landed flight 1549 in the Hudson River in New York .
|
| 34 |
+
|
| 35 |
+
Sullenberger accepts recognition on behalf of crew members, others .'
|
| 36 |
+
- 'Gonzalo Higuain scoresd the only goal as Real Madrid defeat Valencia 1-0 .
|
| 37 |
+
|
| 38 |
+
The victory marks first success in the Primera Liga for coach Juande Ramos .
|
| 39 |
+
|
| 40 |
+
Valencia, who had Carlos Marchena sent off, were losing only second match .'
|
| 41 |
+
- 'By definition, a premature baby is born before the 37th week of pregnancy .
|
| 42 |
+
|
| 43 |
+
Premature babies cost the U.S. at least $26 billion each year .
|
| 44 |
+
|
| 45 |
+
There are risk factors, but not all premature births can be prevented .
|
| 46 |
+
|
| 47 |
+
In tough economic times, pregnant women should not cut back on health care .'
|
| 48 |
+
- source_sentence: (CNN) -- The U.S. Olympic Committee and the International Olympic
|
| 49 |
+
Committee are expected to ask a federal judge Monday to shut down Web sites they
|
| 50 |
+
allege scam customers trying to buy Olympic tickets, according to court documents.
|
| 51 |
+
The Web site www.beijingticketing.com is accused in a lawsuit of scamming Olympic
|
| 52 |
+
ticket buyers. The IOC and the USOC filed lawsuits on July 22 against several
|
| 53 |
+
Web sites -- primarily www.beijingticketing.com and www.beijing-2008tickets.com
|
| 54 |
+
-- for illegally using Olympic trademarks to dupe customers into giving them credit
|
| 55 |
+
card, passport and banking information. Lawyers for the IOC and USOC are expected
|
| 56 |
+
to petition U.S. District Court Judge Jeffrey White on Monday to permanently shut
|
| 57 |
+
down several sites listed in the lawsuit. The lawsuit names the company XL & H
|
| 58 |
+
Ltd, known as Xclusive Leisure & Hospitality Ltd. and six other Web sites believed
|
| 59 |
+
to be fraudulent. Several consumers who purchased tickets from the site contacted
|
| 60 |
+
the USOC when they did not receive tickets, despite numerous calls and e-mails
|
| 61 |
+
to the Web sites founder, according to a USOC press release. The scam has hit
|
| 62 |
+
Olympic fans in the United States, Australia, New Zealand, England, Japan, China
|
| 63 |
+
and Norway, according to media reports. Australian Olympic Committee President
|
| 64 |
+
John Coates told local media that relatives of the country's softball team had
|
| 65 |
+
been victims of the site. The Sydney Morning Herald reported that the New South
|
| 66 |
+
Wales Government set up a hotline for those duped out of tickets. The hotline
|
| 67 |
+
has received hundreds of calls from around the globe with consumers detailing
|
| 68 |
+
losses as high as $57,000. The IOC and USOC successfully secured a restraining
|
| 69 |
+
order on July 23 in federal court in Phoenix, Arizona, that shut down www.beijing-2008tickets.com,
|
| 70 |
+
according to court documents. That site is now shut down and no contact information
|
| 71 |
+
is available. The site www.BeijingTicketing.com priced tickets for Friday's Olympic
|
| 72 |
+
opening ceremony at about $2,000, with events such as swimming selling for between
|
| 73 |
+
$300 and $500. The site is the first entry that comes up for a Google search for
|
| 74 |
+
"Olympic tickets," second only to the authorized dealer of Olympic tickets, www.cosport.com.
|
| 75 |
+
The Web site ww.beijingticketing.com site lists a London phone number, which rang
|
| 76 |
+
unanswered. The site lists an office address in Arizona. The site boasts tickets
|
| 77 |
+
for nearly every Olympic event, with some events showing sell-outs already. The
|
| 78 |
+
site also looked to assure ticket buyers of its authenticity with a note at the
|
| 79 |
+
bottom of the site. "Beijing 2008 Ticketing is a well known tickets agent and
|
| 80 |
+
a trusted market place for buyers to find book Olympic tickets," the note stated.
|
| 81 |
+
"We assure 100% guaranteed tickets delivery."
|
| 82 |
+
sentences:
|
| 83 |
+
- 'French National Assembly backs president''s plan to rejoin NATO .
|
| 84 |
+
|
| 85 |
+
President Nicolas Sarkozy''s plan is politically sensitive .
|
| 86 |
+
|
| 87 |
+
France was a founding member of NATO but left in 1966 .
|
| 88 |
+
|
| 89 |
+
President Charles de Gaulle said in 1966 membership meant losing independence
|
| 90 |
+
.'
|
| 91 |
+
- 'Harry Nicolaides, 41, was arrested last August over his 2005 book .
|
| 92 |
+
|
| 93 |
+
The book includes a paragraph about the king and crown prince .
|
| 94 |
+
|
| 95 |
+
Authorities deemed it a violation of a law that makes it illegal to insult the
|
| 96 |
+
crown .
|
| 97 |
+
|
| 98 |
+
Man''s lawyer says he was relieved and left for Australia at about midnight .'
|
| 99 |
+
- 'USOC, IOC claim fake ticket sites have bilked customers using their trademark
|
| 100 |
+
.
|
| 101 |
+
|
| 102 |
+
Sites www.BeijingTicketing.com and www.beijing-2008tickets.com named in suit .
|
| 103 |
+
|
| 104 |
+
One site claims "We assure 100 % guaranteed tickets delivery"'
|
| 105 |
+
- source_sentence: (CNN) -- Police are searching for any other alleged victims of
|
| 106 |
+
a Sunday school teacher accused of raping and killing an 8-year-old girl from
|
| 107 |
+
Tracy, California. Melissa Huckaby is charged with killing 8-year-old Sandra Cantu,
|
| 108 |
+
who was a friend of her own daughter. "We are asking the public if they have any
|
| 109 |
+
indication that any of their children may have had inappropriate contact with
|
| 110 |
+
[the suspect] to contact us," Tracy Police Sgt. Tony Sheneman said Tuesday evening.
|
| 111 |
+
"There has been no indication that this has happened. But she is a Sunday school
|
| 112 |
+
teacher and did have contact with children, so that is why we are asking." Melissa
|
| 113 |
+
Huckaby, 28, was arrested and charged with killing and raping 8-year-old Sandra
|
| 114 |
+
Cantu, a friend of her 5-year-old daughter. Huckaby, wearing red jail scrubs over
|
| 115 |
+
a white T-shirt, was in court Tuesday for an arraignment. She was charged with
|
| 116 |
+
murder, kidnapping, the performance of a lewd and lascivious act on a child under
|
| 117 |
+
14 and rape by instrument. If convicted, Huckaby would face the death penalty
|
| 118 |
+
or life in prison without parole, San Joaquin County District Attorney James Willett
|
| 119 |
+
told reporters after the hearing. A decision on whether to seek the death penalty
|
| 120 |
+
will be made later, he said. Huckaby did not enter a plea or speak during the
|
| 121 |
+
court hearing, although she flinched when Sandra Cantu's name was spoken, and
|
| 122 |
+
she cried at times. Sandra's body was found April 6, stuffed into a suitcase and
|
| 123 |
+
submerged in a pond at a dairy farm. Huckaby was arrested Friday night after questioning
|
| 124 |
+
by police. Sandra was last seen alive March 27 in the mobile home park where she
|
| 125 |
+
lived with her family -- the same mobile home park where Huckaby lives with her
|
| 126 |
+
daughter. The two children were close friends and played together frequently,
|
| 127 |
+
police said. Huckaby is the granddaughter of Clifford Lane Lawless, pastor of
|
| 128 |
+
Clover Road Baptist Church near the mobile home park, and she taught Sunday school
|
| 129 |
+
at the church, police have said. The church was searched as part of the investigation
|
| 130 |
+
into Sandra's disappearance and death. Before her arrest, Huckaby acknowledged
|
| 131 |
+
to a newspaper reporter that she owned the suitcase that contained Sandra's body.
|
| 132 |
+
But Huckaby said the suitcase had been stolen. Willett declined to comment on
|
| 133 |
+
any of the evidence or allegations in the case, saying evidence would be presented
|
| 134 |
+
in court. A memorial ceremony is scheduled for Sandra at 1 p.m. Thursday at a
|
| 135 |
+
high school in Tracy, police said. -- CNN's Ninette Sosa contributed to this report.
|
| 136 |
+
sentences:
|
| 137 |
+
- 'President Bush continues a Christmas tradition by issuing pardons .
|
| 138 |
+
|
| 139 |
+
Convicted Cheney aide Lewis "Scooter" Libby is not among those pardoned .
|
| 140 |
+
|
| 141 |
+
Posthumous pardon issued for man who ran guns to Israeli Jews in late 1940s .
|
| 142 |
+
|
| 143 |
+
Life sentence of man convicted of methamphetamine commuted .'
|
| 144 |
+
- 'Police comb community for other possible victims .
|
| 145 |
+
|
| 146 |
+
Melissa Huckaby was arrested and charged with killing and raping Sandra Cantu
|
| 147 |
+
.
|
| 148 |
+
|
| 149 |
+
Sandra, 8, was friends with Huckaby''s 5-year-old daughter .'
|
| 150 |
+
- 'San Antonio, Texas, 12-year-old wins "Doodle for Google" national contest .
|
| 151 |
+
|
| 152 |
+
Christin Engelberth''s work will grace Google''s main site for 24 hours Thursday
|
| 153 |
+
.
|
| 154 |
+
|
| 155 |
+
Other prizes: $25,000 college scholarship, laptop, tech grant for her school .
|
| 156 |
+
|
| 157 |
+
When she grows up, she said, she hopes to do "anything to do with art"'
|
| 158 |
+
- source_sentence: 'Editor''s note: Fawaz A. Gerges holds the Christian A. Johnson
|
| 159 |
+
Chair in Middle Eastern Studies and International Affairs at Sarah Lawrence College.
|
| 160 |
+
His most recent book is "The Far Enemy: Why Jihad Went Global." Fawaz Gerges says
|
| 161 |
+
the elite running Iran has lost the support of two key groups -- women and young
|
| 162 |
+
voters. (CNN) -- With an apparent political coup in Iran by President Mahmoud
|
| 163 |
+
Ahmadinejad and his supporters over the weekend, the ruling mullahs have dispensed
|
| 164 |
+
with all democratic pretense and joined the ranks of traditional dictators in
|
| 165 |
+
the Middle East. The hardliners in Tehran, led by the Revolutionary Guards and
|
| 166 |
+
ultraconservatives, have won the first round against reformist conservatives but
|
| 167 |
+
at an extravagant cost: loss of public support. Widespread accusations of fraud
|
| 168 |
+
and manipulation are calling into question the very legitimacy and authority of
|
| 169 |
+
the mullahs'' Islamic-based regime. The electoral crisis has exposed a deepening
|
| 170 |
+
divide between female and young voters, who represent about 70 percent of the
|
| 171 |
+
population, and a radical conservative ruling elite out of touch with the hopes,
|
| 172 |
+
fears and aspirations of young Iranians. The consensus in Iran, particularly among
|
| 173 |
+
young voters, is that the election was stolen from reformist candidate, Mir Hossein
|
| 174 |
+
Moussavi, and that the outcome did not reflect the electorate''s genuine will.
|
| 175 |
+
After the Interior Ministry announced the final election results showing a nearly
|
| 176 |
+
2-to-1 landslide for Ahmadinejad (62.63 to 33.75 percent), thousands of young
|
| 177 |
+
protesters took to the streets and clashed with police and set trash bins and
|
| 178 |
+
tires ablaze. Shock and disbelief turned to anger and rage. What poured gasoline
|
| 179 |
+
on a simmering fire among the youth was a belief of widespread fraud. "It''s like
|
| 180 |
+
taking 10 million votes away from Moussavi and giving them to Ahmadinejad," said
|
| 181 |
+
Tehran-based analyst Saeed Leilaz. That is equivalent to more than 30 million
|
| 182 |
+
votes in the American electorate. "Death to the dictatorship" and "down with the
|
| 183 |
+
dictator" shouted the demonstrators while being clubbed by police and anti-riot
|
| 184 |
+
squads. For the first time, young Iranians showed defiance against their tormentors
|
| 185 |
+
and overcame their fear. Many young men reportedly roamed Tehran streets looking
|
| 186 |
+
for a fight with the police. The ruling mullahs face an unprecedentedly serious
|
| 187 |
+
crisis at home. While it does not threaten their existence, the crisis risks undermining
|
| 188 |
+
their ability to govern effectively and pursue a successful foreign policy. In
|
| 189 |
+
his first response to the results of the election, Moussavi nailed it on the head.
|
| 190 |
+
"I won''t surrender to this manipulation," he wrote on his Web site. "The result
|
| 191 |
+
of such performance by some officials will jeopardize the pillars of the Islamic
|
| 192 |
+
Republic and will establish tyranny." Moussavi''s warning to the mullahs that
|
| 193 |
+
stealing the election would weaken the very foundation of their regime and ultimately
|
| 194 |
+
bring about its collapse carries weight because he has been part of the political
|
| 195 |
+
inner circle of the Islamic Republic, not an outsider. Moussavi is a former prime
|
| 196 |
+
minister admired for the way he managed the country''s economy during the prolonged
|
| 197 |
+
and bloody Iran-Iraq war in the 1980s, a conflict which cost Iran over $500 billion.
|
| 198 |
+
He worked closely with Ali Khamenei, then Iran''s president and today supreme
|
| 199 |
+
leader, and clashed with him over political authority and powers. Moussavi is
|
| 200 |
+
a member of Iran''s Expediency Council, which mediates between the parliament
|
| 201 |
+
and the non-elected Guardian Council led by Ayatollah Khamenei. Initially, many
|
| 202 |
+
reformists were skeptical about Moussavi''s reform credentials and feared that
|
| 203 |
+
he was too conservative for their taste. Throughout the presidential campaign,
|
| 204 |
+
Moussavi labored hard to portray his proposals on social policy and foreign affairs
|
| 205 |
+
as an extension of the Islamic system in order to disarm conservative critics,
|
| 206 |
+
even denying that he is a mainstream reformist candidate in the hope of winning
|
| 207 |
+
the support of reformers and moderate conservatives. Indeed, as the presidential
|
| 208 |
+
campaign progressed, Moussavi won the backing not only of an important conservative
|
| 209 |
+
segment of the electorate but also the formidable youth constituency. His charismatic
|
| 210 |
+
wife, Zahra Rahnavard, electrified the female vote and won the hearts and minds
|
| 211 |
+
of women voters who flooded their campaign rallies. In the past two weeks, Moussavi''s
|
| 212 |
+
campaign gained momentum. There was increasing evidence that the tide was turning
|
| 213 |
+
and that women and young voters would tip the balance of power his way, if they
|
| 214 |
+
turned out to vote in large numbers. Some important facts: There are 46 million
|
| 215 |
+
eligible voters in Iran, half of whom are women. In 1997, more than 60 percent
|
| 216 |
+
of the votes that brought moderate Mohammad Khatami to power were cast by women,
|
| 217 |
+
and in 2000, women voters were instrumental in giving reformists a sweeping majority
|
| 218 |
+
in the Parliament. Promising greater individual freedoms to youths was instrumental
|
| 219 |
+
in the two landslide victories by Khatami in 1997 and 2001. Surely, if the turnout
|
| 220 |
+
was 85 percent as the Interior Ministry said, Moussavi should have done much better
|
| 221 |
+
than the mere 33.8 percent he supposedly received. According to many independent
|
| 222 |
+
media reports from all over Iran, women and young voters turned out in record
|
| 223 |
+
numbers, especially in cities where Moussavi is very popular. That should have
|
| 224 |
+
given Moussavi a comfortable lead over the incumbent. Multiple irregularities
|
| 225 |
+
were reported, including the fact that Moussavi supposedly lost the vote in his
|
| 226 |
+
home province. There is an alarming discrepancy between the final results and
|
| 227 |
+
the voting patterns of the Iranian electorate over more than a decade. Lack of
|
| 228 |
+
transparency in the vote count has reinforced Iranians'' suspicion. The weight
|
| 229 |
+
of evidence is at odds with the final outcome given by the Interior Ministry.
|
| 230 |
+
Before Moussavi formally appealed the election result, supreme leader Khamenei
|
| 231 |
+
closed the door for any possible compromise. In a message on state TV, he urged
|
| 232 |
+
the nation to unite behind Ahmadinejad, calling the result a "divine assessment."
|
| 233 |
+
Yet the Guardian Council, not Khamenei, is the authority that either ratifies
|
| 234 |
+
or annuls the results. It is refreshing that Khamenei has finally ordered an investigation
|
| 235 |
+
into allegations of ballot fraud. It is a good start. A more reassuring intiative
|
| 236 |
+
would be for the Guardian Council to order a full recount with the presence of
|
| 237 |
+
independent observers. That would go a long way to putting to rest Iranians''
|
| 238 |
+
concerns. In the meantime, trying to divert attention from the mess at home, Khamenei
|
| 239 |
+
and Ahmadinejad have blamed external "enemies" and "foreign media" for instigating
|
| 240 |
+
a "full-fledged fight against our people." The truth is that the crisis is internal
|
| 241 |
+
and has nothing to do with foreign media or the West. The Obama administration
|
| 242 |
+
has done the right thing by keeping a low profile and not actively interfering
|
| 243 |
+
in Iran''s unfolding domestic crisis. Unlike his predecessor, Obama has defused
|
| 244 |
+
the minefield of relations between Washington and Tehran, shifting the focus of
|
| 245 |
+
Iranians to their own society. But the disputed result shows that the ultraconservative
|
| 246 |
+
mullahs are not only out of touch with a plurality of their citizens but also
|
| 247 |
+
with reality. Their conduct reflects a deeper crisis of self-confidence and fear
|
| 248 |
+
of the future. Has the Islamic revolution run out of ideological steam? If the
|
| 249 |
+
mullahs fear Moussavi, a loyalist, they must be scared of their shadows and uncertain
|
| 250 |
+
about their authority and power. That speaks volumes about where the Islamic Republic
|
| 251 |
+
is and where it is heading. The mullahs are swimming against the dominant current
|
| 252 |
+
of Iranian society. In the next four years, Iran will likely be engulfed in social
|
| 253 |
+
and political turmoil unless the electoral crisis is resolved in a transparent
|
| 254 |
+
manner. The opinions expressed in this commentary are solely those of Fawaz Gerges.'
|
| 255 |
+
sentences:
|
| 256 |
+
- 'Celtic defeat rivals Rangers 2-0 to win Scottish League Cup final at Hampden
|
| 257 |
+
.
|
| 258 |
+
|
| 259 |
+
Extra-time goals by Darren O''Dea and Aiden McGeady secure cup for Celtic .
|
| 260 |
+
|
| 261 |
+
The result will give Celtic a boost as they battle with Rangers for Scottish title
|
| 262 |
+
.'
|
| 263 |
+
- 'Rare species of freshwater dolphins found in Bangladesh''s waterlogged jungles
|
| 264 |
+
.
|
| 265 |
+
|
| 266 |
+
Fears Irrawaddy had dwindled in number to just a few hundred .
|
| 267 |
+
|
| 268 |
+
However, researchers have now counted almost 6,000 .'
|
| 269 |
+
- 'Fawaz Gerges: Iran''s ruling mullahs forfeited their public support in the election
|
| 270 |
+
.
|
| 271 |
+
|
| 272 |
+
Gerges: Ruling mullahs are out of step with women and young voters .
|
| 273 |
+
|
| 274 |
+
He says hardliners are swimming against the current of Iranian society .'
|
| 275 |
+
- source_sentence: '(CNN) -- Eight Florida teenagers -- six of them girls -- will
|
| 276 |
+
be tried as adults and could be sentenced to life in prison for their alleged
|
| 277 |
+
roles in the videotaped beating of another teen, the state attorney''s office
|
| 278 |
+
said Thursday. The teenagers seen in a video assaulting a 16-year-old could face
|
| 279 |
+
life in prison. The suspects, who range in age from 14 to 18, all face charges
|
| 280 |
+
of kidnapping, which is a first-degree felony, and battery, said Chip Thullbery,
|
| 281 |
+
a spokesman for the Polk County state attorney. Three of them are also charged
|
| 282 |
+
with tampering with a witness. Everyone involved in the case was under a gag order
|
| 283 |
+
imposed by a judge. The only attorney for the teens who has been publicly identified
|
| 284 |
+
did not return calls from CNN, and his assistant cited the gag order as the reason.
|
| 285 |
+
The teens are scheduled for their first appearance in court Friday. The video
|
| 286 |
+
shows a brutal scene: The 16-year-old victim is punched, kneed and slapped by
|
| 287 |
+
other girls. She huddles in the fetal position, or stands and screams at her attackers,
|
| 288 |
+
but the assault continues. Authorities say the eight teens said they were retaliating
|
| 289 |
+
for insults posted on the Internet by the attack victim. Polk County Sheriff Grady
|
| 290 |
+
Judd called the March 30 attack "animalistic." "I''ve been involved in law enforcement
|
| 291 |
+
for 35 years, and I''ve seen a lot of extremely violent events, but I''ve never
|
| 292 |
+
seen children, 14 to 18 years of age, engage in this conduct for a 30-minute period
|
| 293 |
+
of time and then make these video clips," he said. Police say the teens planned
|
| 294 |
+
to post the video on YouTube. Watch the disturbing video » . The victim, a 16-year-old
|
| 295 |
+
from Lakeland, Florida, was hospitalized, and still has blurred vision, hearing
|
| 296 |
+
loss, and a swollen face, her mother told CNN on Wednesday. The video shows only
|
| 297 |
+
girls doing the beating; Judd said the boys acted as lookouts. The idea of girls
|
| 298 |
+
administering a vicious beating so they can post the video online may seem shocking,
|
| 299 |
+
but it''s becoming an increasingly common scenario, according to experts and news
|
| 300 |
+
reports. Watch why more teens are putting fights online » . A search for "girl
|
| 301 |
+
fight" on YouTube gets thousands of results, and a suggestion to also try "girl
|
| 302 |
+
fight at school, boy girl fight" and other search terms. There''s at least one
|
| 303 |
+
Web site devoted exclusively to videos of girls fighting. In 2003, 25 percent
|
| 304 |
+
of high school girls said they had been in a physical fight in the past year,
|
| 305 |
+
according to a survey by the Centers for Disease Control and Prevention. (The
|
| 306 |
+
figure for boys was 40.5 percent.) A Justice Department report released in 2006
|
| 307 |
+
showed that by age 17, 21 percent of girls said they had assaulted someone with
|
| 308 |
+
the intent to cause serious harm. Frank Green is executive director of Keys to
|
| 309 |
+
Safer Schools, a group that studies and tries to prevent school violence. He said
|
| 310 |
+
he''s not sure whether girls have actually become more violent, or whether there''s
|
| 311 |
+
just more awareness of their fights. "In one respect, girls have always been more
|
| 312 |
+
vicious than boys," Green said. "Their violence is of a personal nature." He said
|
| 313 |
+
boys usually have some focus and a concrete goal when they fight. "But girls want
|
| 314 |
+
to cause pain and make the other girl feel bad," he said. Judd, the Polk County
|
| 315 |
+
sheriff, said an important part of the plan in the Lakeland attack was to post
|
| 316 |
+
the video of the beating on YouTube to humiliate and embarrass the victim. "It''s
|
| 317 |
+
the next stage of cyberbullying," psychologist Susan Lipkins said. "They want
|
| 318 |
+
to show what they''re doing." "Our kids are being peer pressured, in another sense
|
| 319 |
+
of a trend, to put these shock videos out there at other peoples'' expense," said
|
| 320 |
+
Talisa Lindsay, the victim''s mother. "And I hope that it doesn''t come to the
|
| 321 |
+
point where there''s more people''s lives that are being affected by having to
|
| 322 |
+
take a beating for entertainment, or possibly being killed." Watch mother describe
|
| 323 |
+
how the victim is doing » . The suspects didn''t have a chance to post the video
|
| 324 |
+
online before police moved in and seized it, Judd said. The Sheriff''s Department
|
| 325 |
+
made it public, and it wound up on YouTube anyway. Judd recognizes the irony.
|
| 326 |
+
"In a perverted sense, we were feeding into exactly what the kids wanted," he
|
| 327 |
+
said. "But according to Florida law, [the video] is public record, and it''s going
|
| 328 |
+
to be in the public domain whether we agree with that or not." Judd said the suspects
|
| 329 |
+
showed no remorse when they were arrested and booked. "They were laughing and
|
| 330 |
+
joking about, ''I guess we won''t get to go to the beach during spring break.''
|
| 331 |
+
And one ... asked whether she could go to cheerleading practice," he said. Lipkins,
|
| 332 |
+
the psychologist, says there''s a "disconnect between their actions and their
|
| 333 |
+
thoughts." "They think the entire society is doing it, and they think it''s funny.
|
| 334 |
+
So they put it on YouTube. And I don''t think they expect kids to get really hurt,
|
| 335 |
+
and they also don''t expect to get really caught." E-mail to a friend . CNN''s
|
| 336 |
+
Rich Phillips contributed to this report.'
|
| 337 |
+
sentences:
|
| 338 |
+
- 'Between 4 and 6 percent of the U.S. population suffers from SAD .
|
| 339 |
+
|
| 340 |
+
Light therapy, psychotherapy, and medications are treatment options .
|
| 341 |
+
|
| 342 |
+
Recent research indicates genetic component to the condition .'
|
| 343 |
+
- 'Cornelia Wallace was in her late 60s .
|
| 344 |
+
|
| 345 |
+
She was with Wallace when would-be assassin shot him in 1972 .
|
| 346 |
+
|
| 347 |
+
"She served as first lady during a very turbulent time," Gov. Bob Riley says .'
|
| 348 |
+
- 'Eight Florida teens to be tried as adults in videotaped beating case .
|
| 349 |
+
|
| 350 |
+
Video shows 16-year-old girl punched by other girls .
|
| 351 |
+
|
| 352 |
+
21 percent of girls age 17 say they''ve assaulted someone, the Justice Dept. reports
|
| 353 |
+
.
|
| 354 |
+
|
| 355 |
+
The teens have "disconnect" between thoughts and actions, psychologist says .'
|
| 356 |
+
pipeline_tag: sentence-similarity
|
| 357 |
+
library_name: sentence-transformers
|
| 358 |
+
---
|
| 359 |
+
|
| 360 |
+
# SentenceTransformer based on sentence-transformers/paraphrase-MiniLM-L6-v2
|
| 361 |
+
|
| 362 |
+
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/paraphrase-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
|
| 363 |
+
|
| 364 |
+
## Model Details
|
| 365 |
+
|
| 366 |
+
### Model Description
|
| 367 |
+
- **Model Type:** Sentence Transformer
|
| 368 |
+
- **Base model:** [sentence-transformers/paraphrase-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2) <!-- at revision c9a2bfebc254878aee8c3aca9e6844d5bbb102d1 -->
|
| 369 |
+
- **Maximum Sequence Length:** 128 tokens
|
| 370 |
+
- **Output Dimensionality:** 384 dimensions
|
| 371 |
+
- **Similarity Function:** Cosine Similarity
|
| 372 |
+
<!-- - **Training Dataset:** Unknown -->
|
| 373 |
+
<!-- - **Language:** Unknown -->
|
| 374 |
+
<!-- - **License:** Unknown -->
|
| 375 |
+
|
| 376 |
+
### Model Sources
|
| 377 |
+
|
| 378 |
+
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
|
| 379 |
+
- **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
|
| 380 |
+
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
|
| 381 |
+
|
| 382 |
+
### Full Model Architecture
|
| 383 |
+
|
| 384 |
+
```
|
| 385 |
+
SentenceTransformer(
|
| 386 |
+
(0): Transformer({'max_seq_length': 128, 'do_lower_case': False, 'architecture': 'BertModel'})
|
| 387 |
+
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
|
| 388 |
+
)
|
| 389 |
+
```
|
| 390 |
+
|
| 391 |
+
## Usage
|
| 392 |
+
|
| 393 |
+
### Direct Usage (Sentence Transformers)
|
| 394 |
+
|
| 395 |
+
First install the Sentence Transformers library:
|
| 396 |
+
|
| 397 |
+
```bash
|
| 398 |
+
pip install -U sentence-transformers
|
| 399 |
+
```
|
| 400 |
+
|
| 401 |
+
Then you can load this model and run inference.
|
| 402 |
+
```python
|
| 403 |
+
from sentence_transformers import SentenceTransformer
|
| 404 |
+
|
| 405 |
+
# Download from the 🤗 Hub
|
| 406 |
+
model = SentenceTransformer("aasifali4813/bert-summarizer")
|
| 407 |
+
# Run inference
|
| 408 |
+
sentences = [
|
| 409 |
+
'(CNN) -- Eight Florida teenagers -- six of them girls -- will be tried as adults and could be sentenced to life in prison for their alleged roles in the videotaped beating of another teen, the state attorney\'s office said Thursday. The teenagers seen in a video assaulting a 16-year-old could face life in prison. The suspects, who range in age from 14 to 18, all face charges of kidnapping, which is a first-degree felony, and battery, said Chip Thullbery, a spokesman for the Polk County state attorney. Three of them are also charged with tampering with a witness. Everyone involved in the case was under a gag order imposed by a judge. The only attorney for the teens who has been publicly identified did not return calls from CNN, and his assistant cited the gag order as the reason. The teens are scheduled for their first appearance in court Friday. The video shows a brutal scene: The 16-year-old victim is punched, kneed and slapped by other girls. She huddles in the fetal position, or stands and screams at her attackers, but the assault continues. Authorities say the eight teens said they were retaliating for insults posted on the Internet by the attack victim. Polk County Sheriff Grady Judd called the March 30 attack "animalistic." "I\'ve been involved in law enforcement for 35 years, and I\'ve seen a lot of extremely violent events, but I\'ve never seen children, 14 to 18 years of age, engage in this conduct for a 30-minute period of time and then make these video clips," he said. Police say the teens planned to post the video on YouTube. Watch the disturbing video » . The victim, a 16-year-old from Lakeland, Florida, was hospitalized, and still has blurred vision, hearing loss, and a swollen face, her mother told CNN on Wednesday. The video shows only girls doing the beating; Judd said the boys acted as lookouts. The idea of girls administering a vicious beating so they can post the video online may seem shocking, but it\'s becoming an increasingly common scenario, according to experts and news reports. Watch why more teens are putting fights online » . A search for "girl fight" on YouTube gets thousands of results, and a suggestion to also try "girl fight at school, boy girl fight" and other search terms. There\'s at least one Web site devoted exclusively to videos of girls fighting. In 2003, 25 percent of high school girls said they had been in a physical fight in the past year, according to a survey by the Centers for Disease Control and Prevention. (The figure for boys was 40.5 percent.) A Justice Department report released in 2006 showed that by age 17, 21 percent of girls said they had assaulted someone with the intent to cause serious harm. Frank Green is executive director of Keys to Safer Schools, a group that studies and tries to prevent school violence. He said he\'s not sure whether girls have actually become more violent, or whether there\'s just more awareness of their fights. "In one respect, girls have always been more vicious than boys," Green said. "Their violence is of a personal nature." He said boys usually have some focus and a concrete goal when they fight. "But girls want to cause pain and make the other girl feel bad," he said. Judd, the Polk County sheriff, said an important part of the plan in the Lakeland attack was to post the video of the beating on YouTube to humiliate and embarrass the victim. "It\'s the next stage of cyberbullying," psychologist Susan Lipkins said. "They want to show what they\'re doing." "Our kids are being peer pressured, in another sense of a trend, to put these shock videos out there at other peoples\' expense," said Talisa Lindsay, the victim\'s mother. "And I hope that it doesn\'t come to the point where there\'s more people\'s lives that are being affected by having to take a beating for entertainment, or possibly being killed." Watch mother describe how the victim is doing » . The suspects didn\'t have a chance to post the video online before police moved in and seized it, Judd said. The Sheriff\'s Department made it public, and it wound up on YouTube anyway. Judd recognizes the irony. "In a perverted sense, we were feeding into exactly what the kids wanted," he said. "But according to Florida law, [the video] is public record, and it\'s going to be in the public domain whether we agree with that or not." Judd said the suspects showed no remorse when they were arrested and booked. "They were laughing and joking about, \'I guess we won\'t get to go to the beach during spring break.\' And one ... asked whether she could go to cheerleading practice," he said. Lipkins, the psychologist, says there\'s a "disconnect between their actions and their thoughts." "They think the entire society is doing it, and they think it\'s funny. So they put it on YouTube. And I don\'t think they expect kids to get really hurt, and they also don\'t expect to get really caught." E-mail to a friend . CNN\'s Rich Phillips contributed to this report.',
|
| 410 |
+
'Eight Florida teens to be tried as adults in videotaped beating case .\nVideo shows 16-year-old girl punched by other girls .\n21 percent of girls age 17 say they\'ve assaulted someone, the Justice Dept. reports .\nThe teens have "disconnect" between thoughts and actions, psychologist says .',
|
| 411 |
+
'Cornelia Wallace was in her late 60s .\nShe was with Wallace when would-be assassin shot him in 1972 .\n"She served as first lady during a very turbulent time," Gov. Bob Riley says .',
|
| 412 |
+
]
|
| 413 |
+
embeddings = model.encode(sentences)
|
| 414 |
+
print(embeddings.shape)
|
| 415 |
+
# [3, 384]
|
| 416 |
+
|
| 417 |
+
# Get the similarity scores for the embeddings
|
| 418 |
+
similarities = model.similarity(embeddings, embeddings)
|
| 419 |
+
print(similarities)
|
| 420 |
+
# tensor([[1.0000, 0.7511, 0.1292],
|
| 421 |
+
# [0.7511, 1.0000, 0.1549],
|
| 422 |
+
# [0.1292, 0.1549, 1.0000]])
|
| 423 |
+
```
|
| 424 |
+
|
| 425 |
+
<!--
|
| 426 |
+
### Direct Usage (Transformers)
|
| 427 |
+
|
| 428 |
+
<details><summary>Click to see the direct usage in Transformers</summary>
|
| 429 |
+
|
| 430 |
+
</details>
|
| 431 |
+
-->
|
| 432 |
+
|
| 433 |
+
<!--
|
| 434 |
+
### Downstream Usage (Sentence Transformers)
|
| 435 |
+
|
| 436 |
+
You can finetune this model on your own dataset.
|
| 437 |
+
|
| 438 |
+
<details><summary>Click to expand</summary>
|
| 439 |
+
|
| 440 |
+
</details>
|
| 441 |
+
-->
|
| 442 |
+
|
| 443 |
+
<!--
|
| 444 |
+
### Out-of-Scope Use
|
| 445 |
+
|
| 446 |
+
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
| 447 |
+
-->
|
| 448 |
+
|
| 449 |
+
<!--
|
| 450 |
+
## Bias, Risks and Limitations
|
| 451 |
+
|
| 452 |
+
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
|
| 453 |
+
-->
|
| 454 |
+
|
| 455 |
+
<!--
|
| 456 |
+
### Recommendations
|
| 457 |
+
|
| 458 |
+
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
| 459 |
+
-->
|
| 460 |
+
|
| 461 |
+
## Training Details
|
| 462 |
+
|
| 463 |
+
### Training Dataset
|
| 464 |
+
|
| 465 |
+
#### Unnamed Dataset
|
| 466 |
+
|
| 467 |
+
* Size: 5,000 training samples
|
| 468 |
+
* Columns: <code>sentence_0</code> and <code>sentence_1</code>
|
| 469 |
+
* Approximate statistics based on the first 1000 samples:
|
| 470 |
+
| | sentence_0 | sentence_1 |
|
| 471 |
+
|:--------|:-------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
| 472 |
+
| type | string | string |
|
| 473 |
+
| details | <ul><li>min: 29 tokens</li><li>mean: 127.89 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 26 tokens</li><li>mean: 56.37 tokens</li><li>max: 93 tokens</li></ul> |
|
| 474 |
+
* Samples:
|
| 475 |
+
| sentence_0 | sentence_1 |
|
| 476 |
+
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
| 477 |
+
| <code>(CNN) -- It may take a lot of frequent-flier miles, a penchant for cold places, a tolerance of taxes and regular doses of chocolate, but happiness could be within reach. However, it's not where most people might expect. Journalist Eric Weiner says he wanted to explore the relationship between place and happiness. Just ask Eric Weiner, who made it his mission to find the most content places around the globe, uncovering lots of surprises along the way. Hungering for a tropical paradise? A warm climate doesn't necessarily make a happy nation, Weiner said. Thinking of moving to a wealthy state? Money can degrade happiness, he found. Weiner, who wrote the book, "The Geography of Bliss: One Grump's Search for the Happiest Places in the World," began his quest for very personal reasons. "I'm an unhappy person, so it's kind of what prompts a hungry person to search for food," he said. Weiner spent 10 years as a foreign correspondent for National Public Radio, a job that took him to some of the...</code> | <code>Journalist spent a year looking for the world's happiest countries .<br>Eric Weiner: Bhutan is probably the closest thing on Earth to Shangri-La .<br>He marvels at the creativity and "coziness" of Iceland .<br>Self-described "grump:" chocolate contributes to happiness in Switzerland .</code> |
|
| 478 |
+
| <code>(CNN) -- Hamburg have put one foot in the UEFA Cup final after a header from Germany winger Piotr Trochowski proved enough to give them a 1-0 win at Bundesliga rivals Werder Bremen in the first leg of their semifinal. Piotr Trochowski celebrates the only goal as Hamburg took a major step towards the UEFA Cup final. Martin Jol's side scored the only goal of a pulsating match in the 38th minute when Trochowski, the smallest player on the pitch, rose superbly at the back post to head Guy Demel's right-wing cross past goalkeeper Tim Wiese. Both side had countless half-chances to score but Hamburg wasted the best of them on the hour mark when Bayern Munich-bound striker Ivica Olic broke through in acres of space, but fired his shot straight at Wiese. Hamburg's victory puts them on line for their first European final since 1983, when they beat Juventus 1-0 to win the European Cup. This was the second of four matches in quick succession between the north Germany neighbors -- who met each othe...</code> | <code>Hamburg in line for first European final since 1983 after defeating Werder 1-0 .<br>Winger Piotr Trochowski heads in the only goal of their UEFA Cup semifinal .<br>Eventual winners will face Shakhtar Donetsk or Dynamo Kiev in Istanbul final .</code> |
|
| 479 |
+
| <code>(CNET) -- Suleman Ali cashed out just in time. Suleman Ali sold Esgut, his portfolio of Facebook applications, for seven figures in April. The 26-year-old, a former Microsoft employee who helped put together the Windows Home Server product, founded a company called Esgut within months of the debut of Facebook's developer platform in May 2007. Esgut is a portfolio of Facebook applications, and a few of them, like Superlatives and Entourage, became genuine viral hits. In April, Ali sold the 12-employee Esgut to the Social Gaming Network, a Silicon Valley company backed by the likes of Bezos Expeditions, the Founders Fund, and Greylock Partners. He said the price was in the seven figures. But Ali is the first to acknowledge that for upstart social-platform developers, hailed just months ago as the Valley's hottest breed of bright young things, the condition has taken a significant turn for the worse. "Most people are not counting on anything," the lanky and bespectacled Ali said over lunc...</code> | <code>Suleman Ali sold Esgut, his tech startup, for seven figures in April .<br>Esgut is a portfolio of Facebook applications; a few of them became big viral hits .<br>Suleman "started building Facebook apps just out of restlessness"<br>He sold his company just before the social-platform craze subsided .</code> |
|
| 480 |
+
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
| 481 |
+
```json
|
| 482 |
+
{
|
| 483 |
+
"scale": 20.0,
|
| 484 |
+
"similarity_fct": "cos_sim",
|
| 485 |
+
"gather_across_devices": false,
|
| 486 |
+
"directions": [
|
| 487 |
+
"query_to_doc"
|
| 488 |
+
],
|
| 489 |
+
"partition_mode": "joint",
|
| 490 |
+
"hardness_mode": null,
|
| 491 |
+
"hardness_strength": 0.0
|
| 492 |
+
}
|
| 493 |
+
```
|
| 494 |
+
|
| 495 |
+
### Training Hyperparameters
|
| 496 |
+
#### Non-Default Hyperparameters
|
| 497 |
+
|
| 498 |
+
- `per_device_train_batch_size`: 16
|
| 499 |
+
- `per_device_eval_batch_size`: 16
|
| 500 |
+
- `num_train_epochs`: 1
|
| 501 |
+
- `multi_dataset_batch_sampler`: round_robin
|
| 502 |
+
|
| 503 |
+
#### All Hyperparameters
|
| 504 |
+
<details><summary>Click to expand</summary>
|
| 505 |
+
|
| 506 |
+
- `do_predict`: False
|
| 507 |
+
- `eval_strategy`: no
|
| 508 |
+
- `prediction_loss_only`: True
|
| 509 |
+
- `per_device_train_batch_size`: 16
|
| 510 |
+
- `per_device_eval_batch_size`: 16
|
| 511 |
+
- `gradient_accumulation_steps`: 1
|
| 512 |
+
- `eval_accumulation_steps`: None
|
| 513 |
+
- `torch_empty_cache_steps`: None
|
| 514 |
+
- `learning_rate`: 5e-05
|
| 515 |
+
- `weight_decay`: 0.0
|
| 516 |
+
- `adam_beta1`: 0.9
|
| 517 |
+
- `adam_beta2`: 0.999
|
| 518 |
+
- `adam_epsilon`: 1e-08
|
| 519 |
+
- `max_grad_norm`: 1
|
| 520 |
+
- `num_train_epochs`: 1
|
| 521 |
+
- `max_steps`: -1
|
| 522 |
+
- `lr_scheduler_type`: linear
|
| 523 |
+
- `lr_scheduler_kwargs`: None
|
| 524 |
+
- `warmup_ratio`: None
|
| 525 |
+
- `warmup_steps`: 0
|
| 526 |
+
- `log_level`: passive
|
| 527 |
+
- `log_level_replica`: warning
|
| 528 |
+
- `log_on_each_node`: True
|
| 529 |
+
- `logging_nan_inf_filter`: True
|
| 530 |
+
- `enable_jit_checkpoint`: False
|
| 531 |
+
- `save_on_each_node`: False
|
| 532 |
+
- `save_only_model`: False
|
| 533 |
+
- `restore_callback_states_from_checkpoint`: False
|
| 534 |
+
- `use_cpu`: False
|
| 535 |
+
- `seed`: 42
|
| 536 |
+
- `data_seed`: None
|
| 537 |
+
- `bf16`: False
|
| 538 |
+
- `fp16`: False
|
| 539 |
+
- `bf16_full_eval`: False
|
| 540 |
+
- `fp16_full_eval`: False
|
| 541 |
+
- `tf32`: None
|
| 542 |
+
- `local_rank`: -1
|
| 543 |
+
- `ddp_backend`: None
|
| 544 |
+
- `debug`: []
|
| 545 |
+
- `dataloader_drop_last`: False
|
| 546 |
+
- `dataloader_num_workers`: 0
|
| 547 |
+
- `dataloader_prefetch_factor`: None
|
| 548 |
+
- `disable_tqdm`: False
|
| 549 |
+
- `remove_unused_columns`: True
|
| 550 |
+
- `label_names`: None
|
| 551 |
+
- `load_best_model_at_end`: False
|
| 552 |
+
- `ignore_data_skip`: False
|
| 553 |
+
- `fsdp`: []
|
| 554 |
+
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
|
| 555 |
+
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
|
| 556 |
+
- `parallelism_config`: None
|
| 557 |
+
- `deepspeed`: None
|
| 558 |
+
- `label_smoothing_factor`: 0.0
|
| 559 |
+
- `optim`: adamw_torch_fused
|
| 560 |
+
- `optim_args`: None
|
| 561 |
+
- `group_by_length`: False
|
| 562 |
+
- `length_column_name`: length
|
| 563 |
+
- `project`: huggingface
|
| 564 |
+
- `trackio_space_id`: trackio
|
| 565 |
+
- `ddp_find_unused_parameters`: None
|
| 566 |
+
- `ddp_bucket_cap_mb`: None
|
| 567 |
+
- `ddp_broadcast_buffers`: False
|
| 568 |
+
- `dataloader_pin_memory`: True
|
| 569 |
+
- `dataloader_persistent_workers`: False
|
| 570 |
+
- `skip_memory_metrics`: True
|
| 571 |
+
- `push_to_hub`: False
|
| 572 |
+
- `resume_from_checkpoint`: None
|
| 573 |
+
- `hub_model_id`: None
|
| 574 |
+
- `hub_strategy`: every_save
|
| 575 |
+
- `hub_private_repo`: None
|
| 576 |
+
- `hub_always_push`: False
|
| 577 |
+
- `hub_revision`: None
|
| 578 |
+
- `gradient_checkpointing`: False
|
| 579 |
+
- `gradient_checkpointing_kwargs`: None
|
| 580 |
+
- `include_for_metrics`: []
|
| 581 |
+
- `eval_do_concat_batches`: True
|
| 582 |
+
- `auto_find_batch_size`: False
|
| 583 |
+
- `full_determinism`: False
|
| 584 |
+
- `ddp_timeout`: 1800
|
| 585 |
+
- `torch_compile`: False
|
| 586 |
+
- `torch_compile_backend`: None
|
| 587 |
+
- `torch_compile_mode`: None
|
| 588 |
+
- `include_num_input_tokens_seen`: no
|
| 589 |
+
- `neftune_noise_alpha`: None
|
| 590 |
+
- `optim_target_modules`: None
|
| 591 |
+
- `batch_eval_metrics`: False
|
| 592 |
+
- `eval_on_start`: False
|
| 593 |
+
- `use_liger_kernel`: False
|
| 594 |
+
- `liger_kernel_config`: None
|
| 595 |
+
- `eval_use_gather_object`: False
|
| 596 |
+
- `average_tokens_across_devices`: True
|
| 597 |
+
- `use_cache`: False
|
| 598 |
+
- `prompts`: None
|
| 599 |
+
- `batch_sampler`: batch_sampler
|
| 600 |
+
- `multi_dataset_batch_sampler`: round_robin
|
| 601 |
+
- `router_mapping`: {}
|
| 602 |
+
- `learning_rate_mapping`: {}
|
| 603 |
+
|
| 604 |
+
</details>
|
| 605 |
+
|
| 606 |
+
### Framework Versions
|
| 607 |
+
- Python: 3.12.13
|
| 608 |
+
- Sentence Transformers: 5.3.0
|
| 609 |
+
- Transformers: 5.0.0
|
| 610 |
+
- PyTorch: 2.10.0+cu128
|
| 611 |
+
- Accelerate: 1.13.0
|
| 612 |
+
- Datasets: 4.0.0
|
| 613 |
+
- Tokenizers: 0.22.2
|
| 614 |
+
|
| 615 |
+
## Citation
|
| 616 |
+
|
| 617 |
+
### BibTeX
|
| 618 |
+
|
| 619 |
+
#### Sentence Transformers
|
| 620 |
+
```bibtex
|
| 621 |
+
@inproceedings{reimers-2019-sentence-bert,
|
| 622 |
+
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
| 623 |
+
author = "Reimers, Nils and Gurevych, Iryna",
|
| 624 |
+
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
| 625 |
+
month = "11",
|
| 626 |
+
year = "2019",
|
| 627 |
+
publisher = "Association for Computational Linguistics",
|
| 628 |
+
url = "https://arxiv.org/abs/1908.10084",
|
| 629 |
+
}
|
| 630 |
+
```
|
| 631 |
+
|
| 632 |
+
#### MultipleNegativesRankingLoss
|
| 633 |
+
```bibtex
|
| 634 |
+
@misc{oord2019representationlearningcontrastivepredictive,
|
| 635 |
+
title={Representation Learning with Contrastive Predictive Coding},
|
| 636 |
+
author={Aaron van den Oord and Yazhe Li and Oriol Vinyals},
|
| 637 |
+
year={2019},
|
| 638 |
+
eprint={1807.03748},
|
| 639 |
+
archivePrefix={arXiv},
|
| 640 |
+
primaryClass={cs.LG},
|
| 641 |
+
url={https://arxiv.org/abs/1807.03748},
|
| 642 |
+
}
|
| 643 |
+
```
|
| 644 |
+
|
| 645 |
+
<!--
|
| 646 |
+
## Glossary
|
| 647 |
+
|
| 648 |
+
*Clearly define terms in order to be accessible across audiences.*
|
| 649 |
+
-->
|
| 650 |
+
|
| 651 |
+
<!--
|
| 652 |
+
## Model Card Authors
|
| 653 |
+
|
| 654 |
+
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
|
| 655 |
+
-->
|
| 656 |
+
|
| 657 |
+
<!--
|
| 658 |
+
## Model Card Contact
|
| 659 |
+
|
| 660 |
+
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
|
| 661 |
+
-->
|
config.json
ADDED
|
@@ -0,0 +1,30 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_cross_attention": false,
|
| 3 |
+
"architectures": [
|
| 4 |
+
"BertModel"
|
| 5 |
+
],
|
| 6 |
+
"attention_probs_dropout_prob": 0.1,
|
| 7 |
+
"bos_token_id": null,
|
| 8 |
+
"classifier_dropout": null,
|
| 9 |
+
"dtype": "float32",
|
| 10 |
+
"eos_token_id": null,
|
| 11 |
+
"gradient_checkpointing": false,
|
| 12 |
+
"hidden_act": "gelu",
|
| 13 |
+
"hidden_dropout_prob": 0.1,
|
| 14 |
+
"hidden_size": 384,
|
| 15 |
+
"initializer_range": 0.02,
|
| 16 |
+
"intermediate_size": 1536,
|
| 17 |
+
"is_decoder": false,
|
| 18 |
+
"layer_norm_eps": 1e-12,
|
| 19 |
+
"max_position_embeddings": 512,
|
| 20 |
+
"model_type": "bert",
|
| 21 |
+
"num_attention_heads": 12,
|
| 22 |
+
"num_hidden_layers": 6,
|
| 23 |
+
"pad_token_id": 0,
|
| 24 |
+
"position_embedding_type": "absolute",
|
| 25 |
+
"tie_word_embeddings": true,
|
| 26 |
+
"transformers_version": "5.0.0",
|
| 27 |
+
"type_vocab_size": 2,
|
| 28 |
+
"use_cache": true,
|
| 29 |
+
"vocab_size": 30522
|
| 30 |
+
}
|
config_sentence_transformers.json
ADDED
|
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"__version__": {
|
| 3 |
+
"sentence_transformers": "5.3.0",
|
| 4 |
+
"transformers": "5.0.0",
|
| 5 |
+
"pytorch": "2.10.0+cu128"
|
| 6 |
+
},
|
| 7 |
+
"model_type": "SentenceTransformer",
|
| 8 |
+
"prompts": {
|
| 9 |
+
"query": "",
|
| 10 |
+
"document": ""
|
| 11 |
+
},
|
| 12 |
+
"default_prompt_name": null,
|
| 13 |
+
"similarity_fn_name": "cosine"
|
| 14 |
+
}
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f1cdf86a4d878f88e6f568ff78dc5e6254912494d0d04265d09bfc83c33fef65
|
| 3 |
+
size 90864176
|
modules.json
ADDED
|
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
[
|
| 2 |
+
{
|
| 3 |
+
"idx": 0,
|
| 4 |
+
"name": "0",
|
| 5 |
+
"path": "",
|
| 6 |
+
"type": "sentence_transformers.models.Transformer"
|
| 7 |
+
},
|
| 8 |
+
{
|
| 9 |
+
"idx": 1,
|
| 10 |
+
"name": "1",
|
| 11 |
+
"path": "1_Pooling",
|
| 12 |
+
"type": "sentence_transformers.models.Pooling"
|
| 13 |
+
}
|
| 14 |
+
]
|
sentence_bert_config.json
ADDED
|
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"max_seq_length": 128,
|
| 3 |
+
"do_lower_case": false
|
| 4 |
+
}
|
tokenizer.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"backend": "tokenizers",
|
| 3 |
+
"cls_token": "[CLS]",
|
| 4 |
+
"do_basic_tokenize": true,
|
| 5 |
+
"do_lower_case": true,
|
| 6 |
+
"is_local": false,
|
| 7 |
+
"mask_token": "[MASK]",
|
| 8 |
+
"model_max_length": 128,
|
| 9 |
+
"never_split": null,
|
| 10 |
+
"pad_token": "[PAD]",
|
| 11 |
+
"sep_token": "[SEP]",
|
| 12 |
+
"strip_accents": null,
|
| 13 |
+
"tokenize_chinese_chars": true,
|
| 14 |
+
"tokenizer_class": "BertTokenizer",
|
| 15 |
+
"unk_token": "[UNK]"
|
| 16 |
+
}
|