aasifali4813 commited on
Commit
49a4099
·
verified ·
1 Parent(s): 561238e

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 384,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,661 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:5000
9
+ - loss:MultipleNegativesRankingLoss
10
+ base_model: sentence-transformers/paraphrase-MiniLM-L6-v2
11
+ widget:
12
+ - source_sentence: (CNN) -- A third minute goal from Gonzalo Higuain gave Real Madrid
13
+ a 1-0 victory over Valencia, coach Juande Ramos' first Primera Liga win since
14
+ taking charge of the Spanish champions. Gonzalo Higuain is congratulated after
15
+ scoring Real Madrid's only goal against Valencia. The win ended Madrid's run of
16
+ three straight league defeats and moved them up to 29 points, nine behind leaders
17
+ Barcelona -- who visit Villarreal on Sunday. With captain Raul Gonzalez only on
18
+ the bench befcause of the flu, Higuain played alone in attack, and he soon made
19
+ his mark with the early goal -- the Argentine collecting Arjen Robben's pass to
20
+ fire home his 11th goal of the season. Valencia almost fell two goals behind in
21
+ the 16th minute when Rafael van der Vaart struck the post with a curling drive.
22
+ Higuain had a chance to double Madrid's tally in the second-half, but his close-range
23
+ shot struck the bar. The visitors' hopes were effectively ended when captain Carlos
24
+ Marchena was sent off after picking up his second yellow card, for a foul on
25
+ Robben. The defeat, only Valencia's second of the season after also going down
26
+ to Barcelona, leaves them on 30 points, just one above Madrid.
27
+ sentences:
28
+ - 'California governor honors U.S. Airways pilot Capt. Chesley Sullenberger .
29
+
30
+ Schwarzenegger: "We need a great hero" in state legislature like Sullenberger
31
+ .
32
+
33
+ Sullenberger safely landed flight 1549 in the Hudson River in New York .
34
+
35
+ Sullenberger accepts recognition on behalf of crew members, others .'
36
+ - 'Gonzalo Higuain scoresd the only goal as Real Madrid defeat Valencia 1-0 .
37
+
38
+ The victory marks first success in the Primera Liga for coach Juande Ramos .
39
+
40
+ Valencia, who had Carlos Marchena sent off, were losing only second match .'
41
+ - 'By definition, a premature baby is born before the 37th week of pregnancy .
42
+
43
+ Premature babies cost the U.S. at least $26 billion each year .
44
+
45
+ There are risk factors, but not all premature births can be prevented .
46
+
47
+ In tough economic times, pregnant women should not cut back on health care .'
48
+ - source_sentence: (CNN) -- The U.S. Olympic Committee and the International Olympic
49
+ Committee are expected to ask a federal judge Monday to shut down Web sites they
50
+ allege scam customers trying to buy Olympic tickets, according to court documents.
51
+ The Web site www.beijingticketing.com is accused in a lawsuit of scamming Olympic
52
+ ticket buyers. The IOC and the USOC filed lawsuits on July 22 against several
53
+ Web sites -- primarily www.beijingticketing.com and www.beijing-2008tickets.com
54
+ -- for illegally using Olympic trademarks to dupe customers into giving them credit
55
+ card, passport and banking information. Lawyers for the IOC and USOC are expected
56
+ to petition U.S. District Court Judge Jeffrey White on Monday to permanently shut
57
+ down several sites listed in the lawsuit. The lawsuit names the company XL & H
58
+ Ltd, known as Xclusive Leisure & Hospitality Ltd. and six other Web sites believed
59
+ to be fraudulent. Several consumers who purchased tickets from the site contacted
60
+ the USOC when they did not receive tickets, despite numerous calls and e-mails
61
+ to the Web sites founder, according to a USOC press release. The scam has hit
62
+ Olympic fans in the United States, Australia, New Zealand, England, Japan, China
63
+ and Norway, according to media reports. Australian Olympic Committee President
64
+ John Coates told local media that relatives of the country's softball team had
65
+ been victims of the site. The Sydney Morning Herald reported that the New South
66
+ Wales Government set up a hotline for those duped out of tickets. The hotline
67
+ has received hundreds of calls from around the globe with consumers detailing
68
+ losses as high as $57,000. The IOC and USOC successfully secured a restraining
69
+ order on July 23 in federal court in Phoenix, Arizona, that shut down www.beijing-2008tickets.com,
70
+ according to court documents. That site is now shut down and no contact information
71
+ is available. The site www.BeijingTicketing.com priced tickets for Friday's Olympic
72
+ opening ceremony at about $2,000, with events such as swimming selling for between
73
+ $300 and $500. The site is the first entry that comes up for a Google search for
74
+ "Olympic tickets," second only to the authorized dealer of Olympic tickets, www.cosport.com.
75
+ The Web site ww.beijingticketing.com site lists a London phone number, which rang
76
+ unanswered. The site lists an office address in Arizona. The site boasts tickets
77
+ for nearly every Olympic event, with some events showing sell-outs already. The
78
+ site also looked to assure ticket buyers of its authenticity with a note at the
79
+ bottom of the site. "Beijing 2008 Ticketing is a well known tickets agent and
80
+ a trusted market place for buyers to find book Olympic tickets," the note stated.
81
+ "We assure 100% guaranteed tickets delivery."
82
+ sentences:
83
+ - 'French National Assembly backs president''s plan to rejoin NATO .
84
+
85
+ President Nicolas Sarkozy''s plan is politically sensitive .
86
+
87
+ France was a founding member of NATO but left in 1966 .
88
+
89
+ President Charles de Gaulle said in 1966 membership meant losing independence
90
+ .'
91
+ - 'Harry Nicolaides, 41, was arrested last August over his 2005 book .
92
+
93
+ The book includes a paragraph about the king and crown prince .
94
+
95
+ Authorities deemed it a violation of a law that makes it illegal to insult the
96
+ crown .
97
+
98
+ Man''s lawyer says he was relieved and left for Australia at about midnight .'
99
+ - 'USOC, IOC claim fake ticket sites have bilked customers using their trademark
100
+ .
101
+
102
+ Sites www.BeijingTicketing.com and www.beijing-2008tickets.com named in suit .
103
+
104
+ One site claims "We assure 100 % guaranteed tickets delivery"'
105
+ - source_sentence: (CNN) -- Police are searching for any other alleged victims of
106
+ a Sunday school teacher accused of raping and killing an 8-year-old girl from
107
+ Tracy, California. Melissa Huckaby is charged with killing 8-year-old Sandra Cantu,
108
+ who was a friend of her own daughter. "We are asking the public if they have any
109
+ indication that any of their children may have had inappropriate contact with
110
+ [the suspect] to contact us," Tracy Police Sgt. Tony Sheneman said Tuesday evening.
111
+ "There has been no indication that this has happened. But she is a Sunday school
112
+ teacher and did have contact with children, so that is why we are asking." Melissa
113
+ Huckaby, 28, was arrested and charged with killing and raping 8-year-old Sandra
114
+ Cantu, a friend of her 5-year-old daughter. Huckaby, wearing red jail scrubs over
115
+ a white T-shirt, was in court Tuesday for an arraignment. She was charged with
116
+ murder, kidnapping, the performance of a lewd and lascivious act on a child under
117
+ 14 and rape by instrument. If convicted, Huckaby would face the death penalty
118
+ or life in prison without parole, San Joaquin County District Attorney James Willett
119
+ told reporters after the hearing. A decision on whether to seek the death penalty
120
+ will be made later, he said. Huckaby did not enter a plea or speak during the
121
+ court hearing, although she flinched when Sandra Cantu's name was spoken, and
122
+ she cried at times. Sandra's body was found April 6, stuffed into a suitcase and
123
+ submerged in a pond at a dairy farm. Huckaby was arrested Friday night after questioning
124
+ by police. Sandra was last seen alive March 27 in the mobile home park where she
125
+ lived with her family -- the same mobile home park where Huckaby lives with her
126
+ daughter. The two children were close friends and played together frequently,
127
+ police said. Huckaby is the granddaughter of Clifford Lane Lawless, pastor of
128
+ Clover Road Baptist Church near the mobile home park, and she taught Sunday school
129
+ at the church, police have said. The church was searched as part of the investigation
130
+ into Sandra's disappearance and death. Before her arrest, Huckaby acknowledged
131
+ to a newspaper reporter that she owned the suitcase that contained Sandra's body.
132
+ But Huckaby said the suitcase had been stolen. Willett declined to comment on
133
+ any of the evidence or allegations in the case, saying evidence would be presented
134
+ in court. A memorial ceremony is scheduled for Sandra at 1 p.m. Thursday at a
135
+ high school in Tracy, police said. -- CNN's Ninette Sosa contributed to this report.
136
+ sentences:
137
+ - 'President Bush continues a Christmas tradition by issuing pardons .
138
+
139
+ Convicted Cheney aide Lewis "Scooter" Libby is not among those pardoned .
140
+
141
+ Posthumous pardon issued for man who ran guns to Israeli Jews in late 1940s .
142
+
143
+ Life sentence of man convicted of methamphetamine commuted .'
144
+ - 'Police comb community for other possible victims .
145
+
146
+ Melissa Huckaby was arrested and charged with killing and raping Sandra Cantu
147
+ .
148
+
149
+ Sandra, 8, was friends with Huckaby''s 5-year-old daughter .'
150
+ - 'San Antonio, Texas, 12-year-old wins "Doodle for Google" national contest .
151
+
152
+ Christin Engelberth''s work will grace Google''s main site for 24 hours Thursday
153
+ .
154
+
155
+ Other prizes: $25,000 college scholarship, laptop, tech grant for her school .
156
+
157
+ When she grows up, she said, she hopes to do "anything to do with art"'
158
+ - source_sentence: 'Editor''s note: Fawaz A. Gerges holds the Christian A. Johnson
159
+ Chair in Middle Eastern Studies and International Affairs at Sarah Lawrence College.
160
+ His most recent book is "The Far Enemy: Why Jihad Went Global." Fawaz Gerges says
161
+ the elite running Iran has lost the support of two key groups -- women and young
162
+ voters. (CNN) -- With an apparent political coup in Iran by President Mahmoud
163
+ Ahmadinejad and his supporters over the weekend, the ruling mullahs have dispensed
164
+ with all democratic pretense and joined the ranks of traditional dictators in
165
+ the Middle East. The hardliners in Tehran, led by the Revolutionary Guards and
166
+ ultraconservatives, have won the first round against reformist conservatives but
167
+ at an extravagant cost: loss of public support. Widespread accusations of fraud
168
+ and manipulation are calling into question the very legitimacy and authority of
169
+ the mullahs'' Islamic-based regime. The electoral crisis has exposed a deepening
170
+ divide between female and young voters, who represent about 70 percent of the
171
+ population, and a radical conservative ruling elite out of touch with the hopes,
172
+ fears and aspirations of young Iranians. The consensus in Iran, particularly among
173
+ young voters, is that the election was stolen from reformist candidate, Mir Hossein
174
+ Moussavi, and that the outcome did not reflect the electorate''s genuine will.
175
+ After the Interior Ministry announced the final election results showing a nearly
176
+ 2-to-1 landslide for Ahmadinejad (62.63 to 33.75 percent), thousands of young
177
+ protesters took to the streets and clashed with police and set trash bins and
178
+ tires ablaze. Shock and disbelief turned to anger and rage. What poured gasoline
179
+ on a simmering fire among the youth was a belief of widespread fraud. "It''s like
180
+ taking 10 million votes away from Moussavi and giving them to Ahmadinejad," said
181
+ Tehran-based analyst Saeed Leilaz. That is equivalent to more than 30 million
182
+ votes in the American electorate. "Death to the dictatorship" and "down with the
183
+ dictator" shouted the demonstrators while being clubbed by police and anti-riot
184
+ squads. For the first time, young Iranians showed defiance against their tormentors
185
+ and overcame their fear. Many young men reportedly roamed Tehran streets looking
186
+ for a fight with the police. The ruling mullahs face an unprecedentedly serious
187
+ crisis at home. While it does not threaten their existence, the crisis risks undermining
188
+ their ability to govern effectively and pursue a successful foreign policy. In
189
+ his first response to the results of the election, Moussavi nailed it on the head.
190
+ "I won''t surrender to this manipulation," he wrote on his Web site. "The result
191
+ of such performance by some officials will jeopardize the pillars of the Islamic
192
+ Republic and will establish tyranny." Moussavi''s warning to the mullahs that
193
+ stealing the election would weaken the very foundation of their regime and ultimately
194
+ bring about its collapse carries weight because he has been part of the political
195
+ inner circle of the Islamic Republic, not an outsider. Moussavi is a former prime
196
+ minister admired for the way he managed the country''s economy during the prolonged
197
+ and bloody Iran-Iraq war in the 1980s, a conflict which cost Iran over $500 billion.
198
+ He worked closely with Ali Khamenei, then Iran''s president and today supreme
199
+ leader, and clashed with him over political authority and powers. Moussavi is
200
+ a member of Iran''s Expediency Council, which mediates between the parliament
201
+ and the non-elected Guardian Council led by Ayatollah Khamenei. Initially, many
202
+ reformists were skeptical about Moussavi''s reform credentials and feared that
203
+ he was too conservative for their taste. Throughout the presidential campaign,
204
+ Moussavi labored hard to portray his proposals on social policy and foreign affairs
205
+ as an extension of the Islamic system in order to disarm conservative critics,
206
+ even denying that he is a mainstream reformist candidate in the hope of winning
207
+ the support of reformers and moderate conservatives. Indeed, as the presidential
208
+ campaign progressed, Moussavi won the backing not only of an important conservative
209
+ segment of the electorate but also the formidable youth constituency. His charismatic
210
+ wife, Zahra Rahnavard, electrified the female vote and won the hearts and minds
211
+ of women voters who flooded their campaign rallies. In the past two weeks, Moussavi''s
212
+ campaign gained momentum. There was increasing evidence that the tide was turning
213
+ and that women and young voters would tip the balance of power his way, if they
214
+ turned out to vote in large numbers. Some important facts: There are 46 million
215
+ eligible voters in Iran, half of whom are women. In 1997, more than 60 percent
216
+ of the votes that brought moderate Mohammad Khatami to power were cast by women,
217
+ and in 2000, women voters were instrumental in giving reformists a sweeping majority
218
+ in the Parliament. Promising greater individual freedoms to youths was instrumental
219
+ in the two landslide victories by Khatami in 1997 and 2001. Surely, if the turnout
220
+ was 85 percent as the Interior Ministry said, Moussavi should have done much better
221
+ than the mere 33.8 percent he supposedly received. According to many independent
222
+ media reports from all over Iran, women and young voters turned out in record
223
+ numbers, especially in cities where Moussavi is very popular. That should have
224
+ given Moussavi a comfortable lead over the incumbent. Multiple irregularities
225
+ were reported, including the fact that Moussavi supposedly lost the vote in his
226
+ home province. There is an alarming discrepancy between the final results and
227
+ the voting patterns of the Iranian electorate over more than a decade. Lack of
228
+ transparency in the vote count has reinforced Iranians'' suspicion. The weight
229
+ of evidence is at odds with the final outcome given by the Interior Ministry.
230
+ Before Moussavi formally appealed the election result, supreme leader Khamenei
231
+ closed the door for any possible compromise. In a message on state TV, he urged
232
+ the nation to unite behind Ahmadinejad, calling the result a "divine assessment."
233
+ Yet the Guardian Council, not Khamenei, is the authority that either ratifies
234
+ or annuls the results. It is refreshing that Khamenei has finally ordered an investigation
235
+ into allegations of ballot fraud. It is a good start. A more reassuring intiative
236
+ would be for the Guardian Council to order a full recount with the presence of
237
+ independent observers. That would go a long way to putting to rest Iranians''
238
+ concerns. In the meantime, trying to divert attention from the mess at home, Khamenei
239
+ and Ahmadinejad have blamed external "enemies" and "foreign media" for instigating
240
+ a "full-fledged fight against our people." The truth is that the crisis is internal
241
+ and has nothing to do with foreign media or the West. The Obama administration
242
+ has done the right thing by keeping a low profile and not actively interfering
243
+ in Iran''s unfolding domestic crisis. Unlike his predecessor, Obama has defused
244
+ the minefield of relations between Washington and Tehran, shifting the focus of
245
+ Iranians to their own society. But the disputed result shows that the ultraconservative
246
+ mullahs are not only out of touch with a plurality of their citizens but also
247
+ with reality. Their conduct reflects a deeper crisis of self-confidence and fear
248
+ of the future. Has the Islamic revolution run out of ideological steam? If the
249
+ mullahs fear Moussavi, a loyalist, they must be scared of their shadows and uncertain
250
+ about their authority and power. That speaks volumes about where the Islamic Republic
251
+ is and where it is heading. The mullahs are swimming against the dominant current
252
+ of Iranian society. In the next four years, Iran will likely be engulfed in social
253
+ and political turmoil unless the electoral crisis is resolved in a transparent
254
+ manner. The opinions expressed in this commentary are solely those of Fawaz Gerges.'
255
+ sentences:
256
+ - 'Celtic defeat rivals Rangers 2-0 to win Scottish League Cup final at Hampden
257
+ .
258
+
259
+ Extra-time goals by Darren O''Dea and Aiden McGeady secure cup for Celtic .
260
+
261
+ The result will give Celtic a boost as they battle with Rangers for Scottish title
262
+ .'
263
+ - 'Rare species of freshwater dolphins found in Bangladesh''s waterlogged jungles
264
+ .
265
+
266
+ Fears Irrawaddy had dwindled in number to just a few hundred .
267
+
268
+ However, researchers have now counted almost 6,000 .'
269
+ - 'Fawaz Gerges: Iran''s ruling mullahs forfeited their public support in the election
270
+ .
271
+
272
+ Gerges: Ruling mullahs are out of step with women and young voters .
273
+
274
+ He says hardliners are swimming against the current of Iranian society .'
275
+ - source_sentence: '(CNN) -- Eight Florida teenagers -- six of them girls -- will
276
+ be tried as adults and could be sentenced to life in prison for their alleged
277
+ roles in the videotaped beating of another teen, the state attorney''s office
278
+ said Thursday. The teenagers seen in a video assaulting a 16-year-old could face
279
+ life in prison. The suspects, who range in age from 14 to 18, all face charges
280
+ of kidnapping, which is a first-degree felony, and battery, said Chip Thullbery,
281
+ a spokesman for the Polk County state attorney. Three of them are also charged
282
+ with tampering with a witness. Everyone involved in the case was under a gag order
283
+ imposed by a judge. The only attorney for the teens who has been publicly identified
284
+ did not return calls from CNN, and his assistant cited the gag order as the reason.
285
+ The teens are scheduled for their first appearance in court Friday. The video
286
+ shows a brutal scene: The 16-year-old victim is punched, kneed and slapped by
287
+ other girls. She huddles in the fetal position, or stands and screams at her attackers,
288
+ but the assault continues. Authorities say the eight teens said they were retaliating
289
+ for insults posted on the Internet by the attack victim. Polk County Sheriff Grady
290
+ Judd called the March 30 attack "animalistic." "I''ve been involved in law enforcement
291
+ for 35 years, and I''ve seen a lot of extremely violent events, but I''ve never
292
+ seen children, 14 to 18 years of age, engage in this conduct for a 30-minute period
293
+ of time and then make these video clips," he said. Police say the teens planned
294
+ to post the video on YouTube. Watch the disturbing video » . The victim, a 16-year-old
295
+ from Lakeland, Florida, was hospitalized, and still has blurred vision, hearing
296
+ loss, and a swollen face, her mother told CNN on Wednesday. The video shows only
297
+ girls doing the beating; Judd said the boys acted as lookouts. The idea of girls
298
+ administering a vicious beating so they can post the video online may seem shocking,
299
+ but it''s becoming an increasingly common scenario, according to experts and news
300
+ reports. Watch why more teens are putting fights online » . A search for "girl
301
+ fight" on YouTube gets thousands of results, and a suggestion to also try "girl
302
+ fight at school, boy girl fight" and other search terms. There''s at least one
303
+ Web site devoted exclusively to videos of girls fighting. In 2003, 25 percent
304
+ of high school girls said they had been in a physical fight in the past year,
305
+ according to a survey by the Centers for Disease Control and Prevention. (The
306
+ figure for boys was 40.5 percent.) A Justice Department report released in 2006
307
+ showed that by age 17, 21 percent of girls said they had assaulted someone with
308
+ the intent to cause serious harm. Frank Green is executive director of Keys to
309
+ Safer Schools, a group that studies and tries to prevent school violence. He said
310
+ he''s not sure whether girls have actually become more violent, or whether there''s
311
+ just more awareness of their fights. "In one respect, girls have always been more
312
+ vicious than boys," Green said. "Their violence is of a personal nature." He said
313
+ boys usually have some focus and a concrete goal when they fight. "But girls want
314
+ to cause pain and make the other girl feel bad," he said. Judd, the Polk County
315
+ sheriff, said an important part of the plan in the Lakeland attack was to post
316
+ the video of the beating on YouTube to humiliate and embarrass the victim. "It''s
317
+ the next stage of cyberbullying," psychologist Susan Lipkins said. "They want
318
+ to show what they''re doing." "Our kids are being peer pressured, in another sense
319
+ of a trend, to put these shock videos out there at other peoples'' expense," said
320
+ Talisa Lindsay, the victim''s mother. "And I hope that it doesn''t come to the
321
+ point where there''s more people''s lives that are being affected by having to
322
+ take a beating for entertainment, or possibly being killed." Watch mother describe
323
+ how the victim is doing » . The suspects didn''t have a chance to post the video
324
+ online before police moved in and seized it, Judd said. The Sheriff''s Department
325
+ made it public, and it wound up on YouTube anyway. Judd recognizes the irony.
326
+ "In a perverted sense, we were feeding into exactly what the kids wanted," he
327
+ said. "But according to Florida law, [the video] is public record, and it''s going
328
+ to be in the public domain whether we agree with that or not." Judd said the suspects
329
+ showed no remorse when they were arrested and booked. "They were laughing and
330
+ joking about, ''I guess we won''t get to go to the beach during spring break.''
331
+ And one ... asked whether she could go to cheerleading practice," he said. Lipkins,
332
+ the psychologist, says there''s a "disconnect between their actions and their
333
+ thoughts." "They think the entire society is doing it, and they think it''s funny.
334
+ So they put it on YouTube. And I don''t think they expect kids to get really hurt,
335
+ and they also don''t expect to get really caught." E-mail to a friend . CNN''s
336
+ Rich Phillips contributed to this report.'
337
+ sentences:
338
+ - 'Between 4 and 6 percent of the U.S. population suffers from SAD .
339
+
340
+ Light therapy, psychotherapy, and medications are treatment options .
341
+
342
+ Recent research indicates genetic component to the condition .'
343
+ - 'Cornelia Wallace was in her late 60s .
344
+
345
+ She was with Wallace when would-be assassin shot him in 1972 .
346
+
347
+ "She served as first lady during a very turbulent time," Gov. Bob Riley says .'
348
+ - 'Eight Florida teens to be tried as adults in videotaped beating case .
349
+
350
+ Video shows 16-year-old girl punched by other girls .
351
+
352
+ 21 percent of girls age 17 say they''ve assaulted someone, the Justice Dept. reports
353
+ .
354
+
355
+ The teens have "disconnect" between thoughts and actions, psychologist says .'
356
+ pipeline_tag: sentence-similarity
357
+ library_name: sentence-transformers
358
+ ---
359
+
360
+ # SentenceTransformer based on sentence-transformers/paraphrase-MiniLM-L6-v2
361
+
362
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/paraphrase-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
363
+
364
+ ## Model Details
365
+
366
+ ### Model Description
367
+ - **Model Type:** Sentence Transformer
368
+ - **Base model:** [sentence-transformers/paraphrase-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2) <!-- at revision c9a2bfebc254878aee8c3aca9e6844d5bbb102d1 -->
369
+ - **Maximum Sequence Length:** 128 tokens
370
+ - **Output Dimensionality:** 384 dimensions
371
+ - **Similarity Function:** Cosine Similarity
372
+ <!-- - **Training Dataset:** Unknown -->
373
+ <!-- - **Language:** Unknown -->
374
+ <!-- - **License:** Unknown -->
375
+
376
+ ### Model Sources
377
+
378
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
379
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
380
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
381
+
382
+ ### Full Model Architecture
383
+
384
+ ```
385
+ SentenceTransformer(
386
+ (0): Transformer({'max_seq_length': 128, 'do_lower_case': False, 'architecture': 'BertModel'})
387
+ (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
388
+ )
389
+ ```
390
+
391
+ ## Usage
392
+
393
+ ### Direct Usage (Sentence Transformers)
394
+
395
+ First install the Sentence Transformers library:
396
+
397
+ ```bash
398
+ pip install -U sentence-transformers
399
+ ```
400
+
401
+ Then you can load this model and run inference.
402
+ ```python
403
+ from sentence_transformers import SentenceTransformer
404
+
405
+ # Download from the 🤗 Hub
406
+ model = SentenceTransformer("aasifali4813/bert-summarizer")
407
+ # Run inference
408
+ sentences = [
409
+ '(CNN) -- Eight Florida teenagers -- six of them girls -- will be tried as adults and could be sentenced to life in prison for their alleged roles in the videotaped beating of another teen, the state attorney\'s office said Thursday. The teenagers seen in a video assaulting a 16-year-old could face life in prison. The suspects, who range in age from 14 to 18, all face charges of kidnapping, which is a first-degree felony, and battery, said Chip Thullbery, a spokesman for the Polk County state attorney. Three of them are also charged with tampering with a witness. Everyone involved in the case was under a gag order imposed by a judge. The only attorney for the teens who has been publicly identified did not return calls from CNN, and his assistant cited the gag order as the reason. The teens are scheduled for their first appearance in court Friday. The video shows a brutal scene: The 16-year-old victim is punched, kneed and slapped by other girls. She huddles in the fetal position, or stands and screams at her attackers, but the assault continues. Authorities say the eight teens said they were retaliating for insults posted on the Internet by the attack victim. Polk County Sheriff Grady Judd called the March 30 attack "animalistic." "I\'ve been involved in law enforcement for 35 years, and I\'ve seen a lot of extremely violent events, but I\'ve never seen children, 14 to 18 years of age, engage in this conduct for a 30-minute period of time and then make these video clips," he said. Police say the teens planned to post the video on YouTube. Watch the disturbing video » . The victim, a 16-year-old from Lakeland, Florida, was hospitalized, and still has blurred vision, hearing loss, and a swollen face, her mother told CNN on Wednesday. The video shows only girls doing the beating; Judd said the boys acted as lookouts. The idea of girls administering a vicious beating so they can post the video online may seem shocking, but it\'s becoming an increasingly common scenario, according to experts and news reports. Watch why more teens are putting fights online » . A search for "girl fight" on YouTube gets thousands of results, and a suggestion to also try "girl fight at school, boy girl fight" and other search terms. There\'s at least one Web site devoted exclusively to videos of girls fighting. In 2003, 25 percent of high school girls said they had been in a physical fight in the past year, according to a survey by the Centers for Disease Control and Prevention. (The figure for boys was 40.5 percent.) A Justice Department report released in 2006 showed that by age 17, 21 percent of girls said they had assaulted someone with the intent to cause serious harm. Frank Green is executive director of Keys to Safer Schools, a group that studies and tries to prevent school violence. He said he\'s not sure whether girls have actually become more violent, or whether there\'s just more awareness of their fights. "In one respect, girls have always been more vicious than boys," Green said. "Their violence is of a personal nature." He said boys usually have some focus and a concrete goal when they fight. "But girls want to cause pain and make the other girl feel bad," he said. Judd, the Polk County sheriff, said an important part of the plan in the Lakeland attack was to post the video of the beating on YouTube to humiliate and embarrass the victim. "It\'s the next stage of cyberbullying," psychologist Susan Lipkins said. "They want to show what they\'re doing." "Our kids are being peer pressured, in another sense of a trend, to put these shock videos out there at other peoples\' expense," said Talisa Lindsay, the victim\'s mother. "And I hope that it doesn\'t come to the point where there\'s more people\'s lives that are being affected by having to take a beating for entertainment, or possibly being killed." Watch mother describe how the victim is doing » . The suspects didn\'t have a chance to post the video online before police moved in and seized it, Judd said. The Sheriff\'s Department made it public, and it wound up on YouTube anyway. Judd recognizes the irony. "In a perverted sense, we were feeding into exactly what the kids wanted," he said. "But according to Florida law, [the video] is public record, and it\'s going to be in the public domain whether we agree with that or not." Judd said the suspects showed no remorse when they were arrested and booked. "They were laughing and joking about, \'I guess we won\'t get to go to the beach during spring break.\' And one ... asked whether she could go to cheerleading practice," he said. Lipkins, the psychologist, says there\'s a "disconnect between their actions and their thoughts." "They think the entire society is doing it, and they think it\'s funny. So they put it on YouTube. And I don\'t think they expect kids to get really hurt, and they also don\'t expect to get really caught." E-mail to a friend . CNN\'s Rich Phillips contributed to this report.',
410
+ 'Eight Florida teens to be tried as adults in videotaped beating case .\nVideo shows 16-year-old girl punched by other girls .\n21 percent of girls age 17 say they\'ve assaulted someone, the Justice Dept. reports .\nThe teens have "disconnect" between thoughts and actions, psychologist says .',
411
+ 'Cornelia Wallace was in her late 60s .\nShe was with Wallace when would-be assassin shot him in 1972 .\n"She served as first lady during a very turbulent time," Gov. Bob Riley says .',
412
+ ]
413
+ embeddings = model.encode(sentences)
414
+ print(embeddings.shape)
415
+ # [3, 384]
416
+
417
+ # Get the similarity scores for the embeddings
418
+ similarities = model.similarity(embeddings, embeddings)
419
+ print(similarities)
420
+ # tensor([[1.0000, 0.7511, 0.1292],
421
+ # [0.7511, 1.0000, 0.1549],
422
+ # [0.1292, 0.1549, 1.0000]])
423
+ ```
424
+
425
+ <!--
426
+ ### Direct Usage (Transformers)
427
+
428
+ <details><summary>Click to see the direct usage in Transformers</summary>
429
+
430
+ </details>
431
+ -->
432
+
433
+ <!--
434
+ ### Downstream Usage (Sentence Transformers)
435
+
436
+ You can finetune this model on your own dataset.
437
+
438
+ <details><summary>Click to expand</summary>
439
+
440
+ </details>
441
+ -->
442
+
443
+ <!--
444
+ ### Out-of-Scope Use
445
+
446
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
447
+ -->
448
+
449
+ <!--
450
+ ## Bias, Risks and Limitations
451
+
452
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
453
+ -->
454
+
455
+ <!--
456
+ ### Recommendations
457
+
458
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
459
+ -->
460
+
461
+ ## Training Details
462
+
463
+ ### Training Dataset
464
+
465
+ #### Unnamed Dataset
466
+
467
+ * Size: 5,000 training samples
468
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
469
+ * Approximate statistics based on the first 1000 samples:
470
+ | | sentence_0 | sentence_1 |
471
+ |:--------|:-------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
472
+ | type | string | string |
473
+ | details | <ul><li>min: 29 tokens</li><li>mean: 127.89 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 26 tokens</li><li>mean: 56.37 tokens</li><li>max: 93 tokens</li></ul> |
474
+ * Samples:
475
+ | sentence_0 | sentence_1 |
476
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
477
+ | <code>(CNN) -- It may take a lot of frequent-flier miles, a penchant for cold places, a tolerance of taxes and regular doses of chocolate, but happiness could be within reach. However, it's not where most people might expect. Journalist Eric Weiner says he wanted to explore the relationship between place and happiness. Just ask Eric Weiner, who made it his mission to find the most content places around the globe, uncovering lots of surprises along the way. Hungering for a tropical paradise? A warm climate doesn't necessarily make a happy nation, Weiner said. Thinking of moving to a wealthy state? Money can degrade happiness, he found. Weiner, who wrote the book, "The Geography of Bliss: One Grump's Search for the Happiest Places in the World," began his quest for very personal reasons. "I'm an unhappy person, so it's kind of what prompts a hungry person to search for food," he said. Weiner spent 10 years as a foreign correspondent for National Public Radio, a job that took him to some of the...</code> | <code>Journalist spent a year looking for the world's happiest countries .<br>Eric Weiner: Bhutan is probably the closest thing on Earth to Shangri-La .<br>He marvels at the creativity and "coziness" of Iceland .<br>Self-described "grump:" chocolate contributes to happiness in Switzerland .</code> |
478
+ | <code>(CNN) -- Hamburg have put one foot in the UEFA Cup final after a header from Germany winger Piotr Trochowski proved enough to give them a 1-0 win at Bundesliga rivals Werder Bremen in the first leg of their semifinal. Piotr Trochowski celebrates the only goal as Hamburg took a major step towards the UEFA Cup final. Martin Jol's side scored the only goal of a pulsating match in the 38th minute when Trochowski, the smallest player on the pitch, rose superbly at the back post to head Guy Demel's right-wing cross past goalkeeper Tim Wiese. Both side had countless half-chances to score but Hamburg wasted the best of them on the hour mark when Bayern Munich-bound striker Ivica Olic broke through in acres of space, but fired his shot straight at Wiese. Hamburg's victory puts them on line for their first European final since 1983, when they beat Juventus 1-0 to win the European Cup. This was the second of four matches in quick succession between the north Germany neighbors -- who met each othe...</code> | <code>Hamburg in line for first European final since 1983 after defeating Werder 1-0 .<br>Winger Piotr Trochowski heads in the only goal of their UEFA Cup semifinal .<br>Eventual winners will face Shakhtar Donetsk or Dynamo Kiev in Istanbul final .</code> |
479
+ | <code>(CNET) -- Suleman Ali cashed out just in time. Suleman Ali sold Esgut, his portfolio of Facebook applications, for seven figures in April. The 26-year-old, a former Microsoft employee who helped put together the Windows Home Server product, founded a company called Esgut within months of the debut of Facebook's developer platform in May 2007. Esgut is a portfolio of Facebook applications, and a few of them, like Superlatives and Entourage, became genuine viral hits. In April, Ali sold the 12-employee Esgut to the Social Gaming Network, a Silicon Valley company backed by the likes of Bezos Expeditions, the Founders Fund, and Greylock Partners. He said the price was in the seven figures. But Ali is the first to acknowledge that for upstart social-platform developers, hailed just months ago as the Valley's hottest breed of bright young things, the condition has taken a significant turn for the worse. "Most people are not counting on anything," the lanky and bespectacled Ali said over lunc...</code> | <code>Suleman Ali sold Esgut, his tech startup, for seven figures in April .<br>Esgut is a portfolio of Facebook applications; a few of them became big viral hits .<br>Suleman "started building Facebook apps just out of restlessness"<br>He sold his company just before the social-platform craze subsided .</code> |
480
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
481
+ ```json
482
+ {
483
+ "scale": 20.0,
484
+ "similarity_fct": "cos_sim",
485
+ "gather_across_devices": false,
486
+ "directions": [
487
+ "query_to_doc"
488
+ ],
489
+ "partition_mode": "joint",
490
+ "hardness_mode": null,
491
+ "hardness_strength": 0.0
492
+ }
493
+ ```
494
+
495
+ ### Training Hyperparameters
496
+ #### Non-Default Hyperparameters
497
+
498
+ - `per_device_train_batch_size`: 16
499
+ - `per_device_eval_batch_size`: 16
500
+ - `num_train_epochs`: 1
501
+ - `multi_dataset_batch_sampler`: round_robin
502
+
503
+ #### All Hyperparameters
504
+ <details><summary>Click to expand</summary>
505
+
506
+ - `do_predict`: False
507
+ - `eval_strategy`: no
508
+ - `prediction_loss_only`: True
509
+ - `per_device_train_batch_size`: 16
510
+ - `per_device_eval_batch_size`: 16
511
+ - `gradient_accumulation_steps`: 1
512
+ - `eval_accumulation_steps`: None
513
+ - `torch_empty_cache_steps`: None
514
+ - `learning_rate`: 5e-05
515
+ - `weight_decay`: 0.0
516
+ - `adam_beta1`: 0.9
517
+ - `adam_beta2`: 0.999
518
+ - `adam_epsilon`: 1e-08
519
+ - `max_grad_norm`: 1
520
+ - `num_train_epochs`: 1
521
+ - `max_steps`: -1
522
+ - `lr_scheduler_type`: linear
523
+ - `lr_scheduler_kwargs`: None
524
+ - `warmup_ratio`: None
525
+ - `warmup_steps`: 0
526
+ - `log_level`: passive
527
+ - `log_level_replica`: warning
528
+ - `log_on_each_node`: True
529
+ - `logging_nan_inf_filter`: True
530
+ - `enable_jit_checkpoint`: False
531
+ - `save_on_each_node`: False
532
+ - `save_only_model`: False
533
+ - `restore_callback_states_from_checkpoint`: False
534
+ - `use_cpu`: False
535
+ - `seed`: 42
536
+ - `data_seed`: None
537
+ - `bf16`: False
538
+ - `fp16`: False
539
+ - `bf16_full_eval`: False
540
+ - `fp16_full_eval`: False
541
+ - `tf32`: None
542
+ - `local_rank`: -1
543
+ - `ddp_backend`: None
544
+ - `debug`: []
545
+ - `dataloader_drop_last`: False
546
+ - `dataloader_num_workers`: 0
547
+ - `dataloader_prefetch_factor`: None
548
+ - `disable_tqdm`: False
549
+ - `remove_unused_columns`: True
550
+ - `label_names`: None
551
+ - `load_best_model_at_end`: False
552
+ - `ignore_data_skip`: False
553
+ - `fsdp`: []
554
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
555
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
556
+ - `parallelism_config`: None
557
+ - `deepspeed`: None
558
+ - `label_smoothing_factor`: 0.0
559
+ - `optim`: adamw_torch_fused
560
+ - `optim_args`: None
561
+ - `group_by_length`: False
562
+ - `length_column_name`: length
563
+ - `project`: huggingface
564
+ - `trackio_space_id`: trackio
565
+ - `ddp_find_unused_parameters`: None
566
+ - `ddp_bucket_cap_mb`: None
567
+ - `ddp_broadcast_buffers`: False
568
+ - `dataloader_pin_memory`: True
569
+ - `dataloader_persistent_workers`: False
570
+ - `skip_memory_metrics`: True
571
+ - `push_to_hub`: False
572
+ - `resume_from_checkpoint`: None
573
+ - `hub_model_id`: None
574
+ - `hub_strategy`: every_save
575
+ - `hub_private_repo`: None
576
+ - `hub_always_push`: False
577
+ - `hub_revision`: None
578
+ - `gradient_checkpointing`: False
579
+ - `gradient_checkpointing_kwargs`: None
580
+ - `include_for_metrics`: []
581
+ - `eval_do_concat_batches`: True
582
+ - `auto_find_batch_size`: False
583
+ - `full_determinism`: False
584
+ - `ddp_timeout`: 1800
585
+ - `torch_compile`: False
586
+ - `torch_compile_backend`: None
587
+ - `torch_compile_mode`: None
588
+ - `include_num_input_tokens_seen`: no
589
+ - `neftune_noise_alpha`: None
590
+ - `optim_target_modules`: None
591
+ - `batch_eval_metrics`: False
592
+ - `eval_on_start`: False
593
+ - `use_liger_kernel`: False
594
+ - `liger_kernel_config`: None
595
+ - `eval_use_gather_object`: False
596
+ - `average_tokens_across_devices`: True
597
+ - `use_cache`: False
598
+ - `prompts`: None
599
+ - `batch_sampler`: batch_sampler
600
+ - `multi_dataset_batch_sampler`: round_robin
601
+ - `router_mapping`: {}
602
+ - `learning_rate_mapping`: {}
603
+
604
+ </details>
605
+
606
+ ### Framework Versions
607
+ - Python: 3.12.13
608
+ - Sentence Transformers: 5.3.0
609
+ - Transformers: 5.0.0
610
+ - PyTorch: 2.10.0+cu128
611
+ - Accelerate: 1.13.0
612
+ - Datasets: 4.0.0
613
+ - Tokenizers: 0.22.2
614
+
615
+ ## Citation
616
+
617
+ ### BibTeX
618
+
619
+ #### Sentence Transformers
620
+ ```bibtex
621
+ @inproceedings{reimers-2019-sentence-bert,
622
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
623
+ author = "Reimers, Nils and Gurevych, Iryna",
624
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
625
+ month = "11",
626
+ year = "2019",
627
+ publisher = "Association for Computational Linguistics",
628
+ url = "https://arxiv.org/abs/1908.10084",
629
+ }
630
+ ```
631
+
632
+ #### MultipleNegativesRankingLoss
633
+ ```bibtex
634
+ @misc{oord2019representationlearningcontrastivepredictive,
635
+ title={Representation Learning with Contrastive Predictive Coding},
636
+ author={Aaron van den Oord and Yazhe Li and Oriol Vinyals},
637
+ year={2019},
638
+ eprint={1807.03748},
639
+ archivePrefix={arXiv},
640
+ primaryClass={cs.LG},
641
+ url={https://arxiv.org/abs/1807.03748},
642
+ }
643
+ ```
644
+
645
+ <!--
646
+ ## Glossary
647
+
648
+ *Clearly define terms in order to be accessible across audiences.*
649
+ -->
650
+
651
+ <!--
652
+ ## Model Card Authors
653
+
654
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
655
+ -->
656
+
657
+ <!--
658
+ ## Model Card Contact
659
+
660
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
661
+ -->
config.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_cross_attention": false,
3
+ "architectures": [
4
+ "BertModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": null,
8
+ "classifier_dropout": null,
9
+ "dtype": "float32",
10
+ "eos_token_id": null,
11
+ "gradient_checkpointing": false,
12
+ "hidden_act": "gelu",
13
+ "hidden_dropout_prob": 0.1,
14
+ "hidden_size": 384,
15
+ "initializer_range": 0.02,
16
+ "intermediate_size": 1536,
17
+ "is_decoder": false,
18
+ "layer_norm_eps": 1e-12,
19
+ "max_position_embeddings": 512,
20
+ "model_type": "bert",
21
+ "num_attention_heads": 12,
22
+ "num_hidden_layers": 6,
23
+ "pad_token_id": 0,
24
+ "position_embedding_type": "absolute",
25
+ "tie_word_embeddings": true,
26
+ "transformers_version": "5.0.0",
27
+ "type_vocab_size": 2,
28
+ "use_cache": true,
29
+ "vocab_size": 30522
30
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.3.0",
4
+ "transformers": "5.0.0",
5
+ "pytorch": "2.10.0+cu128"
6
+ },
7
+ "model_type": "SentenceTransformer",
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1cdf86a4d878f88e6f568ff78dc5e6254912494d0d04265d09bfc83c33fef65
3
+ size 90864176
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 128,
3
+ "do_lower_case": false
4
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "backend": "tokenizers",
3
+ "cls_token": "[CLS]",
4
+ "do_basic_tokenize": true,
5
+ "do_lower_case": true,
6
+ "is_local": false,
7
+ "mask_token": "[MASK]",
8
+ "model_max_length": 128,
9
+ "never_split": null,
10
+ "pad_token": "[PAD]",
11
+ "sep_token": "[SEP]",
12
+ "strip_accents": null,
13
+ "tokenize_chinese_chars": true,
14
+ "tokenizer_class": "BertTokenizer",
15
+ "unk_token": "[UNK]"
16
+ }