knguyennguyen commited on
Commit
b5ace37
·
verified ·
1 Parent(s): f8eaf51

Add new SentenceTransformer model.

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,558 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:15123
8
+ - loss:MultipleNegativesRankingLoss
9
+ base_model: sentence-transformers/all-mpnet-base-v2
10
+ widget:
11
+ - source_sentence: gaming laptop featuring a large display, high refresh rate, and
12
+ a powerful processor. it includes multiple storage options, a backlit keyboard,
13
+ and various connectivity ports.
14
+ sentences:
15
+ - 'Title: My Hero Academia 3-Inch Funko POP Enamel Pin, Tsuyu Asui"Froppy" | Officially
16
+ Licensed Boku no Hero Collectible | Metal Pins For Backpacks, Jackets | Anime
17
+ Gifts, Superhero Accessories Descripion: [''GO BEYOND PLUS ULTRA This Tsuyu Asui
18
+ Pop! Pin is the Rainy Season Hero you\''ve been waiting for. Tsuyu Asui\''s leadership
19
+ skills, paired with her unconventional frog-like appearance, make her one of the
20
+ most notable students at U.A. High School. The hero-in-training is hopping into
21
+ your My Hero Academia collection with this metal pin. Molded in Funko\''s trademark
22
+ style, the 3D Pop! portrait features a character-accurate design. The Tsuyu Asui-inspired
23
+ Pop! Pin depicts the hero in her "Froppy" costume, finished with silver-colored
24
+ hardware. Regarded as a "perfect pillar of emotional support," now Froppy can
25
+ support you wherever you go. The three butterfly clasps allow the Quirky accessory
26
+ to be pinned securely onto a variety of items, including backpacks, jackets, and
27
+ more. You can also use the built-in flip stand to display the metal pin on flat-surfaced
28
+ spaces. A HEROIC FUNKO POP COLLECTIBLE Funko\''s My Hero Academia Pop! Pins are
29
+ not your average pins. These fun and collectible accessories come in a variety
30
+ of metal finishes, with unique techniques to help push the boundaries of the designs.
31
+ Inspired by your favorite characters from Class 1-A, each metal pin features the
32
+ signature 3D Pop! heads. Many have specialized variant treatments to put them
33
+ above all other pins in quality and style.'']'
34
+ - "Title: Lenovo Ideapad Gaming3i Gaming Laptop RTX3050| 15.6 FHD 120Hz Refresh\
35
+ \ Rate | Intel Core i5-12500H 12Core| Backlit Keyboard | Wi-Fi 6 | USB Type C\
36
+ \ | Windows 11 | HDMI Cable (32GB RAM | 1TB PCIe SSD) Descripion: ['12th Gen Intel\
37
+ \ Core i5 Powered Laptops - Built for the Next Generation of Gaming'\n 'Lenovo\
38
+ \ Ideapad Gaming 3i Laptop 15.6\" FHD Display with NVIDIA GeForce RTX 3050 Graphics\
39
+ \ that combines 120Hz refresh rate for smooth, crisp and tear-free gameplay; \
40
+ \ Intel’s new performance hybrid architecture integrates two core families into\
41
+ \ a single CPU, keeping everything in your gaming universe running smoothly; \
42
+ \ With an HDMI 2.0 port, you can easily plug in to an extra monitor or TV to get\
43
+ \ the full experience; Enjoy 15% cooling improvements across the board with features\
44
+ \ like keyboard air intake. Larger ventilation rates improve on last generation\
45
+ \ performance by 20%, with 10% increased fan airflow for some really cool and\
46
+ \ quiet gaming; With backlit keyboard to aid in key visibility in any lighting\
47
+ \ situation.'\n 'Specifications' 'Operating System:' 'Windows 11 Home' 'Processor:'\n\
48
+ \ '12th Gen Intel Core i5-12500H, 12 Cores (4Performance-cores + 8Efficient-cores)\
49
+ \ / 16Threads, P-core 2.5 - 4.5GHz, E-core 1.8 - 3.3GHz, 18MB'\n 'Display & Graphics:'\n\
50
+ \ '15.6\" FHD (1920 x 1080) 120Hz IPS Display, Anti-glare; NVIDIA GeForce RTX\
51
+ \ 3050 4GB GDDR6 Video Memory'\n 'Memory:' '32GB DDR4 RAM' 'Drives:' '1TB PCI-e\
52
+ \ NVMe Solid State Drive'\n 'Communications:'\n 'Wi-Fi 6 11ax, 2x2 + Bluetooth\
53
+ \ 5.1; Ethernet 100/1000M (RJ-45)' 'Camera:'\n 'HD 720p with Privacy Shutter'\
54
+ \ 'Speakers:'\n 'Stereo speakers, 2W x2, Nahimic Audio' 'Keyboard:' 'Backlit Keyboard'\n\
55
+ \ 'Ports & Slots:'\n '2x USB 3.2 Gen 1 1x HDMI 2.0 1x Ethernet (RJ-45) 1x Headphone\
56
+ \ / microphone combo jack (3.5mm) 1x Power connector'\n 'Power Supply:' '45Whr\
57
+ \ Li-Polymer Battery' 'Additional Information:'\n 'Dimensions: 14.16 x 10.49 x\
58
+ \ 0.86 inches; Approximate Weight: 5.1 lbs']"
59
+ - "Title: MSI GE75 Raider RTX 2060 6GB 17.3\" 144Hz FHD Gaming Laptop Computer,\
60
+ \ Intel Hexa-Core i7-10750H, 32GB DDR4 RAM, 512GB PCIe SSD + 1TB HDD, RGB Backlight\
61
+ \ KB, Windows 10, iPuzzle External HD Descripion: ['We sell computers with upgraded\
62
+ \ configurations. If the computer has modifications (listed above), then the manufacturer\
63
+ \ box is opened for it to be tested and inspected and to install the upgrades\
64
+ \ to achieve the specifications as advertised. If no modifications are listed,\
65
+ \ the item is unopened and untested. Through our in-depth inspection and testing,\
66
+ \ defects and defects can be significantly reduced.'\n 'Processor & Memory:' '10th\
67
+ \ Gen Intel Core i7-10750H Processor'\n '32GB DDR4 2666MHz RAM' 'Drives:'\n '1TB\
68
+ \ 5400RPM Hard Drive + 512GB NVMe Solid State Drive'\n 'No Optical Drive' 'Operating\
69
+ \ System:' 'Windows 10 Home (64-bit)'\n 'Communications:' 'Intel Wi-Fi 6 AX201\
70
+ \ 2x2 WLAN + Bluetooth 5.0'\n 'Integrated 720P HD Webcam'\n 'Killer Gaming Network\
71
+ \ E3100 (10/100/1000 mbps) Ethernet LAN'\n 'Graphics & Video:' '17.3\" FHD (1920\
72
+ \ x 1080) 144Hz 3ms Display'\n 'NVIDIA GeForce RTX 2060, 6GB' 'Audio:'\n '2x 3W\
73
+ \ Giant Speakers + 2x 3W Subwoofer' 'Keyboard:'\n 'Steel Series RGB Backlight\
74
+ \ Keyboard with Anti-Ghost Key + Silver Lining'\n 'Ports & Slots:' '1x USB 3.2\
75
+ \ Gen 2 Type-C' '2x USB 3.2 Gen 1'\n '1x USB 3.2 Gen 2' '1x HDMI-Out' '1x Media\
76
+ \ Card Reader'\n '1x Mini-Display Port mDP v1.2' '1x Ethernet Lan (10/100/1000\
77
+ \ mbps)'\n '1x Mic-In/Headphone-Out Jack' 'Power Supply:'\n '6-Cell 51Wh Li-Ion\
78
+ \ Battery' 'Additional Information:'\n 'Dimensions: 15.63\"x10.57\"x1.08\"' 'Weight:\
79
+ \ 5.75lbs']"
80
+ - source_sentence: men's jacket with a water-resistant exterior, soft inner lining,
81
+ adjustable features, and multiple storage options.. men's jacket with a water-resistant
82
+ exterior, soft inner lining, adjustable features, and multiple storage options.
83
+ sentences:
84
+ - "Title: Lenovo Chromebook Duet 2-in-1 Tablet 10.1\" FHD Touchscreen Laptop Computer,\
85
+ \ MediaTek Helio P60T Octa-Core, 4GB LPDDR4X RAM, 128GB eMCP, Webcam, Chrome OS,\
86
+ \ BROAGE 16GB Flash Stylus, Online Class Ready Descripion: ['Product Description'\n\
87
+ \ 'Ideal for Home, Student, Professionals, Small Business, School Education, and\
88
+ \ Commercial Enterprise, Online Class, Google Classroom, Remote Learning, Zoom\
89
+ \ Ready.'\n 'Processor' 'MediaTek P60T (8C, 4x A73 @2.0GHz + 4x A53 @2.0GHz)'\n\
90
+ \ 'Graphics' 'Integrated ARM Mali-G72 MP3 GPU' 'Chipset'\n 'MediaTek SoC Platform'\
91
+ \ 'Memory' '4GB LPDDR4X' 'Storage' '128GB eMCP'\n 'Display' '10.1\" FHD (1920x1200)\
92
+ \ WVA 400nits' 'Touchscreen'\n '10-point Multi-touch' 'WLAN + Bluetooth' '11a/b/g/n/ac,\
93
+ \ 2x2 + BT4.2'\n 'WWAN' 'None' 'Case Material' 'Aluminium / Plastic' 'Camera'\n\
94
+ \ 'Front 2.0MP / Rear 8.0MP' 'Microphone' '2x, Array' 'Docking'\n 'Lenovo Keyboard\
95
+ \ Pack' 'Color' 'Ice Blue + Iron Grey' 'Surface Treatment'\n 'Anodizing +Painting'\
96
+ \ 'TPM' 'Google Security Chip H1' 'Keyboard'\n 'Non-backlit, English (US)' 'Battery'\
97
+ \ 'Integrated 7000mAh'\n 'Power Adapter' '5V / 2.0A' 'Dimensions (H x W x D)'\n\
98
+ \ 'Tablet Only:9.44\" x 6.29\" x 0.29\"'\n 'Tablet + Full Keyboard Pack:9.64\"\
99
+ \ x 6.66\" x 0.71\"' 'Weight'\n 'Tablet Only:0.99lbs' 'Tablet + Full Keyboard\
100
+ \ Pack:2.03lbs'\n 'Operating System' 'Chrome OS' 'Accessories'\n 'BROAGE 3 In\
101
+ \ 1 Design Stylus (Stylus Pen + Ballpoint Pen + USB 3.0 16GB Flash Drive)']"
102
+ - 'Title: Columbia Men''s Grand Wall Sherpa Jacket Descripion: [''Constructed with
103
+ water-resistant nylon fully lined in soft sherpa fleece, this warm jacket is ready
104
+ to work hard and play hard. Complete with zippered hand pockets, drawstring adjustable
105
+ hood, drawstring adjustable hem, and soft binding at the cuffs that feels good
106
+ every time you put it on. This men’s all-season jacket is offered in multiple
107
+ sizes and colors. Regular Fit. To ensure the size you choose is right, utilize
108
+ our sizing chart and the following measurement instructions: For the sleeves,
109
+ start at the center back of your neck and measure across the shoulder and down
110
+ to the sleeve. If you come up with a partial number, round up to the next even
111
+ number. For the chest, measure at the fullest part of the chest, under the armpits
112
+ and over the shoulder blades, keeping the tape measure firm and level.'']'
113
+ - "Title: Lenovo Yoga 7i Premium 2-in-1 15 Laptop I 15.6\" FHD IPS Touchscreen I\
114
+ \ 11th Gen Intel 4-Core i5-1135G7 (> i7-10710U) I 8GB DDR4 512GB SSD I Backlit\
115
+ \ FP Thunderbolt Win10 Grey + 32GB MicroSD Card Descripion: ['PRODUCT OVERVIEW:'\n\
116
+ \ 'This sleek 2 in 1 laptop offers a contemporary style. Crafted from sandblasted\
117
+ \ and anodized metal, the Yoga 7i’s subtly rounded edges are designed to feel\
118
+ \ comfortable in your hands. A 360-degree hinge offers stability as you transition\
119
+ \ from tablet to laptop mode and back.'\n 'KEY SPECIFICATIONS:' 'PC Type:' '2-in-1\
120
+ \ Laptop Computer' 'PC Series:'\n 'Lenovo Yoga' 'Processor:'\n '11th Gen Intel\
121
+ \ 4-Core i5-1135G7, Max Turbo\\xa0Frequency Up to 4.2GHz, 8MB Smart Cache, 8 Threads'\n\
122
+ \ 'Memory:' '8GB DDR4' 'Storage:' '512GB SSD' 'Graphics:'\n 'Intel Iris Xe Graphics\
123
+ \ Integrated' 'Display:'\n '15.6 inch Full HD (1920x1080) IPS Touchscreen Display'\
124
+ \ 'Communications:'\n 'Wi-Fi 6(802.11ax 2x2) + Bluetooth 5.0' 'Camera:'\n '720p\
125
+ \ HD Webcam with privacy shutter' 'Keyboard:' 'Backlit Keyboard'\n 'Security Feature:'\
126
+ \ 'Fingerprint Reader' 'Audio:'\n 'Dolby Atmos Speaker System' 'Voice Assistant:'\
127
+ \ 'Alexa'\n 'Operating system:' 'Windows 10 Home 64 bit' 'Ports & Slots:'\n '2\
128
+ \ x USB-C Thunderbolt 4 (DisplayPort 1.4), 2 x USB-A 3.2, 1 x Headphone / microphone\
129
+ \ combo jack (3.5mm)'\n 'Battery:' 'Built-in 71Wh, Up to 16 hours battery life'\n\
130
+ \ 'Additional Information:'\n 'Dimensions: 14.03\" x 9.28\" x 0.76\" Approximate\
131
+ \ Weight: 4.18 lbs'\n 'Accessory:' '32GB MicroSD Card']"
132
+ - source_sentence: a laptop for home and office use
133
+ sentences:
134
+ - 'Title: BHSJ Toddler Baby Girls Duffle Fleece Coats Winter Windproof Thicken Cardigan
135
+ Jackets Casual Warm Lapel Parka Outerwear Sweater Grandpa Turtleneck Green Knit
136
+ Light Summer Knitted Cashmere Skull Descripion: ["Welcome to BHSJ shop Toddler
137
+ Baby Girls Long Sleeve Winter Solid Windproof Coat Warm Outwear Jacket; Feature;
138
+ Fashion design,100% Brand New,high quality! Material:Cotton Blend Pattern Type:Solid
139
+ Sleeve length:Long Sleeve Main Color: As The Picture Show Style:Fashion Stylish
140
+ and fashion design make your baby more attractive Great for casual, Daily, party
141
+ or photoshoot, also a great idea for a baby show gifts It is made of high quality
142
+ materials,Soft hand feeling, no any harm to your baby''s skin Please allow slight
143
+ 1-3cm difference due to manual measurement and a little color variation for different
144
+ display setting thanks for your understanding! 1 inch = 2.54 cm If your kid is
145
+ chubby, we recomend choosing a larger size, thanks. Thank you and nice day! Package
146
+ include:1PC Coat ✨ Standard Shipping: 8-18 Days to Arrive ✨ Expedited Shipping:
147
+ 3-5 Days to Arrive Size: 90 Recommended Age: 18-24 Months Bust: 72cm/28.35''''
148
+ Length: 53cm/20.87'''' Size: 100 Recommended Age: 2-3 Years Bust: 73cm/28.74''''
149
+ Length: 56cm/22.05'''' Size: 110 Recommended Age: 3-4 Years Bust: 76cm/29.92''''
150
+ Length: 59cm/23.23'''' Size: 120 Recommended Age: 4-5 Years Bust: 79cm/31.10''''
151
+ Length: 62cm/24.41'''' Size: 130 Recommended Age: 5-6 Years Bust: 82cm/32.28''''
152
+ Length: 65cm/25.59''"]'
153
+ - "Title: ZeroXposur Men's Lightweight Quilted Puffer Jacket Descripion: [\"Cool\
154
+ \ weather won't keep you from enjoying the outdoors with this men's ZeroXposur\
155
+ \ puffer jacket.\"\n 'PRODUCT FEATURES' 'Midweight design' 'Midweight design'\
156
+ \ 'Zipper front'\n 'Zipper front' 'Wind resistant shell' 'Wind resistant shell'\n\
157
+ \ 'ZX ThermoCloud fill creates a High-Performance Insulation for superior warmth\
158
+ \ that retains heat'\n 'ZX ThermoCloud fill creates a High-Performance Insulation\
159
+ \ for superior warmth that retains heat'\n 'Inner adjustable waistband' 'Inner\
160
+ \ adjustable waistband' 'Long sleeves'\n 'Long sleeves' '2-pocket' '2-pocket'\
161
+ \ 'FIT and SIZING' '27.5-in. length'\n '27.5-in. length' 'Regular fit' 'Regular\
162
+ \ fit' 'FABRIC and CARE' 'Nylon'\n 'Nylon' 'Machine wash' 'Machine wash' 'Imported'\
163
+ \ 'Imported']"
164
+ - "Title: HP 17 17.3\" FHD Laptop Computer for Home and Office, Intel 4-Core i5-1135G7,\
165
+ \ 8GB DDR4 RAM, 256GB PCIe SSD, Intel Iris Xe Graphics, Numeric Pad, Fast Charge,\
166
+ \ BT 4.2, Windows 10 Home (S Mode), w/Battery Descripion: ['Brand:' 'HP' 'Screen\
167
+ \ Size:' '17.3 inches' 'Screen Resolution:'\n '1920 x 1080 (Full HD, IPS, anti-glare,\
168
+ \ 300 nits, 100% sRGB)'\n 'Touchscreen:' 'No' 'Display Type:' 'LED' 'Graphic:'\n\
169
+ \ 'Intel Iris Xe Graphics' 'Processor:'\n 'Intel Core i5-1135G7 (up to 4.2 GHz\
170
+ \ with Intel Turbo Boost Technology, 8 MB L3 cache, 4 cores)'\n 'Processor Cores:'\
171
+ \ 'Quad Core' 'Processor Speed (Base):'\n 'Up to 4.2 Gigahertz' 'System Memory\
172
+ \ (RAM):' '8GB DDR4 RAM'\n 'Total Storage Capacity:' '256GB PCIE SSD' 'Keyboard:'\n\
173
+ \ 'Full-size Keyboard with Integrated Numeric Pad' 'Backlit Keyboard:' 'No'\n\
174
+ \ 'Built-in Microphone:' 'Yes' 'Built-in Bluetooth:' 'Yes'\n 'Built-in Webcam:'\
175
+ \ 'Yes' 'Wireless Connectivity:'\n 'Realtek RTL8821CE 802.11a/b/g/n/ac (1x1) Wi-Fi\
176
+ \ and Bluetooth 4.2 combo'\n 'Ports:'\n '2x SuperSpeed USB 3.0 Type-A, 1x SuperSpeed\
177
+ \ USB 2.0 Type-C, 1x HDMI, 1x Headphone/microphone combo, 1x AC smart pin, 1x\
178
+ \ Ethernet Ports'\n 'Expansion Slots:' '1x Multi-format SD Media Card Reader'\n\
179
+ \ 'Operating System:' 'Windows 10 Home in S Mode' 'Color:' 'Natural Silver'\n\
180
+ \ 'Dimensions:' '16.33\" x 10.72\" x 0.96\"' 'Weight:' '5.25 pounds' 'Bundle:'\n\
181
+ \ 'Lanbertent Rechargeable Battery.The batteries and charger set is a cost-effective\
182
+ \ choice for you to charge the other two while using two batteries uninterruptedly\
183
+ \ for wireless mouse or keyboard.']"
184
+ - source_sentence: men's sunglasses with a durable frame, secure fit, and high-performance
185
+ lenses designed for uv protection and clarity.
186
+ sentences:
187
+ - 'Title: 2018 Lenovo ThinkPad P52 Workstation Laptop - Windows 10 Pro - Intel Hexa-Core
188
+ i7-8850H, 16GB RAM, 500GB SSD, 15.6" FHD IPS 1920x1080 Display, NVIDIA Quadro
189
+ P1000 4GB (Renewed) Descripion: [''This pre-owned or refurbished product has been
190
+ professionally inspected and tested to work and look like new. How a product becomes
191
+ part of Amazon Renewed, your destination for pre-owned, refurbished products:
192
+ A customer buys a new product and returns it or trades it in for a newer or different
193
+ model. That product is inspected and tested to work and look like new by Amazon-qualified
194
+ suppliers. Then, the product is sold as an Amazon Renewed product on Amazon. If
195
+ not satisfied with the purchase, renewed products are eligible for replacement
196
+ or refund under the Amazon Renewed Guarantee.'']'
197
+ - 'Title: adidas Boys'' Zip Front Indicator Hooded Jacket Descripion: ["No need
198
+ to overcomplicate things. This boys'' hoodie keeps his adidas look simple and
199
+ comfortable. Raglan sleeves add a sporty touch. Contrast 3-Stripes and an embroidered
200
+ logo have been cool since forever. Soft fleece feels cozy and warm."]'
201
+ - 'Title: Oakley Men''s Oo9009 Flak Jacket Xlj Rectangular Sunglasses Descripion:
202
+ [''World-class athletes have driven us to create innovation after innovation,
203
+ and flak jacket takes that to the next level with the latest in performance technology.
204
+ The frame offers o matter and unobtanium components for a comfortably secure fit
205
+ and pure plutonite lenses to filter out 100 percent of all uv rays.'']'
206
+ - source_sentence: girls' winter jacket with a waterproof outer layer, a warm inner
207
+ fleece, and adjustable features for comfort and fit.. girls' winter jacket with
208
+ a waterproof outer layer, a warm inner fleece, and adjustable features for comfort
209
+ and fit.
210
+ sentences:
211
+ - 'Title: Columbia Girls'' Bugaboo Ii Fleece Interchange Jacket Descripion: [''Finding
212
+ an all-inclusive girls winter jacket that can be used in different weather conditions
213
+ can be a challenge. Fortunately, our Bugaboo II Fleece Interchange Winter Jacket
214
+ is the perfect multiple-use all-weather coat – utilizing our classic three-in-one
215
+ design. It features a waterproof and breathable outer shell and an inner fleece
216
+ layer that can be worn separately or zipped together for extra protection against
217
+ wet and cold weather. The waterproof outer shell features an inner layer of our
218
+ thermal heat reflective secret sauce we call Omni-HEAT. The engineered silver
219
+ dots are designed to capture natural body heat, reducing weight while increasing
220
+ comfort. The warm inner fleece jacket can be worn separately or zipped into the
221
+ waterproof and breathable outer layer. This three-in-one jacket system features
222
+ zippered hand pockets and adjustable cuffs, a taffeta lined removable storm hood,
223
+ media and goggle pocket, fleece lined zippered hand pockets, and adjustable cuffs
224
+ for added warmth control in evolving weather conditions. Available in a range
225
+ of colors and youth sizes.'']'
226
+ - 'Title: adidas Tiro 21 Track Jacket Descripion: [''With long sleeves and classic
227
+ 3 stripe design, the adidas Tiro 21 Track Jacket will have you ready to work out
228
+ in style. Pairable with several bottoms. Branding on front and down sleeves. Long
229
+ sleeves. Zipper closure. 2-pocket. 100% polyester. Machine wash warm, do not bleach,
230
+ tumble dry low.'']'
231
+ - "Title: ANOKA Yellowstone Jacket for Women Green XXL Descripion: ['size details'\n\
232
+ \ \"Size:S----US:6----Bust:106cm/41.73''----Sleeve:69cm/27.17''----Length:71cm/27.95''\
233
+ \ Size:M----US:8----Bust:110cm/43.31''----Sleeve:70cm/27.56''----Length:72cm/28.35''\
234
+ \ Size:L----US:10----Bust:116cm/45.67''----Sleeve:71cm/27.95''----Length:73cm/28.74''\
235
+ \ Size:XL----US:12----Bust:122cm/48.03''----Sleeve:72cm/28.35''----Length:74cm/29.13''\
236
+ \ Size:XXL----US:14----Bust:128cm/50.39''----Sleeve:73cm/28.74''----Length:75cm/29.53''\
237
+ \ Size:XXXL----US:16----Bust:134cm/52.76''----Sleeve:74cm/29.13''----Length:76cm/29.92''\
238
+ \ Size:XXXXL----US:18----Bust:140cm/55.12''----Sleeve:75cm/29.53''----Length:77cm/30.31''\
239
+ \ Size:XXXXXL----US:20----Bust:146cm/57.48''----Sleeve:76cm/29.92''----Length:78cm/30.71''\"\
240
+ ]"
241
+ pipeline_tag: sentence-similarity
242
+ library_name: sentence-transformers
243
+ ---
244
+
245
+ # SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
246
+
247
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
248
+
249
+ ## Model Details
250
+
251
+ ### Model Description
252
+ - **Model Type:** Sentence Transformer
253
+ - **Base model:** [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2) <!-- at revision 9a3225965996d404b775526de6dbfe85d3368642 -->
254
+ - **Maximum Sequence Length:** 128 tokens
255
+ - **Output Dimensionality:** 768 tokens
256
+ - **Similarity Function:** Cosine Similarity
257
+ <!-- - **Training Dataset:** Unknown -->
258
+ <!-- - **Language:** Unknown -->
259
+ <!-- - **License:** Unknown -->
260
+
261
+ ### Model Sources
262
+
263
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
264
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
265
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
266
+
267
+ ### Full Model Architecture
268
+
269
+ ```
270
+ SentenceTransformer(
271
+ (0): Transformer({'max_seq_length': 128, 'do_lower_case': False}) with Transformer model: MPNetModel
272
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
273
+ )
274
+ ```
275
+
276
+ ## Usage
277
+
278
+ ### Direct Usage (Sentence Transformers)
279
+
280
+ First install the Sentence Transformers library:
281
+
282
+ ```bash
283
+ pip install -U sentence-transformers
284
+ ```
285
+
286
+ Then you can load this model and run inference.
287
+ ```python
288
+ from sentence_transformers import SentenceTransformer
289
+
290
+ # Download from the 🤗 Hub
291
+ model = SentenceTransformer("knguyennguyen/mpnet_laptopjacke")
292
+ # Run inference
293
+ sentences = [
294
+ "girls' winter jacket with a waterproof outer layer, a warm inner fleece, and adjustable features for comfort and fit.. girls' winter jacket with a waterproof outer layer, a warm inner fleece, and adjustable features for comfort and fit.",
295
+ "Title: Columbia Girls' Bugaboo Ii Fleece Interchange Jacket Descripion: ['Finding an all-inclusive girls winter jacket that can be used in different weather conditions can be a challenge. Fortunately, our Bugaboo II Fleece Interchange Winter Jacket is the perfect multiple-use all-weather coat – utilizing our classic three-in-one design. It features a waterproof and breathable outer shell and an inner fleece layer that can be worn separately or zipped together for extra protection against wet and cold weather. The waterproof outer shell features an inner layer of our thermal heat reflective secret sauce we call Omni-HEAT. The engineered silver dots are designed to capture natural body heat, reducing weight while increasing comfort. The warm inner fleece jacket can be worn separately or zipped into the waterproof and breathable outer layer. This three-in-one jacket system features zippered hand pockets and adjustable cuffs, a taffeta lined removable storm hood, media and goggle pocket, fleece lined zippered hand pockets, and adjustable cuffs for added warmth control in evolving weather conditions. Available in a range of colors and youth sizes.']",
296
+ 'Title: ANOKA Yellowstone Jacket for Women Green XXL Descripion: [\'size details\'\n "Size:S----US:6----Bust:106cm/41.73\'\'----Sleeve:69cm/27.17\'\'----Length:71cm/27.95\'\' Size:M----US:8----Bust:110cm/43.31\'\'----Sleeve:70cm/27.56\'\'----Length:72cm/28.35\'\' Size:L----US:10----Bust:116cm/45.67\'\'----Sleeve:71cm/27.95\'\'----Length:73cm/28.74\'\' Size:XL----US:12----Bust:122cm/48.03\'\'----Sleeve:72cm/28.35\'\'----Length:74cm/29.13\'\' Size:XXL----US:14----Bust:128cm/50.39\'\'----Sleeve:73cm/28.74\'\'----Length:75cm/29.53\'\' Size:XXXL----US:16----Bust:134cm/52.76\'\'----Sleeve:74cm/29.13\'\'----Length:76cm/29.92\'\' Size:XXXXL----US:18----Bust:140cm/55.12\'\'----Sleeve:75cm/29.53\'\'----Length:77cm/30.31\'\' Size:XXXXXL----US:20----Bust:146cm/57.48\'\'----Sleeve:76cm/29.92\'\'----Length:78cm/30.71\'\'"]',
297
+ ]
298
+ embeddings = model.encode(sentences)
299
+ print(embeddings.shape)
300
+ # [3, 768]
301
+
302
+ # Get the similarity scores for the embeddings
303
+ similarities = model.similarity(embeddings, embeddings)
304
+ print(similarities.shape)
305
+ # [3, 3]
306
+ ```
307
+
308
+ <!--
309
+ ### Direct Usage (Transformers)
310
+
311
+ <details><summary>Click to see the direct usage in Transformers</summary>
312
+
313
+ </details>
314
+ -->
315
+
316
+ <!--
317
+ ### Downstream Usage (Sentence Transformers)
318
+
319
+ You can finetune this model on your own dataset.
320
+
321
+ <details><summary>Click to expand</summary>
322
+
323
+ </details>
324
+ -->
325
+
326
+ <!--
327
+ ### Out-of-Scope Use
328
+
329
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
330
+ -->
331
+
332
+ <!--
333
+ ## Bias, Risks and Limitations
334
+
335
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
336
+ -->
337
+
338
+ <!--
339
+ ### Recommendations
340
+
341
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
342
+ -->
343
+
344
+ ## Training Details
345
+
346
+ ### Training Dataset
347
+
348
+ #### Unnamed Dataset
349
+
350
+
351
+ * Size: 15,123 training samples
352
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
353
+ * Approximate statistics based on the first 1000 samples:
354
+ | | sentence_0 | sentence_1 |
355
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
356
+ | type | string | string |
357
+ | details | <ul><li>min: 5 tokens</li><li>mean: 27.46 tokens</li><li>max: 101 tokens</li></ul> | <ul><li>min: 30 tokens</li><li>mean: 110.08 tokens</li><li>max: 128 tokens</li></ul> |
358
+ * Samples:
359
+ | sentence_0 | sentence_1 |
360
+ |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
361
+ | <code>boys' winter jacket with a water-resistant exterior, thermal insulation, and an adjustable hood.</code> | <code>Title: Columbia Boys' Powder Lite Hooded Winter Jacket Descripion: ["Our Powder Lite cold-weather jacket combines a classic fit with technology to keep boys warm and dry. Crafted of a water resistant shell, lined with Omni-HEAT reflective system, and packed with our Thermarator insulation, this coat will keep kids warm and comfortable when the weather turns blustery and cold… ready to help them take winter by storm. \u2028\u2028Complete with hood and a soft chin guard, zipped hand pockets to keep items secure, while a draw cord adjustable hem keeps the cold locked out, making for the perfect fit for active youths. This boy's winter jacket is available in many accommodating colors and boy sizes. To ensure the size you choose is right, utilize our sizing chart and the following measurement instructions: For the sleeves, start at the center back of your neck and measure across the shoulder and down to the sleeve. If you come up with a partial number, round up to the next even number. For the chest, measure at the fullest part of the chest, under the armpits and over the shoulder blades, keeping the tape measure firm and level.\u2028 Imported. \u2028Made from 100% polyester. \u2028Zippered closure. \u2028Machine Wash."]</code> |
362
+ | <code>laptop with a large display, efficient processor, ample memory, and multiple connectivity options.</code> | <code>Title: ASUS New VivoBook 15 15.6 Inch FHD 1080P Laptop (AMD Ryzen 3 3250U up to 3.5GHz, 12GB DDR4 RAM, 256GB SSD, AMD Radeon Vega 3, WiFi, Bluetooth, HDMI, Windows 10) (Grey) Descripion: ['XM sells computers with upgraded configurations. If the computer has modifications (listed above), then the manufacturer box is opened for it to be tested and inspected and to install the upgrades to achieve the specifications as advertised. Operating System: Windows 10 Home 64-bitDisplay: 15.6 inch FHD(1920 x 1080) with four-sided wider NanoEdge bezel displayProcessor: AMD Ryzen 3 3250U Processor (2.6 GHz base frequency up to 3.5 GHz, 2 Cores, 1MB Cache)Memory: Up to 16GB DDR4 RAMHard Drive: Up to 1TB SSDGraphics: AMD Radeon Vega 3Wireless: 802.11ac, Bluetooth 4.1Webcam: YESAudio features: Stereo speakersPorts: 1 x COMBO audio jack1 x Type-A USB 3.0 (USB 3.1 Gen 1)1 x Type-C USB 3.0 (USB 3.1 Gen 1)2 x USB 2.0 port(s)1 x HDMIBattery Type: 2 -Cell 37 Wh BatteryWeight: 3.75lbsDimensions: 14.4 x 9.1 x 0.8 inchesColor: Gray']</code> |
363
+ | <code>men's parka with a hood, featuring a waterproof design, secure storage options, and insulation for warmth.. men's parka with a hood, featuring a waterproof design, secure storage options, and insulation for warmth.</code> | <code>Title: Under Armour Men's Unstoppable Waterproof Hooded Down Parka Project Rock Long STORM Jacket Descripion: ['Under Armour Project Rock Hooded Down Parka Jacket Black 1346093-001 Storm technology: breathable and waterproof, Secure pockets. Zip/snap closure. Down fill.']</code> |
364
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
365
+ ```json
366
+ {
367
+ "scale": 20.0,
368
+ "similarity_fct": "cos_sim"
369
+ }
370
+ ```
371
+
372
+ ### Training Hyperparameters
373
+ #### Non-Default Hyperparameters
374
+
375
+ - `per_device_train_batch_size`: 128
376
+ - `per_device_eval_batch_size`: 128
377
+ - `num_train_epochs`: 5
378
+ - `multi_dataset_batch_sampler`: round_robin
379
+
380
+ #### All Hyperparameters
381
+ <details><summary>Click to expand</summary>
382
+
383
+ - `overwrite_output_dir`: False
384
+ - `do_predict`: False
385
+ - `eval_strategy`: no
386
+ - `prediction_loss_only`: True
387
+ - `per_device_train_batch_size`: 128
388
+ - `per_device_eval_batch_size`: 128
389
+ - `per_gpu_train_batch_size`: None
390
+ - `per_gpu_eval_batch_size`: None
391
+ - `gradient_accumulation_steps`: 1
392
+ - `eval_accumulation_steps`: None
393
+ - `torch_empty_cache_steps`: None
394
+ - `learning_rate`: 5e-05
395
+ - `weight_decay`: 0.0
396
+ - `adam_beta1`: 0.9
397
+ - `adam_beta2`: 0.999
398
+ - `adam_epsilon`: 1e-08
399
+ - `max_grad_norm`: 1
400
+ - `num_train_epochs`: 5
401
+ - `max_steps`: -1
402
+ - `lr_scheduler_type`: linear
403
+ - `lr_scheduler_kwargs`: {}
404
+ - `warmup_ratio`: 0.0
405
+ - `warmup_steps`: 0
406
+ - `log_level`: passive
407
+ - `log_level_replica`: warning
408
+ - `log_on_each_node`: True
409
+ - `logging_nan_inf_filter`: True
410
+ - `save_safetensors`: True
411
+ - `save_on_each_node`: False
412
+ - `save_only_model`: False
413
+ - `restore_callback_states_from_checkpoint`: False
414
+ - `no_cuda`: False
415
+ - `use_cpu`: False
416
+ - `use_mps_device`: False
417
+ - `seed`: 42
418
+ - `data_seed`: None
419
+ - `jit_mode_eval`: False
420
+ - `use_ipex`: False
421
+ - `bf16`: False
422
+ - `fp16`: False
423
+ - `fp16_opt_level`: O1
424
+ - `half_precision_backend`: auto
425
+ - `bf16_full_eval`: False
426
+ - `fp16_full_eval`: False
427
+ - `tf32`: None
428
+ - `local_rank`: 0
429
+ - `ddp_backend`: None
430
+ - `tpu_num_cores`: None
431
+ - `tpu_metrics_debug`: False
432
+ - `debug`: []
433
+ - `dataloader_drop_last`: False
434
+ - `dataloader_num_workers`: 0
435
+ - `dataloader_prefetch_factor`: None
436
+ - `past_index`: -1
437
+ - `disable_tqdm`: False
438
+ - `remove_unused_columns`: True
439
+ - `label_names`: None
440
+ - `load_best_model_at_end`: False
441
+ - `ignore_data_skip`: False
442
+ - `fsdp`: []
443
+ - `fsdp_min_num_params`: 0
444
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
445
+ - `fsdp_transformer_layer_cls_to_wrap`: None
446
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
447
+ - `deepspeed`: None
448
+ - `label_smoothing_factor`: 0.0
449
+ - `optim`: adamw_torch
450
+ - `optim_args`: None
451
+ - `adafactor`: False
452
+ - `group_by_length`: False
453
+ - `length_column_name`: length
454
+ - `ddp_find_unused_parameters`: None
455
+ - `ddp_bucket_cap_mb`: None
456
+ - `ddp_broadcast_buffers`: False
457
+ - `dataloader_pin_memory`: True
458
+ - `dataloader_persistent_workers`: False
459
+ - `skip_memory_metrics`: True
460
+ - `use_legacy_prediction_loop`: False
461
+ - `push_to_hub`: False
462
+ - `resume_from_checkpoint`: None
463
+ - `hub_model_id`: None
464
+ - `hub_strategy`: every_save
465
+ - `hub_private_repo`: False
466
+ - `hub_always_push`: False
467
+ - `gradient_checkpointing`: False
468
+ - `gradient_checkpointing_kwargs`: None
469
+ - `include_inputs_for_metrics`: False
470
+ - `eval_do_concat_batches`: True
471
+ - `fp16_backend`: auto
472
+ - `push_to_hub_model_id`: None
473
+ - `push_to_hub_organization`: None
474
+ - `mp_parameters`:
475
+ - `auto_find_batch_size`: False
476
+ - `full_determinism`: False
477
+ - `torchdynamo`: None
478
+ - `ray_scope`: last
479
+ - `ddp_timeout`: 1800
480
+ - `torch_compile`: False
481
+ - `torch_compile_backend`: None
482
+ - `torch_compile_mode`: None
483
+ - `dispatch_batches`: None
484
+ - `split_batches`: None
485
+ - `include_tokens_per_second`: False
486
+ - `include_num_input_tokens_seen`: False
487
+ - `neftune_noise_alpha`: None
488
+ - `optim_target_modules`: None
489
+ - `batch_eval_metrics`: False
490
+ - `eval_on_start`: False
491
+ - `use_liger_kernel`: False
492
+ - `eval_use_gather_object`: False
493
+ - `batch_sampler`: batch_sampler
494
+ - `multi_dataset_batch_sampler`: round_robin
495
+
496
+ </details>
497
+
498
+ ### Training Logs
499
+ | Epoch | Step | Training Loss |
500
+ |:------:|:----:|:-------------:|
501
+ | 4.2017 | 500 | 1.9644 |
502
+
503
+
504
+ ### Framework Versions
505
+ - Python: 3.11.11
506
+ - Sentence Transformers: 3.1.1
507
+ - Transformers: 4.45.2
508
+ - PyTorch: 2.5.1+cu121
509
+ - Accelerate: 1.2.1
510
+ - Datasets: 3.2.0
511
+ - Tokenizers: 0.20.3
512
+
513
+ ## Citation
514
+
515
+ ### BibTeX
516
+
517
+ #### Sentence Transformers
518
+ ```bibtex
519
+ @inproceedings{reimers-2019-sentence-bert,
520
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
521
+ author = "Reimers, Nils and Gurevych, Iryna",
522
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
523
+ month = "11",
524
+ year = "2019",
525
+ publisher = "Association for Computational Linguistics",
526
+ url = "https://arxiv.org/abs/1908.10084",
527
+ }
528
+ ```
529
+
530
+ #### MultipleNegativesRankingLoss
531
+ ```bibtex
532
+ @misc{henderson2017efficient,
533
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
534
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
535
+ year={2017},
536
+ eprint={1705.00652},
537
+ archivePrefix={arXiv},
538
+ primaryClass={cs.CL}
539
+ }
540
+ ```
541
+
542
+ <!--
543
+ ## Glossary
544
+
545
+ *Clearly define terms in order to be accessible across audiences.*
546
+ -->
547
+
548
+ <!--
549
+ ## Model Card Authors
550
+
551
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
552
+ -->
553
+
554
+ <!--
555
+ ## Model Card Contact
556
+
557
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
558
+ -->
config.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "sentence-transformers/all-mpnet-base-v2",
3
+ "architectures": [
4
+ "MPNetModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": 0,
8
+ "eos_token_id": 2,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-05,
15
+ "max_position_embeddings": 514,
16
+ "model_type": "mpnet",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 1,
20
+ "relative_attention_num_buckets": 32,
21
+ "torch_dtype": "float32",
22
+ "transformers_version": "4.45.2",
23
+ "vocab_size": 30527
24
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.1.1",
4
+ "transformers": "4.45.2",
5
+ "pytorch": "2.5.1+cu121"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": null
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39d250b93afc75c926e58f01cbf0f6eaefb7335f5235a7e06e931f6988234d68
3
+ size 437967672
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 128,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "[UNK]",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "104": {
36
+ "content": "[UNK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ },
43
+ "30526": {
44
+ "content": "<mask>",
45
+ "lstrip": true,
46
+ "normalized": false,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": true
50
+ }
51
+ },
52
+ "bos_token": "<s>",
53
+ "clean_up_tokenization_spaces": false,
54
+ "cls_token": "<s>",
55
+ "do_lower_case": true,
56
+ "eos_token": "</s>",
57
+ "mask_token": "<mask>",
58
+ "max_length": 128,
59
+ "model_max_length": 128,
60
+ "pad_to_multiple_of": null,
61
+ "pad_token": "<pad>",
62
+ "pad_token_type_id": 0,
63
+ "padding_side": "right",
64
+ "sep_token": "</s>",
65
+ "stride": 0,
66
+ "strip_accents": null,
67
+ "tokenize_chinese_chars": true,
68
+ "tokenizer_class": "MPNetTokenizer",
69
+ "truncation_side": "right",
70
+ "truncation_strategy": "longest_first",
71
+ "unk_token": "[UNK]"
72
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff